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Second English Edition 


Foreword to the 
First English Edition 


The Mathematical Society of Japan takes 
pleasure in presenting this second edition of 
our Encyclopedic Dictionary of Mathematics 
to every researcher and user of mathematics. 
It is intended to be a compact, up-to-date 
source of information comprising, as com- 
pletely as possible, all significant results in all 
fields of our Science, pure and applied, from 
the elementary to the advanced level. The 
success of the first editton owed much to the 
kind assistance given by the American Mathe- 
matical Society. As described in the preface, 
the members of our Society have taken re- 
sponsibility for compiling this new edition. 
We hope that it will be as useful to the mathe- 
maticians of today as the first edition was 

to the mathematicians of yesterday. We also 
hope that this edition will be followed in 
years to come by subsequent ones incorporat- 
ing the future development of our Science. 


Hikosaburo Komatsu 
President 1985-1987 
Mathematical Society of Japan 


The American Mathematical Society welcomes 
the publication of the Encyclopedic Dictionary 
of Mathematics. For many years we have been 
fascinated by the publication in Japanese, 
Iwanami Sugaku Ziten, because we saw that 
this was an encyclopedia that contained effec- 
tive and penetrating information about all 

the fields of advanced mathematical research. 
We were also frustrated because we could not 
read Japanese and so we could not really 
reach out to this expert and effective source 

of information. We now welcome the fact that 
the second Japanese edition has been trans- 
lated into English and we look forward to the 
fascination which we can now have in getting 
at this rich mine of information. 


Saunders MacLane 
President 1973-1974 
American Mathematical Society 


Preface to the 
Second English Edition 


This second edition of the Encyclopedic Dic- 
tionary of Mathematics is in substance an 
English version of the third edition of Iwanami 
Sugaku Ziten (in Japanese). We shall explain 
how these two versions are related to each 
other and how the present edition differs from 
the first English edition. For the sake of sim- 
plicity we abbreviate Encyclopedic Dictionary 
of Mathematics and Iwanami Sugaku Ziten to 
EDM and ISZ, respectively, and indicate the 
numbers of editions beyond the first by Arabic 
numerals. 

The prefaces of the previous editions will 
clarify how ISZ, its augumented edition, ISZ2, 
ISZ3, and EDM came into existence in 1954, 
1960, 1968, 1985, and 1977, respectively. EDM, 
published nine years later than ISZ2, consisted 
of its English translation and some new mate- 
rials added to update its contents. In retro- 
spect, it was felt that this procedure was not 
adequate to cope with the recent rapid pro- 
gress of mathematics, and a suggestion was 
raised in the Mathematical Society of Japan 
that ISZ3 and EDM2 be produced simulta- 
neously. The favorable reaction of the mathe- 
matical public to EDM encouraged us greatly. 

In 1978 an agreement was made among the 
Society, Iwanami Shoten, Publishers, and The 
MIT Press for the publication of ISZ3 and 
EDMz2. An editorial committee was estab- 
lished in the Society with the members named 
in the preface to ISZ3. Manuscripts were pre- 
pared simultaneously in Japanese and English 
by each contributor, with Yuji Ito acting as 
linguistic consultant. 

I should mention that we benefited greatly 
from the kind comments on EDM by the 
following mathematicians: J. F. Adams, M. 
Atiyah, A. Borel, H. Cartan, K. Chandrase- 
kharan, S. S. Chern, J. Dieudonné, E. Hewitt, 
F. Hirzebruch, O. Lehto, J. L. Lions, L. Markus, 
and J. P. Serre. In particular, we are deeply 
grateful to J. Dieudonné for his many detailed 
comments. 

Compared with ISZ3 and EDM, EDM2 has 
the following characteristics: 

(1) EDM2 contains many more new mathe- 
matical results than EDM. For the details we 
refer the reader to the main points of revision 
mentioned in the preface to ISZ3, because 
EDM2 has the same mathematical content 
as ISZ3 and the additions made to ISZ2 and 
subsequently in EDM are of relatively minor 
importance. 

(2) The Japanese textbooks listed in the 
article references of ISZ3 are replaced in 
EDM2 by references to standard textbooks 


written in English. Such replacement was not 
done in compiling EDM. 

(3) Years of birth and death are included in 
the Name Index of EDM2 as far as possible. 
This information was given in ISZ2 but was 
removed in compiling EDM. 

(4) The Subject Index of EDM2 is so de- 
signed that every concept consisting of two or 
more words can be traced from each of these 
component words. (This principle was adopted 
in ISZ2 and EDM but was not rigidly fol- 
lowed in ISZ3 due to the lack of space.) 

(5) While all editions of ISZ were in one 
volume and EDM was in two volumes, EDM2 
is in four volumes: I. Forewords, prefaces, 
introduction, text A-E; II. Text F—N; III. 
Text O-Z; IV. Appendices, indexes and other 
backmatter. The Systematic List of Articles 
appears in volume IV, page 1857. 

While EDM2 is more voluminous than 
ISZ3, we hope that, being written in English 
and printed with generous margins, it can 
easily provide readers with information on 
every significant result of today’s mathematics 
and so will be useful to a large number of 
mathematicians. I am responsible for any 
shortcomings that may exist despite all our 
efforts, and I would appreciate any remarks 
from the readers. I hope to have occasion to 
remedy any such shortcomings in the future. 

Finally I would like to repeat my thanks to 
all the collaborators named in my preface to 
ISZ3. Also I express my gratitude to Yuji Ito 
for his tremendous effort to polish the English 
of all manuscripts, to Shigeru Iitaka for his 
laborious assistance in the final stage of the 
work, to Mrs. M. Nawata for her excellent 
secretarial work, and to the staff of the De- 
partment of Mathematics at Gakushuin Uni- 
versity for providing me with an office for 
compilation for eight years. 


Kiyosi Ité 
December 1986 


Preface to the 
Third Japanese Edition 


The first edition of Iwanami Stgaku Ziten was 
published in 1954, and a revised and aug- 
mented edition appeared in 1960. Extensive 
revision of the work was carried out sub- 
sequently, and the result was published in 1968 
as the second edition, which has retained its 
valuable and useful life for the past 17 years. 
The English translation of this second edition 
published by The MIT Press has achieved 
international recognition. In the meantime, 
remarkable progress has been made in mathe- 
matical sciences: recognition of the interrela- 
tionships amongst various branches within 
mathematics has increased significantly, and 
the formulation of mathematics as a synthetic 
entity is in the making. Furthermore, advanced 
mathematical theories have been utilized fre- 
quently in the physical, biological, and social 
sciences, and expectations for mathematics 

to be the basis of all the sciences have been 
increasing. In order to cope with these devel- 
opments, it was decided that the second edition 
of Sugaku Ziten should be further revised and 
an updated version be published as the third 
edition. The English translation of this third 
edition will also be published by The MIT 
Press. 

The main points of the revision are as 
follows: 

(1) On the Size and Scope of the Encyclo- 
pedia: The total number of articles has in- 
creased only slightly from that of the second 
edition, from 436 to 450. However, in view 
of the rapid and extensive development in 
mathematics in recent years, many old articles 
have been reorganized and unified, and much 
new material has been added. Consequently, 
the mathematical content of the encyclopedia 
has increased a great deal over that of the 
second edition, and this, together with the 
authors’ efforts to render the articles more 
readable than before, necessitated a 50% in- 
crease in the number of pages devoted to the 
main text. The account of computer science 
has been taken out of the subject area of 
Numerical Analysis, and a new subject area 
has been created for it; thus mathematics is 
now Classified into 21 subject areas altogether 
instead of 20. 

(2) On the Arrangement of the Articles: In 
the second edition, the title of each article 
was spelled out in Romanized Japanese, and 
articles were arranged alphabetically. In the 
third edition, however, the titles of the articles 
are arranged in the order of the Japanese syl- 
labary (katakana) and each article is numbered 


accordingly. To each article a parenthetical 
number ts also attached, giving the subject 
area to which the article belongs and its place 
in that list. 

(3) On the Text: Sections within each arti- 
cle are headed by letters A, B, C,...; the ini- 
tial section of each article gives an outline 
and is devoted to an introductory and general 
account of the topic with which the article 
is concerned. In the actual exposition of 
the articles, every effort has been made to 
pay close attention to the interrelationships 
amongst various fields of mathematics. Even 
when there was no need to revise the original 
article, changes were made in the bibliography 
to list items that would be more accessible to 
the readers. 

(4) On the Appendices: The appendices have 
been revised and augmented along with the 
text. Numerical tables were deleted when- 


_ever it was felt that scientific calculators or 


microcomputers could easily reproduce their 
contents. 

(5) On the Indexes: The listing of mathe- 
matical terms in Japanese has been changed, 
as in the case of the arrangement of articles, 
from the alphabetical order to the order of 
the Japanese syllabary. However, terms in 
Japanese that start with a European word 
have been separated out, and, in order to 
facilitate finding, have been listed indepen- 
dently in an Index for Mathematical Terms in 
Japanese with European Headings. The loca- 
tion of a term listed in the indexes is indicated 
by the article number and the letter heading 
of the section in the article where the term 
appears. 


We here outline briefly how the process of 
compilation of this edition was organized. In 
the summer of 1978 the Mathematical Society 
of Japan decided, following a proposal by 
Professors Shékichi Iyanaga and Yukiyosi 
Kawada, the chief editors of the previous 
editions of Sugaku Ziten, to undertake the 
compilation of the third edition. Following 
a resolution by the members of the Soci- 
ety, Kunthiko Kodaira, Sigeru Mizohata, 
Itiro Tamura, Nagayoshi Iwahori, Tosihusa 
Kimura, and myself have joined Iyanaga and 
Kawada to form an editorial committee and 
have formulated basic guidelines for the com- 
pilation of the third edition. The work of 
editing and compiling articles in each subject 
area was delegated to the editors designated 
for the areas, and I was asked to undertake the 
responsibility of putting the entire volume in 
order. 

The subject areas and their editors are as 
follows: 


xi 


I Logic and Foundations: Shéji Maehara 


IT Sets, General Topology, and Cate- 
gories: Shôji Maehara, Hikosaburo 
Komatsu, Masayoshi Nagata 


Ili Algebra: Masayoshi Nagata 
IV Group Theory: Nagayoshi Iwahorti 
V Number Theory: Takayoshi Mitsui 


VI Euclidean and Projective Geometry: 
Itiro Tamura 


VII Differential Geometry: Morio Obata, 
Shigeru Iitaka, Itiro Tamura 


VIII Algebraic Geometry: Shigeru litaka 
IX Topology: Itiro Tamura 
X Analysis: Sigeru Mizohata, Kiyosi Itô 


XI Complex Analysis: Yukio Kusunoki, 
Shigeru litaka 


XII Functional Analysis: Hikosaburo 
Komatsu 


XIIE Differential, Integral, and Functional 
Equations: Tosthusa Kimura, Sigeru 
Mizohata 

XIV Special Functions: Sin Hitotumatu 

XV — Numerical Analysts: Masaya Yamaguti 


XVI Computer Science and Combinatorics: 
Sin Hitotumatu 


Probability Theory: Kiyosi Itô 
XVIII Statistics: Ket Takeuchi 


XIX Mathematical Programming and 
Operations Research: Shigeru Furuya 


XX Mechanics and Theoretical Physics: 
Huzihiro Araki 


XXI History of Mathematics: Shokichi 
lyanaga 
Appendices: Sin Hitotumatu 

Before the actual process of compilation 
started, the editorial committee had met sev- 
eral times, and two meetings of all the subject- 
area editors had been called. Furthermore, 
editors for each area consulted with other 
experts in their fields as they selected the 
titles of articles to be included. In areas where 
remarkable development had taken place, such 
as differential geometry, functional analysis, 
probability theory, and applied mathemat- 
ics, extensive revision and insertion of many 
new articles were proposed by the respective 
editors. For this reason, the original articles 
were thoroughly reorganized and systematized. 
We resolved to keep the whole work in one 
volume, even though a number of new articles 
have had to be added. 

The selection of articles and of their re- 
spective authors was finally completed in the 
spring of 1980, and 197 colleagues were asked 
to write the articles. The names of these con- 
tributors and those of the previous editions are 
listed elsewhere in this volume. To all of the 
authors we express our sincere gratitude. 

Editing of the entire manuscript has been 
carried out since the summer of 1982, and we 
are deeply indebted to the following colleagues 
for their painstaking efforts in checking cross 
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references and tn proofreading: Nobuyosi Moto- 
hashi (Foundations, Set Theory), Takeo Yoko- 
numa (Algebra, Group Theory), Takayoshi 
Mitsui (Number Theory), Tetsuro Kawasaki 
(Geometry, Topology), Shigeru litaka, Isao 
Wakabayashi (Algebraic Geometry), Morio 
Obata, Koichi Ogiue, Osamu Kobayashi (Dif- 
ferential Geometry), Seiz6 Ito, Hisashi Oka- 
moto (Analysis), Hiroaki Aikawa, Makoto 
Ohtsuka (Computer Analysis), Hikosaburo 
Komatsu, Akihiko Miyachi (Functional 
Analysis), Kazuo Okamoto, Datsuke Fujiwara 
(Differential Equations), Sin Hitotumatu 
(Special Functions), Teruo Ushiyima (Nu- 
merical Analysis), Hideo Wada (Computer 
Science), Yasunori Okabe (Probability Theory), 
Mituaki Huzii, Yoshihiro Yajima (Statistics), 
Shigeru Furuya (Mathematical Programming), 
Koichi Nakamura (Theoretical Physics), 
Shichi Okamoto (History of Mathematics), 
Kosaku Okutsu (History of Mathematics, 
Number Theory). 

On those occastons when it became neces- 
sary to rapidly revise manuscripts in order to 
unify the presentation, we were forced to go 
through the revision without consulting the 
authors of the manuscripts. I am responsible 
for all such revisions and hereby express my 
apologies to the authors concerned. 

As for the indexes, we received assistance 
from Takeo Yokonuma, Koichi Yano, Hiroaki 
Aikawa, and Hisashi Okamoto in all aspects 
of the work, which lasted for two years, 
through the final proofreading. Preliminary 
work on the Name Index was carried out by 
staff members of the University of Tokyo, and 
Seizé Ito supervised tts final compilation. We 
also asked many other colleagues, and in par- 
ticular Nobuyuki Ikeda and Huzihiro Araki, 
to participate in the preparation of the Name 
Index, using resources available at different 
universities. To all of these people goes our 
sincere gratitude for their assistance. 

Last August, when the completion of the 
work was drawing near, it became necessary 
for me to leave Japan for some time, and this 
made it imperative that I entrust Shigeru 
litaka with responsibility for supervising the 
work at the final stage of editing. I am most 
grateful to Iitaka for having agreed to take 
over this responsibility and for having brought 
the work to completion under the pressure of 
time. 

From the beginning of this project, we have 
received an unlimited amount of assistance 
from the members of the Dictionary Depart- 
ment of Iwanami Shoten, Publishers. Messrs. 
Ikutaro Sasaki, Hirotomo Ushida, Kazuhiko 
Uetake, and Nagao Sato in particular, have 
made supreme efforts and have come up fre- 
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quently with innovative ideas to make this 
dictionary polished and perfect. To them and 
also to those at Dai-Nippon Printing Co., who 
typeset with the use of computers the entire 
text, including the complicated mathematical 
formulas, and who have cooperated with us 
fully in the process of editing the indexes, goes 
our deep gratitude. 


Kiyost Itô 
October 1985 
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Preface to the 
First English Edition 


The first and second editions of Iwanami 
Sugaku Ziten (in Japanese) were published, 
respectively, in April 1954 and June 1968 by 
Iwanami Shoten, Publishers, Tokyo. Begin- 
ning in the late 1950s, a number of unsuccess- 
ful attempts were made to arrange for translat- 
ing the Sūgaku Ziten into European languages. 
Finally an agreement for an English transla- 
tion was made between The MIT Press and 
the Mathematical Society of Japan in July 
1968. The discussions were carried on first 

by Professor Kôsaku Yosida, then presi- 

dent of the Mathematical Society of Japan, 
and later by Yukiyosi Kawada, who suc- 
ceeded him in April 1968. Throughout these 
initial negotiations, which lasted from 1966 to 
1968, we received the kindest assistance 

from Dr. Gordon Walker, Executive Director 
of the American Mathematical Society, 

and from Professors W. T. Martin and Shizuo 
Kakutani. 

The agreement for the project was shortly 
followed by the establishment of a commit- 
tee for the English edition of Sugaku Ziten 
within the Mathematical Society of Japan, 
with the following membership: Professors 
Yasuo Akizuki, Shigeru Furuya, Sin Hito- 
tumatu, Masuo Hukuhara, Isao Imai, Sh6- 
kichi lyanaga, Yukiyosi Kawada, Kunihiko 
Kodaira, Atuo Komatu, Hirokichi Kudo, 
Shdji Maehara, Yukio Mimura, Kiyoshi 
Noshiro, Shigeo Sasaki, Shoji Ura, Nobuo 
Yoneda, and Késaku Yosida. This committee 
requested the original authors of the articles 
and other members of the Society to translate 
the work. A list of translators will be found at 
the end of this work. 

In November 1968, an advisory committee 
for the project was formed with the following 
membership: Professor Edwin Hewitt (chair- 
man), Dr. Sydney H. Gould, Professor Shizuo 
Kakutant, Professor Kenneth O. May, and 
Professor Isaac Namioka. 

As the translating began, we were immedi- 
ately faced with problems concerning unifica- 
tion of terminology and style, some of which 
were inherent in the differences between the 
structures of our two languages—for example, 
the fact that the Japanese language makes no 
distinction between singular and plural forms 
of nouns. 

In August 1969, Professor Hewitt kindly 
arranged a meeting at the University of Wash- 
ington, Seattle, that included the members 
of the Japanese and American committees 
and a representative of The MIT Press. It was 
agreed during this meeting that the transla- 
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tion should be faithful, with only a minimum 
number of changes, such as the correction of 
mathematical errors; whereas the references 
to each article might be augmented consider- 
ably for the conventence of Western readers. 
Professor Kenneth O. May volunteered to 
review the entire translated manuscript, and 
Professors Isaac Namioka and Shizuo Kaku- 
tani, who are proficient in both Japanese and 
English, proposed to read through some of the 
manuscript of the translated articles. It was 
also agreed that the Systematic List of Arti- 
cles should appear in French, German, and 
Russian, as well as in English. 

We owe very much to the American com- 
mittee: Professor Hewitt organized the whole 
work, and Professor May revised the entire 
manuscript and gave us important advice con- 
cerning the appendices, according to which we 
deleted some of the numerical tables which 
may be easily found in readily accessible West- 
ern books. Professor Namioka reviewed a 
great part of the manuscript, transmitting his 
views to Professor May, who forwarded them 
to us with his comments. All of this assistance 
helped us greatly in making our final decisions. 
Professor Kakutani gave us very detailed and 
important advice on the choice of reference 
works. 

We were also assisted concerning English 
terminology and reference books by the fol- 
lowing Japanese mathematicians working in 
American universities: Tadatoshi Akiba, Pro- 
fessors Kiyosi It6, Tatsuji Kambayashi, Tosio 
Kato, Teruhisa Matsusaka, Katsumi Nomizu, 
Ichiro Satake, Michio Suzuki, and Gaisi 
Takeutt. 

The Mathematical Society of Japan estab- 
lished the following double reviewing system: 
group A with its twenty subgroups, each 
headed by the members listed below, reviewed 
their respective subjects; while group B re- 
viewed the whole manuscript, mainly from the 
linguistic standpoint. 

Group A 

1. Foundations of mathematics: Shôji 
Maehara 

2. Set theory: Atuo Komatu and Shôji 
Maehara 

3. Algebra: Akira Hattori, Masayoshi Nagata, 
and Hideyuki Matsumura 

4. Group theory: Shingo Murakami, Mitsuo 
Sugiura, and Reiji Takahashi 

5. Number theory: Yukiyosi Kawada and 
Tomio Kubota 

6. Geometry, 7. Differential geometry: Sh6- 
kichi lyanaga, Shigeo Sasaki, and Kentaro 
Yano 

8. Algebraic geometry: Yasuo Akizuki and 
Kunthiko Kodaira 

9. Topology: Atuo Komatu 


10. Real analysis: Sin Hitotumatu, Shunji 
Kametant, and Shigeki Yano 
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11. Complex analysis: Kiyoshi Noshiro 


12. Functional analysis: Yukio Mimura and 
Kosaku Yosida 


13. Differential equations: Masuo Hukuhara 
and Sigeru Mizohata 


14. Special functions: Sin Hitotumatu 
15. Numerical analysis, 16. Probability 
theory: Kiyosi Itô 

17. Statistics: Hirokichi Kudo 


18. Information theory: Tosio Kitagawa and 
Hirofumi Uzawa 


19. Theoretical physics: Isao Imai and 
Kazuhiko Nishijima 


20. History of mathematics: Tamotsu Murata 
Group B 
Kenichi Iyanaga and Mitsuyo Iyanaga 

Professor Sin Hitotumatu also assisted us in 
translating the titles of Japanese books given 
in the references and the explanations attached 
to the lists of formulas and numerical tables in 
the appendices. We are also grateful for the 
generous cooperation offered to us by our 
colleagues in the Department of Mathemat- 
ics, Faculty of Science, University of Tokyo: 
Hiroshi Fujita, Shigeru Furuya, Akio Hat- 
tori, Seiz6 It6, Nagayoshi Iwahori, Tosihusa 
Kimura, Kunihiko Kodaira, Hikosaburo 
Komatsu, Akihiro Nozaki, and Itiro Tamura. 
We are indebted as well to Professors Walter 
L. Bailey of the University of Chicago and 
Yuji Ito of Brown University for many valu- 
able consultations concerning both mathe- 
matical and linguistic questions. 

In translating the Systematic List of Articles 
into French, German, and Russian, we were 
assisted by Professor Hideya Matsumoto in 
Paris, Professor Emanuel Sperner in Hamburg, 
Professor Katsuhiro Chiba in Tokyo, and 
Professor Arkadti Maltsev in Moscow. 

We began to send the manuscript to The 
MIT Press in March 1970 and finished send- 
ing it in July 1972. The manuscript was edited 
there, then sent to Professors May, Namioka, 
and Kakutant, and finally was sent back to us 
with their comments and questions. All of the 
references were carefully checked by Laura 
Platt. 

Iwanami Shoten, Publishers, have always 
been cooperative with us. In the office of 
the Mathematical Society of Japan, Yoko 
Endo, Reiko Nagase, and Chieko Sagawa 
helped us with their efficient secretarial 
work. 

The fruition of this project was made pos- 
sible only by the gracious assistance offered to 
us by many people, including those already 
mentioned. We should like to express our 
most sincere gratitude to all those who have 
helped us so kindly. 


Shékichi Iyanaga, Yukiyosi Kawada 
August 1973 
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Addition made in January 1976 


After the procedures described above, the 
whole manuscript of this Encyclopedia was 
sent to The MIT Press in August 1973. It was, 
however, toward the end of November 1975 
that the final decision was made by The MIT 
Press to send the manuscript to composition 
in early 1976 in order that the work be pub- 
lished in 1977. 

At the same time, we were asked to review 
and update the manuscript up to the end of 
February 1976. We are now making our best 
effort to this effect with the kind help, especially 
from the linguistic viewpoint, of Dr. E. J. 
Brody. 

In so doing, we have noticed that perhaps 
too much emphasis has been given to results 
obtained by Japanese mathematicians and that 
there are still many things in this book which 
should be improved. 

We hope that ongoing revisions will be 
carried out in subsequent editions. 


xiv 


Preface to the 
Second Japanese Edition 


Seven and a half years have passed since the 


-revised and augmented edition of Iwanami 


Sugaku Ziten was published. The nature and 
purpose of this book remain the same as de- 
scribed in the preface to the first edition: It is 
an encyclopedic dictionary with articles of 
medium length aimed at presenting the whole 
of mathematics in a lucid system, giving exact 
definitions of important terms in both pure 
and applied mathematics, and describing the 
present state of research in each field, together 
with historical background and some per- 
spectives for the future. However, mathemat- 
ical science is in rapid motion, and the “pre- 
sent state of research” changes constantly. 
The present updated second edition has been 
published to remedy this situation as far as 
possible. 

The main points of revision are as follows: 

(1) On the Articles and the Size of the Ency- 
clopedia: From the articles of the last edition, 
we have removed those whose importance has 
diminished recently (e.g., Geometry of Tri- 
angles), while we have added new articles in 
domains of growing importance (e.g., Cate- 
gories and Functors; K-Theory). Many articles 
concerning applied mathematics in the first 
edition were short; in this edition, they have 
been combined into articles of medium length 
to save space and to systematize the presenta- 
tion. The number of articles, 593 in the first 
edition, has thus diminished to 436. We have 
made every effort to keep the size of the ency- 
clopedia as it was, but the substantial aug- 
mentation of content has necessarily brought 
about an enlargement of about 30%. 

(2) On the Text: When the title of an article 
has remained the same as in the first edition, 
we have reviewed the whole text and revised 
whenever necessary. Especially for the fun- 
damental ideas, we have endeavored to give 
thorough explanations. In the first edition we 
gave English, French, and German transla- 
tions of article titles; in the present edition, we 
have also added Russian. The bibliographies 
at the ends of articles have been updated. 

(3) On Terminology: In previous editions we 
endeavored to unify the terminology of the 
whole encyclopedia so that the reader would 
have no difficulties with cross-references. Here 
we have done this once again with the hope of 
attaining results more perfect than before. 

(4) On the Appendices: The appendices were 
designed to supplement the text efficiently. 
Some overlapping of the appendices with the 
text found in previous editions has been re- 
moved. Also deleted in this edition are elemen- 
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tary formulas in analytic geometry and tables 
of statistical distributions, which can be easily 
found in other books. However, we have 
added some formulas in topology, the theory 
of probability, and statistics, as well as tables 
of characters of finite groups, ete. 

(5) On the Indexes: Important terms are 
listed multiply in the indexes to facilitate find- 
ing, e.g., the term transcendental singularity 
appears under both transcendental and singu- 
larity. Both names in the text and those in the 
references are included in the Name Index of 
this edition. The numbers of items in the Index 
of Mathematical Terms in Japanese and in 
European Languages and tn the Name Index in 
this edition are 17740, 10124, and 2438, com- 
pared with 8254, 8070, and 1279, respectively, in 
the previous edition. 


The compilation of this edition was organized 
as follows. In the spring of 1964 we began to 
select the titles of articles with the aid of the 
following colleagues: in set theory and foun- 
dations of mathematics, Shôji Maehara; in 
algebra and number theory, Yasuo Akizuki, 
Yukiyosi Kawada; in differential geometry, 
theory of Lie groups, and topology, Yozo 
Matsushima, Atuo Komatu; tn analysis, 
Masuo Hukuhara, Késaku Yosida, Shunji 
Kametani, Sin Hitotumatu; in probability 
theory, statistics, and mathematics for pro- 
gramming, Kiyosi It6, Hirokicht Kudo, and 
Shigeru Furuya; in theoretical physics, Isao 
Imai; and for the appendices, Sin Hitotumatu. 
I have participated in compiling the articles 
on geometry and the history of mathematics. 
Kawada and Hitotumatu undertook the re- 
sponsibility of putting the volume in order. 

The work of selecting titles was completed 
in the summer of 1964. We then asked 173 
colleagues to contribute articles. The names of 
these contributors and those of the previous 
editions are listed elsewhere. To all of them 
goes our most sincere gratitude. 

In editing the manuscript, we were assisted 
by the following colleagues in addition to 
those mentioned already: in set theory and 
foundations of mathematics, Setsuya Seki and 
Tsurane Iwamura; in algebra and number 
theory, Masayosi Nagata, Akira Hattori, 
Hideyuki Matsumura, Ichiro Satake, Tikao 
Tatuzawa; in geometry, theory of Lie groups, 
and topology, Singo Murakami, Hideki Ozeki, 
Noboru Tanaka, Kiiti Morita, Hirosi Toda, 
Minoru Nakaoka, Masahiro Sugawara, Shôrô 
Araki; in analysis, Kiyoshi Noshiro, Yûsaku 
Komatu, Seizo Ito, Hiroshi Fujita, Shige- 
Toshi Kuroda, Sigeru Mizohata, Masaya 
Yamaguti, Tosiya Saito, Tosihusa Kimura, 
Masahiro Iwano; in probability theory, sta- 
tistics, and mathematics of programming, 
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Nobuyuki Ikeda, Tadashi Ueno, Masashi 
Okamoto, Haruki Morimoto, Kei Takeuchi, 
Goro Ishii, Tokitake Kusama, Hukukane 
Nikaido, Toshio Kitagawa; and in theoretical 
physics, Ryogo Kubo, Hironari Miyazawa, 
Yoshihide Koza. 

After the summer of 1965, we entered into 
the period of finer technical editing, in which 
we were assisted by the following colleagues: 
in algebra, Keijiro Yamazaki, Shin-ichiro 
Ihara, Takeshi Kondo; tn geometry, Tadashi 
Nagano, Mitsuo Sugiura, Ichiro Tamura, 
Kiyoshi Katase; in analysis, Nobuyuki Suita, 
Kotaro Oikawa, Kenkiti Kasahara, Tosinobu 
Muramatsu, Hikosaburo Komatsu, Setuzd 
Yosida, Hiroshi Tanaka; and tn history, 
Tamotsu Murata. 

We were also assisted by Katsuhiro Chiba 
in the Russian-language translation of the 
article titles, and by Osamu Kôta and Kiyoshi 
Katase in the indexing. 

Proof sheets began appearing in the spring 
of 1966. In proofreading, Kaoru Sekino, 
Osamu K6ta, Kiyoshi Katase, and Teruo 
Ushijima helped us, as well as Mrs. Rieko 
Fujisaki. Miss Yoko Endo worked with all of 
us throughout the entire revision at the office 
of the Mathematical Society of Japan. She 
helped us especially in looking for and check- 
ing references and preparing the Name Index. 

Yukiyosi Kawada supervised the whole 
work, succeeding me tn the role I had played 
in the compilation of the first and augmented 
editions. Sin Hitotumatu collaborated with 
him throughout, especially on the appendices. 
The second and the third proof sheets of the 
text were read by Kawada; the fourth proof by 
Hitotumatu; the proof sheets of appendices by 
both Kawada and Hitotumatu. 

The Editorial Committee of the Mathe- 
matical Society of Japan asked me to write 
this preface. Having edited the previous edi- 
tions and realizing full well the difficulty of the 
task, I would like to express my particular 
gratitude to Kawada. In the Dictionary De- 
partment of Iwanami Shoten, Publishers, 
Messrs. Hiroshi Horie, Tetsuo Misaka, Shi- 
geki Kobayashi, and Toshio Kouda were very 
cooperative in their collaboration with us. 

To them and also to those who typeset and 
printed this book at Dai-Nippon Printing Co. 
and Shaken Co. goes our gratitude. 


S. lyanaga 
March 1968 


Preface to the Revised and 
Augmented Japanese Edition 


Six years have passed since the publication of 
the first edition of this encyclopedia. This re- 
vised and augmented edition incorporates the 
achievement of these years. It contains, to- 
gether with the correction of errors found in 
the first edition, some new articles such as 
Abelian Varieties, Automata, Sheaves, Homo- 
logical Algebra, Information Theory, and also 
supplements to articles in the first edition such 
as Complex Multiplication, Computers, and 
Manifolds. These additional items render the 
new edition 93 pages longer than the previous 
one. Each article has been thoroughly re- 
vised, and the indexes have been completely 
rewritten. 

We were assisted by the following colleagues 
in selecting articles, writing, and proofreading: 
In set theory and foundations of mathematics, 
Sigekatu Kuroda, Setsuya Seki; in algebra 
and number theory, Tadao Tannaka, Tsuneo 
Tamagawa: in real analysis, Shunji Kametani, 
Késaku Yosida; in function theory, Kiyoshi 
Noshiro, Sin Hitotumatu; in theory of dif- 
ferential and functional equations, Masuo 
Hukuhara, Masahiro Iwano, Ken Yamanaka; 
in functional analysis, Kôsaku Yosida, Seiz6 
Itô; in geometry, Shigeo Sasaki, Nagayoshi 
Iwahori; in topology, Atuo Komatu, Itiro 
Tamura, Nobuo Yoneda; in theory of proba- 
bility, Kiyosi 1t6, Seiz6 116; in statistics, Toshio 
Kitagawa, Sigeiti Moriguti, Tatsuo Kawata; 
in applied mathematics, Sigeiti Moriguti; and 
in mechanics and theoretical physics, Taka- 
hiko Yamanouchi, Isao Imai. The revision 
and augmentation of the articles concerning 
the history of mathematics was done by 
myself. Miles. Yôko Tao, Eiko Miyagawa, and 
Mutsuko Nogami worked in the office of the 
Society. 

The names of authors who contributed to 
the completion of this edition have been added 
to the original list of contributors. 

The project of editing this edition started 
in the summer of 1958. We acknowledge our 
deep gratitude to all those who have col- 
laborated with us since that time. 


S. Iyanaga 
August 1960 
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This encyclopedia, Iwanami Sugaku Ziten, was 
compiled by the Mathematical Society of 
Japan at the request of the Iwanami Shoten, 
Publishers, who have hitherto published a 
series of scientific dictionaries such as Iwanami 
Rikagaku-Ziten (Iwanami Encyclopedia of 
Physics and Chemistry) and Iwanami Tetugaku- 
Ziten (Iwanami Encyclopedia of Philosophy). 
As mentioned in the prefaces to these volumes, 
the importance of such encyclopedias in clari- 
fying the present state of each science is obvious 
if we observe the rapid pace of contemporary 
research. Mathematics is also in rapid motion. 
As a fundamental part of exact science, it 
serves as a basis of all sctence and technology. 
It also retains its close contact with philo- 
sophy. Therefore, the significance of having 

an encyclopedia of mathematics cannot be 
overemphasized. 

Mathematics have made remarkable pro- 
gress in the 20th century. As for the situation 
toward the end of the 19th century, we quote 
the following passage from the article Mathe- 
matics in the 19th Century of this encyclo- 
pedia: “Toward the end of [the 19th] century, 
the subjects of mathematical research became 
highly differentiated. Branches split into more 
specialized areas of study, while unexpected 
relations were found between previously 
unconnected fields. The situation became 
so complicated that it was difficult to view 
mathematics as a whole. It was in these cir- 
cumstances that in 1898, at the suggestion of 
Franz Meyer and under the sponsorship of the 
Academies of Gottingen, Berlin, and Vienna, a 
project was initiated to compile an encyclo- 
pedia of the mathematical sciences. Entitled 
the Enzyklopädie der mathematischen Wissen- 
schaften, it was completed in 20 years....” 

One of the characteristics of 20th-century 
mathematics is the conscious utilization of the 
axiomatic method and of general concepts 
such as sets and mappings, which serve as 
foundations of different theories. Indeed, 
mathematics is being reorganized on the basis 
of topology and algebra. One such example of 
reorganization is found in Bourbaki’s Eléments 
de mathématique; some fifteen volumes of this 
series have been published since 1939, and 
more are coming. This encyclopedia, with 
its limited size, cannot contain proofs for 
theorems. However, we intend to present a 
lucid view of the totality of mathematics, 
including its historical background and future 
possibilities. 

Each article of this encyclopedia is of me- 
dium length— sufficiently short to permit 
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the reader to find exact definitions of notions, 
and sufficiently long to contain explanations 
clarifying how important concepts in the same 
field are related to each other. The problem of 
choosing adequate titles required some de- 
liberation. The Systematic List of Articles, 
classified according to specific fields, shows 
those we have chosen. The Index of Terms 
contains detailed references for each notion. 
The appendices, including formulas and tables, 
supplement the text, and will be particularly 
useful for applied mathematicians. 

The project of compiling this encyclopedia 
was proposed in the spring of 1947 by the 
Steering Committee of the Mathematical 
Society of Japan. It was promptly adopted, 
and the selection of articles in specific fields 
was started by the sectional committees of the 
Society. After seven years, our encyclopedia 
is finally appearing. We shall not give a de- 
tailed description of how our work proceeded 
through all these years. We list simply the 
names of those who assisted us greatly and 
to whom we should like to express our deep 
gratitude. 

The president of the Society at the start of 
this project was the late Professor Tadahiko 
Kubota; but our work has been supported also 
by Professors Teiji Takagi, Zyoiti Suetuna, and 
Masatsugu Tsuji as well as by other leading 
members of the Society. 

At the stage of selecting articles, we were 
assisted by the following colleagues: in history 
and the foundations of mathematics, Sige- 
katu Kuroda, Motokiti Kondo; in algebra 
and number theory, Kenjiro Shoda, Tadasi 
Nakayama, Masao Sugawara, Yukiyosi 
Kawada, Kenkichi Iwasawa; in geometry, 
Kentaro Yano, Asajiro Ichida; in function 
theory, Kiyoshi Noshiro, Yûsaku Komatu; in 
the theory of differential and functional equa- 
tions, Masuo Hukuhara, Shigeru Furuya; in 
topology, Atuo Komatu, Ryoji Shizuma; in 
functional analysis, Yukio Mimura, Shizuo 
Kakutani, Kôsaku Yosida; in the theory of 
probability and statistics, Tatsuo Kawata, 
Toshio Kitagawa, Junjiro Ogawa; and in 
applied mathematics, Ayao Amemiya, Isao 
Imai, Kunihiko Kodaira, Shigeiti Moriguti. 

We asked 190 colleagues to contribute arti- 
cles, which were collected in 1949. Since then 
we have spent an unexpectedly long time edit- 
ing them. Terminology had to be unified 
throughout the encyclopedia so that the reader 
would have no trouble with cross-references. 
Repetitions had to be eliminated and gaps 
filled. Part of the manuscript thus had to be 
rewritten a number of times. We have made 
out utmost effort in this editing work, but we 
are not completely without apprehension that 
our result has still left something to be desired. 
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For any shortcomings in the work, I take 
complete responsibility, as I have acted as the 
editor-in-chief. Also, since we have rewritten 
the manuscript, as already mentioned, we have 
refrained from printing the name of the original 
author of each article; for this, I must request 
the understanding of the contributors. 

In the stage of editing and proofreading, 
we were assisted by the following colleagues: 
Yukio Mimura, Yukiyosi Kawada, Kazuo 
Matsuzaka, Sin Hitotumatu, Setsuo Fuku- 
tomi, Setsuya Seki, Shoji Irie, Shigeo Sasaki, 
Tatsuo Kawata, Sigekatu Kuroda, Yûsaku Ko- 
matu, Ayao Amemiya, Isao Iami, Tosio Kato, 
Tsurane Iwamura, Morikuni Goto, Késaku 
Yosida, Jir6 Tamura, Yasuo Akizuki, Kiyoshi 
Noshiro, Motosaburo Masuyama, Sigeiti 
Moriguti, Osamu Kôta, Nobuo Yoneda, 
Tsuneo Tamagawa, Jun-ichi Hano; and more 
particularly, in the foundations of mathematics, 
Sigekatu Kuroda, Tsurane Iwamura; in alge- 
bra and number theory, Kazuo Matsuzaka, 
Yukiyosi Kawada; in algebraic geometry, 
Yakuo Akizuki; in real analysis, Tatsuo Ka- 
wata; in complex analysis, Yisaku Komatu, 
Sin Hitotumatu, Jir6 Tamura; in functional 
analysis, Késaku Yosida; in topology, Setsuo 
Fukutomi, Nobuo Yoneda; in the theory of 
probability and statistics, Motosaburo Masu- 
yama, Sigeiti Moriguti; and in applied mathe- 
matics, Ayao Amemiya, Isao Imai, Tosio 
Kato, Sigeiti Moriguti. 

The portraits of Abel and Riemann were 
kindly loaned to us by Torataro Shimomura. 

The formulas in the appendices were com- 
piled by Isao Imai, Sin Hitotumatu, and Sigeiti 
Moriguti; the Subject Index (in Japanese and 
European languages) by Osamu Kôta and 
Mrs. Hiroko Ide; the Name Index and Com- 
ments on Journals and Serials by Setsuo 
Fukutomi. Setsuo Fukutomi has taken an 
active part in our work ever since 1948 and 
given much effort to collecting and rewriting 
the manuscript and to unifying terminology. 
The editorial staff of Iwanami Shoten, Pub- 
lishers, has always been cooperative. Without 
their generous support, this encyclopedia 
could never have been published. 

I should like to express my sincere gratitude 
to all those who have collaborated with us 
directly or indirectly. 


S. lyanaga 
March 1954 


Introduction 


The text of this Encyclopedic Dictionary con- 
sists of 450 articles arranged alphabetically, 
beginning with 1 Abel and ending with 450 
Zeta Functions. Most of these articles are 
divided into sections, indicated by A, B, C,..., 
AA, BB, .... Cross-references to articles, e.g., 
to the second article, are of the form: (— 2 
Abelian Groups) or (— 2 Abelian Groups A), 
according as the whole article or a specific 
section is being referred to. Citations in the 
indexes are also given in terms of article and 
section numbers. 

Key terms accompanied by their definitions 
in the text are printed in boldface. All of these 
terms are found in the Subject Index at the 
end of the volume. 

The sign * means that the term preceded by 
it can be found in the index. A list of special 
notations used throughout the work (with 
explanations of their meanings) appears after 
the appendices. 

A Systematic List of Articles, showing the 
general structure of the work, will be found on 
p. 1857 (vol. IV). The number in parentheses 
after each article title refers to this systematic 
classification into subject areas; e.g., “Abelian 
Varieties (VIII.5)” means that the article on 
Abelian varieties is the fifth article in Section 
VIII of the systematic list. 

Books and articles in journals are cited in the 
text by numbers in brackets: [1], [2],....At 
the end of each article there is a section of 
references in which, for books, the name of the 
author or authors, title, name of the publisher, 
year of publication, and the number of the 
edition are given; for journals, the name of the 
author, title of the article, name of the journal, 
and the volume numbers and inclusive page 
numbers are given in this order. (The names of 
journals and publishers are abbreviated as 
indicated in the lists at the end of the work.) 


The Cyrillic alphabet is transliterated as 


follows: 


Cyrillic 
Alphabet 


MSRUPUAP ERB SCOKXMSEKANVIAOTESSASNYRTEME WIS 


MBC TP FEES EX eK HOUR ODE HARE OR GOTH oR 


(i) 

(i kratkoe) 
(ka) 

(el) 


(tvérdyi znak) 
(yery) 
(myagkii znak) 
(e) 

(yu) 

(ya) 


Transliteration 


oo amne oP 


zh 


— p = N 


“aoe SB tp Oo Ss g 


yu 
ya 


1 
Abel, Niels Henrik 


1 (XXI.12) 
Abel, Niels Henrik 


Niels Henrik Abel (August 5, 1802—April 6, 
1829) was born a son of a poor pastor in the 
hamlet of Findé in Norway. In 1822, he en- 
tered the University of Christiania; however, 
he studied mathematics almost entirely on his 
own. He was recognized as a promising stu- 
dent by his senior, Holmboe, and after gradu- 
ation he studied abroad in Berlin and Paris. 
In Berlin he met and was aided by A. Crelle, 
the founder of the Journal fiir die Reine und 
Angewandte Mathematik, and participated in 
the founding of this journal. Although he did 
brilliant work in Paris, he did not gain the 
fame he deserved. He returned to Norway in 
May 1827, but, unable to find a job, he was 
obliged to fight poverty while continuing his 
research. He died at twenty-six of tuberculosis. 

His best-known works are the result that 
algebraic equations of order five or above 
cannot generally be solved algebraically, the 
result that tAbelian equations can be solved 
algebraically, the theory of *binomial series 
and of series in general, the theory of tellip- 
tic functions and more generally of talge- 
braic functions, the introduction of tAbeltan 
integrals, and the establishment of *Abel’s 
theorem. His work in both algebra and analy- 
sis, written in a style conducive to easy com- 
prehension, reached the highest level of attain- 
ment of his time. 
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2 (IV.2) 
Abelian Groups 


A. General Remarks 


A tgroup G is called an Abelian group (or 
commutative group) if G satisfies the commuta- 
tive law ab = ba for all a, be G. In this article, G 
always denotes an Abelian group. Every tsub- 
group of G is a ‘normal subgroup, and all the 
elements of finite order in G form a subgroup 


T, for which the *factor group G/T has no 
elements of fintte order except the identity e. 

T is called the (maximal) torsion subgroup of G. 
If G=T, then the Abelian group G is called a 
torsion group (or periodic group). On the other 
hand, if T= {e}, then G is called torsion-free; if 
T#G, T# {e}, then G is called mixed. If the 
order of every element of a torsion group G is 
a power of a fixed prime number p, then G is 
called an Abelian p-group (or primary Abelian 
group). An Abelian torsion group is the fdirect 
sum of primary Abelian groups. Thus the 
study of torsion groups is reduced to that of 
primary Abelian groups. 


B. Finite Abelian Groups 


The following fundamental theorem on finite 
Abelian groups was established by L. Kro- 
necker, G. Frobenius, and L. Stickelberger in 
the 1870s. An Abelian group G of order p”, 
where p is a prime number, is a direct product 
of tcyclic subgroups Z,,...,Z,:@=Z, x... x 
Z,. If Z; is of order při, then n=n,+...+n,, 
and we can assume that n,;2>n;,,. A direct 
product decomposition of G, as above, is not 
unique, but n,,...,, are determined uniquely 
by G. The system {p",...,p} or {m,,...,n,} is 
called the system of invariants (or type) of G, 
and a system of tgenerators {z,,...,z,} of 
Zi,- Z is called a basis of G. An Abelian 
group of type (p, p, ..., p) is called an elemen- 
tary Abelian group. The decomposition of a 
finite Abelian group into a ‘direct sum of 


subsets (not necessarily of subgroups) was 


considered by G. Hajos (1942) and applied 


successfully to a problem of number theory 
(— 151 Finite Groups). 


C. Finitely Generated Abelian Groups 


The theory of finitely generated Abelian 
groups, i.e., Abelian groups generated by a 
finite number of elements, is as old as that of 
finite Abelian groups. The direct product of 
‘infinite cyclic groups is called a free Abelian 
group. A finitely generated Abelian group G is 
the direct product of a finite Abelian group 
and a free Abelian group. The finite factor is 
the torsion subgroup of G. The free factor of the 
group G is not necessarily unique; however, 
the number of infinite cyclic factors of the free 
factor is uniquely determined and is called the 
rank of G. Two finitely generated Abelian 
groups are isomorphic if they have isomorphic 
maximal torsion subgroups and the same 
rank. This theory can be extended to the 
theory of tmodules over a ‘principal ideal 
domain (— 67 Commutative Rings K). 


D. Torsion Groups 


The structure of Abelian p-groups is relatively 
well known, compared with other infinitely 
generated Abelian torsion groups. In the 
1920s, H. Priifer made the first important 
contribution to the study of Abelian p-groups, 
and H. Ulm and L. Zippin completed the 
theory for countable groups in the 1930s. The 
uncountable case was first treated by L. Kuli- 
kov in the 1940s, but the study of this case is 
still in progress. 

An Abelian p-group G # {e} is called divis- 
ible (or complete) if for any aeG there is an 
element xeG satisfying x? =a. A divisible 
group is a tdirect sum of Abelian groups of 
type p” (Prüfer). Here a group of type p” is 
isomorphic to the tmultiplicative group of all 
the p”th roots of unity (n=1,2,...) in the 
complex number field. Let G be any Abelian 
p-group. The maximal divisible subgroup 
V of G is a direct product factor of G: G = 
Vx R, where R has no divisible subgroups. 
An Abelian p-group without a divisible sub- 
group is called a reduced Abelian group. 

An element x of an Abelian p-group G is 
said to have infinite height if for any n there is 
an element y,€G satisfying x = y?”. The ele- 
ments of infinite height form a subgroup G' of 
G. If G' = {e} and G is countable, then G is 
decomposed uniquely into the direct sum of 
cyclic groups. This assertion fails if the hy- 
pothesis of countability is dropped. By ‘trans- 
finite induction we can define G* as follows. If 
ß is an fisolated ordinal number, then G’ = 
(G*~')!: if B is a tlimit ordinal number, then 
G? =(),<,G*. For the least ordinal number t 
such that G'=G""*!, G" is the maximal divis- 
ible subgroup of G. If G is reduced, then G'= 
{e}. We call z the type of an Abelian p-group 
G. For a <t, G* = G*/G**! is called the Ulm 
factor of G, and the sequence G°, ..., G% ... 
(x< t) is called the sequence of Ulm factors 
of G. Each Ulm factor G” has no element of 
infinite height, and if «<t—1, then G* has an 
element of arbitrarily large order. Let t be a 
countable ordinal number; assume that for 
any ordinal number æ <t there is given a 
countable Abelian p-group A, such that A, 
has no element of infinite height, and that 
for a#t—1 A, has an element of arbitrarily 
large order. Then there ts a reduced coun- 
table Abelian p-group which is of type t with 
a sequence of Ulm factors isomorphic to Ag, 
A,,-+-, Ay, --. («<t) (Zippin). Two reduced 
countable Abelian p-groups A and B are iso- 
morphic if they have the same type qt; and for 
any «<t the Ulm factors A* and B* are iso- 
morphic. The assertion fails if the hypothesis 
of countability ts dropped. 
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E. Torsion-Free Groups 


In Abelian groups, the group operation is 
often written a+b, using the additive notation; 
an additively written group, called an additive 
group, is generally assumed to be Abelian. In 
the rest of this article we consider exclusively 
additive Abelian groups, of which the additive 
group Z of rational integers is the most primi- 
tive example. In such a group the identity 
element is called the zero element and is de- 
noted by 0; the inverse of a is denoted by —a, 
and we write a+(—b)=a—b. The direct sum 
of additive groups A, (AeA) is called a free 
additive group if each A, ts isomorphic to Z. 
An additive group G ts regarded as a tmodule 
over the tring Z, to which the notion of linear 
independence is applicable (— 277 Modules). 
Elements a,,...,a, of G are linearly dependent 
if there are integers n,,...,n, not all of which 
are zero such that n,a,+...+n,a,=0. Those 
that are not linearly dependent are termed 
linearly independent. An infinite set of elements 
of G is called linearly independent if the ele- 
ments of any finite subset are linearly inde- 
pendent. If there are N elements of G that are 
linearly independent, but if any N + 1 elements 
of G are linearly dependent, then N is called 
the rank of G. Such a system of N linearly 
independent elements is called a maximal 
independent system. A torsion-free additive 
group G is not necessarily free if G is not 
finitely generated. 

The first important work on torsion-free 
additive groups was done by F. W. Levi (1917). 
A. G. Kurosh (1937) completed the theory in 
the case of finite rank. In the general case little 
is known, and I. Kaplansky, J. Rotman, and 
others are continuing the investigation. 

The additive group Q of rational numbers ts 
of rank 1, and conversely any additive group 
of rank 1 is isomorphic to some subgroup of 
Q. An additive group G is called divisible (or 
complete) if for any ae G and for any integer n 
there is an element x,,€G such that nx,=a. A 
divisible torsion-free additive group is isomor- 
phic to a direct sum of some copies of Q. For 
any torsion-free additive group G there is a 
divisible torsion-free additive group containing 
G. A minimal additive group F among these 
groups is uniquely determined up to tsomor- 
phism and has the rational number field Q as 
an toperator domain. Let Q = {a/b|(a, b)= 
1, p{b} be the ring of p-integers in Q, and let 
G, be the smallest Q’-subgroup of F contain- 
ing G. Let Q, be the *p-adic number field and 
Z, the ring of tp-adic integers. Extending the 
operator domain Q to Q, we obtain naturally 
a Q,-module F, from F. Let G, be the natural 
closure of G, in F,. Then G, has Z, as an 
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operator domain and thus becomes a Z,- 
module. A Z,-module of rank N is isomorphic 
to the direct sum of x, copies of Q, and N—x, 
copies of Z,:G,~D,Q,0n+ Ln ZpWm (n= 
ee ee ., N-k,). Here x, is called 
the p-rank of G. As the invariants of G, Kurosh 
gives the rank, the p-ranks for all primes p, 
and a certain equivalence class of the sequence 
of the matrices M,. Here p ranges over all 
primes, and W, is the matrix of coefficients 
when the elements of a maximal independent 
system of F are written as linear combinations 
of (v 


n? Wn): 


F. General Abelian Groups 


An Abelian group is, in general, an textension 
of a torsion group by a torsion-free group. A 
torsion group T is called bounded if there is an 
integer n such that ¢"=1 for all te T. Suppose 
there is a torsion group T. Then T is a direct 
summand of an Abelian group G which con- 
tains T as its maximal torsion subgroup if and 
only if T is the direct product of a divisible 
group and a bounded group (R. Baer and S. 
Fomin). 


G. Characters 


A character y of an Abelian group G is a func- 
tion which assigns to each ae G a complex 
number y(a) of absolute value 1 and satisfies 
x(ab) = x(a) x(b) for all a, beG. The product y= 
XıX2 Of two characters y; and x, 1s defined 

by x(a)=7,(a)z>(a), and y is also a character 
of G. Thus all the characters of G form an 
Abelian group C(G), which is called the char- 
acter group of G. The identity element of the 
character group is the identity character (or 
principal character) y such that y(a)= 1 for all 
aeG. If G is finite, then G = C(G). This implies 
the duality G=C[C(G)]. This fact was ex- 
tended by L. S. Pontryagin to tlocally compact 
ttopological Abelian groups (— 422 Topo- 
logical Abelian Groups B-D). For additive 
Abelian groups with operator domains — 277 
Modules. 
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A. History 


Except for C. F. Gauss, whose work on this 
subject saw the light many years after his 
death, N. H. Abel was the first to consider 
talgebraic functions as functions of complex 
variables and to discover double periods of 
telliptic functions such as x = x(u), which is the 
inverse function of an elliptic integral 


| * dx 

u= ; 

V fa(x) 

where f,(x) is a polynomial in x of degree 4. C. 
G. Jacobi expressed elliptic functions explicitly 
as ratios of theta series. As a natural generali- 
zation of elliptic functions, which are tAbelian 
functions of genus 1, Jacobi and his successors 
studied the inverse function of the thyperellip- 
tic integral, or more generally, the inverse 
function of the Abelian integral. By investigat- 
ing hyperelliptic integrals of the first kind of 
genus 2, 


i ax; | 2 X dX3 

V fe(x1) heey) 
where f,(x) is a polynomial of degree 6 in x, 
Jacobi obtained multiple-valued functions 
with quadruple periods. He then discovered 


the remarkable fact that if we consider sums of 
two integrals 


*1 dx, -| 2 dx, 
V fo(%1) V fol x2) 
“x EA *2 x,dx 


I H). 


the elementary symmetric functions s, =x, 
+x, and s, =x; x, of x, and x, are single- 
valued functions of u; and u, with quadruple 
periods. He also conjectured that these func- 
tions sı and s, might be expressed explicitly in 
terms of theta series of u, and u,; this conjec- 
ture was later confirmed by J. G. Rosenhain 
and A. Gopel. 

In the latter half of the 19th century, the 
general theory of Abelian functions was es- 
tablished. The central subject was Jacobi’s 
inverse problem (— Section L), which is a 
natural generalization of the above results. B. 
Riemann solved this problem by expressing 
elementary Abelian functions (— Section M) 
as rational functions of theta functions. 

The theory of functions with multiple 
periods was developed further by H. Poincaré, 
G. Frobenius, and E. Picard. In the 20th cen- 
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tury, the importance of the theories of Abelian 
functions and Abelian varieties has become 
more obvious with the development of the 
theory of functions of several complex vari- 
ables and algebraic geometry. In particular, 
problems intimately related to number theory 
have given rise to the development of a purely 
algebraic theory of Abelian varieties. Valuable 
contributions have been made by S. Lefschetz, 
C. L. Siegel (tSiegel modular forms), A. Weil 
(algebraic theory of Abelian varieties, syste- 
matization of the theory of theta functions), 
and by D. Mumford (tmoduli theory, algebraic 
theory of theta functions). 


B. Algebraic Theory 


When a tgroup variety is tcomplete as a 
variety, the group law is commutative; such a 
group variety is called an Abelian variety (— 
12 Algebraic Varieties H.). Let B be a sub- 
variety of an Abelian variety A, and assume 
that B is a subgroup of A as an abstract group. 
Then B has the structure of an Abelian variety 
whose law of composition is induced by that 
of A, and B is called an Abelian subvariety of 
A. More generally, when an algebraic subset 8 
is a subgroup of A, then the component B of B 
containing the identity element is an Abelian 
subvariety, and % is a union of B and a finite 
number of cosets of B. When A is defined over 
a field k, then any Abelian subvariety of A is 
defined over a finite tseparable extension of k 
(W. L. Chow’s theorem). An Abelian variety 

A is called simple if A has no Abelian sub- 
varieties other than A itself and 0. 

Every frational mapping of an algebraic 
variety V into an Abelian variety is defined at 
each simple point of V. This implies that an 
Abelian variety is tabsolutely minimal. 


C. Homomorphisms 


A rational mapping of an Abelian variety A 
into an Abelian variety B is called a rational 
homomorphism (or simply homomorphism) if f 
is a group homomorphism. Let F be a rational 
mapping of A into B; then F can be uniquely 
expressed as follows: F(x) = Fo(x)+ F(0) (xe A), 
where Fp is a homomorphism and F(0) is the 
image of the unit element 0 of A. Hence the 
structure of an Abelian variety (as a group 
variety) is uniquely determined by the underly- 
ing algebraic-variety structure. 

When a rational homomorphism f is bira- 
tional, f is called a birational isomorphism (or 
simply isomorphism). It is clear that a rational 
isomorphism is an abstract isomorphism, but 
the converse is not necessarily true. Let A, B 
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be two Abelian varieties. We say that A is 
isogenous to B if the dimension of A is equal to 
that of B and there exists a surjective homo- 
morphism of A onto B, or equivalently, if there 
exists a surjective homomorphism of A onto B 
whose kernel is finite. The relation of isogeny 
is an equivalence relation. For an Abelian 
variety A and an Abelian subvariety X of A 
there is an Abelian subvariety Y of A such 
that the natural homomorphism X x Y> 
A:(x, y)>x+y is an isogeny (Poincaré’s com- 
plete reducibility theorem). In particular, every 
Abelian variety is isogenous to a product of 
simple Abelian varieties that are determined 
uniquely up to isogeny and order. 

Let A, B be two Abelian varieties; we denote 
by Hom(A, B) the additive group of rational 
homomorphisms of A into B. When a rational 
homomorphism A is surjective, then the degree 
v(A) of 4 is defined by A(A)= v(A)B as talgebraic 
cycles. If 4 is an isogeny, then v(A) 40, and the 
order of the kernel {t|te A, At=0} is at most 
v(A) and equal to v(A) if and only if å is fsepa- 
rable. The additive group Hom(A, B) is shown 
to be free of rank <4dim A-dim B by the l-adic 
representation (— Section E). If A =B, then 
Hom(A, A) has a ring structure; it is called the 
ring of endomorphisms (or endomorphism ring) 
of A and is denoted by (A). The tensor prod- 
uct Y(A) = Y(A) © Q, where Q is the field 
of rational numbers, is an tassociative alge- 
bra over Q. If A is simple, then 2[,(A) is a 
‘division algebra. More generally, AW, (4) is iso- 
morphic to a direct product of some ttotal 
matrix algebras over division algebras; thus 
Y9(A) is fsemisimple. In particular, if A is 1- 
dimensional (in other words, if A is an telliptic 
curve), the types of 2{,(A) are well known; 
when the characteristic p=0, then Y,(A) is 
either the field of rational numbers or an 
timaginary quadratic field. When p> 0, aside 
from these two fields, we have a tquaternion 
algebra over Q as a possible type of (A). 

Let k be a finite field with g elements. An 
algebraic integer is called a Weil number for g 
if every conjugate of it has absolute value Ja 
If A is an Abelian variety defined and simple 
over k, the qth power endomorphism of A: 
x—x‘ determines a conjugacy class of Weil 
numbers for q, as Weil showed (— 450 Zeta 
Functions). Moreover, we have the following 
classification theorem (J. Tate, T. Honda): 
There is a one-to-one correspondence between 
the set of all k-isogeny classes of k-simple 
Abelian varieties over k and the set of all con- 
jugacy classes of Weil numbers for q. Tate also 
determined the structure of the division alge- 
bra Wg (4) over Q, which is described in terms 
of the decomposition of the qth power endo- 
morphism into prime ideals. 
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D. Divisors 


Let © be the additive group of tdivisors on an 
Abelian variety A and ©, be the subgroup of 
divisors that are talgebraically equivalent to 0. 
Then the factor group ©/G, has no ‘torsion 
part; this implies that for an Abelian variety 
tnumerical equivalence coincides with falge- 
braic equivalence. We denote this relation 
by =. Given an element a of A, the translation 
T,:A3x->x+aeéA gives a tbirational trans- 
formation, which is everywhere fbiregular, on 
the underlying variety of A; we denote by X, 
the image of a divisor X on A. Then X =O if 
and only if X, is flinearly equivalent to X for 
each point a of A. The *Albanese variety of an 
Abelian variety A is A itself, and the tPicard 
variety A of A is isogenous to A. In particular, 
for the ‘Jacobian variety J of an algebraic 
curve, J is isomorphic to J itself. The Picard 
variety A of Ais isomorphic to A (duality 
theorem). Let X be a divisor on A; the map- 
ping a—the ‘linear equivalence class of the 
divisor X,— X, ac A, is a rational homomor- 
phism of A into A, and we denote it by gy. If 
~y=0, then X =0, and vice versa; hence we 
have an additive map of G/, into Hom(A, A). 
If ox is surjective, we say that X is nondegener- 
ate. A tpositive divisor X is nondegenerate if 
and only if X is tample, and then nX is tvery 
ample for n>3. There always exist positive 
nondegenerate divisors on an Abelian variety; 
therefore an Abelian variety is a tprojective 
variety. For a given divisor X on A, we can 
find n suitable points u,,...,u,, where n is the 
dimension of A, so that the fintersection prod- 
uct X,, °...:X,, is defined. We denote by (Xx) 
the tdegree of the zero cycle X „t... Xu If X 
is positive nondegenerate, then the dimension 
(X) of the tdefining module of the tcomplete 
linear system determined by X is equal to 
(X)/n! (Poincaré’s theorem). Furthermore, the 
degree v(py) of py, where X is any divisor on 
A, is given by the formula v(py)=((X)/n!)? 
(Frobenius’s theorem). 

For a nondegenerate divisor X on A there is 
a unique integer i=i(X), 0<i<dim A, called 
the index of X, such that H?(A, ©(X))=0 for 
p#iand H(A, O(X)) 40. We have i(—X)= 
dim A —i(X) and i(X)=0 if and only if X is 
ample. 


E. l-adic Representations 


Let A be an Abelian variety of dimension n. 
For a given prime number l, let G,(A) denote 
the group of points on A whose order is a 
power of l. If / is different from the character- 
istic of the base field of A, then the group 
,(A) is isomorphic to the direct product of 2n 


factor groups Q,/Z;,, where Q, is the field of l- 
adic numbers and Z, is the group of l-adic 
integers (— 439 Valuations). We call such an 
isomorphism the /-adic coordinate system of 
©,(A). Now let 4 be a rational homomorphism 
of A into an Abelian variety B of dimension m. 
Then we can see that 4 induces a homomor- 
phism of ,(A) into G,(B). This shows that by 
placing /-adic coordinate systems on ©,(A) and 
,(B) respectively, we get a matrix representa- 
tion M,(A) of å with 2m rows and 2n columns. 
The representation A— M,(A) is faithful, and 
M,(A) is called the l-adic representation of 2. 
When A =B, then 2-+M,(A) is a faithful repre- 
sentation of the ring of endomorphisms YA). 
This representation can be naturally extended 
to the representation of the algebra W, (A); 

the characteristic polynomial of the /-adic 
representation M,(A) (where J is an element 

of Xo (4)) is a polynomial with coefficients in 
Q. Moreover, the polynomial does not depend 
on the choice of the prime number /. When 

AE UA), then v(A) is equal to det M,(/). The 
trace of M,(A) ts usually written as o(A). 

Let 4 be a rational homomorphism of A 
into B and Y be a divisor on B. Then by the 
correspondence cl(Y)—cl(A7!(Y)), we obtain a 
rational homomorphism of B into A, called the 
transpose of 4 and denoted by ‘4, where cl 
means the linear equivalence class (— 16 Alge- 
braic Varieties M). If X is a nondegenerate 
divisor on A, then the composition map B:A- 
A of 'py:A—A and the canonical isomorphism 
AA satisfy the equality Bo py =v(px)d 
(6 =the identity mapping of A). We denote by 
py! the element (1/v(gy)) B in Hom(A, A) Q Q. 
The correspondence >a’, a’ =y' o'wo@y 
(xEW(A)) is an tinvolution of Q,(A) and 
is of order 1 or 2. If «40, then a(a’oa)>0 
(Castelnuovo’s lemma). A. Weil was the first to 
recognize the importance of this theorem in 
connection with tRiemann’s hypothesis on 
tcongruence zeta functions. 


F. Differential Forms 


A ‘differential form œ on an Abelian variety A 
of dimension n is called invariant if T*% =a 
for every point ae A, where T, is the trans- 
lation by a (— Section D). The differential form 
tof the first kind is invariant, and conversely, 
every invariant differential form is of the first 
kind. Let K be the tuniversal domain and 
K(A) be the tfunction field of A. The set of the 
linear differential forms of the first kind on A 
is a linear space over K of dimension n, and its 
basis becomes a basis over K(A) of the linear 
space consisting of all linear differential forms 
on A. An invariant derivation on A is a deriva- 
tion D in K(A) satisfying (Df) o T, = D(f oT, 


for any element f of K(A) and every point a of 
A. For a linear differential form w=» f;du; 
and a derivation D, we put <œ, D> =È f; Du;. 
Then <@, D> is a bilinear form in œ and D. A 
derivation D is invariant if and only if <œ, D> 
is a constant function for every invariant linear 
differential form œ. Similarly, a linear differen- 
tial form q@ is invariant if and only if <œ, DY is 
a constant function for every invariant deriva- 
tion D. The linear space of invariant linear 
differential forms and that of invariant deriva- 
tions are dual to each other with respect to 
the bilinear form <w, DY. 

Now consider the case when the character- 
istic p of the universal domain is positive. The 
automorphism aa’, ae K, of the universal 
domain K induces a group isomorphism of A; 
we denote by A? the image of A and by x? the 
image of a point x of A. The image A? is an 
Abelian variety, and the group isomorphism 
m:x—x?, x€ A, is an isogeny of A onto A”. Let 
B be another Abelian variety and let 4 be an 
isogeny of A onto B. If there is an isogeny x: 
B- A? such that 7 = uo À, then we say that A is 
of height 1. The function field K(B) of B can be 
considered as a subfield of the function field 
K(A) of A by the mapping 4. If 4 is of height 1 
and v(A)= p, there exists an invariant deriva- 
tion D of K(A) with the constant field K(B), 
uniquely determined up to constants. More- 
over, we can choose D so that D’ =D or D? =0. 
In the first case å is said to be of type (i,); in 
the second case it is said to be of type (i,). An 
isogeny whose degree is a prime different from 
the characteristic p is said to be of type (s,), 
and a tseparable isogeny whose degree is p 
is said to be of type (s,). Any isogeny can be 
written as a product of isogenies of these four 
types. 


G. Polarized Abelian Varieties 


Let X be a divisor on an Abelian variety A; we 
denote by ¥ the class of divisors X’ such that 
mX =m X’ for suitably chosen positive inte- 
gers m, m'. When the class ¥ contains positive 
nondegenerate divisors, we say that X deter- 
mines a polarization on A, and the couple 

(A, ¥) is called a polarized Abelian variety. In 
particular, if A is a Jacobian variety whose 
polarization X is determined by a theta divi- 
sor, we call (A, ¥) the canonically polarized 
Jacobian variety. If an endomorphism « of A 
keeps the polarization invariant, i.e., if the 
class determined by «7 '(X) coincides with the 
class X, then « is called an endomorphism of 
the polarized Abelian variety (A, X). In partic- 
ular, if « is an automorphism of A, then we 
say that « is an automorphism of (A, X). The 
group of all automorphisms of a polarized 
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Abelian variety is finite. In particular, the 
group of automorphisms of a canonically 
polarized Jacobian variety is finite. Hence 
follows the famous theorem concerning the 
finiteness of the group of automorphisms of an 
talgebraic curve of genus not less than 2. 

On the other hand, the algebraic equiva- 
lence class of a nondegenerate divisor is called 
an inhomogeneous polarization. (The above 
polarization is then sometimes called a homo- 
geneous polarization.) An inhomogeneous 
polarization X determines an isogeny Qy: 
A— Â uniquely. If gy is an isomorphism, 
the polarization is called principal. An endo- 
morphism of an inhomogeneously polarized 
Abelian variety can be defined similarly. 


H. Analytic Theory 


For the rest of this article we take the complex 
number field C as the universal domain, and in 
this case we can utilize analytic and topolog- 
ical methods. 

Let C” be an n-dimensional vector space 
over C. In a natural way, the space C” be- 
comes a 2n-dimensional vector space R?” 
over the real field R, and the mapping J:z-> 
al z, zeC”, is an R-linear automorphism 
of R?” such that J? = —1. Conversely, if for an 
even-dimensional R-vector space R°” such 
a mapping J is given, then by putting (a+ 
J-i b)x=ax+bJx (xeR?";a,beR), we can 
introduce an n-dimensional complex linear 
structure into R?”. We then say that J deter- 
mines a complex structure on R2"; we denote 
by C” =(R?”, J) the space having the complex 
structure determined by J. Let @,,...,@»,, be 
2n R linearly independent points on C” = 
(R2", J). Then the tlattice D generated by 
these points is discrete, and the factor group 
T"=C’"/D is a complex torus of dimension n. 
We fix a basis of C” and introduce a complex 
coordinate system on C”. Utilizing the basis 
(1, ---, @2, Of R?”, we also introduce a real 
coordinate system on R2". We then obtain an 
n x 2n matrix Q=(@,), where the (@,;, ..., Opi) 
are the complex coordinates of œ;; the ma- 
trix Q is called the period matrix of T”. Let 
(Z,,---,2,) be the complex coordinates of 
a point ze C” and (x,,...,X»,) be the real 
coordinates. Then we have ‘(z,,...,Z,)= 
Q(X, .--, Xn). If we let the same symbol J 
stand for the representation matrix of the 
linear transformation J with respect to the 
basis w,,...,@»,, then we have J-1 Q=QJ. 
The underlying real Lie group of T” is a 2n- 
dimensional (real) ttorus group (R/Z)?". Hence 
the tPoincaré polynomial of T” is (1 + x)?". 
Any tHermitian metric on C” (as a vector 
space) induces a ‘flat "Kahler metric; hence the 
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Hodge numbers (— Kahler Manifolds C) are 
hra = (D: 


I. Theta Functions 


A holomorphic function f(z) on C"=(R", J) is 
called a theta function if for every de D we 
have f(z+d)=f(z)exp(/,(z)+c,), where l(z) is 
a linear form on C” which, as for c4, depends 
on d. The set of zeros of a theta function f, 
which we write as (f), determines an analytic 
divisor on T”. Conversely, for every effective 
analytic divisor X on T”, there exists a theta 
function f such that (f)=X. With respect to 
the real coordinate system x,,...,X 2, deter- 
mined by the basis @,,...,@2,, we can find 2n 
x 2n matrices A, Ag and a 1 x 2n matrix b, 
with elements in C, such that the transforma- 
tion formula f(x + a) = f(x) exp(2x,/—1(taAx 
+3'aA,a+t'‘ba)) (where A =A, (mod Z), ‘Ay 

= Ap) holds for every 1 x 2n matrix a whose 
elements are rational integers. Moreover, if 
we put E=A—'A, then E is an talternating 
matrix whose elements are rational integers, 
and S= EJ is a tpositive semidefinite sym- 
metric matrix. Conversely, if there exists such 
an alternating matrix E we can find a theta 
function. (There does exist, however, a com- 
plex torus on which no theta function exists 
other than trivial ones, i.e., ones of the form 
exp((z)), where (z) is a polynomial of degree 
at most 2.) 

A theta function f is called nondegenerate if 
it cannot be a function of n— 1 complex vari- 
ables, and f is nondegenerate if and only if the 
matrix $= EJ is positive definite. A complex 
torus has the structure of an Abelian variety if 
and only if there exists a nondegenerate theta 
function, i.e., if and only if there exists an 
alternating matrix E whose elements are ra- 
tional integers such that EJ is a positive de- 
finite symmetric matrix. The latter condition 
is satisfied if and only if there exists an alter- 
nating matrix E whose elements are rational 
integers such that Q'E""Q=0, J/-i Q'E "OQ 
>0 (positive definite Hermitian matrix). In 
particular, a period matrix Q satisfying these 
conditions is called a Riemann matrix, and the 
rational matrix ‘E~! is called the principal 
matrix belonging to Q. 

Determining a polarization on an Abelian 
variety can be reduced to designating a class of 
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T” = C”/D, and let f be a theta function such 
that (f)=X. Then the divisor X is nondegen- 
erate if and only if the theta function f is non- 
degenerate, and the latter statement holds if 
and only if the alternating matrix E obtained 


from f is tinvertible. For a given principal 
matrix 'E~', we can choose suitable coordinate 
systems of C” and D so that Q=(.,, F) and E = 


—A 0 
a diagonal matrix whose elements are telemen- 
tary divisors of E. In these situations, AF is 
symmetric and its imaginary part positive 
definite, i.e., AF is a point of the tSiegel upper 
half-space S,,. Thus to the polarized Abelian 
variety (where the polarization is determined 
by 'E~') there corresponds a point of G,. This 
gives a one-to-one correspondence between 
the isomorphism classes of Abelian varieties 
inhomogeneously polarized by a principal 
matrix with given elementary divisors and the 
points of the factor space G,,/T,(A), where 
T,(A) is a subgroup that is tcommensurable 
to the Siegel tmodular group of degree n and 
operates on S, discontinuously. The case 
of principal polarization corresponds to the 
case when A= 1,, and then I,(A) is the Siegel 
modular group itself. S,,/T,(A) is the coarse 
tmoduli space of Abelian varieties, polarized 
as above; the projective embedding of S,,/T,(A) 
is given by means of tSiegel modular forms. 


0 A 
( ) where J, is a unit matrix and A is 


J. Abelian Functions 


Meromorphic periodic functions on C” with 
periods @,...,@5, (i.€., meromorphic func- 
tions on T=C"/D, D = Za, @... ® Za,,) are 
called Abelian functions with periods D (or on 
T). The quotient of two theta functions having 
the same periods and the same transforma- 
tion formulas is an Abelian function, and con- 
versely any Abelian function can be written as 
such. All Abelian functions on T form a field 
C(T) called the Abelian function field. If T has 
the structure of an Abelian variety, then C(T) 
coincides with the field of rational functions on 
T. In general, for any complex torus T there 
are an Abelian variety T’ (possibly of dim 0) 
and a surjective homomorphism 4:T >T 
which induces an isomorphism 4*:C(T’)s 
C(T). Such a pair (T’, 4: TT’) is unique up 

to isomorphism, and is called the talgebraic 
reduction of T. 


K. Homomorphisms 


Let C" =(R?", J) (i=1, 2) be complex linear 
-ya wm 2 BD l j 


i=1, 2. If a C-linear mapping A:C"+C™ 
satisfies A(D,) < D,, then A induces a com- 
plex analytic homomorphism of T, =C"/D, 
to T, =C"/D,. Conversely, any complex 
analytic homomorphism of T, to T, is ob- 
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tained in this way. Let T, and T, be Abelian 
varieties, and let Q, =(w”, ..., oS) and Q, 
=(o)’), ..., wf.) be their Riemann matrices. 
Then for a homomorphism 1:T, >T,, we 

can find a representation matrix W(A) with 
complex coefficients; and with respect to the 
real coordinate systems («(", ..., af) and 

(wi, ..., a4), we can find a representation 
matrix M(A) with coefficients in Z such that 
W(A)Q, =Q, M(A). Conversely, if for a complex 
matrix W there is a matrix M, with coefficients 
in Z, satisfying the relation given above, then 
W gives a homomorphism of T, to T,. The 
above equation is called Hurwitz’s relation. 
The notion of l-adic coordinate system, which 
is valid for a general characteristic, corre- 
sponds to that of the lattice group, and the l- 
adic representation M,(A) of å is the abstrac- 
tion of the integral representation M (2). 


L. Abelian Integrals 


Let R be a compact *Riemann surface of genus 
g21(— 11 Algebraic Functions) and let w be 
a sum of tAbelian differentials of the first kind 
or of the second kind. Then the ‘period of œ 
along a cycle y depends only on the thomology 
class of y. The set of all differentials of the first 
kind forms a complex linear space of dimen- 
sion g; we denote it by Dy. Let P be a point 
and P, a fixed point of R; then we denote by 
u(P) the vector integral (fp 01, -~ f, ©) 
where (@,,...,@,) is a basis of Dy and the path 
from P, to P is common to every integral. The 
correspondence P—u(P) is not a single-valued 
mapping; the totality of differences of values 
of u(P) coincides with the group D consisting 
of periods (fo, sata w), where y varies over 
all cycles. Let a set of cycles {y;, .-. ,Y2g7 be a 
basis of the homology group, with coefficients 
in Z; then 2g column vectors of the g x 2g 
matrix Q=(c,), @; J,,0;, are linearly inde- 
pendent over R. Since the group D coincides 
with the set of linear combinations, with co- 
efficients in Z, of column vectors of Q, this D 
is a flattice of rank 2g, and the matrix Q is a 
period matrix of the complex torus T? = C9/D 
of dimension g. 

For a basis of the homology group with 
coefficients in Z, we take tnormal sections 
Xis- Xg, gs -> X2g Of R, and let the same 
symbol Q stand for the period matrix («,)), 
w= fa, %; then we have 


/-1 QE'Q>0 


(positive definite Hermitian matrix), where 


QE'O=0, 


0 L , ; : 
E= ] 0 (with J, the unit matrix). 


This implies that E is a principal matrix be- 
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longing to Q; we call the equality and inequal- 
ity just given Riemann’s period relation and 
Riemann’s period inequality, respectively. Fur- 
thermore, we can choose a suitable basis of Do 
so that the period matrix Q is of the form 

(,, F) with Fe S, (— Section I). We consider 
the function 9(u), u=(u,,...,u,), defined by an 
infinite series 


G(u)= PV exp(2x,/ —1 (m'u +4mF'm)), 


where the sum is taken over all row vectors 
m=(m,,...,m,) with coefficients in Z. If u is in 
a bounded region, then the series for (u) is 
uniformly and absolutely convergent. Hence 
3(u) is a holomorphic function of u. This is 

a theta function corresponding to the princi- 


0 7 
pal matrix E -( i a and is called the 
~*g 


Riemann theta function. As 3(u) is nondegen- 
erate, the complex torus T? = C9/D has the 
structure of an Abelian variety. If we regard 
the Riemann surface 9R as an algebraic curve, 
this Abelian variety T’ is precisely the Jaco- 
bian variety of the curve R. The collection of 
zeros of &u), a divisor on TY, defines the ca- 
nonical polarization on the Jacobian variety 
(— Section G). 

The correspondence P—u(P), pe induces 
a well-defined mapping ọ of R into T? = 
C/D. Moreover, if we set p(A)= 25-1 e(P) — 
È- P(Q,) for any divisor A=P,... P./Q, ...Q, 
of degree 0, then (A) is a point on T? = C/D, 
which is represented by the vector (£f, Soi W, 
ESTAN fg) of C%, and the mapping A> 
(A) is a homomorphism of the group o of 
divisors of degree 0 onto T? = C?/D. The kernel 
of this homomorphism coincides with the 
group ©, of tprincipal divisors (Abel’s theo- 
rem). Hence a divisor A =P, ... P./Q, ...Q, of 
degree 0 is a divisor of some function if and 
only if we have X- foo: =0 (mod D) (i= 
1,...,g), or equivalently, the left-hand side is 
0 for a suitable path. 

Given g fixed points P,,...,P, on R and 
given (u,,...,U,) as any vector of C%, the prob- 
lem of finding g points Q,,...,Q, satisfying 
relations È$- [P!@;=u; (mod D), i=1,...,g, is 
called Jacobi’s inverse problem. To solve the 
problem, we take a divisor A of degree 0 such 
that the class (A) (mod D) is represented by 
(u,,...,u,); then, by virtue of the *Riemann- 
Roch theorem, there exists a divisor Q; ...Q, 
satisfying 


A=Q, ...Q,/P, ... P, (mod G)). 


Abel’s theorem implies that the set of points 
{Q;,...,Q,} is a solution of Jacobi’s problem. 
Moreover, for general (u,,...,u,), the solution 
is unique; i.e., there exists a subvariety ¥ of 
dimension g—2 on C such that the solution ts 
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unique up to order if and only if (u,, ...,u,) 
does not lie on &. In particular, if every point 
P, coincides with the fixed point P, that ap- 
peared in the definition of u(P), the subvariety 
on T?=C4/D determined by & is obtained in 
the following way: Let W,+...+W,,-, bea 
canonical divisor on ®, and put c=@(W,)+... 
+ p(W,,~2). Then the locus of points c— @(R,) 
~...—@(R,_2) (where g—2 points Ry,...,R,_» 
are taken independently over all points of 9R) is 
the desired subvariety. 

In terms of the theory of complex manifolds 
the above result can be rewritten as follows: Let 
SYR) be the symmetric product of R and X the 
subvariety of T?=C*/D induced from ¥. Then 
the holomorphic map S%@): SR) T? induced 
from g:R-T* is bimeromorphic and isomor- 
phic outside that X whose elements corre- 
spond to effective divisor classes of degree g 
contained in canonical divisors on R. 


M. Elementary Abelian Functions 


Let z be a nonconstant meromorphic func- 
tion on R. Then for any u=(u,,...,u,)eC? 
that does not lie on &, there exist g points 
Q,,...,Q, (uniquely determined up to order) as 
the solution of Jacobi’s problem. Therefore 

the elementary symmetric functions 


g 
-> Salu; z)= | | 2(Q;) 
i<j j=l 
are well defined if u lies outside the variety ¥ 
of dimension g —2. Each function s;(u; z), re- 
garded as a function of u, can be extended 
uniquely to an Abelian function in the whole 
space C%; the function is denoted by the same 
symbol s,(u; z). The Abelian functions s; (u; z), 

s,(u; z) are called the elementary Abelian 
functions obtained from z. 

Now let K be the field of Abelian functions 
on C"/D and k the field of meromorphic func- 
tions on R; then the dimension of K over C is 
g, and [K : C(s; (u; z), ..., 8y(u; z))] =r, where r 
is the degree of the function z that is given by 
[k:C(z2)]. Moreover, if we take any function w 
such that k = C(z, w), then we have 


K=C(s,(u;z),..., (u; w),..., 


S,(u; w)), 


where s; (u; w), ...,5,(u; w) are the elementary 
Abelian functions obtained from w. 

We can write any elementary Abelian func- 
tion as a rational function of Riemann theta 
functions; therefore any Abelian function can 
be written as a rational function of Riemann 
theta functions. Furthermore, if u and v are 
variable vectors, then s,(u+ v; z) can be repre- 


54(U; z); Sy 
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sented as an algebraic function of s,(u;z),..., 
S,(U; z), sı (0; Z), ...,5,(v; Z); i.e., we can choose 
a suitable polynomial H,(Z; X,,...,X,3 Yi, 

., Y,) with coefficients in C so that H,(s,(u+ 
v; Z); S1 (U; Z), ...5,(U; Z); $1 (0; Z), -..,5,(v;Z)) =9. 
This algebraic addition formula with respect 
to the elementary Abelian functions s;(u; z), i= 
1,...,g, is a function-theoretic interpretation 
of the fact that the addition map X x X > 
X :(x,y)>x+y is a tmorphism of algebraic 
varieties. 

As the study of tAbelian integrals of the first 
kind led us to the theory of Jacobian varieties, 
*Abelian integrals of the second and the third 
kind give rise to the theory of tgeneralized 
Jacobian varieties (— 9 Algebraic Curves). 

The theory of Abelian varieties has signifi- 
cant applications to number theory, as shown 
by the following examples: the theory of tun- 
ramified Abelian extensions with respect to a 
function field of several variables defined over 
a finite field (S. Lang), the theory of heights of 
points on an Abelian variety (Weil, A. Néron, 
J. Tate), and the theory of complex multipli- 
cation (— 73 Complex Multiplication) in the 
case of higher dimensions (Y. Taniyama, G. 
Shimura). 


N. Some Recent Results 


(1) Level Structure, Moduli of Abelian 
Varieties. Let A be an Abelian variety over k 
of dimension g and n a positive integer which 
the characteristic of k does not divide. A level 
n structure on A is defined to be a set of 2g 
points o,,...,02, ON A which form a basis for 
the group of points of order n on A. 

Let A(g, d, n; k) be the set of triples: (i) an 
Abelian variety A over k of dimension g, 
(ii) an inhomogeneous polarization X on A 
with v(py)=d?, and (iii) a level n structure 
Ci, .++,0, Of A, all up to isomorphisms. Simi- 
larly we can define A(g, d, n; S) for Abelian 
schemes over a scheme S. The correspondence 
S— A(g, d, n; S) defines a functor £ (g, d, n). 
D. Mumford has shown that there exists 
the tcoarse moduli scheme A(g, d, n) quasi- 
projective over Spec(Z[1/n]), and that it is even 
fine if n>3[6]. He used the technique of fHil- 
bert schemes and ‘stable points (— 16 Alge- 
braic Varieties). One of the key steps of his 
proof is to show that for an embedding ¢: A> 
P” of degree r (i.e., the degree of (A) in P”) 
over an algebraically closed field k and a posi- 
tive integer n such that char(k) does not divide 
nand n>,/(m+ 1)r, the point (A(x;));-1 229 
in (P”)"”’ is stable with respect to the action 
of PGL(m), where the x; are the points of order 
n on A (with an arbitrary order). Mumford 
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later showed another method of constructing 
the moduli of polarized Abelian varieties by 
using algebraic theta constants [7]. 


(2) Néron Minimal Models, Good and Stable 
Reduction. Let R be a discrete valuation ring 
with residue field k and quotient field K. For 
an Abelian variety A over K, there exists a 
smooth group scheme .o of finite type over S 
=Spec(R), called the Néron minimal model of 
A, such that for every scheme S’ smooth over S 
there is a canonical isomorphism 


Hom,(S’,.)= Hom ,(S,, A), 


where Sx is the pullback of S’ by Spec(K)—> 
Spec(R) (A. Néron, M. Raynaud). In particu- 
lar, we have £g = A. Denote by Ag the fiber of 
W over the closed point of S. 

If . is proper over S, we say that A has a 
good reduction at R. If the connected compo- 
nent Aj of Ay containing 0 has no unipotent 
radical (or equivalently, Aj is an extension of 
an Abelian variety by an algebraic torus over 
a finite algebraic extension of k), we say that A 
has a stable reduction. If there is a finite sepa- 
rable extension K’ of K with a prolongation R’ 
of R to K' such that A x g K’ has good (stable) 
reduction, we say that A has a potential good 
(stable) reduction at R. Let K, be a separable 
closure of K and R a prolongation of R to 
K,. For a prime number /#char(k), we have 
a canonical homomorphism p:Gal(K,/K)—> 
Aut(6,(A)), called a monodromy. Then A 
has potential good reduction if and only if the 
image of the finertia group I(R) by p is a finite 
group (J.-P. Serre and Tate). Every Abelian 
variety A over K has potential stable reduc- 
tion at R (stable reduction theorem, A. Gro- 
thendieck [16]). 


(3) Graded Ring of Theta Functions. If f is a 
theta function with a period system D, f” is 
also a theta function with the same period 
system for every positive n. We denote by S, 
the vector space of the theta functions with the 
period system D subject to the same trans- 
formation law as f”. If we denote by X the 
effective divisor on T=C*/D defined by f, then 
S, can be naturally considered as the fdefining 
module of the ‘complete linear system of nX, 
and the dimension of $, (=[(nX)) is equal to 
the product of the nonzero diagonal elements 
of nA (— Section I) (Frobenius). If ge S,, and 
heS,, then gheS,,.,; hence S= @,505, 1s a 
*graded ring, which is normal and finitely 
generated. For m>2 and n È 3, the product 
map S,, X Sp > Smin is surjective (D. Mumford, 
S. Koizumi). If the elementary divisors of E 
can be divided by an integer > 4, the kernel of 
the natural graded mapping S(S,)—S (where 
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S(S,) denotes the tsymmetric algebra over S,) 
is generated by the quadratic relations (1e., the 
part of degree 2) for sufficiently large degrees 
(Mumford). Geometrically this means that if 
X =(f), feS,, is nondegenerate, then, with the 
projective embedding i: T-+P* defined by the 
complete linear system of X, i(T) is an inter- 
section of quadrics in P“ containing i(T). 

Mumford has developed a theory of alge- 
braic theta functions that also works for the 
positive characteristic case [7] and has proved 
the above results in general. 
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A. General Remarks 


Let N be the set of natural numbers, and let A, 
B,... be subsets of N. The sum C=A+ Bis 
defined as the set {c|ce A, ce B, or c=a+b, 
ae A, be B}. A finite sum of subsets of N is 
defined similarly. If N is the sum A+...+A 

(r times), then we say that A is a basis of order 
rin N. Let A(x) denote the number of integers 
in A that do not exceed x. The density of A 

is defined as inf A(x)/x. If A(n)> an, B(n)> 

pn (O<a, B <1) for all neN, then we have 
(A+ B)(n)>(min(1, «+ B))n. This result was 


stated by E. Landau (1937) without proof; A. J. 


Khinchin had given a proof (1932) for the case 
a=, and H. B. Mann (1942) and E. Artin and 
P. Scherk (1943) succeeded in proving the 
statement for the general case. Suppose that 
the densities of A and B are, respectively, « 
and 0. If B is a basis of finite order in N, then 
the density of A + B is greater than that of A 
(Khinchin and P. Erdés, 1936). Let P be the 
set of all prime numbers. Though the density 
of P is 0, the density of P+ P is positive (L. G. 
Shnirel’man, 1930). Hence P is a basis of finite 
order in N; in other words, there exists a posi- 
tive integer r such that every natural number 
can be expressed as a sum of at most r primes. 
Though the density of the set Q of the kth 
powers of natural numbers is 0, there exists a 
positive integer s(k) such that the sum Q +... 
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+ Q (s(k) times) is a set of positive density. L. 
K. Hua (1956) gave a simple proof of this fact, 
based on Yu. V. Linnik’s idea. It follows, 
therefore, that any natural number n can be 
expressed as a sum n=ai+...+a*, where 
a;E N, t>s(k). This result had already been 
shown by D. Hilbert (1909). 

An ancient method of finding prime num- 
bers is fEratosthenes’ sieve. V. Brun (1920) 
devised a new sieve method to express an 
arbitrary integer n as the sum of two integers 
n=a+b, where the number of prime factors 
of a and b is as small as possible. This method 
was improved by H. A. Rademacher (1924), 
Landau (1931), A. Bustab (1937), and others. 
Among these the method found by A. Selberg 
(1952) is notable (— 123 Distribution of Prime 
Numbers E). 


B. Farey Sequences 


Let t be a positive integer. We arrange in 
increasing order the set of all positive irreduc- 
ible fractions lying between 0 and 1 whose 
denominators do not exceed t. This sequence 
is called the Farey sequence of order t. For 
example, the Farey sequence of order 5 con- 
sists of 


Í 1 1 2 ł 
Í PPPE 


=| © 


4 1 
sae 


fiw 


2 
> 3° 


in| Ge 


A necessary and sufficient condition that a 
fraction a/b be directly followed by a fraction 
c/d in the Farey sequence of order n is b+d> 
n+1, bc—ad=1. In this case the fraction 
(a+c)/(b+4) is called the mediant of a/b and 
c/d. Interpolating the Farey sequence of order 
n with such mediants (a+c)/(b + d) satisfying 
b+d=n-+1, we obtain the Farey sequence 

of order n+ 1. 

Let a/q be a fraction in the Farey sequence 
of order t, and a’/q’, a”/q” be adjacent mem- 
bers of a/q in the sequence such that a’/q’< 
a/q <a" /q’. The interval [(a’ + a)/(q' +q), 
(a+a")/(q+q")] is known as the Farey are 
surrounding a/q. In particular, if a/q=0/1, 
then we set [—1/n, 1/n] to be the Farey arc 
surrounding 0/1, where n=[t]4+1([ ] is the 
tGauss symbol). We can thus decompose the 
interval [—1/n, 1—1/n] into a disjoint union 
of Farey arcs. If « is contained in the Farey 
are surrounding a/q, then |a~a/q| <1/qt. 
Therefore, for a given tÈ 1 and a real a, we 
can prove the existence of a/q such that 


(la|,q)=1, |a—a/q|<1/qr. 


Let f(Q=Z 04,6" be a power series which 
is convergent in the disk |¢| < 1. Then, by 
Cauchy’s integral formula, 


O0<q<t, 


13 


1 
1 FQ) «=f flere At da. 


i nae. +1 
2ni Jiao” 0 


To estimate such an integral, we sometimes 
utilize the decomposition of the interval [0, 1] 
into a disjoint union of Farey arcs as men- 
tioned above. This method is called the circle 
method, and the subdivision of the interval is 
known as the Farey dissection. 

Given a positive number c, a Farey arc 
around a/q is usually called a major are (or 
basic interval) if q does not exceed the given 
bound c; otherwise, it is called a minor are (or 
supplementary interval). Usually, the principal 
part of the previously mentioned integral is 
derived from the integral over the major arcs, 
and the residual part is provided by the in- 
tegral over the minor arcs. 


C. Goldbach’s Problem 


Goldbach’s problem is found in letters (1742) 
he exchanged with L. Euler. In them he stated 
that every positive integer can be expressed as 
the sum of primes. More precisely, he conjec- 
tured that any even integer not smaller than 6 
can be expressed as the sum of two odd primes 
and that any odd integer not smaller than 9 
can be expressed as the sum of three odd 
primes. 

I. M. Vinogradov (1937) proved that every 
sufficiently large odd integer can be expressed 
as the sum of three primes. Let N be a suffi- 
ciently large odd integer. If we write 


A(q, N)= Hla) X exp( -25N ), 


3 
o` (q) \<a<q 
(a,q)=1 


then the series S(N) = 72, A(q, N) is ab- 
solutely convergent and is equal to 


1 1 

u(i n= mae ey, 
It is known that S(N)> 6/x? for all N. If we 
denote by r(N) the number of solutions of 
N=p,+P2+Ps, then r(N)~(N*/2(log N)”)- 
S(N). To prove this, Vinogradov used the cir- 
cle method. He employed the tprime number 
theorem for arithmetic progressions to esti- 
mate the integrals over the major arcs and 
devised an ingenious method to estimate the 
series 1, <y€xp(2ziap) in the computation of 
the integrals on the minor arcs. A finite or 
infinite sum of exponential functions such as 
this is called a trigonometric sum. More gener- 
ally, we consider trigonometric sums of several 
variables. Vinogradov provided detailed re- 
marks and calculations [6]. 

In the case of even integers, the problem is 
still unsolved, although J. G. van der Corput, 
T. Estermann, and N. G. Chudanov proved 
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simultaneously (1938) that almost all even 
integers (i.e., except a set of density 0) can be 
expressed as the sum of two primes. For these 
problems, Linnik (1946) and Chudanov (1947) 
introduced function-theoretic methods. They 
obtained the density theorem concerning the 
zeros of L-series, and A. Zulauf (1952) con- 
tinued along the same lines. These methods 
had been suggested by G. H. Hardy and J. E. 
Littlewood (c. 1919), although they had as- 
sumed that the textended Riemann hypothesis 
held. 

We denote by P, a number having at most j 
prime factors counted with their multiplicity. 
Using his sieve method, Brun (1919) proved 
that every sufficiently large even integer 2n can 
be expressed in the form 2n = P, + Py. Radema- 
cher (1924) improved this result and obtained 
2n=P,+ P}. Applying hts new sieve method to 
this problem, Selberg (1950) proved that 2n= 
P, + P}. On the other hand, using the tlarge 
sieve, A. Rényi proved that 2n = P, + P, for 
some k. Afterwards, combining Richert’s sieve 
with.a large sieve, Chen Jing-Run (1973) 
proved that 2n = P, + P, for large 2n [11]. 


D. Polygonal Numbers 


Let m be an integer greater than 3, and let 
a,=1, a,.,-—a,=(m—2)n+1 (n=1,2,...). 
The sequence {a,} forms the system of polyg- 
onal numbers of order m. The general term 

of {a,} is given by n+4(m—2)(n? —n) (n= 
1,2,...). Such a, are said to be triangular 
numbers if m = 3, square numbers if m = 4, and 
pentagonal numbers if m= 5. 

P. Fermat (1636) stated that every natural 
number can be expressed as the sum of m 
polygonal numbers of order m. This conjecture 
was proved by A. M. Legendre (1798) for m= 
3, by J. L. Lagrange (1772) for m=4, and by 
A. L. Cauchy (1813) for the general case. With 
regard to Lagrange’s result, Legendre noticed 
that in order that a positive integer n be ex- 
pressed as the sum of three squares, it is neces- 
sary and sufficient that n not be of the form 
4°(8m +7). 

Given a positive integer n, the number of 
integral solutions of the equation xj +x3+... 
+x?2=n is denoted by r,(n). For example, r,(5) 
= 8. The tgenerating function Lz r,(n)n“* 
can be expressed as 07%. _ „(m° +n7)~*, where 
the term corresponding to m=n=0 is omitted; 
the function is equal to 4¢,(s), where €,(s) is 
the tDedekind zeta function of the Gaussian 
number field K = Q(./=1). The equation 
40,(s)=4€(s) L(s, x) (where x(n) =(—4/n)) leads 
to 
ra(n)=4}), (10, 


m|n 


4E 
Additive Number Theory 


where >’ means the sum over all odd factors 
m of n. This result was obtained by C. G. J. 
Jacobi (1829). He also obtained the following 
formula: 


ra(n)=8 Y m, 


min. 4m 


where >’ means the sum over all divisors m of 
n not divisible by 4 (Hardy and E. M. Wright, 
C. L. Siegel, 1964). Let q =exp(2zit) (Imt > 0), 
and 


fg=t+ ¥ ring" 
n=1 


=(1 +2q +244 +24? +... Y. 


Hardy (1920) considered the variation of f(q) 
for q=aexp(2zih/k) (with 0<a < 1) as a—>1; he 
obtained 


S 5 1 —s/2 
fi~”? (==) log (*) 
a 


where S, «= Èt- exp(2zi(h/k)j*). Furthermore, 
he constructed the singular series 


3/2 nisi?) a 


eT A ASE 
; ro ee ^ 
2 l h 
Ask ¥ (S,grexp(—2nifn ‘ 
Ishak i k 
(h,k}=1 


and showed that r,(n) = p,(n) for s= 5, 6, 7, and 
8. P. T. Bateman (1951) proved the same equa- 
tion for s=3 and 4. If s>9 then r.(n)= p(n) + 
O(n*'*) (Hardy, Littlewood, and S. Rama- 
nujan, c. 1919). The detailed exposition of this 
result is in the notes of A. Z. Walfisz (1952) 
and Rademacher [4]. H. D. Kloostermann 
(1926) and Estermann (1962) studied the equa- 
tion ax? + bx} +cx3 +dxz=n, which led toa 
new field of study concerning the tKlooster- 
mann sum. For instance, estimates such as 


poi xtd 
L exp (2x Š =) <2/p 


are obtained by using the theory of zeta func- 
tions in algebraic function fields in one vari- 
able (A. Weil, 1948) (— 450 Zeta Functions). 


E. Waring’s Problem 


The first formulation of Waring’s problem is 
found in E. Waring, Meditationes algebraicae 
(1770), in which he discusses the problem of 
expressing an arbitrary positive integer as the 
sum of at most nine cubes or as the sum of at 
most nineteen biquadratics. Hilbert proved (— 
Section A) that there exists a positive integer 
s(k) such that, for any integer N, the equation 


xt + xš +... +x =N 
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has a nonnegative integral solution if s > s(k). 
We denote by g(k) the least value of s(k), and 
by G(k) the least value of s(k) for which the 
equation is solvable with at most finitely many 
exceptions of N. Research concerning g(k) and 
G(k) received its initial impetus from the circle 
method considered by Hardy and Littlewood, 
and it underwent considerable development in 
the works of H. Weyl and Vinogradov. 

Let r,(N) be the number of solutions of the 
above equation. We then have 


1 S 
an=] ( > exp(2niax) ) 
o \xsnilk 


x exp(—2niNa)da. 


If we make the Farey dissection, translating 
the interval [0, 1] slightly, then the main term 
of r,(N) is provided by the integrals over major 
arcs, and the residual term is derived from the 
integrals over minor arcs. Let p be a prime, 
and let M(N, p’) denote the number of solu- 
tions of the congruence equation 


xtaxk+...+x'=N(mod p’). 


Then lim)... M(N, p/p’ =x,(N) is not zero, 
and the infinite product [], 7,(N)=S(N) con- 
verges for s >4k, where S(N) is larger than a 
positive constant which is determined inde- 
pendently of the choice of N. On the other 
hand, let 


q-1 a 
Slag) = ¥. exp (2zi : 
q 


x=0 


A(g,N)=q7"° © Sla, avexp( ~22iN ), 


l<a<q 


(a,q)=1 


Then 22, A(q, N) is absolutely convergent, 
and the sum is equal to S(N). According to 
Hua (1959) we have 
r(N)~ s) SED. ys, 

T(s/k) 
provided that s>2k?(2logk +loglogk +c). 
Next, if we denote by V(N, ô) the volume of the 
closed region satisfying N<x{+x5+...+ 
x* <N +ô in the s-dimensional Euclidean 
space, then lim;_,, V(N, 6)/6 = Xa (N) exists and 
is equal to ((1 + 1/k)5/P(s/k)) N**"!. Hence we 
can show that the principal part of r,(N) is 
equal to the infinite product ĮI, x,(N), where p 
runs over all finite and infinite prime spots in 
Q. This is a generalization of the singular 
series studied by Hardy. 

With regard to g(k), there are studies by L. 
E. Dickson (1936) and others. It is easy to see 
that g(k) > 2* + (3/2)* —2 and that G(k)>k+1. 
It has been shown that g(3)=9, G(3)<7 (G. L. 
Watson, Linnik, 1947) and that g(4) < 37, 
G(4)= 16 (H. Davenport, 1939). More gener- 
ally, Vinogradov (1959) proved that 
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G(k)<2klogk + 4k loglogk 


+ 2klogloglogk + ck. 


To prove this, Vinogradov (1934) introduced 
the following integral, which is closely related 
to the fprime number theorem: 


1 1 
TEES 
o 0 


Y exp(2ni(a,xX+a,Xx7 +... 


x<P 


2s 


+ 0%,x*)) 


Hua (1949) improved Vinogradov’s result and 
proved that if s>4k(k+1)+ Ik, then 


I(P)<(5s)5" (log P)?! P25~ KR + 12 +8, 


where 6 =4k(k + 1)(1—1/k)'"'. Concerning 
I(P), another notable approach was made by 
A. A. Karacuba and N. M. Korobov (1963). 
Further investigation proved that I(P)= 
Cres P2s— kk +1)/2 + o( P2575 +12) if s > ck? logk 
(Vinogradov, Hua, 1959). The result is called 
the Vinogradov mean value theorem. 

There are many variations and generaliza- 
tions of this theorem. Vinogradov and Hua 
(1944) studied the problem of representing an 
arbitrary N as N=p*+p3+...+p* (with p; 
prime). Hua (1937) and others also considered 
the problem of representing N as N = f(x,)+ 
f(xa)+ ...+ f(x), where f(x) is a given poly- 
nomial. Also, let 


n n n 
C(%1,X 25-0 Xn) = Dy X. D C jp XiXjXx 


j=1 k=1 


ll 
= 
Sw 
li 


be a homogeneous polynomial of degree 3 
with integral coefficients. Davenport (1963) 
proved that if n> 16, then the equation 
C(x,,X>,...,X,)=0 has at least one nontrivial 
integral solution. There are further develop- 
ments of the theory of representations of in- 
tegers by forms in many variables by V. A. 
Tartakovskii, H. Davenport, B. J. Birch, and 
D. J. Lewis. 


F. Additive Problems in an Algebraic Number 
Field 


Siegel (1922) considered the generalization of 
Hardy’s square sum problem to the case of an 
algebraic number field. He later (1945) studied 
the generalized Waring’s problem in an alge- 
braic number field K of finite degree: Let I 

be the principal order of K, and let J, be the 
subring of I generated by the kth powers of 
integers in K. It is easily seen that the index 

(I: J,) is finite. Hence our concern regarding 
s(k) must be restricted to integers contained in 
J,. Another question is how to extend the 
Farey dissection to an algebraic number field. 
Siegel succeeded in solving these difficulties. 


da, ... day. 
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His ingenuity is seen in his way of dealing with 
the minor arcs, which provided a stimulus to 
the research of T. Mitsui (1960). 

A generalization of Goldbach’s problem to 
the case of an algebraic number field was 
obtained by Mitsui (1960) and O. Korner 
(1961). 

As another extension of the Vinogradov 
three primes theorem, Mitsui (1971) proved 
the following theorem: Let K be an algebraic 
number field of degree n. Let C be the prin- 
cipal ideal class generated by a totally positive 
number in K, and P be the set of prime ideals 
of degree 1 contained in C. Let N be a positive 
integer and J,(N) be the number of representa- 
tions of N as the sum of the norms of s prime 
ideals belonging to P, 

I,(N)= y. 1, peP (I<i<s). 
N=Np; +... + Np 

If N is sufficiently large and s È> 3, we have the 

asymptotic formula 


Ns} _, loglogN 
+0( N), 

(log N} ( (log N ) 
where A, is a positive constant depending on s 
and K independent of N, and S(N) denotes the 
singular series. If s= N (mod 2D), where D is 
the discriminant of K, then S(N)>c>0, where 
c is a constant. Later this problem was ex- 
tended by Mitsui and T. Tatuzawa (1981). 


I,(N)=A,S(N) 
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A. General Remarks 


The study of sums of tindependent random 
variables has been one of the main topics in 
modern probability theory (— 250 Limit 
Theorems in Probability Theory). Combin- 
ing the ideas of this study with the considera- 
tion of tstochastic processes with continuous 
time parameter, we get the notion of additive 
processes. 


B. Definitions and Fundamental Properties 


A real-valued tstochastic process {X;}o <:<u> 
denoted by X(t) (O<t<<c) for the rest of this 
article, where for simplicity we assume that 
X(0)=0, is called an additive process (or pro- 
cess with independent increments) if for any to < 
ty<...<t,, X(t;)—X(t;-1) (= 1, 2,...,n) are 
tindependent. An additive process is essentially 
the same as a tspatially homogeneous tMar- 
kov process (i.e., a Markov process on R! that 
is invariant under translations). When, for any 
h>0 and t>s, X(t+h)—X(s+h) and X(t)— 
X (s) have the same law, i.e., the distribution 
law of X(t)— X (s) depends only on t—s, we 
call the additive process X(t) temporally homo- 
geneous. This is essentially the same notion as 
a temporally and spatially homogeneous Mar- 
kov process. 

Let X(t) be a given additive process. If f(t) is 
a function of t only, then clearly Y(t)= X(t)— 
f(t) is also an additive process, and we can 
choose f(t) in such a way that for every t>0 
and for every sequence s,,ft (s,|t), Y(s,) con- 
verges with probability one. Here, lim Y(s,) 
is independent of a particular choice of s,,, 
and we denote it by Y(f—) (Y(t+)). We call 
such Y(t) a central process and also say that 
Y(t) is obtained from X(t) by centering. This 
f(t) is given, for example, by the condition 
E(arctan(X (t)— f(t)))=0. 
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Let Y(t) be a centered additive process. 
Then Y¥(t—)= Y(t)= Y(t+) for all t>0, except 
on an at most countable t-set S, and tes is 
called a fixed point of discontinuity of Y(t). 
Then Y,(t)=lim,_,,. U,(t) exists with proba- 
bility 1, where 
U= $} (¥(s,+)— ¥(s;—)) 


O<i<n 
S;<t 


+ ¥(t)—¥(t—)—G, 


nox 


C? being a constant determined by E(arctan 
U,(t))=0, and S={s,} (j=1,2,...). Y(O= 
Y(t)— Y, (t) is a centered additive process 
without any fixed point of discontinuity. Fur- 
thermore, Y,(t) and Y,(t) are independent. 
Thus we have a decomposition of Y(t): Y(t)= 
Y (t)+ Y (t), where Y,(t) and Y,(t) are mutu- 
ally independent additive processes. The struc- 
ture of Y, (t) is simple, and it is not worthwhile 
to study its behavior in more detail. On the 
other hand, since Y,(t) is a centered additive 
process without any fixed point of discontinu- 
ity, it is an additive process that is tcontinuous 
in probability. Let Y,(t) be a separable modi- 
fication of Y,(t). Then the discontinuities of 
almost all sample functions of Y,(t) are of the 
first kind. If we set Y*(t)= ¥,(t+), Y(t) is a 
‘modification of Y,(t), and almost all sample 
functions of Y;*(t) are right continuous and 
have a left-hand limit at every t. In the study 
of the process Y,(t), it is always convenient to 
take such a modification. Thus we give the 
following general definition: An additive pro- 
cess is called a Lévy process if it is continuous 
in probability and almost all sample functions 
are right continuous and have a left-hand limit 
at every te[0, 00) [3,9]. 

The notions of additive processes and Lévy 
processes can also be considered for R%-valued 
processes. 


C. Additive Processes and Infinitely Divisible 
Distributions 


Let X(t) be a Lévy process and ®,, (s <t) be 
the fdistribution of X (t)— X (s). Then ®,, is an 
‘infinitely divisible distribution {(— 341 Proba- 
bility Measures G). Conversely, for a given 
infinitely divisible distribution ® we can con- 
struct an essentially unique temporally homo- 
geneous Lévy process X(t) such that ® coin- 
cides with the distribution of X (1). If X(t) is 
temporally homogeneous, the ‘characteristic 
function @,,(z)= E(e!"*- *) of the distri- 
bution ®,, is given in the form @,, =exp((t— 
s)w(z)); hence the law of the process X(t) is 
completely determined by the function w(z). 
By the *Lévy-Khinchin canonical form, (z) is 
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written in the form 
W(z)= imz —52* -| (e —1- =) md, 
(1) 


where m, ve R, v>0, and n(du) is a nonnegative 


x 


w u? 


+u? 


measure on R — {0} such that | n(du) 


< oo. These m, v, and n(du) are uniquely deter- 
mined by w(z). 


D. Basic Additive Processes 


Wiener Process. When almost all sample func- 
tions of a Lévy process X(t) are continuous, 
the distribution of X (t)— X (s) is a ‘normal 
distribution. If, further, X(t) is temporally 
homogeneous, (z) has the form y(z)= imz — 
3vz?. In particular, if m=0 and v=1, then 

X(t) is called a Wiener process or Brownian 
motion. This stochastic process was introduced 
by N. Wiener (1923) as a mathematical model 
for the random movement of colloidal par- 
ticles first observed by a British botanist, R. 
Brown. This is one of the most fundamental 
and important stochastic processes in modern 
probability theory (— 45 Brownian Motion). 


Poisson Processes. When almost all sample 
functions of a Lévy process are increasing step 
functions with only jumps of size 1, the distri- 
bution of X(t)— X (s) is a Poisson distribution. 
If, further, X (t) is temporally homogeneous, 
w(z) in (1) has the form w(z)=A(e” — 1) (A>0), 
and X(t) is called a Poisson process. Let X (t) 
be a Poisson process, and let Tj, T+ 7,, Ty + 
T, + T),... be successive jumping times of a 
sample function X(t). Then 7, Ti, T5,... is 

a sequence of mutually independent ran- 
dom variables with the common exponential 
distribution P(Tedt)=e “dt. Conversely, 
given such a sequence {7,}, if we define X (t) = 
inf{n| To + T, +... + T >t}, then X(t) is a 
Poisson process. Thus, for example, the num- 
ber of telephone calls at a switchboard is a 
Poisson process when the intervals between 
successive calls can be regarded as inde- 
pendent and having a common exponential 
distribution. 


E. The Structure of the General Lévy Process 
[8,9] 


In this section we restrict ourselves for sim- 
plicity to temporally homogeneous Lévy pro- 
cesses. As we noted, the probability law of the 
process X (t) is determined by the function 
y(z). The Levy-Khinchin formula (1) ina 
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certain sense shows that y is a combination of 
a Wiener process and Poisson processes. This 
fact can be seen more clearly from the Lévy-Ité 
theorem, which states that the sample function 
of X (t) itself can be expressed as a composite 
of those of a Wiener process and Poisson 
processes. The Lévy-It6 theorem actually 
implies formula (1) and, moreover, clarifies its 
probabilistic meaning. 

The Lévy-It6 theorem can be summarized 
as follows: Let U be a Borel subset of R which 
has a positive distance from the origin, and let 
N(t, U) be the number of s such that X(s)— 
X(s—)eU,s<t. Then N(t, U) is a Poisson 
process. The expectation E(N(t, U)) can be 
written in the form tn(U), where n(U) defines 
a nonnegative Borel measure on R — {0}; fur- 
thermore, it satisfies 


æ% u? 
| Ear se 


Next, we set 


| uN(t, du) 
|u|>e 
»  (X(s)—X(s—)). 


sst 
|X(s)—X(s—)i >e 


e(t) 


I 


Generally, S,(t) diverges as ¢]0. However, with 
probability one, a centered process 


am u 

S,(t) = S,.(t)— | and 

converges uniformly in t on every finite inter- 
val as ¢|0. Furthermore, X (t)—lim, o S,(t) is 
continuous with probability one. However, a 
Lévy process X(t) of which almost all sample 
functions are continuous has the form mt + ./v 
B(t), where m, v>0 are constants and B(t) is a 
Wiener process. Hence we have 


X(t)=mt+./v Bit) 


elO 


sim (uve d= ptm) ) (2) 
lu|>e 


Furthermore, we can show that if 
Ui Cuore Ce 

are disjoint, then 

Bit), N(t, U,), N(t, Uz), ..., N(t, U,) 


are mutually independent Lévy processes. In 
particular, in (2) the terms are mutually inde- 
pendent. The m, v, and n(du) in (2) correspond, 
of course, to those in (1). Conversely, given m, 
v, and n(du), we can construct B(t) and N(t, U) 
with the properties above, and then (2) defines 
a Lévy process which corresponds to y (z) 
given by (1). The measure n(du) is called the 
Lévy measure of X (t). 
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F. Examples of Lévy Processes 


Compound Poisson Processes. A temporally 
homogeneous Lévy process is called a com- 
pound Poisson process if almost all sample 
functions are step functions, namely, if w({z) in 
(1) is given by 


y= | ` (e™—1)n(du), 


= 00 


A A n(du)< œ. 
If we set D(du)=(1/A)n(du), then (du) is a 
probability distribution on R. ® is the distri- 
bution of the size of jumps when they occur. 
A compound Poisson process is constructed 
in the following way: Let 75, T,, T,,...; U;, 
U,,... be mutually independent random vari- 
ables such that 


P(T,edt)=4e “dt, t>0, 


P(U,€du)= O(du), 

and let 

N(Q)=U,+ U,+...+ Unos 
where 

n=inf{n| +T, +... + T, >t}. 


Then X(t) is a compound Poisson process. 
Thus the number of jumps of X(t) follows a 
Poisson process, and the size of each jump 
obeys the distribution ®. 


Stable Processes. A temporally homogeneous 
Lévy process X(t) is called a stable process if 
for every a>0 we can find b>0 and c real 
such that the processes X,(t)=X(at) and X,(t) 
=hX(t)+ ct are equivalent in law. It is called a 
strictly stable process if in the above c can be 
chosen to be 0. X(t) is a stable process (resp. a 
strictly stable process) if and only if the corre- 
sponding infinitely divisible distribution is a 
*quasistable distribution (resp. tstable distri- 
bution). The texponent « (0<a<2) of the 
quasistable distribution is called the exponent 
(or index) of the stable process. y/(z) in (1) cor- 
responding to a stable process is given as 
follows: 


o d 
vo=imz+c, | (ei — 1) 
é u 
0 
| d 
+e f ED gawi 
es ul' 


cadet ae izu \du 
=imz+C ie Wego | 
w(z)=imz 3 (e ae 


LC Q TE? izu \ du 
hee 1+u? Jul?’ 
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W(z)=imz+C, : (e =La ae 
mY , du 
+C_ (eh — izu) ee l<a<2, 
=i u 


W(e)=im2 52°, 


a=2. 
Here m, C}, C , and v are real constants such 
that C, >0, C- >0 with C, +C_>Oand v>0. 
W(z) corresponds to a strictly stable process if 
and only if m=0 when «41 and C}, =C_ 
when «= 1. The above w(z) is also expressed as 
follows: 

(i) ifO0<a<2, 741, then 


W(z)=imz—co|z|" 1—iftan— — 
2 |z| 


where 


and 
na 
co= —(C, Peleecis 


Cc. 


Pee: 


if a2; 


(ti) if «= 1, then 


m |z] 


vo=iz=eok(i +126 ogiz); 


where 


y=m+a(C,+C_) 


When «= 2, it is thus essentially a Wiener 
process: X(t)=mt+/v B(t), where B(t) is a 
Wiener process. When a=1, it is called a 
Cauchy process, a symmetric Cauchy process if 
m=Oand C,=C_ or equivalently y= f=0, or 
an asymmetric Cauchy process if f 40. Gener- 
ally it is called a symmetric stable process if 
m=0 and C,=C_ or a=2 and m=0. In par- 
ticular, for the symmetric Cauchy process cor- 
responding to w(z)= —|z|, we have 


pf 4 
paoa T mad 
Next, when 0<a<1 and m=C_ =0, almost 
all sample functions of X (t) are purely dis- 
continuous increasing functions (i.e., sums of 
positive jumps). In this case, X(t) is called a 
one-sided stable process of the exponent « (or 
subordinator of the exponent x). Now, if X(t) is 
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a symmetric stable process of the exponent 
B(O0<f<2) and Y(t) is a subordinator of the 
exponent « which is independent of X(t), then 
Z(t)=X(Y(t)) is a symmetric stable process 
of the exponent af. This operation is called 
a subordination and is closely related to the 
theory of tfractional powers of tinfinitesimal 
generators of semigroups (— 261 Markov 
Processes) [2]. 

A stable process X(t) is defined in a simi- 
lar way when X(t) takes values in an N- 
dimensional space R”. In particular, if X(t) is 
a symmetric stable process of the exponent « 
(0<a<2) given by 


Efex) = ear, zeR“, 


then for every bounded measurable function 
J (x) with compact support, we have 


(f fos X(t) 


r((N —0)/2) 


= — yN 
2n X21 (&/2) Pe yl f(y) dy. 


The right-hand side is the tRiesz potential of 
order « (— 338 Potential Theory). This fact 
is a generalization of a well-known rela- 

tion between Brownian motion and Newton- 
ian potential (— 45 Brownian Motion), and 
through this relation we can study several 
properties of sample functions and also com- 
pute various quantities related to stable pro- 
cesses [1]. 


G. Sample Path Properties of Lévy Processes 


Let X(t) be a temporally homogeneous Lévy 
process. For a Borel set B in R, the hitting 
time az is defined by 


d,=inf{t>0|X (te B}. 


Recurrence. X (t) is called recurrent if op < 0 
a.s. for every nonempty open set B. Otherwise 
it is called transient. X(t) is recurrent if and 

1 
only a. ay 
Ornstein). If, in particular, E(X(1)) exists, it is 
recurrent if and only if E(X(1))=0. When X(t) 
is a stable process, it is recurrent if and only if 
a>1,ora=1 and f=0. 


dz =o (Port and Stone, 


Hitting Probabilities for Single Points. If B= 
{a}, dg is denoted by a,. OER is said to be 
regular for X(t) if ¢7 =O a.s. Set C={xeR| 
P(o, < 0«©)>0}. The following result is due to 
Kesten [10]: 

(i) If v 40, then C=R and 0 is regular; 

(ii) If v=0 and f+, |uln(du) = œ, then either C 
=R and 0 is regular, or C= Ø and 0 is not 
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1 
regular according to whether | Re———-dz 
j R AYE) 


<æ for all 2>0 or = æ for some 1 >0; 
(iii) If v=0 and f}; |u|n(du) < œ, then the 
following four cases are possible, where 
a=m—Jpu(1+u’)'n(du) and S(n) is the sup- 
port of the Lévy measure n: 
(a) C= Ø when a=0, 
(b) C=(0, œ) when a>0 and S(n)<(0, œ), 
(c) C=(—oo, 0) when a<0 and S(n)<(—«o, 0), 
and 
(d) C=R in the remaining case. 

For further properties of sample functions 


> [4]. 


H. Generalization of Additive Processes 


A temporally homogeneous Lévy process is, as 
we have seen, essentially a temporally homo- 
geneous Markov process on R which is homo- 
geneous in space (i.e., invariant under transla- 
tions of the space). Thus, on a homogeneous 
space when homogeneity in space makes sense, 
we can generalize the notion of additive pro- 
cesses. Let M be a thomogeneous space with 
transformation group G. A temporally homo- 
geneous Markov process X(t) is called an 
invariant Markov process, (or homogeneous 
Markov process) if its system of ttransition 
probabilities {P(t, x, E)} satisfies P(t, x, E) = 
P(t, gx, gE) for all ge G. Thus an additive pro- 
cess is exactly an invariant Markov process 
on R“ when G is the group of translations. G. 
A. Hunt determined all invariant Markov 
processes when M is a ‘Lie group or a factor 
space of a Lie group [6]. 

Let G be a Lie group and A = A(G) be the 
(left-invariant) tLie algebra of G. Let G, = 
GU {a} be a tone-point compactification of 
G, and C be the set of all continuous functions 
on G,. We can define Yf(YeEA, fEC) as usual by 


n(t)=expty, R,f(t)=f(to), 


when the limit exists uniformly. Let C, = 

{ feC| Y(Xf) exists for every X, YEA}. Let 
X,,X>,...,X,4 be a basis of A(G), and let x,, 
X5,...,Xq be functions in C, such that x,(e)= 
0 and X;(x;)(e)= ô; (i j=1,2, ...,d; e is the 
unit element of G). Take a neighborhood of e, 
and define a function o(g)= È$; x7 (g) for g 
contained in the neighborhood, and extend 
this function to G, in such a way that ọe C, 
and o >k >0 (k= constant) outside of the 
neighborhood of e. Then ge G defines a trans- 
formation of G, by 1,0 = go, t,@ =a, and in 
this way G, is supplied with the structure of a 
thomogeneous space with the transformation 
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group G. Now let X(t) be an invariant Markov 
process on G, which is *continuous in proba- 
bility. Then the tsemigroup 7; (which is a 
tstrongly continuous semigroup on C) of the 
process X(t) is characterized as follows: The 
domain of the infinitesimal generator A of T, 
contains C,, and for feC, 


ARD aX N+ Y aX AKIO) 


J= 


+ | if (to) — f(t) 
G.— {e} 
-È XSA: xo) nido) 


where a;, a; are real numbers (i, j= 1, 2, ...,d) 
such that (a;;) is a symmetric nonnegative 
definite matrix, and n(do) is a nonnegative 
measure on G,— {e} such that fe,- te; (0). 
n(do)< œ. Conversely, given such a,, a; and 
n(do), there exists one and only one invariant 
Markov process on G, whose infinitesimal 
generator is given as above. 

A similar result is obtained when M is a 
factor space of a Lie group by its compact 
subgroup. Furthermore, for more concrete 
homogeneous spaces such as spheres or Loba- 
chevskii spaces (more generally, tsymmetric 
Riemannian spaces) the canonical form of the 
invariant Markov processes and infinitely 
divisible laws is obtained by making use of 
harmonic analysis [5]. 
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Adeles and Ideles 


A. Introduction 


The concept of idele was first introduced by C. 
Chevalley (J. Math. Pures Appl., (9) 15 (1936); 
Ann. Math., (2) 41 (1940)), for talgebraic num- 
ber fields. Later on, this concept and the allied 
concept of adele were defined for fsimple alge- 
bras and also for falgebraic groups over 
algebraic number fields, and the two concepts 
became important in the arithmetical theory of 
these objects. We shall first explain the general 
concept of restricted direct product, by means 
of which adeles and ideles will be defined. 


B. Restricted Direct Product 


Let I be an index set. Suppose we are given, 
for each pel, a tlocally compact group G,, and 
for each p except for a given finite set, say 

Pis P2, -+> Pr a compact open subgroup U, of 
G,. Let G be the subgroup of the direct prod- 
uct Į I,e; G, consisting of elements (g,) whose 
G,-components g, lie in U,, except for a finite 
number of p. Put U =I Li- G, x I, ,,, Up- 
Then U is a locally compact group with re- 
spect to the fproduct topology. The group G 
can be supplied naturally with a topology with 
respect to which G is a locally compact group 
and the quotient space G/U is discrete. The 
group G together with this topology is called 
the restricted direct product of {G,} with re- 
spect to {U,}. 


C. Adeles and Ideles 


Let k be an talgebraic number field of finite 
degree and / be the totality of finite and in- 
finite tprime divisors of k. For each pel we 
denote by k, and k,* the fcompletion of k with 
respect to p and the multiplicative group of 
nonzero elements of k,, respectively. Further- 
more, for each finite prime divisor p, we de- 
note by o, and u, the ring of *p-adic integers 
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of k and the multiplicative group of tunits of 
D,, respectively. 

(1) Since o, is a compact open subgroup of 
k, as an additive group, we can construct the 
restricted direct product A, of {k,} with re- 
spect to {o,}. Then A, is a locally compact ring 
with respect to the componentwise ring oper- 
ations. We call A, the adele ring (or ring of 
valuation vectors) of k, and an element of A, an 
adele (or valuation vector) of k. The element of 
the direct product I], k, whose p-component 
is a fixed element of k for all p is an adele. We 
call such an adele a principal adele. Since u, is a 
compact open subgroup of k,” for each finite 
prime p, we can construct the restricted direct 
product J, of {k,*} with respect to {u,}. We 
call J, the idele group of k and an element of J, 
an idele of k. The element of the direct product 
II k,” whose p-component is a fixed element 
of k for all p is an tdele. We call such an idele a 
principal idele. Each element b of J, induces an 
automorphism f, of the additive group A, 
defined by f,(a)=b-a (ac A,). Thus J, can be 
regarded as a subgroup of the automorphism 
group Aut(A,) of the additive group A,. The 
topology of J, coincides with the relative 
topology of J, as a subgroup of Aut(A,). We 
note, however, that the topology of J, is dif- 
ferent from the relative topology of J, as a 
subspace of A,, and the former is stronger 
than the latter. Finally, for a tfunction field 
in one variable over a ‘finite field, the adele 
ring and the idele group can be defined 
similarly. 

(2) Let R be a tnormal simple algebra over k 
and O be a tmaximal order of R. For each 
pel put R,=R Q ,k,, and for each finite 
prime divisor p put D,=0,:©. Then ©, is a 
compact open additive subgroup of ®,. By the 
adele ring Ax of R we mean the restricted 
direct product of {,} with respect to {©,}. 
Let R,” and U, be the multiplicative group of 
nonzero divisors of R, and the multiplicative 
group of the units of ©,, respectively. (W, can 
be defined only if p is a finite prime divisor.) 
By the idele group Jy of R we mean the re- 
stricted direct product of {R,*} with respect to 
{U,}. The notion of principal adele (or idele) of 
R can be defined similarly, as in (1). The struc- 
tures, as topological groups, of Ay and Jg do 
not depend on the choice of a maximal order 
©. The adele ring A, and the idele group J, 
described in (1) are special cases of Ay and Jy, 
respectively. 

(3) Let G be a linear talgebraic group de- 
fined over k, and let G, be the set of k,- 
‘rational points of the group for each pel. For 
each finite prime divisor p, let U, be the set of 
elements « of G, such that the coordinates of 
both a and a ! are p-adic integers. We can 
then construct the restricted direct product of 
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{G,} with respect to {U,}, which is called the 
idele group (or adele group) of G. 

In the following section we focus on describ- 
ing the fundamental properties of adeles and 
ideles of an algebraic number field k. We shall 
start, however, by observing more generally 
those adeles and ideles of a normal simple 
algebra R over k. (For the properties of the 
adele group of algebraic groups — [7]; 13 
Algebraic Groups.) 


D. The Structures of the Adele Ring and Idele 
Group 


Let R be a normal simple algebra over an 
algebraic number field of finite degree k. We 
identify the totality of principal adeles of R 
(principal ideles of R) with R (R*), and denote 
it by the same letter R (R~). Then R (R*) is a 
discrete subgroup of Ax (Jy). The quotient 
group A,,/R is compact. Denoting by |2,|, 
(a,€k,) and N,(«,) (x€ R,) the tnormalized 
valuation of k, and the treduced norm from R, 
to k, respectively, we define, for aE Jy, a posi- 
tive number: V(a)=TT,-7|N,(%,)l>, where a= 
(x). We call V(a) the volume of a. If a is a prin- 
cipal idele, we have V(a)=1 by the fproduct 
formula on valuations. Denote by J}, the set 
of ideles a with V(a)=1 and put C2 =J9/R*. 
Then C9 has finite volume with respect to 

the tHaar measure of Jy. Furthermore, C$ is 
compact if and only if is a tdivision algebra. 
In particular, C? is compact. Let Q be the field 
of rational numbers. For each rational prime 
p, we define a character 4, of the completion 
Q, of Q with respect to the p-adic topology 
(by a character of Q,, we mean a continuous 
homomorphism from Q, to the 1-dimensional 
torus R/Z): If p=p,, is the infinite prime of Q, 
then we put Ap = —xmodZ (xeEQ,). If p is 
finite, then we let 4, be the composite of the 
following three canonical homomorphisms, 
namely, the one from Q, to Q,/Z,, the one 
from Q,/Z, to Q/Z, and the one from Q/Z to 
R/Z. We define a character 4, of R, as follows: 
4, =4,0tt(R,/Q,), where p is the rational 
prime divisible by p and tr(®,/Q,) denotes the 
treduced trace from R, to Q,. For x, yeR,, 
put (x, y), =exp(2zid,(xy)). Then the additive 
group R, is self-dual relative to (x, y),. Fur- 
thermore, if we put <a, b>=T1,(a,, bp), for 
a=(a,) and b=(b,)e Ag then Ax is self-dual 
relative to <a, b>. The tannihilator of the 
group of principal adeles with respect to <a, b> 
is R. Hence it follows from *Pontryagin’s 
duality theorem that A,,/ is compact. Hence- 
forth let R=k. We call the quotient group 

C, =J,/k* (an element of C,) the idele class 
group of k (an idele class). If a character x of J, 
satisfies the condition y(«)= 1 for all xek (i.e., 
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if y is a character of C,), we call such a charac- 
ter a Gréssencharakter (or Hecke character). 
Gréssencharakters were introduced by E. 
Hecke as characters of a certain type of the 
tideal group of k (Math. Z., 1 (1918), 5 (1920)), 
but they are essentially the same as the ones 
defined above [1]. Let D, be the connected 
component of the identity element of C,. Then 
C,/D, is totally disconnected and compact. 
Hence a Grossencharakter y is of finite order if 
and only if 7(D,)= 1. We can prove by tclass 
field theory that C,/D, is canonically isomor- 
phic to the Galois group over k of the maxi- 
mal Abelian extension of k (— 59 Class Field 
Theory). For the structure of D,, the following 
fact is known: Let r; and r, be the number of 
treal infinite prime divisors and timaginary 
infinite prime divisors of k, respectively. Then 
the dual group of D, is isomorphic to R x 

Q" tr"! x Z”, where R is the additive group of 
real numbers with the usual topology and 

Q (Z) is the additive group of rational numbers 
(rational integers) with the discrete topology. 
Let F be a function field in one variable over a 
finite field Fo. The properties of the adele ring 
and idele group of F are similar to the prop- 
erties of A, and J,, while the group Cp has 

a simpler structure than C,. To explain the 
structure of Cp, let F be the maximal Abelian 
extension of F, G be the Galois group of F/F, 
and Gp be the subgroup of G consisting of the 
elements ø such that a(a)=«”" for all «eF, 
(=the talgebraic closure of F,), where q is the 
number of elements of the finite field Fy and n 
is a given rational integer. Also, let GP be the 
subgroup of G; consisting of the elements 
inducing the identity map on F,. G? is a com- 
pact group with respect to the tKrull topology. 
G, can be naturally supplied with a topology 
such that the group Gr is a locally compact 
group and the quotient group G,/G$ is dis- 
crete. Then class field theory implies that Gp is 
isomorphic to C, as a topological group. 

The following characterization of the adele 
ring of a number field or function field in one 
variable over a finite field is the work of K. 
Iwasawa (Ann. Math., (2) 57 (1953)). Let A be a 
tsemisimple commutative and locally compact 
topological ring with unity 1. Assume that A is 
neither discrete nor compact, and moreover 
that A contains a discrete subfield k31 and 
A/k is compact. Then k is an algebraic number 
field or a function field in one variable over a 
finite field, and A is isomorphic to the adele 
ring of k as a topological ring. 


E. Ideles and Cohomology 


Let K be a Galois extension of finite degree of 
an algebraic number field k, and © be the 
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Galois group of the extension K/k. © operates 
naturally on the idele group J, and the idele 
class group Cx of K. The structures of the 
tcohomology groups of © with the coefficient 
groups J, and Cx were investigated by G. 
Hochschild, T. Nakayama, E. Artin, J. Tate, 
and others. In particular, we have H'(G, Cx)= 
{0} and H?(G, Co) = Z/nZ (cyclic group of 
order n), where n=[K:k]. These facts play an 
important role in one of the proofs of class 
field theory (— [3]; 59 Class Field Theory). 
Furthermore, A. Weil introduced the so-called 
Weil group, which is a tgroup extension of a 
certain type of Cy by ©. He defined the most 
general L-functions, which include both 
tArtin L-functions and tHecke L-functions 
with Gréssencharakters (— [2]; 450 Zeta 
Functions). 


F. Fourier Analysis on the Adele Group 


*Dedekind zeta functions and Hecke L- 
functions are meromorphic on the whole 
complex plane and satisfy functional equa- 
tions of certain types. This can be proved by 
methods of Fourier analysis on the adele 
group A, (Artin, Iwasawa, Tate [1,8,9]). Fora 
continuous complex-valued function g(a) on 
A, Satisfying suitable conditions, we define the 
Fourier transform of g(a) as follows: 


=| p(a)<a, b> db, 
Ay 


where db denotes the Haar measure on A,. 
By normalizing db suitably and applying 
*Poisson’s summation formula, we get, for 
each idele a of k, 

$ plax)=V(a)? $ pa a). 

aek ack 

This is called the ©-formula. Consider the 
following integral on J,: 


«=| V(aFx(a)o(a)d*a, 
Jk 


where d*a denotes the Haar measure on J}, 

s is a complex number, and x is a Grössen- 
charakter of k, namely, a character of C,. This 
integral converges if s>1, and by using the 
@-formula one can show that &(s) is meromor- 
phic on the whole complex plane and satis- 
fies a functional equation of a certain type. 
When the function ¢ is of special type, then 
the above integral can be explicitly expressed 
as the product of L-functions, I -functions, and 
exponential functions. This method of express- 
ing L-functions by integrals on J, and apply- 
ing the ©-formula can be applied to investi- 
gate tHey zeta functions and L-functions of 
various types defined for a simple algebra (— 
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450 Zeta Functions) (G. Fujisaki [6]; T. Tama- 
gawa, Ann. Math., 77 (1963)). 
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Affine Geometry 


A. Construction of Affine Spaces 


An affine space A is constructed as follows: Let 
V be a tvector space over a ffield K, and let A 
be a nonempty set. For any vector ae V and 
any element p of A, suppose that an addition p 
+acA is defined so as to satisfy the following 
three conditions: (i) p+ 0=p (0 being a zero 
vector); (ii) (p +a) +b=p+(a+b) (a, be V); and 
(iii) for any qe A there exists a unique vector 
acV such that q =p +a. (Condition (i) follows 
from (ii) and (iii).) Then we call A an affine 
space, V the standard vector space of A, and K 
the coefficient field of A. Each element of A is 
called a point. 

If we fix an arbitrary point o€ A, there is a 
one-to-one correspondence between A and V 
given by the mapping sending pe A to aeV 
such that p=o +a. Such an element a of V is 
called a position vector of p with the initial 
point o and is denoted by op. We say that r+1 
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points p, (0<a«<r) of A are independent if r 
vectors a; = Pop; (1 <i<r) are linearly inde- 
pendent in V; otherwise, they are said to be 
dependent. This definition of dependence of 
points p, is independent of the choice of the 
initial point among them. If V is of dimension 
n, we say that A is of dimension n, dim A =n; in 
this case, we sometimes write A” instead of A 
and V” instead of V. The affine space A is of 
dimension n if and only if the maximum num- 
ber of independent points in A is n+1. 

Next, for any vector subspace V* of V” and 
an arbitrary point pe A”, we put A¥ = {qe A" | 
q=p+x,xeV*} and call it a subspace of A”. It 
is an affine space of dimension k: Conversely, 
every subset of A” that is an affine space under 
the affine space structure of A can be expressed 
in this form. A!, A*, and A”"! in A” are called 
a line, plane, and hyperplane, respectively. A set 
that consists of only one point is also consid- 
ered as a subspace A®. For subspaces A’ and 
A’ of A”, we denote by A” N A® the intersection 
(i.e., the set-theoretic intersection) of A’ and A’, 
and by A’U AS the join of A’ and A’ (i.e., the 
intersection of all subspaces that contain both 
A’ and A‘). Then A”N A’ is the affine space of 
highest dimension contained in A’ and 4°, and 
A’ UA‘ is the affine space of lowest dimension 
that contains A’ and A‘. Ifr+1 points are 
given in A”, there always exists a subspace A” 
that contains all these points. In particular, if 
the points are independent, then such an A’ ts 
unique. Moreover, if A'N A: # Ø (Ø is the 
empty set), then we have r+s=dim(A'U A*)+ 
dim(A’f A’). This is called the dimension 
theorem (or intersection theorem) of affine 
geometry. 

Next suppose that r+ 1 points p, (Q<a<r) 
in A” are independent, and put p,,, =Po. Let 
q, be an arbitrary point on p,Up,,, that differs 
from p, and p,.,. If å% are elements of K such 
that 4%: Pada =9,P,1;, then qo, ...,q, are de- 
pendent if and only if 2°4'...2"=(— 1)". And 
ifr>2 and o,=4,U Pa+2U -U Patr (Pir = Pi- 
if i2 1), then do, ...,6, have a point in common 
if and only if 4°24! ... A" =1. The former is 
called Menelaus’s theorem, and the latter 
Ceva’s theorem. 

The set L(A) of all subspaces (including @ 
considered as an affine space of dimension —1) 
constitutes a tlattice with respect to the inclu- 
sion relation. 


B. Parallelism in Affine Spaces 


Let A’ and A* be subspaces of A”. We say 
that A” and A' are parallel in the wider sense 
if either of the following conditions holds: 
(i) A> A’ or ASDA’; or (it) A'N A= Ø 

and dim(A’U A") <r + s. Next, let A’ and 
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B” be subspaces of A” of the same dimension. 
If A’ and B" coincide, or A" N B" = Ø and 
dim(A’U B’)=r-+ 1, then they are said to be 
parallel in the narrower sense (or simply paral- 
lel), and we denote the relation by A’ // B”. If 
r=s= 1, the definitions of parallelism in the 
narrower sense and wider sense are equivalent. 
If r>1, parallelism in the narrower sense im- 
plies parallelism in the wider sense. For two 
sets (a,) and (b,) (O<a<r) of r+1 independent 
points, let V” and W” be vector spaces with 
bases dod; and bob; (1 <i<r), respectively. 
Then A”=a,U...Ua, and B’=b,)U...Ub, are 
parallel if and only if V’ = W”; and for an 
arbitrary point p, there exists a unique r- 
dimensional subspace that is parallel to A” and 
passes through point p. If A’ and B* are paral- 
lel in the wider sense, then there exist sub- 
spaces A‘ and B' (t>1) of A’ and B* that are 
parallel to each other. Moreover, if neither A’ 
nor B* is contained in the other and if t is the 
largest integer with the property just given, 
then we have t=r+s+1—dim(A’U B’). 
Parallelism between subspaces of A" is an 
equivalence relation. Specifically, the equiva- 
lence class of a 1-dimensional subspace A! is 
called a point at infinity and is denoted by A®. 
Given a subspace A’ of A”, the set of points at 
infinity 4°, represented by lines A! con- 
tained in A’ is denoted by A’, '; we have 
A’ // B! if and only if A"71=B",!. A’! is 
called a space at infinity. In particular, the set 
A"! is called the hyperplane at infinity. The 
set-theoretic sum A” U A"! = A" is supplied 
with the structure of a tprojective space; the 
“points” in A” are elements of A”, and the 
“lines” in A" are A! U A® and A}. 


C. Coordinates of Affine Spaces 


If we fix a point o in A” and a basis {e,,...,e,$ 
of the standard vector space V”, then any 
point p in A” ts uniquely expressed as 


n 


p=o+) xie; 


i=1 


xek. (1) 


The system %} =(0;e,,...,¢,) is called an affine 
frame (simply the frame) of A"; the point o 

is called its origin, and e; is called the ith 

unit vector. The mapping sending p to (x?, 
...,X") gives a tbijection of A" to K"; we call 
(x',...,x") affine coordinates of p with respect 
to %, and x’ the ith affine coordinate. In partic- 
ular, if K is a topological field (e.g., the real 
number field R or the complex number field 
C), this bijection A” K” induces a topology of 
A”, which can be shown to be independent of 
the choice of %. For the remainder of this 
article, by “coordinates” we mean affine co- 
ordinates unless otherwise stated. Putting 
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a;=o+e; (1 <i<n), we sometimes call (0; a,, 
...,a,) an affine frame. Further, putting /;=o0U 
a;,™j=oUa,U...Ua,_,Ua;,,U...Ua,, we call 
a;, l;, and 7; the ith unit point, the ith coordi- 
nate axis, and the ith coordinate hyperplane, 
respectively. 

Assume that subspaces A’ and A’ (r,s >0,r + 
s=n) are not parallel in the wider sense. For 
a point p of A", denote by A’(p) the subspace 
that passes through p and is parallel to A’, and 
put q= A’(p)N A’. A mapping g: A" A de- 
fined by o(p)=q is called a parallel projection 
on A’ with respect to A’. In particular, if A" = 
m; and A*=1; (r=n—1,s=1), we write ọ(p)= p; 
Then the ith coordinate x! of p is an element 
of K such that op;=x'dd;. Hence such coordi- 
nates are also called parallel coordinates (or 
Cartesian coordinates). 

Suppose that we are given r+ 1 points 
bo, ---, b, of A” and r + 1 elements 4°, ..., 4" of 
K such that ÈX} -o4°=1. We fix a point o of A”. 
If a point p in A” satisfies op = Dr-o 4*0b,, then 
bop = Li, 4'bob;; hence p is contained in the 
subspace b,U...Ub,. Conversely, if a point p is 
contained in the latter subspace, then there 
exists a system (4°, ..., å”) such that 


Op=Tt_yAtob, and ar ena 


The system (4°, ..., 4”) has a geometric mean- 
ing since we also have o'p= Elo 4"0'b, if we 
replace the point o by any other point o’ of A". 
The elements 4°, ..., 4” are called barycentric 
coordinates of p with respect to {bo,...,b,}. In 
particular, if {bo, ...,b,} are independent, 

then the barycentric coordinates (4°, ..., A") 
are uniquely determined by the point p on 

bo U... Ub,. Furthermore, let (y', ..., y”) be 
affine coordinates of p with respect to an affine 
frame Ẹ, and let (x}, ..., x”) be affine coordi- 
nates of b, (x=0, ...,r). Then p belongs to the 
subspace A”=b,U... Ub, if and only if y'= 
Dio A“ x} (i=1,...,n). In this case we say 

that the system of the linear equations y'= 

> Axi (È A* = 1) gives a parametric represen- 
tation of the subspace A’ (by parameters 4°). 
Specifically, if r=n—1, the solvability of the 
system of equations y'= E?$ Atxi (i=1,...,n), 
1 =) 4° implies the equation E}; y u;= Up for 
some nontrivial constants fo, ..., H„. Hence the 
latter equation represents the hyperplane z= 
A"! If a point p has barycentric coordi- 
nates 2? =4! =... =4"=(r+ 1) with respect 
to {bo,...,6,}, it is called the barycenter of bo, 
b,,...,b,. The barycenter is uniquely deter- 
mined by the set {bo, ...,5,} and is denoted 

by g(bo, ...,5,). Specifically, the barycenter 

of two points bọ and b, is called the midpoint 
(or middle point) of bọ and b,. If we divide 
B={bo,...,5,} into two sets of points and if 
g, and g, are barycenters of these two sets 

of points, respectively, then g, Ug, passes 
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through the barycenter of B. More generally, a 
point with barycentric coordinates (4°, ..., 4’) 
with respect to {bo, ...,5,} is called a barycen- 
ter of bo, ...,5, with weights 1°, ..., A’. 


D. Affine Spaces over Ordered Fields 


Suppose the coefficient field K is an tordered 
field (e.g., the real number field R). Given a 
hyperplane z of A”, we take an affine frame for 
which z is the nth coordinate hyperplane. If we 
denote coordinates of points with respect to 
this frame by (x!,...,x"), then the equation of 
n is given by x"=0. Let A”, and A” be sets of 
points whose nth coordinates are positive and 
negative, respectively. They are called half- 
spaces of A" divided by z. The union of z and 
a half-space is called a closed half-space. A 
half-space of a subspace A’ of A” (divided by 
some A’! on A’) is called a half-space of di- 
mension r. For a point p of A” that does not lie 
on z, the half-space containing p is called the 
side of p with respect to z. In particular, when 
n=1, let p and q be two points on a line }. The 
closed side of q with respect to p is called the 
(closed) half-line (or ray) from p to q. The inter- 
section of the closed half-lines emanating from 
p to q and from q to p is called the segment 
joining p and q and is denoted by pq. Clearly 
pq =qp. A subset C of A” is called a convex set 
if the segment joining two arbitrary points of 
C ts contained in C. Each half-space of each 
dimension is convex. For any family C, of 
convex sets, (\,C, is also convex. Therefore, 
for any subset D in A” there exists a minimal 
convex set that contains D. It is called the 
convex closure (or convex hull) of D. The con- 
vex closure C(P) of a finite set of points P= 
{Dos +++; Px} in A” is called a convex cell, and 
dim(pg U...Up,) is called the dimension of the 
convex cell. In particular, when po, ..., Pp are 
independent, C(P) is called a k-dimensional 
simplex with vertices po, ...,p,. The 1-dimen- 
sional simplex having two distinct points p 
and q as vertices is the segment pq, and the 
vertices p and q are called ends of the segment. 
A point is regarded as a 0-dimensional sim- 
plex. Each 2-dimensional or 3-dimensional 
simplex is called a triangle or tetrahedron, 
respectively. A k-dimensional simplex S with 
vertices Po, --., Py 1s a Set of points whose 
barycentric coordinates 4* (O<a<k, Y47=1) 
with respect to the vertices satisfy A*>0. On 
the other hand, if we put A* =p) U ... U p, and 
Ny =Po U -U Pa-1 U Par U-.. U Pps and denote 
by A* the side of p, in A* and by A* the closed 
side of p, in A* with respect to z,, then the 
simplex S is given by (\k_) 4%, and [f-o A} is 
called an open simplex. A” has the structure of 
a ttopological space in which the set of open n- 
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dimensional simplexes forms a tbase of topen 
sets. In particular, if K is R, the topology of A” 
thus defined is compatible with the one that is 
naturally induced by the topology of R. With 
respect to this topology, A” is a tHausdorff 
space. The terms open and closed used before 
for n-dimensional simplexes agree with the 
corresponding notions with respect to this 
topology. 

A subset of A” is said to be bounded if it is 
contained in some simplex. A bounded set 
obtained through a finite process of construct- 
ing intersections and unions from a finite 
number of closed half-spaces is called a poly- 
hedron. The points of a convex polyhedron are 
characterized by several linear inequalities 
satisfied by their coordinates. A set of points 
whose coordinates (x',...,x") satisfy hi<x'< 
k! for kê, hře K is called a parallelotope; it is a 
polyhedron whose tinterior is called an open 
parallelotope. A simplex is a polyhedron, and 
polyhedra admit tsimplicial decompositions. 
A polyhedron can also be defined as the set- 
theoretic union of a finite number of simplexes. 

Let P be a finite set of points, and let its 
convex closure C(P) be a convex cell of dimen- 
sion m. Then C(P) is contained in an m- 
dimensional subspace A”. The tboundary of 
C(P) in the topological space A” is called the 
boundary of a convex cell C(P). We can take a 
subset Q of P so that dim C(Q)=m-—1 and 
C(Q) is the intersection of the boundary of 
C(P) and an (m—1)-dimensional subspace. 
Such a C(Q) is called a face of C(P), and 
we denote this relation by C(P)>C(Q). If 
C(P)>C(P,)>...>C(P,), then C(P,) is called 
an (m—s)-dimensional face of C(P). A 0- 
dimensional face is called a vertex, and a 1- 
dimensional face is called an edge. Suppose 
that C(P)>C(Q) for P={po,...,p,} and Q 
= {Pigs -++>Pi,_,}- Then F=p,,U...Up,,_, isa 
hyperplane of E=p )U...Up,, and C(P) is 
contained in a closed side of E divided by F. 
Therefore, if C(P) has d (m—1)-dimensional 
faces, then C(P) is expressed as the intersection 
of d m-dimensional closed half-spaces. This 
shows that any convex cell is a polyhedron. 


E. Affine Transformations 


A mapping g: A"—> A” is an affine mapping if 
there is a linear mapping @: V"—V™ of the 
standard vector spaces of A” and A” such that 
~(p+x)=(p)+0(x) holds for any pe A” and 
any xe V". An affine mapping of A” into itself 
is called an affine transformation (or affinity) 
of A”. Specifically, a bijective affine transforma- 
tion is called a regular (or proper) affine trans- 
formation. An affine transformation ¢ of A” ts 
characterized by each one of the following 
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properties: (1) Let ọe A" be a fixed point. Then 
gy is a mapping of A” onto itself that can be 
expressed as 


g(o+x)=0+at f(x), (2) 


where a is a fixed vector of V” and f is a 
linear transformation of V”, (ii) The mapping 
p: A"+A" is a mapping such that @(a)p(b)= 
2-(p)e(q) if ab = 4: pġ (2e K). Moreover, if 
the ‘characteristic of K is not equal to 2, an 
affine transformation is also characterized as 
follows: (iii) ọ is a mapping that sends lines 
into lines and preserves the ratio of each pair 
of parallel segments. 

The set 2(A”) of all regular affine transfor- 
mations of A” constitutes a group that we call 
the group of affine transformations. If the 
linear mapping f associated with a regular 
affine transformation ø is the identity map- 
ping, then ¢ is called a translation. The set 
%(A”) of all translations is a normal subgroup 
of Q{(A") and is called the group of translations. 
The group of translations is isomorphic to V 
regarded as an additive group. The vector 
group B(A”) G.e., an additive group of a linear 
space) acts ‘simply transitively on A”. We see 
that Y(A")/B(A")=GL(n, K), where GL(n, K) 
denotes the *general linear group. The set of 
all regular affine transformations that leave a 
point 9 of A” invariant constitutes a subgroup 
(A") of Y(A"); it is called an tisotropy group 
at ọ and is isomorphic to GL(n, K). Let § = 
(0; e, , , e,) be an affine frame of A” with 
origin o, and let ọ be a regular affine trans- 
formation of A” given by (2); put x =} x'e,, 
olo +x)=0 +} X'e; a=} aʻe; and f(e) = 
È aje;. Then ¢ is expressed with respect to Ẹ 
by the following equation: 


X'=a'+ ¥ ajx*, det(ai))#0, 1 <i<n. (3) 
k=] 

Conversely, a transformation that is given by 

(3) is a regular afftne transformation. Elements 

of B(A") and (A") are expressed with respect 

to & by 


1<ign, (4) 


¥'= Vajx*,  det(a/)40, 1<i<n, (5) 
1 


k= 
respectively. Hence A(4”) is represented as a 
tsemidirect product group of %(A") and (F>(A”). 
In particular, a regular affine transformation 
that is represented by Xx’ = ax! (1 <i<n) for 
some q€ K (a £0) is called a similarity with the 
origin 0 as center. 

According to F. Klein, the objects we deal 
with in affine geometry are the properties 
(parallelism, barycenters, etc.) that are invar- 
iant under regular affine transformations. 
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Subsets S, and S, of A” are called affinely 
congruent if there exists a regular affine trans- 
formation g sending S$, onto S,. For a fixed 
k, two k-dimensional simplexes are affinely 
congruent. Now we fix an affine frame § in A” 
and denote by x; the coordinates of n + 1 
points p,(0<a<n)in A”. Then the quantity 


1 
V(Po, +> Py) =— 
FU aces 


E (6) 
MG Xi Gee X 


is called the volume with respect to & of the 1- 
dimensional simplex with vertices po, , Pa. If 
ọ is a regular affine transformation given by 
(3), we have V=det(a/) V. Hence the ratio of 
volumes of two n-dimensional simplexes is 
independent of the choice of coordinate sys- 
tems, and is invariant under regular affine 
transformations. 

A regular affine transformation given by (3) 
satisfying det(a}) = | is called an equivalent 
affnity. The set of all equivalent affinities 
constitutes a subgroup of Y{(A"). The geometry 
belonging to this group is called affine geome- 
try in the narrower sense. For instance, the 
concept of volume is an invariant in affine 
geometry in the narrower sense. 


F. Relation to Projective Geometry 


Let P" be a projective space over a coefficient 
field K (= 343 Projective Geometry). If we fix 
a hyperplane 7z, in P”, then the set of projec- 
tive transformations that leave z,, invariant 
constitutes a subgroup of the group of tprojec- 
tive transformations of P”; this subgroup is 
isomorphic to a group of regular affine trans- 
formations. Actually, if we use a projective 
frame [d), a,, ,a,, U] such that a,, , a, are 
points on z,,, then each projective transfor- 
mation leaving 7, invariant is expressed by 
equations of the same form as (3) with respect 
to the tinhomogeneous projective coordinates. 
The point set A” complementary to 7,, in P” 

is an affine space, and z, coincides with the 
hyperplane at infinity. Moreover, two distinct 
lines in P” are parallel in A” if they meet on the 
hyperplane at infinity. Hence, denoting by 
(0:7). <, /") the +homogeneous projective co- 
ordinates of the intersection of a line ļ in A” 
and z,,, we call (1, , /") the direction ratio of 
1. A projective transformation leaving each 
point of z, invariant induces a translation. 
The ‘principle of duality that holds in projec- 
tive geometry does not hold in affine geome- 
try. The tpole of the hyperplane at infinity, 
with respect to a quadric hypersurface, is 
called the center of the quadric hypersurface. A 
regular quadric hypersurface is called central 
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or noncentral according as its center belongs to 
A" or is a point at inlinity. Quadric hypersur- 
faces in an affine space are classilied in several 
ways, by taking account of their relations with 
the hyperplane at infinity (= 78 Conic Sec- 
tions, 350 Quadric Surfaces). 
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8 (111.1) 
Algebra 


Though the word “algebra” usually refers to 

a field of mathematics as wil] be explained 
below, the word may also denote specific 
mathematical structures such as tassociative 
algebras, ‘Jordan algebras, *Clifford algebras, 
etc. The first concepts concerning “unknowns” 
in algebra originated in India, whence came 
also our decimal positional system of numer- 
ation. These ideas were introduced to Europe 
through Arabia in the Renaissance period. F. 
*Viéte systematized them into a symbolic 
method, called algebra, representing numbers 
by letters. The first problem of algebra was 
solving equations. Before Viéte, G. Cardano 
and L. Ferrari had solved algebraic equations 
of degrees 3 and 4; the solution of equations of 
lower degree had been known from antiquity. 
The effort to solve equations of higher degree 
remained unresolved until the middle of the 
19th Century, when N. H. tAbel and E. *Galois 
proved the nonexistence of algebraic solutions 
of such equations. They considered not only 
individual roots of these equations but also 
any rational transforms of their roots at the 
Same time, and thus were led to the concept of 
fields. They also noticed that the problem of 
algebraic solution could be characterized by 
properties of permutation groups of the roots. 
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After the discovery of the Galois group, group 
theory and group-theoretical considerations 
maintained the central position in algebra for 
some time (-» 172 Galois Theory). These de- 
veloped into the “abstract algebra” of this 
Century in the general atmosphere of arithme- 
tization and of axiomatization of mathematics. 
At the turn of the century the monumental 
textbook in three volumes by H. Weber [1] 
was considered the standard work on algebra. 
Then there appeared in 1910 an epoch-making 
paper [2] by E. Steinitz on the abstract theory 
of lields. 

The main objects of algebra today are falge- 
braic systems of various kinds, such as ‘groups, 
‘rings, ‘fields, and *modules. Another fun- 
damental concept of algebra is that of fiso- 
morphism or of homomorphism. The col- 
lection of algebraic systems of a given kind, 
together with the homomorphisms among 
them, gives rise to the notion of ‘category; a 
functor is a sort of homomorphism between 
categories ( 52 Categories and Functors). 
These notions were first used in thomological 
algebra, created in the 1940s by methods 
transferred from topology to algebra; now 
they are of basic significance to the whole of 
mathematics. 

An important branch of algebra with wide 
applications is the theory of fvector spaces, or 
more generally that of tmodules over a ring. 
This branch is called linear algebra. Homo- 
morphisms between finitely generated modules 
can be represented by tmatrices. Another 
branch of algebra, called trepresentation 
theory, is concerned with representations of 
groups or rings by matrices. The methods of 
modern algebra provide useful and powerful 
tools for the whole of mathematics, in partic- 
ular for the theory of numbers and algebraic 
geometry. 

The present development of algebra owes 
much to the activity of the German school in 
the late 1920s represented by E. Noether, E. 
Artin, W. Krull, and B. L. van der Waerden. 
The book by van der Waerden [3] has had a 
great impact on mathematics. N. Bourbaki [4] 
has been influenced by van der Waerden but 
gives accounts of more recent developments, 
particularly in linear algebra. In Japan, M. 
Sono, who worked at about the same period 
as E. Noether, was a forerunner in this field; 
after him, algebraists of the Kyoto School, Y. 
Akizuki, M. Nagata, and their followers, did 
notable research, especially in algebraic geom- 
etry. On the other hand, K. Shoda studied 
with E. Noether toward 1930 in Germany; his 
school includes such algebraists as T. Naka- 
yama, K. Asano, and G. Azumaya. Finally 
K. Morita and his disciples have made signif- 
icant contributions to homological algebra. 
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9 (VIII.2) 
Algebraic Curves 


A. General Remarks 


An ‘algebraic variety of dimension 1 is called 
an algebraic curve (for analytic theory — 11 
Algebraic Functions). The theory of algebraic 
curves has two aspects, the geometry of 1- 
dimensional complex manifolds in projective 
spaces and the theory of function fields of 
transcendence degree 1 (— 3 Abelian Varieties, 
16 Algebraic Varieties). The number-theoretic 
study of algebraic function fields concerns the 
latter theory (- 73 Complex Multiplication, 
450 Zeta Functions). In this article, the geo- 
metric aspect of the theory is emphasized. 

We denote the tuniversal domain by K. 


B. Classical Results on Plane Algebraic Curves 


Let f(X, Y) be a polynomial of degree m in 
two variables X and Y. A point set in an affine 
two-space defined by f(X, Y) = 0 is called the 
plane algebraic curve of degree m. If we set 
F(Xo, X1,X2)= Xo f(X1/Xo, X2/Xo), the 
homogeneous polynomial F defines an alge- 
braic curve of degree min a projective plane 
P?, The curve is called irreducible if the poly- 
nomial f(X, Y) is irreducible. A curve of degree 
1 is said to be a line. Some results of this sec- 
tion are valid only in the case where the char- 
acteristic of K is zero. 

Let C be a plane curve defined by the equa- 
tion f(X, Y) = 0. A point P = (a, b) on C is 
called an r-ple point if f(X +a, Y +b) has no 
term of degree < rin X and Y. At an r-ple 
point there are r tangent straight lines (count- 
ing multiplicity). An r-ple point with r> 1 is 
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called a multiple point (or singular point). If 
these tangents are distinct, P is called ordi- 
nary. An ordinary double point is called a 
node; e.g., the origin for X? +Y? -3X Y = 0. 

An algebraic curve can be transformed into a 
plane curve that has only ordinary multiple 
points by a finite number of plane {Cremona 
transformations (fquadratic transformations of 
the projective plane into itself). 

Let C be anvirreducible plane curve of de- 
gree m> | in a projective plane. The set of all 
tangent lines at nonsingular points of C deter- 
mines a set of points in the dual projective 
plane, and its closure is an algebraic curve C, 
that is called the dual curve of C. The dual 
curve of C becomes the original C. The degree 
m of Ĉis said to be the class of C, which is 
equal to the number of tangent lines to C 
drawn from a general point. A nonsingular 
point P is called a point of inflection (or a flex) 
if the tangent line at P has contact of order 
> 2. If C is defined by an irreducible homoge- 
neous polynomial F(X9, X,, X,), the curve 
defined by det(é? F/0X;6X )=0 is said to be the 
Hessian of C. A nonsingular point P is a point 
of inflection if and only if P is contained in the 
Hessian. A singular point P is said to be a cusp 
if C is detined by an equation Y? = X> + higher 
terms, in terms of suitable affine coordinates X 
and Y where P = (0,0). Whenever the singular 
points of C are only v nodes and y cusps, the 
effective genus g is given by the formula g = 
(m= 1) (m=2)/2 =v -y. In addition, suppose 
that the dual curve has only v’ nodes and y’ 
cusps as singularities. Then C has y’ points 
of inflection, and m and y’ are given by m' = 
mm 1)--2v—3y and y =3m(m—2)—6v—8y. 
Moreover, m=m(m- 1)—2v'— 3y and y = 
3m'(m'—2) 6y' 8y' hold. These formulas are 
called Pliicker formulas. 

For example, a nonsingular plane curve of 
degree 3 is an elliptic curve, i.e., g = 1, of class 
6, which has 9 points of inflection. A non- 
singular plane curve of degree 4 has, in gen- 
eral, 24 points of inflection and 28 bitangents. 
This results from the Pliicker formulas. 


C. Fundamental Notions 


In what follows, by a curve we mean an alge- 
braic variety of dimension 1. Let T be a non- 
singular complete irreducible curve. An ele- 
ment of the free Abelian group generated by 
points of T is called a divisor. A divisor is 
written in the form a = È n;P,, with n;e Z. The 
integer n= > n; is called the degree of a and is 
denoted by deg a. The expression for a divisor a 
is said to be reduced if P, # P, for į #j. A divisor 
whose reduced expression has only positive 
coefficients is called a positive divisor (or effec- 
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tive divisor, or integral divisor), and this is 
denoted by a>0. The group of divisors on T is 
denoted by G(T), and the subgroup consisting 
of divisors of degree 0 is denoted by G(T). Let 
P be a point of I. The subset of the function 
field K(T) of T consisting of functions regular 
at P forms a valuation ring Rp for a fdiscrete 
valuation of K(T). A prime element t of Rp is 
called a local parameter at P. Let vp be the 
‘normalized valuation of K(I°) defined by Rp; 
the integer vp(f) is called the order off at P. 
The point P is a zero off if vp( f )> 0; it is a 
pole off if vp(f)< 0. There are only a finite 
number of poles and zeros of a given function 
f. The divisor È vp( f) P is called the divisor of 
the function f and is denoted by (f). The set of 
divisors of functions forms a subgroup G, of 
G,. Any divisor a in G is called a principal 
divisor (we also say that a is tlinearly equiva- 
lent to zero and write a ~ 0). 

Let a be an arbitrary divisor. The set of all 
positive divisors that are linearly equivalent to 
a forms a complete linear system |a| deter- 
mined by a. We set L(a) ={ fe K(r) (£) + 
a>0} U {0}. Then L(a) is a finite-dimensional 
vector space over K, and 1-dimensional sub- 
spaces of L(a) correspond bijectively to the 
elements of aj. We set J(a) = dim, L(a) and 
dim |a|=/(a)— 1. Then dim|a| is called the 
dimension of a]. For any divisor a, the integer 
deg a = dim|a is nonnegative and bounded. 
The supremum g of such integers is called the 
genus of I’. The nonnegative integer i(a) = 
g ~dega +dim a is called the speciality index 
of a. 

Let œ be a tdifferential form on I’, P be a 
point of J’, and f be a local parameter at P. 
Then œ can be written in the form w = fdt. We 
now set vp(w) = vp( f) and (w) = È vp(w) P. Then 
(w) is a well-defined divisor, and the class of 
(w) in G/G, is independent of the choice of w. 
This divisor class is called the canonical class; 
any divisor in this class is called a canonical 
divisor (or differential divisor) and is denoted 
by t. We have J(f) = y, deg [= 2g ~ 2. Given a 
divisor a, the index i(a) is equal to the number 
of linearly independent differentials œ such 
that (@)>a, i(a)=l(f—a). The equality J(a)= 
deg a —g + 1 + i(a) is called the Riemann-Roch 
theorem. 

For any irreducible algebraic curve I’, there 
exists a birationally equivalent nonsingular 
curve [ that is unique up to isomorphism. The 
genus of [ is called the effective genus of Į, A 
curve whose effective genus is zero is called a 
rational (or unicursal) curve. An elliptic curve is 
a curve whose effective genus is 1. An irreduc- 
ible curve with genus > 2 is called a hyper- 
elliptic curve if K(T) is a quadratic extension of 
a field K(t) for some t. 

A positive divisor a on a nonsingular com- 
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plete irreducible curve F is called a special 
divisor if i(a) > 0. In this case, 2 (J(a) = 1) < deg a, 
where equality holds if and only if a = 0 or a w 
f, or Į is a hyperelliptic curve (Clifford’s 
theorem). Let I be a nonsingular complete 
irreducible curve and k be a sublield of K such 
that I’ is defined over k. Denoting by k the 
algebraic closure of k, we cal] a divisor p = 

2 n;P, on T a prime rational divisor over kif 

p satisfies the following three conditions: (i) p 
is invariant under any automorphism g of k/k; 
(ii) for any j, there exists an automorphism 

o; of k/k such that P, = P% Gii) n; = . =n, = 
[k(P,):k];. An element in the subgroup of 
G(T) generated by prime rational k-divisors is 
called a k-rational divisor. Let k(T`) be the 
subset of K(T) consisting of functions f defined 
over k. Then k(I) is a subfield of K(I), and the 
quotient field of k(T)®,K = K(T). k(T)is 
called the function field of T over k. Let p be a 
prime rational k-divisor, and let P be a point 
of p. Then Rp N k(T) is a valuation ring of k(T) 
uniquely determined by p and independent of 
the choice of the point P in p. We call this 
valuation ring the valuation ring determined 


by r. 


D. Algehraic Function Fields 


Let k be a field, and let K be a finite separable 
extension of a purely transcendental extension 
k(x) of k such that k is maximally algebraic in 
K. Then K is called an algehraic function field 
over k of dimension | (or of transcendence 
degree 1). The equivalence class of fexponen- 
tial valuations of K that are trivial over k is 
called a prime divisor of K /k, An element of the 
free Abelian group generated by prime divisors 
is called a divisor of K/k. The group opera- 
tion in the divisor group of K/k is usually 
denoted multiplicatively. Let Rp be the 
valuation ring of the prime divisor P, and 

let Mp be the maximal ideal of R,. The de- 
gree deg P of the prime divisor P is defined by 
[(Rp/Mp): K. If we replace the terms: curve T 
by function field K/k; K(T) by K; K by k and 
points on T by prime divisors of K/k, we can 
develop the theory of the function field K/k, 
which is similar to the theory of nonsingular 
curves T (- Sections B, C). Thus we define the 
genus of the function field K/k. 

Suppose we are given an algebraic function 
field K/k of dimension 1. An algebraic curve I’ 
defined over kis called a model of K/k if k(T) 
and K are k-isomorphic. For any function field 
of dimension 1, there always exist two ele- 
ments x and y in K such that K= k(x, y). Let 
f(X, Y) be an irreducible polynomial such that 
f(x,y) = 0. Then the plane curve defined by the 
equation f(X, Y) = 0 is a model of K/k. Among 
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the models of K/k there exists a tnormal model 
T, over k that is unique up to isomorphism 
(and the uniqueness of the normal model of 
the function field within the birational equiva- 
lence class of varieties holds only for curves). In 
particular, if k is the complex number tield, the 
normal model F, is the *Riemann surface of 
the function lield K/k. If Ty has no singular 
point, the theory of the curve I and the 
theory of the function tield K/k are essentially 
identical. (This occurs, for example, when & is 
tperfect.) In that case the genus of Tọ is equal 
to that of K /k, In general, the genus of the 
function field is not less than the genus of the 
normal model Į}, and it is greater than the 
latter if [| has a singular point. If the genus of 
K/k is zero, we Can take a plane quadratic 
curve as a model of K/k. Moreover, K/k has a 
prime divisor of degree 1 if and only if K is a 
purely transcendental extension of k. A func- 
tion lield K/k of genus 1 is called an elliptic 
function field. If K has a prime divisor of de- 
gree 1, an elliptic function field K has a model 
of a plane cubic curve. Moreover, if the char- 
acteristic of the lield & is different from 2, we 
can take as the model T} the curve defined by 
anequationoftheform Y*=4X%—g,X —q3. 
This is called Weierstrass’s canonical form. 
The number j=(g,3—27937) g? #O)is a 
birational invariant of J}. 


E. Jacobian Varieties 


Let [ be a nonsingular curve. A tgroup variety 
J is called the Jacobian variety of T if it has the 
following four properties (we fix an algebrai- 
cally closed ‘field k of definition for Į and J): 
(i) There exists an isomorphism @ (of-abstract 
groups) of Go(I)/G,(I) into J. (ii) ® is contin- 
uous in the following sense: Let ii, b be elements 
of Go(I)/G,(T) represented by a, b. If bis a 
specialization of a over a field K( > k), then 
(b) is also a specialization of ®(a) over K. (iii) 
If there exists a K-rational divisor in the class 
a, then the point ®(a) is also K-rational. (iv) 
For any €¢J, there exists a k(¢)-rational divi- 
sor a in Go such that O(a mod G,) = ë. A group 
variety J satisfying these conditions is neces- 
sarily a complete variety, hence an fAbelian 
variety, and is determined uniquely up to 
isomorphism. The construction of Jacobian 
varieties Over a lield of arbitrary characteristic 
is due to A. Weil [27] (for analytic construc- 
tion > 11 Algebraic Functions ©). 

Let P be a tgeneric point of F over k, and let 
P, be a fixed k-rational point. Then g(P)= 
®(P— P,) defines a rational mapping of F into 
J, and @, which is an isomorphism of Į and its 
image ọ(T}, is determined uniquely by ® up to 
translation on J. This mapping ø is called the 


30 


canonical function on J, The dimension of J is 
equal to the genus g of Į. If P,,..., P, are 
independent generic points of T over k, then 
k(P,,...,P4)s is the function field of J over k, 
where k(P,,., P,)s is the subfield invariant 
under the group of g! automorphisms (P,,.. , 
P,)—>(P,,5 > P,). The Jacobian variety of T is 
also the +Picard variety of T, and it is equal to 
the tAlbanese variety of F (- 16 Algebraic 
Varieties P). Hence for any function f on [I 
with values in an Abelian variety A, there 
exists a unique homomorphism 4 of J into A 
such that f=A0 ọ+ const. This / is called the 
linear extension of f. 

Let © be the set of points on J that can be 
written as p(P,) ++ @(P,-1). Then @ is an 
irreducible subvariety of codimension 1. The 
divisor @ is called the canonical divisor of J. 
The Jacobian variety that is polarized by the 
divisor © is called the canonically polarized 
Jacobian variety (— 3 Abelian Varieties G). If 
two curves Į and J” are birationally equiva- 
lent, the canonically polarized Jacobians of I 
and I” are isomorphic. Conversely, if the 
canonically polarized Jacobian varieties J of T 
and J’ of [’ are isomorphic, then Į and J” are 
birationally equivalent (Torelli’s theorem). Let 
r be any integer such that 1 <r < g, and let W, 
be the set of points that are written in the 
form 


O(P,)+...+0(P,) 


(W, = (T), W,_, = ©, W, = J). Then we have 
©” =r! W,_, (fnumerically equivalent) and 
(Q) = y!, where ©" is the class of intersec- 
tions of r copies of @, The existence of a divi- 
sor @ is characteristic for Jacobian varieties. 
Actually, if A is an Abelian variety of dimen- 
sion n that hasan irreducible subvariety x"~! 
of codimension 1 and a positive l-cycle C such 
that (X)=n! and X" )=(n_ 1)! C, then C 
is a nonsingular irreducible curve, A is the 
Jacobian variety of C, and X is the canonical 
divisor. The canonical divisor © is defined 
by a ftheta function in the classical case. 
For a divisor a = È n;P, we define o(a) to be 
En;o(P)eJ. For lixed numbers 1 <r<g, 0<d, 
welet Wf denote {p(a)eJ|I(a)>d+1, dega= 
Y}. Then Wf € W, and is a tsubscheme of J. 
Wecall the number p=g—({d+ 1)(g—r+d) 
the Brill-Noether number. Then dim W‘ > p. 
Moreover, if I is a general curve, then dim we 
= p. In particular, if p < 0, then W?= @. This 
result has been verified recently by S. Kleiman, 
D. Laksov, P. Griffiths, and J. Harris [9, 14]. 
Let J be a nonsingular curve, and let œ be a 
differential form on T. If the divisor (w) is >0, 
the œ is called a regular 1-form or differential 
form of tbe first kind. Let Q be the tsheaf of 
germs of regular differential forms. A differen- 
tial form of the first kind is an element of 


H (T , Q), and vice versa. Let f be a canonical 
divisor. Then we have a natural isomorphism 
H°(T, Q) = L(f), and the number of linearly 
independent differential forms of the first kind 
is equal to the genus g of I’. The tresidue of a 
differential can be defined as in the classical 
case. A differential that has nonzero residues is 
called a differential of the third kind. The re- 
sidue theorem > Resp œ = 0 holds for any dif- 
ferential (y, The form œ% is called a differential 
form of the second kind if for any Pe T there 
exists a rational function fp such that w—dfp is 
regular at P. The set of differential forms of the 
second kind forms a linear space G, over the 
universal domain and contains the subspace 
G, consisting of the differential forms of the 
first kind. The quotient space G,/dk(I) has 
dimension 29 or y according as the character- 
istic of the universal domain is 0 or not. 

When the characteristic p of the universal 
domain is positive, we have what is called the 
Cartier operator. Let [ be a curve defined over 
a perfect tield k, let L = k(I), and let t be an 
element of L that is transcendental over k and 
such that L/k(t) is separable. Then any dif- 
ferential œ of L/k is written uniquely as @ 
=( JE + fPt+...+ f? t” ')dt, where fe L. 
Then the Cartier operator C given by Cœ 
= f,-1 dt is well defined and independent of 
the choice of t and leaves G, invariant. Hence 
given a basis œw, ,@, of G,, we obtain a 
matrix (a,) with coefficient in L by Co, = 
bs G;,(0;( 1 <i<yy). This gx g matrix A is called 
the Hasse-Witt matrix of Į. The class of 
A modulo the transformations of the form 
SPAS is a birational invariant of ĮI and plays 
an important role in the theory of unramified 
cyclic p-extensions of the algebraic function 


field. 


F. Generalized Jacobian Varieties 


The notion of linear equivalence of divisors on 
a nonsingular curve can be extended to a more 
general situation. Such attempts have been 
made by M. Noether, F. Severi, and M. Rosen- 
licht, who succeeded in obtaining sucha gen- 
eralization [22]. 

Let I" be an algebraic curve, and let 
P, , , P, be singular points of T, Let Dp. be 
the flocal ring of P,. We set O = (\j_, Dp, and 
I'= = {P,, i P}. An element of the free 
Abelian group G(T) generated by points of Į” 
is called a Į -divisor. Let a be a ’-divisor and 
set L(a)={feD|(f)+a>0}U {0}. Then L(a) 
is a finite-dimensional linear space (over the 
universal domain). The dimension of L(a) is 
denoted by 1(a), and we set dimja|=((a)— 1. 
The upper bound z of deg(a) = dim| a|is a 
nonnegative integer and is called the O-genus 
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of I, We call i(a) =z — dega +dimjaļ the O- 
speciality index of the divisor a. Let C be a 
nonsingular curve birationally equivalent 

to Į, and letQ,,...,@Q, be points of C that 
correspond to singular points of Į. An O- 
differential w is a differential form on C (of 
K(T) = K(C)) such that X;_, Resg, fo = 0 for 
any fe. Then i(a) is equal to the number of 
linearly independent ©-differentials w such 
that (@)>a in J’. The equality l(a) = dega — 

m + 1 + i(a) is called the generalized Riemann- 
Roch theorem. An ©-differential œ is called an 
-differential of the first kind if œ is regular 
everywhere on Į”, The number of linearly 
independent ©-differentials of the first kind is 
equal to the O-genus x. Let g be the effective 
genus of [, i.e., the genus of C. Then we have 
the equality 7 —g = dim,(©/) = ô, where Dıs 
the integral closure of 8 in K(T). The set of O- 
differentials forms an D-module that is in 
general not of rank 1. Hence in this case, we 
do not have the “canonical divisor.” Let c be 
the conductor of O/D. Then c determines a 
divisor in a natural way. If we denote the de- 
gree of this divisor by d, we have the tnequal- 
ity d+1<d<26. We have d=20 if and only 
if the set of O-differentials forms an D-module 
of rank 1. This case occurs, for example, if Į is 
a curve on a nonsingular surface or a complete 
intersection. Two [-divisors a and b are said 
to be D-linearly equivalent if there exists a unit 
f of © such that a-b=(f). The set of T- 
divisors that are O-linearly equivalent to zero 
forms a subgroup G(T) of G(T). There exists a 
group variety Jo, unique up to isomorphism, 
that satisfies the four conditions required for 
Jacobian varieties (- Section E) with respect 
to the class group G,(I")/G,(I). The variety Jy 
is called the generalized Jacobian variety. The 
generalized Jacobian variety is not complete, 
in general. If J is the Jacobian variety of C, 
then J, is an extension of J by a connected 
‘linear algebraic group I.. Any Abelian exten- 
sion of the function tield of I can be obtained 
by the tisogentes of the generalized Jacobian 
variety of I’ [22]. This fact plays an important 
role in class field theory over algebraic func- 
tion fields (= 59 Class Field Theory G). The 
theory for nonsingular curves is considered as 
the special case in which © = K(I). 

Suppose that I’ is situated in a projective 
space of dimension n. Let p be the prime ideal 
in k[Xo, Xi, > X,] deiining Į and y(p, m) be 
the number of linearly independent homoge- 
neous polynomials of degree m modulo p. Then 
y(p, m) is a polynomial in m for large m. This 
polynomial is called the Hilbert polynomial of 
p (or T`). Let c be the constant term of the 
Hilbert polynomial. The number p,(T) = 1 —c 
is called the arithmetic genus of Į and is equal 
to the O-genus of I. Let I be a nonsingular 
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irreducible curve in P? of degree d. If I is 
contained in a plane, then p,(T)= (d 1): 
(d-2)/2. Otherwise, p,(I')<d?7/4—d+ 1 when 
dis even, and p,(I')<(d?— 1)/4-d+ 1 when d 
is odd [11]. 


G. Sheaf Theory 


Let Į be an irreducible curve and 0, be the 
local ring of p. Then 0; = x 0, is an talgebraic 
coherent sheaf, which is called the structure 
sheaf of I’, and dim, H! (I, @,) is equal to the 
arithmetic genus 7 of I, Let a be a I-divisor, 
and let @,-(a) be the tsheaf of germs of rational 
functions f such that (f)+a>0 and feXgQ for 
every singular point Q of T (- 383 Sheaves 
D). Then dim, H+ (T, O,(a)) is equal to the 
speciality index i(a), and dim, H°(T, ©p(a)) 

is equal to I(a). When I has no singular 
point, the Riemann-Roch theorem is deduced 
naturally from tSerre’s duality theorem: 
HU, O-(a))S HL, O —a)). 


H. Algebraic Correspondence 


Let I be a nonsingular curve. A divisor of 

the product variety Į x T is called an alge- 
braic correspondence of Į [26,27]. Let Dy 

be the subgroup consisting of divisors that 

are linearly equivalent to degenerate divisors 
ax [ +T x b. Then the class group ¢(I) = 

G(I x I)/Dp is called the group of classes of 
algebraic correspondences. We write X =0 if 

X is an element of D}. Let X be an algebraic 
correspondence, k a fjeld of delinition for T 
over which X is rational, and P a generic 
point of I over k. Then X(P)=pr,[.X(P x T)] 
is rational over KP). The composite X ,0 X, 
of two correspondences X, and X, 1s defined 
by (X, 0 X,)(P)=X,(X,(P)) whenever they 
have meaning. The composite X, o X, deter- 
mines an element of @(I) that depends only on 
the classes of X, and X,. This multiplication 
supplies the group @(I) with the structure of 
an associative ring. This ring is called the 
correspondeuce ring of F, The correspondence 
ring (T`) and the ring .» of endomorphisms of 
the Jacobian variety J are isomorphic, and the 
isomorphism is given by the following rule: Let 
č be an element of G(T), and let X be a divisor 
in č, Let P be a generic point of I with re- 
ference to kover which X is rational. Let Py be 
a k-rational point of I’, Then the class of 

X(P) — X(P,) modulo GI) is independent of 
the choice of a divisor X in the given class. We 
set Y(P)=@(X(P)— X(P))) andlet 4 be the 
linear extension of Y. The correspondence 
€—A is an anti-isomorphism of @(I°) and Z. 
Now we set = £ ® Q. Then .¥, contains 
an automorphism 1 of order 2 called an in- 
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volution. Let 1 be a rational prime different 
from the characteristic p. Then A has a faithful 
representation by 2g x 2g matrices with co- 
efficients in /-adic integers. The ftrace g of 

this representation has the property that 

o( 80 B) >0 if P 40 (Castelnuovo’s lemma). £g 
is an algebra of finite tank over Q, and of is a 
finitely generated Abelian group. Based on 
these results A. Weil proved the Riemann 
hypothesis for congruent (-functions on a non- 
singular curve (- 450 Zeta Functions P), 


I. Coverings 


Let I and C be nonsingular curves such that 
there exists a regular mapping z: [>C. Then 
there is an injection of the function field K(C) 
into K(T). If K(f) is separably algebraic over 
K(C), then ĮI is called a covering (curve) of C. 
The integer [K(I): K(C)] = n is called the de- 
gree of covering. Let P be a point of T and let 
Q=n(P). Let t, s be local parameters at P on I 
and at Q on C, respectively. The nonnegative 
integer vp(ds/dt) is called the differential index 
at P and is denoted by mp. The index mp is 
zero except for a finite number of points. The 
divisor È mpP is called the branch divisor. The 
covering Į is called an unramified covering if 
the branch divisor is zero. If we denote the 
branch divisor by a, we have the formula 
2g(I)—2 =n(2g(C)—2)+dega, where g(T) 
and g(C) are genera of [` and C, respectively. 
This is called the Riemann-Hurwitz formula. 
This formula yields at once that a rational 
curve has no nontrivial unramified covering 
and that I’ can be an unramified covering of 
itself if and only if Į is an elliptic curve. 


J. Theory of Moduli 


Let „M (k) be the set of isomorphy classes of 
complete nonsingular irreducible curves (here 
simply called “curves”) of genus g defined over 
a field k. We can endow .@,(k) with a structure 
of an algebraic variety over k with the prop- 
erty that for any smooth family (over K x: € —> 
S of curves of genus y the map T:S>.@,(k) 
sending s to the isomorphy class of the fiber 
m '(s) is a morphism. This algebraic variety is 
called the (coarse) moduli space of curves of 
genus g. Furthermore, the moduli space over 
Spec(Z) exists (D. Mumford [16]). -4,(k) is 
tnormal, irreducible and fquasiprojective (but 
not complete for g > 0) of dimension 0 (g = 0), 1 
(g=1), 39 —3 (g >2) (=3g —3 + dim Aut(C)) 
with only quotient singularity [5]. 

Since when g = 1 any elliptic curve C over 
an algebraically closed lield k is isomorphic to 
a plane curve with the Weierstrass canonical 
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form y? =4x°? — gax — g, (if char(k) #2; the case 
of char(k) = 2 needs a slight modification), the 
correspondence C-j(g, g3) (— Section D) 
defines an isomorphism .#/, ~ A, (affine line). 
In the case of g > 2 and k= C we have another 
function-theoretic construction of .@, due to 
Teichmüller (- 11 Algebraic Functions F). 

Let ./, be the coarse moduli space (over Z) 
of principally polarized Abelian varieties of 
dimension g (-+ 3 Abelian Varieties 1). For a 
curve C we denote the Jacobian variety of C 
by J(C) (- Section E). The correspondence 
C-J(C) defines a morphism i:.4,->.%, which 
is injective (Theorem of Torelli). It is even an 
timmersion (F. Oort, J. Steenbrinck). If g= 1, 
2, 3, then i is an open immersion whose image 
we can describe. 

For k=C we can express the above map 
i by using periods of curves. Namely, let 
Xis, Xg, By,, Êg be a (canonical) basis of 
H (C, Z) detined by *normal sections of C 
(considered as a real surface). Let w,,.., w, be 
a basis of tdifferential forms of the first kind 
on C with Sp, co; = 0; Then the matrix Q = 
(fa) is symmetric and has a positive definite 
imaginary part, i.e., is an element of the tSie- 
gel upper half-space S, of degree y. With the 
identification y, = S,/Sp(g, Z) (- 3 Abelian 
Varieties 1) the map i:.@,—>.%, is nothing but 
the one sending the isomorphy class of C to 
Rmod Sp(g, Z). For g =4 the closure of i(.#,) 
is a principal divisor detined explicitly with 
theta constants (Schottky, J.-I. Igusa). 


K. Stable Curves 


A reduced connected complete curve C over k 
is called a stable curve of genus g ( > (0) ifi) C 
has only ordinary double points as possible 
singularities; (ii) when I is a smooth rational 
irreducible component of C, then F intersects 
the other components in more than 2 points; 
(iii) dim, H' (C, Co) =g. 

There exists a tcoarse moduli space S, of 
stable curves of genus g which contains -4, 
as a Zariski open subset. The space .% detined 
over Spec(Z) is tcomplete and even tpro- 
jective; hence Y, gives a compactification of 
M,.The completeness of -% follows from the 
stable reduction theorem as follows: let R be a 
discrete valuation ring with quotient field K 
and C be a smooth connected curve over K. 
Then there exists a finite separable algebraic 
extension L of K such that the curve C x x L 
extends to a flat family of stable curves over 
Spec(R,), where R, denotes the integral clo- 
sure of R in L. In this case we say that the 
curve C x x L has a stable reduction in R,. 
With the above notation a curve C over K 
has a stable reduction in R if and only if its 
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Jacobian variety J(C) has a ‘stable reduction 
in R (- 3 Abelian Varieties N). 

Over C, -¥, has the tSatake compactification 
,, which is a disjoint union of .%,,,0 <g' <g, 
as a set. The injection i: H >s, (- Section J) 
extends to a morphism j:.%—./, that sends 
the isomorphy class of a stable curve C = 
C, UU C, (C, irreducible) to that of J (C,) 

x x J(Gea,, g'=Z genus(C,), where Č, is 
the tnormalization of C;. In particular the 
closure of the image of j in . is the set of 
products of Jacobian varieties. 
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Algebraic Equations 


A. General Remarks 


Let F (XiX) AKG, © Xm) be r 
tpolynomials in m variables X,,., Xm Over a 
tfield k. Then the equations 


F, =0,. ..) F,=0 


are called algebraic equations in m unknowns. 
When we consider these equations simulta- 
neously, where r > 2, we call them a system of r 
equations or simultaneous equations. (For r = 1, 
a system of one equation means the single 
equation F, = 0.) Coefficients of F,,, F, are 
called coefficients of the system, and the great- 
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est of the degrees of F}, . . , F, is called the 
degree of the system. 

To solve a system of equations (henceforth 
in this article we shall omit the word “alge- 
braic”) means to find the common tzero points 
(in an talgebraically closed field containing k) 
of elements F,,.., F, of the tpolynomial ring 
k[X,, » X,]. If there exist no common zero 
points, the system is said to be inconsistent; if 
there exists a finite number of such points, it 
is said to be regular; and if there are an infinite 
number of such points, it is called indetermin- 
ate. The telimination method allows us to 
reduce the problem of solving a system of r 
equations to the case r = 1. In particular, any 
regular system of equations can be reduced to 
the case m=r= 1. 


B. Equations in One Unknown 


For the above reason, it is important to con- 
sider an equation of the form f(X ) = 0, where 


Ay £0. (1) 


This gives the general form of an algebraic 
equation in one unknown. 

According as f(X ) is reducible or not in the 
tpolynomial ring k[X], the equationf(X)=O 
is called reducible or irreducible (- 337 Poly- 
nomials). In some talgebraic extension field K 
of k, f(X) can be factored: 


J(X) = ag(X — 04)(X = ap)... (X — an). (2) 


f(X)=ayX"+a,X"' +... +a, 


Xis- 5%, are called the roots of the equation 


f (X) =O. Hence, any algebraic equation of 


degree n has exactly n roots (Kronecker’s 
theorem). Now, ( —1)‘a;/ag is equal to the tele- 
mentary symmetric function of degree į of 
%15+++5a,. Some of the roots a,,.--,, may be 
identical. If x appears p times in a,,., a, We 
say that & is a p-tuple root, and p is called the 
multiplicity of the root g. When p = 1, a is 
called a simple root, and when p > 2, « is called 
a multiple root. Let B,,..., B, be all the distinct 
roots among a,,. ,%,, and let p; be the multi- 
plicity of B{i= 1, , v). Then 


F(X) =ao(X =p}. (XB), (2) 
pt.: +p, =n. 

If py,., p, are not divisible by the *character- 
istic of k, the greatest common divisor g off 
and 

f'=na,X""'+(n—Na,X"7+...4a,_, 


is (X — fp) 1 ...(X —B,)’» +. Thus we can 
reduce the multiplicity of every root to 1 by 
dividing f by g. Any irreducible equation over 
a field of characteristic 0 has no multiple roots. 
Equation (1) has multiple roots if and only if 
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its tdiscriminant D is equal to 0 (- 149 Fields; 
172 Galois Theory). 


C. Equations of Special Type 


In Sections C and D we assume that the char- 
acteristic of k is zero. 


Binomial Equations. An equation of the type 
X™ =a =Q is called a binomial equation. It is 
solved by root extraction. Let n/a (mtb root of 
u) be one of the roots (if a is a positive real 
number, %/a usually denotes a positive real 
root). Then n/a multiplied by 1, č, €7,, E"! 
are the roots of X"—a=O, where ¢ is a tprim- 
itive mth root of unity. 


Reciprocal Equations. An equation a, X” + 
a,X" '+...+a,=0 is called a reciprocal 
equation if dy) =a, ar =4p-1, 42 = lp-2 3; A 
reciprocal equation of an odd degree n = 

2m + thas a root X = = 1, and dividing the left 
side by X + 1 we get a reciprocal equation of 
degree 2m, A reciprocal equation of degree 

n = 2m is reduced to an equation of degree m 
in Y= X + X `! and the quadratic equation 
X?—-XY+1=0. 


D. Equations of Lower Degree 
(- Appendix A, Table 1) 


(1) A linear equation @)X + a,=0 hasa single 
root —d,/dy.(2) the roots of a quadratic equa- 
tion aX? +a, X +a, =0 are given by (—a, 

+./a? —4a9a,)/2ay. (3) To solve a cubic 
equation a, X °? +a; X? +a, X +a,=0, we set 
A =9doa,a,—2a3 —27aza,, Ay=a? —3ao43, 
and solve the quadratic equation T? A, T 

+ A}=0. Let t, and t, be the roots of this 
quadratic equation, and let œ be any cube root 
of |, Then (-a, + w/t, +0? IGNI aa is a 
root of the original cubic equation. (Cardano’s 
formula). If we apply this method to a cubic 
equation aX? + bX? +cX +d=0 with real 
coefficients, we need to use complex cube 
roots even if the roots of the equation are real. 
In fact, it has been proved that it is not pos- 
sible to solve this equation within the real 
numbers in this case; i.e., if the cubic equation 
is irreducible over the extension Qta, b, c,d) of 
the rational number field Q, and if all of its 
roots are real, it is impossible to find the roots 
only by rational operations and with real 
radicals. This is called the casus irreducibilis. 
(4) A quartic equation ao X4 +a, X 2+ a,X 2 
+a, X +a, = 0 can be solved by means of 
reduction to a cubic equation (L. Ferrari) (— 
Appendix A, Table 1). Generally, the proce- 
dure of solving an algebraic equation, 1.€., 
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finding the roots of a given equation from its 
coefficients by means of a finite number of 
rational operations and extractions of radicals, 
is called a solution by radicals (or algebraic 
solution). The tgeneral algebraic equation 
whose degree is > 5 cannot be solved by 
radicals (N. H. Abel) (= 172 Galois Theory). 


E. Analytic Tbeory 


In this section, k denotes the field R of real 
numbers or the field C of complex numbers. 
These cases have been studied for a long time, 
for practical reasons. 

Concerning the case k = C, the field C is 
talgebraically closed; i.e., every equation with 
coefficients from C has a root in C (Gauss’s 
theorem, called the fundamental theorem of 
algebra). Accordingly, in the field C, we always 
have equations (2) and (2’). 

Let «,,, x, be the roots of equation (1). 
Then each 2; is a continuous function of coeffi- 
cients a,, a,, , a, Concerning the location of 
roots of f(X)=0 and f(X )=0 on the complex 
plane, we have the following theorems: 

(1) Any convex polygon on the complex 
plane containing the roots of f(X)=0 also 
contains the roots off’ (X) = 0 (Gauss). 

(2) Let C be a rectifiable ‘Jordan curve not 
passing through a root off(X) = 0. Then 
the number (C, f) of the roots of f(X )=0 
lying in the region enclosed by C is equal to 
(1/2) (e(f'(2/ fx) dz, where the multiplicity 
of the roots is taken into account. 

(3) Let C be a Jordan curve on the complex 
plane. If | f(z)|>|g(z)| at every point z on C, 
then the equations f=0and f+ g = 0 have the 
same number of roots (counting multiplicity) 
within the region enclosed by C (Roucbé’s 
theorem). 

(4) The absolute value of a root of equation 
(1) is less than 


M=max(|a,/do|, ..., |@,/dg|)+ 1. 


(5) Let D be the tdiscriminant of f, and as- 
sume that |x;| <M (i=1,...,n). Then |æ; — xl? 
> D/(2M)"""')2 =E, Since the value of 
|D| is known fromf’and one value of M is 
given by theorem (4), we have one value of E. 
If we draw a circle on the complex plane with 
center at the origin and with radius M, and if 
we cover it with a net whose meshes have 
diameters less than JE /2, then the interior of 
each mesh contains at most one root of f=0. 

When k=R,1¢., fERLX ], let B,,... . f, 
denote the distinct roots off= 0, and recall 
equation (2°). Suppose that f,,, pe R and 
the others ¢R. Then v—a is an even integer 2x, 
and we can renumber f,,,, ,f, sO that f,,, 
=Biictissees Pare = B, (B denotes the con- 
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jugate of P) and Patt =Patkti> aa Pat = Py- 
In this case, f,,., $, are the real roots of 


equation (2), and the other f}’s are called the 
imaginary roots. 

(6) IffER[X ] and ay>a,>.. >a,>0, then 
the absolute value of any root of equation (1) 
is less than | (Kakeya-Enestrém theorem). 

Concerning the real roots of an equationf 
= 0, where feR[X ], we have the following 
theorems: Let N(a, b)(a, be R) denote the 
number of real roots in the interval (a, b). 
Furthermore, let V(c,, c2, - » €p) denote the 
number of changes of sign in the sequence 
C1,€,, Cp of real numbers, which is defined 
as follows: Suppose that we have the sequence 
Cy rely, after deleting the terms c; = 0 from 
the sequence ¢,, €2,, Cp. Then 


142! 
„€ =z 2 (1—sgne,.c, |). 

(7) NCO, œ)= V (ag, a, , a,) (mod 2) and 
N < V(Descartes’s theorem). 

(8) Let Vo= Vi f(c), f'(c), « . P“). Then 
N(a, 6) =V(a) V(b) (mod 2) and N < V(a) = 
vib) (Fourier’s theorem). 

(9) We may assume that f =0 has no multi- 
ple roots. Construct a finite series fo = f, f4 = 
f’, „fı Of polynomials over R such that 
§=fa-—tia forj=1,2,..../—land feR, 
by successive application of the tdivision al- 
gorithm. Let V©= V(folo),file. AO). 
Then N(a, b) = V(a) = V(b) (Sturm’s theorem). 
By means of this theorem we can determine the 
location of real roots as precisely as we wish. 

(10) In order that every root a, of an equa- 
tion f = 0 with a, > O lies on the left side of 
the imaginary axis, i.e., Re «;< 0, it is necessary 
and sufficient that in the following matrix the 
principal minors composed of the first r rows 
and first r columns be positive for al] r = 
1,2, , n (Hurwitz’s theorem): 


Vey Gaye 


So 

S 

a l 
i 
N 

RY 


Also, for feC[X], various results have been 
obtained about under what conditions al] the 
roots off= 0 lie on one side of a given straight 
line or inside a given circle (e.g., the unit circle) 
(- 301 Numerical Solution of Algebraic 
Equations). 
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Algebraic Functions 


A. Definition 


An algebraic function is a multiple-valued 
tanalytic function w = w(z) defined by an tirre- 
ducible algebraic equation P(z, w) =0 with 
complex coefficients. 


B. History and Methods 


The theory of algebraic functions evolved from 
the works of C. F. Gauss, N. H. Abel, and C. 
G. J. Jacobi on telliptic functions in the early 
19th Century. Stimulated by their works, B. 
Riemann and K. Weierstrass established the 
foundations of the theory of complex functions 
and developed the important theory of alge- 
braic functions. 

The equation P(z, w) = 0 defines a curve in 
the 2-dimensional complex tprojective space 
with inhomogeneous coordinates z, w. Inves- 
tigations from this point of view were initiated 
by Riemann, A. Clebsch, and P. Gordan. 

This approach was followed by A. Brill, M. 
Noether, and the Italian school (F. Severi, C. 
Segre, etc.) and has developed into contempo- 
rary algebraic geometry (— 9 Algebraic 
Curves, 12 Algebraic Geometry). 

The set of tfunction elements w(z) satisfying 
P(z, w) = 0 is a tcomplex manifold R, a closed 
(= compact) fRiemann surface, on which z and 
w are tmeromorphic functions. The field Ky, 
consisting of the meromorphic functions on R 
is an talgebraic function field C(z, w). Con- 
versely, for any closed Riemann surface R, the 
field Ky is an talgebraic function field in one 
variable over C, and any pair of functions z 
and w with Ky = C(z, w) has the property that 
R is tconformally equivalent to the Riemann 
surface determined in the above fashion by the 
irreducible algebraic equation P(z, w) = 0 satis- 
fied by z and w. Two Riemann surfaces R,, R, 
determined by the equations P, = 0, P, = 0 are 
conformally equivalent if and only if the fields 
Ky, and Ky, are C-isomorphic. This condition 
is equivalent to the existence of a tbirational 
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transformation between the algebraic curves 
P,=0, P, = 0. The “analytic method” (the 
method of studying algebraic functions as 
functions on Riemann surfaces) is the creation 
of Riemann. It was extended by F. Klein and 
D. Hilbert, and later by H. Weyl, who es- 
tablished in his monograph [6] a rigorous 
foundation of the analytic method for the 
theory of algebraic functions. 

Given an arbitrary algebraic function field 
K in one variable over C, the set R of its 
tprime divisors with a suitable topology and 
analytic structure is a closed Riemann surface 
whose function field Ky coincides with K. The 
“algebraic method” (the method of studying 
algebraic functions as elements of an algebraic 
function tield) was founded by J. W. Dedekind 
and H. Weber at the end of the 19th Century. 
In the 20th Century, the algebraic method has 
made remarkable progress, owing to the devel- 
opment of abstract algebra. It covers the case 
of an arbitrary ground lield as well as that 
of more than one variable. The theory of 
algebraic functions has had considerable in- 
fluence on the development of number theory 
because of a basic analogy between the two 
subjects. 

The funiversal covering spaces (surface) Ñ of 
a closed Riemann surface R can be regarded, 
by conformal mapping, as the Riemann 
sphere, the plane, or the unit disk (or, equiva- 
lently, to the upper half-plane) if the tgenus g 
of R is 0, 1, or >2, respectively. Then the 
tcovering transformation group G, consisting 
of tlinear fractional transformations without 
fixed points in Ñ, is tproperly discontinuous 
and has a compact ‘fundamental domain. 
Conversely, if D is one of the three domains 
just mentioned and if G is the group just de- 
scribed, then = D/G is a closed Riemann 
surface such that D and G are its universal 
covering space and covering transformation 
group. A meromorphic function on §R is repre- 
sented as an tautomorphic function on Ñ with 
respect to G. If y = 0, then G= {1 }, R= R, and 
Ky is the field of rational functions. If g = 1, 
then Ky is the field of ‘elliptic functions. The 
study of algebraic functions as automorphic 
functions was initiated by H. Poincaré and 
Klein. Recently, C. L. Siegel made a remark- 
able contribution to the investigation of the 
case of several variables. The theory of auto- 
morphic functions is also related to number 
theory. Works of E. Hecke, M. Eichler, and 
G. Shimura on this domain are noteworthy 
(- 32 Automorphic Functions, 73 Complex 
Multiplication). 

Another important topic concerning alge- 
braic functions (closed Riemann surfaces) is 
the problem of moduli. Riemann stated, with- 
out rigorous proof, that the set of conformal 
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equivalence classes of closed Riemann surfaces 
of genus g (> 2) depends on 3g = 3 complex 
parameters, called moduli. This has led to the 
modern theory of Teichmiiller spaces, which is 
developing into an extensive new field (- 234 
Kleinian Groups, 416 Teichmiiller Spaces). 

In the rest of this article, we deal mainly 
with the analytic method (for the case of two 
variables — [1]). 


C. Abelian Differentials 


An Abelian differential on a closed Riemann 
surface R is, by definition, a complex tdif- 
ferential form = a(z)dz, where a(z) is a mero- 
morphic function of a local parameter z. Such 
a differential is said to be of the first kind if 
a(z) is holomorphic, of the second kind if the 
residue vanishes everywhere, and of the tbird 
kind otherwise. 

The indelinite integral W(p) = [be w of an 
Abelian differential œw, where pọ is assumed not 
to be a pole of @, is called an Abelian integral. 
It is said to be of the first, second, or tbird kind 
if the same holds for %w. If y is a 1-tcycle on R, 
the quantity fo is referred to as the period of 
œ along y. An elliptic integral is defined to be 
an Abelian integral on a closed Riemann sur- 
face of genus 1. For example, this is the case if 
the equation P(z, w) = 0 defining the surface is 
of degree 2 with respect to w and of degree 3 
or 4 with respect to z. More generally, a closed 
Riemann surface is called byperelliptic if P(z, w) 
is of degree 2 with respect to w, or, equiva- 
lently, if R carries a meromorphic function 
with exactly two poles. An Abelian integral on 
such a surface is called a hyperelliptic integral. 

On a closed surface §R, let V, be the linear 
space over C of the Abelian differentials of the 
first kind. Given a l-cycle of R, there exists a 
unique w,¢€ V, such that Re f y@, 18 equal to the 
tintersection number (a, y) for every 1 -cycle y. 
This differential is also characterized by the 
property ((@, @,)g =) In WA*O,= — oe 
fho for every we V,. If {æ;, . . ..a.,} form a 
basis of the 1-dimensional thomology group 
with integral coefficients, then Re œ, (i= 1, ., 
2g) form a basis of the linear space V, over R 
of the tharmonic differentials on R as well as 
that of the space { Re w w €V,}. Accordingly, 


dimç r, =g, dim, V, = 2g. 


These identities show a close relationship 
between the topological structure of R and the 
space of the Abelian differentials on R (— 194 
Harmonic Integrals). 

One can choose a 1-dimensional homology 
basis {a;,0,;;}7-, SO that (c;, 0) =(Xg+i Xy+) = 
O, (0;,%+;)= 1, and (di, 0,4) =9 (Fj), (j= 
1,2,, g). Such a basis is called a canonical 
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homology basis. If œ and g are Abelian dif- 
ferentials of the first kind, then we have 


wo F È| o a 


tg+j 
= l o| o) >. 
g+j aj 
ad 


Together, these are called the (Riemann) period 
relation. The second formula implies that w 
vanishes identically if the periods of œw along 
all; G= 1, .., y) are zero. 

Let &i,., az, be a 1-dimensional homology 
basis, and let œ; , . ,@, form a basis of V, over 
C. The g x 2g matrix Q with fa, as its (i,j)- 
component is called a period matrix. Corre- 
sponding to the change of bases (a) and (w), it 
is subject to transformation into the form 
AQM, where A is a g x g invertible complex 
matrix and M is a 2g x 2g integral square 
matrix with determinant + 1. Conversely, two 
Riemann surfaces are conformally equivalent if 
they possess period matrices transformable to 
each other in this manner (Torelli’s theorem). 
We can choose @,,..., @, SO that the cor- 
responding period matrix with respect to a 
canonical homology basis becomes (/,, T) with 
the g x g unit matrix J,. Then from the period 
relation, T is symmetric and Im T is positive 
definite. 

On the complex linear space C’%, consider 
the subgroup generated by the 2g column 
vectors of a period matrix Q (the subgroup 
is also denoted by Q). Since it is of rank 2g 
and properly discontinuous, a group mani- 
fold C/Q is obtained. It is determined by R 
uniquely up to analytic isomorphism and is 
called the Jacobian variety of R. The general- 
ized Jacobian variety is introduced in a simi- 
lar fashion by means of Abelian integrals of 
the second and third kinds (- 9 Algebraic 
Curves). 


D. The Riemann-Roch Theorem 


In the present context, a 0-tchain with inte- 
gral coefficients on a Riemann surface ® is 
referred to as a divisor. A divisor d =} n;p; 

(n; € Z, p; e R) is an integral divisor (or positive 
divisor) if n; > 0 in the reduced expression; d is 
a prime divisor if it consists of a single point p, 
and n, = 1. A divisor of a meromorphic func- 
tion for an Abelian differential œ is defined 
by taking the p; as the zeros (poles) off or œ 
and n,{ = n;) as the multiplicity of the zero 
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(pole) at p;. The divisors on constitute an 
Abelian group D in which principal divisors, 
i.e., divisors of meromorphic functions, consti- 
tute a subgroup $. The factor group D/$ is 
called the divisor class group; an element of it is 
called a divisor class, The divisors of Abelian 
differential constitute a single divisor class, 
which is referred to as the canonical divisor 
class (or differential divisor class). The degree 
and the dimension of a divisor class D are 
detined as follows, independent of the choice of 
the representative d =}, n;p;€ D:deg D = È n;, 
dim D = dimc{ f |f is meromorphic, (divisor of 
f)+ dis a positive divisor}. For example, the 
degree of the principal divisor class is zero. 

In terms of these concepts, the Riemann- 
Roch theorem is stated as follows: For a divi- 
sor class D on a closed Riemann surface R of 
genus g and for an integer n, we have 


dim(D + nW)—dim(— D—(1—n)W) 
=deg D+(2n—1)(g—-1), 


where W is the canonical divisor class (- 9 
Algebraic Curves). 

This theorem implies the following prop- 
erties of R: (i) deg W=2g 2. (ii) The holo- 
morphic invariant forms dz? (i.e., analytic 
tensors of order 2), referred to as quadratic 
differentials, constitute a linear space over C of 
dimension 0 (if g= 0), 1 (if g =1), or of dimen- 
sion 3g = 3 (if g > 2). The quadratic differentials 
have close connection with textremal quasi- 
conformal mappings and play an important 
role in the theory of Teichmiiller spaces (= 
352 Quasiconformal Mappings, 416 Teich- 
müller Spaces). (iii) For a point pe , a posi- 
tive integer m is called a gap value if R carries 
no meromorphic function having a pole only 
at p with multiplicity m. Then if g = 0, no point 
has gap values; and if g > 1, every point p has 
exactly g gap values; in this case, p also has 
a nongap value m <g + 1. A point p is called 
an ordinary point if the gap values at p are 
1,2, , g; otherwise p is called a W eierstrass 
point. If y > 2, then the total number N of 
Weierstrass points is not less than 2g + 2 and 
not greater than (g = 1)g(g+ 1) (A. Hurwitz). 
Moreover, the case N = 2g + 2 occurs if and 
only if ‘R is hyperelliptic, and then the gap 
values at Weierstrass points are 1,3,..,2g 1. 
This implies that every closed Riemann surface 
of genus 2 is hyperelliptic. (iv) Suppose the 
genus g of Ris > 2. A conformal mapping f of 
R onto itself with the property that every 1- 
cycle y is always homologous to f(y) is neces- 
sarily the identity transformation. Also, § is 
known to admit only a finite number of con- 
formal mappings onto itself (H. Schwarz); 
the total number does not exceed 84(g— 1) 
(Hurwitz). 
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E. Abel’s Tbeorem 


Abel’s tbeorem is stated as follows: A divisor d 
of degree zero is a principal divisor if and only 
if it is expressed as d= ĝy by means of a 1- 
chain y that has the property that f, œ = 0 for 
every we Ke 

Given a divisor class D of degree zero, con- 
sider a 1-chain y with Ove D. For every l-cycle 
g there corresponds the quantity 


sald) =e0p( 2m /—l1 Re | o), 
y 


independent of the choice of y. Thus D deter- 
mines a tcharacter on the 1-dimensional ho- 
mology group, called the integral character. 
Conversely, every character on the homology 
group is shown to be the integral character 
of some D. In terms of this notion, Abel’s 
theorem can be stated as follows: D is the 
principal divisor class if and only if y,(D)= 

1 for every g, This result shows that the 1- 
dimensional homology group with integral 
coefficients and the group of the divisor classes 
of degree zero (with compact topology) are, 
with respect to integral characters, mutually 
dual (in the sense of Pontryagin) topological 
Abelian groups {(— 422 Topological Abelian 
Groups). For the relationship between Abelian 
integrals and Jacobian varieties, in particular 
the *Jacobi inverse problem, *Abelian func- 
tions, and tRiemann theta functions, = 3 
Abelian Varieties L. Also — references to 234 
Kleinian Groups, 352 Quasiconformal Map- 
pings, 367 Riemann Surfaces, 416 Teichmiiller 
Spaces. 
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A. Introduction 


Algebraic geometry is the branch of mathe- 
matics that deals with talgebraic varieties, that 
is, point sets detined by several algebraic equa- 
tions in a space of any dimension or those 
derived from these sets by means of certain 
constructions (- 16 Algebraic Varieties). It 
may also be considered to be a theory of the 
tfield of algebraic functions in several variables 
in geometric language, and it is closely related 
to the theories of complex analytic manifolds, 
commutative algebra, and homological alge- 
bra. It also has an important connection 

with number theory through the theories of 
automorphic functions, Diophantine equa- 
tions, and zeta functions. 

To investigate local properties of algebraic 
varieties we consider varieties embedded in an 
taffine space; to study global properties we 
usually consider varieties contained in tprojec- 
tive spaces. A quantity (or property) that is 
invariant under tprojective transformations, 
isomorphisms, i.e., tbiregular and tbirational 
transformations, or birational transformations 
is called a tprojective invariant, a relative 
invariant, or an absolute invariant (birational 
invariant), respectively. The study of projective 
invariants is a part of projective geometry, 
whose methods are important in algebraic 
geometry. The notions of relative invariant 
and absolute invariant are used, for example, 
in the classification of algebraic varieties. 
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We usually assume that the coordinates of 
each point of the variety belong to a certain 
fixed ‘field K. In the classical case, namely, 
when the field K is the field C of complex 
numbers, the algebraic varieties are considered 
as complex spaces and are studied by apply- 
ing the theories of partial differential equa- 
tions, tdifferential geometry, etc. Topological 
methods may also be applied. Algebraic geom- 
etry originated from such studies, but, for the 
study of properties such as trational mappings 
or talgebraic systems, it became necessary 
to consider as well the case where the ground 
field K is not talgebraically closed. Further- 
more, to apply this to number theory, it is 
necessary to establish the theory over the field 
of any tcharacteristic p, For this purpose it is 
necessary to establish a theory for varieties 
having ground domains as general as possible. 


B. History 


Analytic geometry began with the study of 
lines and quadratic curves (surfaces) and later 
came to include the study of cubic and quartic 
curves (surfaces), and so on. These subjects 
originally belonged exclusively to analytic (or 
projective) geometry. At that time, the study 
could not have been described by so specific a 
title as algebraic geometry. 

The study of such theories as the construc- 
tion of an algebraic plane curve by families 
of curves of lower degree or the talgebraic 
m-n correspondence on a straight line prob- 
ably began with research such as that by M. 
Chasles. The most outstanding event in the 
history of algebraic geometry was the intro- 
duction and development of the theory of 
algebraic functions (- 11 Algebraic Func- 
tions) by B. Riemann (1857). Before that time 
the degree of an algebraic curve (surface) was 
the only quantity known to be a projective 
invariant of the curve (surface). 

With the theory of algebraic functions, 
Riemann gathered into one family all the 
Curves that can be transformed onto each 
other by birational transformations. As the 
basis for his study, Riemann examined bira- 
tional transformations in place of projective 
transformations. This idea led to the notion of 
the so-called *Riemann surface. The ‘genus of 
the surface was obtained as the characteristic 
number of the family of curves. The concept 
of genus was the first absolute invariant to 
appear in the history of algebraic geometry. 
Riemann based his theory on fAbelian inte- 
grals using *Dirichlet’s principle, under the 
assumption that any algebraic curve reduces 
to one without tsingularities. 

After Riemann many mathematicians tried 
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to reconstruct the theory more precisely with- 
out using transcendental methods. M. Noether 
attempted this reconstruction by using geo- 
metric methods. Using the tCremona trans- 
formation, he confirmed Riemann’s assump- 
tion for curves: that any algebraic curve on a 
plane can be transformed by a birational 
transformation to a plane curve without sin- 
gularities except for simple tnodes. He also 
contributed in making more precise the basis 
conditions for the tRiemann-Roch theorem, 
which is considered to be one of the most 
important theorems in the field. His results 
on space curves and surfaces are also note- 
worthy. J. Pliicker detined the concept of 
genus in geometric terms and introduced the 
tPliicker coordinates. A. Cayley and A. Brill 
worked along similar lines. Cayley’s idea was 
developed later by B. L. van der Waerden and 
W. L. Chow, who introduced the tassociated 
form of an algebraic variety and its tChow 
coordinates. 

Around 1890 the Italian school of algebraic 
geometry appeared. Following the tradition 
established by Noether, they employed alge- 
brogeometric methods and uncovered many 
new facts concerning algebraic surfaces. 
Among those who belonged to this school 
were G. Castelnuovo, F. Enriques, and F. 
Severi. 

In France, H. Poincaré and E. Picard ini- 
tiated their study of algebraic functions of two 
complex variables. After them S. Lefschetz 
investigated the theory of complex algebraic 
surfaces [11, 123. The results attained by the 
Italian and French schools were very sugges- 
tive but lacked rigorous foundations. 

On the other hand, rigorous number- 
theoretic theories of algebraic curves appeared 
in Germany. R. Dedekind and H. Weber de- 
veloped the theory of algebraic function fields 
parallel to that of talgebraic number fields. K. 
Hensel introduced the concept of tp-adic num- 
bers in analogy to tpower series expansions of 
analytic functions. E. Noether constructed an 
abstract theory of tpolynomial ideals from a 
formal theory by E. Lasker and F. S. Macau- 
ley. Under her influence there appeared the 
arithmetic algebraic geometry (of curves) 
over an abstract tield as developed by F. K. 
Schmidt and others. 

In the higer-dimensional case, van der 
Waerden attempted to create a more rigorous 
foundation for algebraic geometry under the 
influence of Noether’s abstract ideal theory 
(c. 1930) [14]. He introduced the concept of 
‘generic points and tspecialization, and specifi- 
cally defined the *multiplicity of intersections 
of two varieties in a projective space. He 
succeeded in getting a rigorous proof of Be- 
zout’s theorem: In n-dimensional projective 


4] 


space, the number of intersections of an r- 
dimensional algebraic subvariety of degree | 
with an (n = r)-dimensional subvariety of 
degree m is always Im if they intersect in only 
a finite number of points. 

The problem of intersections was taken up 
by C. Chevalley and A. Weil in the 1940s. 
Chevalley developed the ideal theory of ‘local 
rings (studied initially by W. Krull); he intro- 
duced topological concepts and applied them 
to the problem of intersections. The theory in 
this direction was later extended further by P. 
Samuel, M. Nagata, and J.-P. Serre. 

Weil gave foundations of algebraic geometry 
over an abstract field and reconstructed the 
theory by introducing geometric language to 
designate objects of abstract algebra [15]. He 
thus gave quite a new aspect to the theory and 
extended H. Hasse’s arithmetization of the 
theory of algebraic functions in one variable to 
the case of several variables. Reconstructing 
Severi’s theory of algebraic correspondence 
over abstract fields, he succeeded in proving 
an analogy of the *Riemann hypothesis on 
‘congruent zeta functions (- 450 Zeta Func- 
tions N). He also constructed, purely algebrai- 
cally, the entire theory of tAbelian varieties 
independent of characteristic. 

Around 1930, 0. Zariski gave another 
foundation to algebraic geometry by applying 
the generalized +Valuation theory that had 
been introduced by Krull. Zariski clarified 
especially the properties of birational trans- 
formations by using valuation theory. Zariski’s 
main theorem states that if a birational map- 
ping is not tregular at a tnormal point P (- 

16 Algebraic Varieties 1), each component of 
the image of P by the mapping is of dimension 
>I. 

Zariski also solved the problem of treso- 
lution of singularities in the affirmative in the 
case of characteristic 0 for varieties of dimen- 
sion < 3. The affirmative resolution of this 
problem (which Riemann assumed) says that 
any algebraic variety in a projective space can 
be transformed birationally to a projective 
algebraic variety without singularities. In 
1964, H. Hironaka gave an affirmative 
answer for any dimension in the case of char- 
acteristic 0. 

Along with the achievements in algebraic 
methods, great development took place in 
analytic methods. Unification of the concepts 
of Riemann surfaces and Riemannian mani- 
folds led to the concept of tcomplex analytic 
manifolds. Furthermore, G. de Rham’s theo- 
rem on the duality of topologically defined 
homology and cohomology based on differen- 
tial forms was proved; also, W. V. D. Hodge’s 
theory of tharmonic integrals was developed. 
In the case of the complex dimension 1, any 
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compact Riemann surface is derived from a 
certain projective algebraic curve. However, 
the situation is not so simple in the case of 
higher dimensions. Weil’s concept of tabstract 
complete algebraic varieties can be consid- 
ered as an analog of compact complex mani- 
folds. If a compact complex analytic variety is 
projective, then it must be an algebraic variety 
(fChow’s theorem). K. Kodaira proved that a 
necessary and sufficient condition for a com- 
pact complex analytic manifold to be biholo- 
morphically equivalent to a projective com- 
plex analytic manifold is that the manifold is 

a tHodge variety. 

Using results on harmonic integrals, J. Igusa 
and Weil established the theory of +Picard and 
tAlbanese varieties associated with algebraic 
manifolds of arbitrary dimensions as a gen- 
eralization of the theory of *Jacobian varieties 
associated with algebraic curves (- 9 Alge- 
braic Curves E). Thus many ambiguities in 
the theory as developed by the Italian school 
were clarified. Later the theory was gen- 
eralized to the case of characteristic p by T. 
Matsusaka, Chow, and S. Lang. The duality 
theorem in this case was later proved by M. 
Nishi and P. Cartier (- 3 Abelian Varieties 
D). 

The concept of *sheaves (- 383 Sheaves) 
had already been used in Kodaira’s theory. 
Serre defined an abstract algebraic variety as a 
tringed space by using an analogy to the con- 
cept of complex analytic spaces ; he considered 
it as a topological space with respect to the 
*Zariski topology. By introducing tcoherent 
algebraic sheaves, Serre claritied the idea that 
classical invariants (such as tarithmetic genus) 
may be considered cohomological quantities 
(— 16 Algebraic Varieties E). 

A. Grothendieck invented the concept of a 
*scheme, which is far more general than that of 
an algebraic variety, by admitting the existence 
of *nilpotent elements in structure sheaves and 
taking as a coordinate ring a general commu- 
tative ring with unity element. By the device 
of taking into account nilpotent elements, an 
analog of the method of successive approxima- 
tion in analysis is now applicable. By master- 
ful use of cohomological techniques, Grothen- 
dieck derived many results, including Zariski’s 
important theorems. 

In the classical case, F. Hirzebruch gen- 
eralized the Riemann-Roch theorem to higher- 
dimensional manifolds. He made use of the 
language of sheaves and some topological 
results of A. Borel and R. Thom [S]. Later 
Grothendieck generalized the theorem for the 
abstract case as well. His idea in this work is 
recognized as the origin of *K-theory. 

Every nonsingular complete curve of genus 
0 is isomorphic to the projective line, and any 
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nonsingular complete curve of genus 1 is iṣọ- 
morphic to a projective curve delined by the 
equation X}X )=X,(X, -X,)(X, —1X,) for 
some 40, 1. On the other hand, the set of all 
isomorphism classes of nonsingular complete 
curves of genus g > 1 is parametrized by a 
normal quasiprojective variety of dimension 
3g = 3. Such facts were first discussed by Rie- 
mann as the problem of moduli. 

Concerning the moduli of manifolds of 
higher dimensions in the classical case, 
Kodaira and D. C. Spencer developed their 
theory of tdeformations of complex structures 
(— 72 Complex Manifolds G). The meaning of 
number of tmoduli is clarified by deformation 
theory. Deformation theory has been extended 
in various ways, and deformation is considered 
as one of the fundamental concepts in alge- 
braic geometry. 

To investigate the global structure of the 
moduli varieties, D. Mumford introduced 
geometric invariant theory (- 16 Algebraic 
Varieties W). 

Etale and crystalline cohomologies initiated 
by Grothendieck and others are useful for the 
study of algebraic varieties of positive charac- 
teristic. In particular, the conjecture made by 
Weil concerning *congruent Zeta functions has 
been solved affirmatively by P. Deligne with 
the help of étale cohomology [2]. 

Many important questions have been 
answered by means of the geometric theory. 
For example, (1) every tvector bundle on Aj is 
trivial (- 16 Algebraic Varieties Z); (2) there 
exist tunirational but nonrational fields over C 
(- 16 Algebraic Varieties J); (3) the funda- 
mental groups of the complements of node- 
curves on Pé are commutative (- 16 Alge- 
braic Varieties I); (4) the cancellation theorem 
holds for A? (= 15 Algebraic Surfaces H); (5) 
whenever the tangent vector bundles are am- 
ple, the varieties are P; (= 16 Algebraic Var- 
ieties R). 
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A. Definitions and General Remarks [1,2,8] 
Let Kbe a field and Q a tuniversal domain 
containing it. An affine algebraic group G 
defined over Kis, by definition, a group G 
which has the structure of a (not necessarily 
irreducible) falgebraic variety defined over kin 
an affine space QY such that the group oper- 
ation (x, y) +x !y on G is an everywhere 
tregular trational mapping defined over k For 
such a group G, the set G, of all k-trational 
points on G is an abstract group. The tirreduc- 
ible component Gy of G (viewed as an alge- 
braic set) containing the identity element g is 
unique and is a normal subgroup defined over 
k with finite index in G; the decomposition of 
G into (absolute) irreducible components 
coincides with that into the cosets of G by G,. 
When G = Gp, the group G is called connected. 
It should be noted that for an algebraic group 
G defined over K, a coset gGp is not necessarily 
delined over K and if it is, a representative g 
cannot necessarily be taken to be k-rational. 
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However, if k is an infinite field and G is con- 
nected, and if moreover k is tperfect or G is 
‘reductive, then G, is tZariski dense in G ([9], 
A. Borel and T. A. Springer [30] ); hence G is 
uniquely determined by G,. (In this case, G, is 
sometimes referred to as a k-group.) When 

kis a topological tield, G, is a +topological 
group with respect to the natural topology de- 
fined by that of k, which is generally stronger 
than the Zariski topology on G,. For instance, 
when k= R, (Gp)p is a {Lie group with finitely 
many connected components. 

A subgroup H of an affine algebraic group 
G, which is Zariski closed, is an affine alge- 
braic group with respect to its natural induced 
structure and is called an algebraic subgroup 
of G. If H is defined over k, then H is k-closed; 
the converse is also true when Kis perfect. 
(An affine algebraic set A is called k-closed 
if A is acommon zero of a set of polynomial 
equations with coefficients in k The set A is 
k-closed if and only if all irreducible com- 
ponents of A are detined over the algebraic 
closure k of k, and for every *Galois auto- 
morphism g of k/k, A°=A.) The notions of 
homomorphism and isomorphism for alge- 
braic groups can be detined in a natural 
manner. For instance, for affine algebraic 
groups G and G’ defined over k, a rational 
bomomorpbism  : G + G’ defined over K or a 
k-morpbism for short, is a homomorphism of 
G into G’ that is at the same time an (every- 
where regular) rational mapping defined over 
k. For a k-morphism ¢ of G into G’, the (set- 
theoretic) image @(G) is a closed subgroup 
of G’ defined over K, the kernel ¢ ~! (e°) is a k- 
closed subgroup of G, and dim g(G) = dim G = 
dim `! (e°). In particular, when dim G = 
dim ~@(G)=dim G’ (or equivalently, when 
~(Gp) = Gy and o`! {(e') is finite), ọ is called an 
isogeny. (Two groups G and G’ are called 
isogenous if there exist a third group G” and 
isogenies G”>G, G"-G’.) When a k-morphism 
Q is bijective and ọ “is also an (everywhere 
regular) rational mapping defined over k, @ is 
called a birational isomorphism defined over K, 
or a k-isomorphism for short. It should be 
noted that a rational homomorphism which 
is an isomorphism of abstract groups is 
not necessarily an isomorphism of algebraic 
groups (e.g., a Frobenius homomorphism); a 
similar statement holds for an injective con- 
tinuous homomorphism of topological groups. 

Given a connected affine algebraic group G 
and a closed subgroup H, both defined over k, 
the quotient space G/H has the uniquely deter- 
mined structure of an algebraic variety defined 
over k such that the canonical mapping G-> 
G/H is separable. The *function field of G/H 
is then identified with the subfield of the func- 
tion field of G formed by all H-invariant ele- 
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ments. In particular, if H is a closed normal 
subgroup, then G/H has a natural structure of 
an affine algebraic group (defined over the 
same ground field k) [8, 10, 11]. 

As an example, we have the group GL{n) of 
all nx n nonsingular matrices (x;;). GL(n) may 
be viewed as an algebraic set in Q"’*!, defined 
by a single equation det(x;,) y = 1, and as such 
is a connected algebraic group defined over 
the ‘prime field. In general, an algebraic group 
reahzed as a closed subgroup of GL{(n) is called 
a linear algebraic group. Since an affine alge- 
braic group is always isomorphic to a linear 
algebraic group, these two terminologies are 
essentially synonymous [ 1, 2]. 


B. Generalization of the Definition 


Replacing the term affine algebraic set (or 
affine cariety) in the definition of an affine 
algebraic group by a more general term ‘alge- 
braic variety, we obtain the notion of an 
algebraic group (algebraic group variety, or 
simply group variety). On this subject, the 
following facts are fundamental. A *complete 
connected algebraic group is an Abelian 
variety (— 3 Abelian Varieties). More gener- 
ally, given a connected algebraic group G 
defined over k, there always exists a (k-closed) 
largest linear connected closed normal şub- 
group L, and the factor group G/L is an 
Abelian variety. Furthermore, for a closed 
normal subgroup H of G, the factor group 
G/H is complete if and only if H > L (Che- 
valley’s theorem [8 ]). In particular, if a con- 
nected algebraic group G is complete and 
linear at the same time, then G reduces to the 
identity group. In view of these theorems, the 
study of algebraic groups can be reduced, in a 
sense, to the study of Abelian varieties and 
linear algebraic groups. For this reason, we 
henceforth restrict ourselves to linear algebraic 
groups, which are simply called algebraic 
groups. (The notion of tgeneralized Jacobian 
variety, introduced by M. Rosenlicht [12], is 
an example of an algebraic group in a general 
sense; — 9 Algebraic Curves.) The notion of 
algebraic groups has been generahzed further 
to that of ‘group schemes by A. Grothendieck 


[3,4]. 


C. Lie Algebras 


Since an algebraic group G defined over khas 
no singularities, the ttangent space g to G at 
the identity element e is detined and has the 
same dimension as G : dim g = dim G. The space 
g can be identified in a natural manner with 
the space of all left-invariant ‘derivations of 
the function field of G, and thus has the struc- 
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ture of a Lie algebra defined over k(— 248 Lie 
Algebras). We call g (the Lie algebra g, over k 
of all k-rational points in g) the Lie algebra of 
G (of the k-group G,), If G is a linear algebraic 
group contained in GL(n), then g, is a Lie 
subalgebra of ql(n, k) with the Lie product 
defined by [ x, y] = xy = yx; a linear Lie algebra 
corresponding to a linear algebraic group is 
called an algebraic Lie algebra. When the 
characteristic of Kis zero, conditions for a 
linear Lie algebra to be algebraic can be given 
in terms of the replica [5]. Also, in the case of 
characteristic zero, for x €gl(n, K), x € g if and 
only if exp(tx)e G, where ț is a variable over k 
and exp(tx) is understood as a fformal power 
series in t (contained in Q). From this, we can 
prove, exactly as in the theory of Lie groups 
(— 249 Lie Croups), a one-to-one correspon- 
dence between k-closed subgroups H of G and 
algebraic Lie subalgebras , of g,, establish- 
ing a complete parallelism of the theories of 
algebraic groups and Lie algebras [5]. This 
parallelism breaks down when k has positive 
characteristic [1, 2]. On the other hand, over a 
field of characteristic p > 0, we have formal 
groups, an analog of local Lie groups intro- 
duced by J. Dieudonné [ 13], and also thyper- 
algebras, which play the role of Lie algebras in 
characteristic 0. 


D. Tori [1,2] 


The group G,, = GL(1), the multiplicative 
group of nonzero elements in Q, is a 1- 
dimensional connected algebraic group defined 
over the prime field. In general, an algebraic 
group G that is isomorphic to the direct prod- 
uct (G,,)" is called an (algebraic) torus. When a 
torus G defined over Kk is isomorphic to (G,,) 
over an extension K of k, G is called K-trivial 
(or K-split), and the field K is called a splitting 
field for G. A torus G defined over Kalways has 
a splitting field K which is a finite separable 
extension of k 

In general, a rational homomorphism y of 
an algebraic group G into G,, is called a char- 
acter of G. If we define the sum of two charac- 
ters y, and Z2 of G by (4, + o)(g)=7,(9) y2(9) 
(g € G), the totality of characters of G is an 
additive group, called the character module of 
G and denoted by X(G). Let G be a torus 
defined over k and X =X(G) its character 
module, and let K be a splitting field for G 
that is a finite tGalois extension of k. If a K- 
isomorphism G =(G,,)" is given by the corre- 
spondence G3g(y,(g), .x,(g)), then the 7; 
are characters of G, and X is a tfree module of 
rank n generated by ¥,,... ,¥,. Furthermore, if 
I’ denotes the Galois group of K/k, then for 
ae I and ye X, the conjugate 7° is also a char- 
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acter of G; under this action of I’, X becomes a 
right f-module. We have complete duahty 
between a torus G and its character module X 
in the following sense. There exists a one-to- 
one correspondence between the closed sub- 
groups G, (defined over K) of G and a (T- 
invariant) submodule X, of X for which X/X, 
has no p-ttorsion (where pis the character- 
istic of k). This correspondence is determined 
by the relation of the annihilators X,= Gł, 

G, =X}, and under the correspondence the 
character modules of G, and of G /G, are 
canonically identiiied with X/X, and X}, 
respectively. Furthermore, let G’ be another 
torus (defined over K and split over K) with the 
character module X’, and suppose that we 
have a (k-) homomorphism y: GG’, Then we 
can define a (I’-) homomorphism 'g: X’> X, 
called the dual homomorphism of , by the 
relation ‘p(y’) = xo Q for x € X’; conversely, 
any (I’-) homomorphism of X’ into X is ob- 
tained uniquely in this manner. In particular, 
ọ is a (k-) isomorphism if and only if its dual 

‘g is a (I-) isomorphism. Since for any free (I’-) 
module X of finite rank there always exists 

a torus G (defined over Kand split over K) 
such that X(G) = X (as a special case of the 
existence theorem of k-forms), the categories 
of all tori (defined over Kand split over K) and 
that of all free (T-) modules of finite rank are 
mutually dual. 


E. Semisimple Elements and Unipotent 
Elements 


A matrix 4 is called ‘semisimple if it is diago- 
nalizable, i.e., if the *minimal polynomial of q 
has only simple roots. A matrix q is called 
tunipotent ifa 1 is nilpotent, i.e., if all charac- 
teristic roots of q are equal to 1, (When the 
characteristic of the ground field is zero, the 
unipotent elements y in GL(n, K) and the nilpo- 
tent elements x in ql(n, k) are in one-to-one 
correspondence by the relation ų = expx.) Any 
nonsingular matrix a can be written uniquely 
as a product of a nonsingular semisimple ma- 
trix a’ and a unipotent matrix a” of the same 
size which are mutually commutative: a = 
wa” = qa’ (multiplicative Jordan decompo- 
sition); a’ (a”) is called the semisimple (unipo- 
tent) part of a and is denoted by a, (a,); a, can 
be expressed as a polynomial of the matrix a 
with scalar coefficients. For an element a of a 
(linear) algebraic group G, the semisimplicity 
(unipotency) of a does not depend on the 
matrix representation of G. Moreover, these 
properties are preserved by homomorphisms 
of algebraic groups. Also, if qe G, then a,, aE 
G. 

For an algebraic group G, we denote the 
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totality of semisimple (unipotent) elements 
contained in G by G, (G,) and call it the semi- 
simple (unipotent) part of G. (Note that G, and 
G, are not necessarily subgroups.) A torus G is 
then characterized by the property that G = 
Go = G,. On the other hand, an algebraic group 
G such that G =G, is called unipotent. For 
instance, the additive group of the universal 


domain, 
1 x 
G,= xeQ?, 
0 1 
is a 1 -dimensional connected unipotent alge- 
braic group. 


F. Solvable Groups and Nilpotent Groups 
[1,2,7,9] 


For two closed normal subgroups H,, H, 
(deiined over k) of an algebraic group G, the 
‘commutator group [H,, H,] (in the sense of 
abstract group theory) is also a closed normal 
subgroup (defined over k) of G. In view of this 
fact, an algebraic group G is called solvable 
(nilpotent), when it is ‘solvable (nilpotent) as 
an abstract group. For example, the totality 
T(n) of nx n nonsingular upper unipatent ma- 
trices, i.e., matrices of the form 


Kaye 


: | , is a connected solvable algebraic 
0 * 
group. A unipotent algebraic group is always 
nilpotent. 

For any connected solvable algebraic group 
G c GL(n), there always exists an element a in 
GL(n) such that a~' Ga c T(n) (Lie-Kolebin 
tbeorem [1,2]). A connected solvable algebraic 
group G has a ‘composition series G = Gy DG, 
>G, = {e} such that each G; is a connected 
closed normal subgroup of G and G,_,/G; 
is isomorphic to either G,, or G,. If G is de- 
fined over k the subgroup G, is a connected k- 
closed normal subgroup of G, and for any 
maximal torus T in G, we have a decompo- 
sition into a tsemidirect product G = TG, (in 
the sense of algebraic groups, i.e., the natural 
map T x G,->G is birational). [t is known 
that for any algebraic group G defined over k 
there exists a maximal torus defined over k 
([4], A. Borel and T. A. Springer [30]). G is 
nilpotent if and only if G has a unique maxi- 
mal torus T; when that is so, T=G, and T is 
contained in the ‘center of G. For a connected 
solvable algebraic group G defined over k we 
can take a € GL(n, K such that a`! Ga € Tin) 
(see the Lie-Kolchin theorem) if and only if all 
characters y€ X(G) are defined over k; when 
this condition is satisfied, G is called k-solvable. 
G, is then defined over k, and G/G, is a k- 
trivial torus. 
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When the characteristic of k is zero, any 
commutative unipotent algebraic group (de- 
fined over K is (k) isomorphic to the direct 
product (G,)". When kis an algebraically 
closed field of characteristic p > 0, any con- 
nected commutative unipotent algebraic group 
defined over kis k-isogenous to a direct prod- 
uct of a certain number of the groups W,„ of 
*Witt vectors (of length m) (Chevalley-Chow 
theorem [ 12]). A 1 -dimensional connected 
unipotent algebraic group detined Over a per- 
fect field kis k-isomorphic to G, [1,2]. 


G. Borel’s Theory 


Let G be an algebraic group and V an alge- 
braic variety (both defined over k). We say 
that Visa transformation space of G (defined 
over k), or simply G acts on J, if there is given 
an everywhere regular rational mapping G x 
Va(g,v)>gve V (defined over K such that 
Ji(G2)=(Giga)v, ev=v (g,,g,€G,vEV). When 
the action of G on V is *transitive, V is called a 
thomogeneous space of G. For a closed syb- 
group H of a connected algebraic group G 
(both defined over k, the quotient space G/H 
has the natural structure of a homogeneous 
space of G (deiined over k). A. Borel [1,2] 
proved the following theorems: 

(1) If G is a connected solvable algebraic 
group and Va complete transformation space 
of G, then G has at least one fixed point in V. 
More precisely, in order that a connected 
algebraic group G defined over k be k-solvable, 
it is necessary and sufficient that for any com- 
plete transformation space V of G defined over 
k for which V, # @,G have at least one k- 
rational fixed point in V [9]. 

(2) Let G be a connected algebraic group. A 
maximal connected solvable closed subgroup 
of G is called a Borel subgroup of G. Then (i) all 
pairs (T, B) formed by a maximal torus T in G 
and a Borel subgroup B containing it are con- 
jugate to each other with respect to inner auto- 
morphisms of G. (ii) For a closed subgroup 
H of G, the quotient space G/H is complete 
if and only if H contains a Borel subgroup 
of G; and, when that is so, G/H is actually a 
projective algebraic variety. (For instance, 
if G = GL(n), B = T(n), then G/B is a so-called 
‘flag manifold.) (ii) The conjugates of B(T) 
cover the whole group G (G,). A closed sub- 
group of G is called parabolic if it contains 
a Borel subgroup of G. A parabolic subgroup 
H coincides with its own tnormalizer N(H); in 
particular, H is always connected. Parabolic 
subgroups are significant in the theory of 
automorphic functions. (For the parabolic 
subgroups associated with BN-pairs — Sec- 
tions Q, R.) 
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When G is a connected algebraic group 
detined over a perfect field k, proposition (i) 
can be sharpened: The pairs (A, H) formed by 
a maximal k-trivial torus A in G and a maxi- 
mal connected k-solvable subgroup H con- 
taining it are conjugate to each other with 
respect to the inner automorphisms detined by 
elements in G,. The normalizer N(H) of such a 
k-solvable subgroup H is a minimal k-closed 
parabolic subgroup of G. When the maximal 
connected k-solvable subgroups of G are re- 
duced to the identity group, G is called k- 
compact or k-anisotropic. (Otherwise, G is 
called k-isotropic.) For instance, the tortho- 
gonal group G = §O(n, f) of a tquadratic form 
f of n variables is k-compact if and only if the 
form f is anisotropic, i.e., the homogeneous 
equation f= 0 has no solution other than 
zero in k. Similar facts hold for other classi- 
cal groups. When kis a *local field, G is k- 
compact if and only if G, is compact as a 
topological group. In general, a k-compact 
group is treductive. 


H. The Weyl Group 


Let G be a connected algebraic group and Q 
an arbitrary torus in G. The ‘centralizer Z(Q) 
of Q is then connected and coincides with the 
connected component of the normalizer N(Q). 
Hence the factor group W= N(Q)/Z(Q) is finite 
and can be identified with a subgroup of the 
automorphism group of Q (or of its character 
module X(Q)) in a natural manner. The group 
W is called the Weyl group of G relative to Q. 
In particular, when Q = T (a maximal torus), 
the order of W is equal to the number of Borel 
subgroups containing T. In this case, the çen- 
tralizer C = Z(T) is called a Cartan subgroup of 
G; it is characterized by the property that C is 

a (maximal) connected nilpotent closed sub- 
group of G which coincides with the connected 
component of its own normalizer N(C). The 
notions of Borel subgroups, Cartan subgroups, 
and maximal tori are preserved under rational 
homomorphisms of algebraic groups. 


1. Semisimple Groups and Reductive Groups 


In an algebraic group G defined over k, there 
exists a largest connected solvable closed 
normal subgroup R, called the radical of G. 
The unipotent part R, of R is called the unipo- 
tent radical of G. When R = {e}, G is called 
semisimple. When R is a torus, namely, R,= 
{e}, G is called reductive. Semisimplicity and 
reductiveness are preserved under forming a 
direct product and taking the image (or in- 
verse image) of an isogeny. For a reductive 
group G, the +commutator subgroup D(G) is 
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e 
semisimple, and G = D(G):R, D(G) NR = finite; 
in other words, G is isogenous to the direct 
product of a connected semisimple algebraic 
group and a torus. In general, if R is the rad- 
ical of a connected algebraic group G and R, is 
the unipotent radical of R, then the factor 
groups G/R, G/R,, are semisimple and reduc- 
tive, respectively. Furthermore, if the charac- 
teristic of the field kis zero, there exists a 
reductive closed subgroup H of G such that G 
decomposes into a semidirect product G = 
H: R, (Chevalley decomposition [5]). (In this 
case, R and R, are k-closed, and H can be 
taken to be k-rational; such an H is unique up 
to inner automorphisms defined by elements in 
G,.) Also in the case of characteristic zero, 
reductive algebraic groups are characterized 
by the property that all rational representa- 
tions are completely reducible. But when k 
has the characteristic p >0, this property char- 
acterizes tori (M. Nagata). 


J. Root Systems [1,2,14] 

Let G be a connected semisimple algebraic 
group, T a maximal torus, and X =X(T) its 
character module. A character ye X is called a 
root of G relative to T if there exists an jso- 
morphism x, of G, onto its image in G such 
that 


t'x,(€)t=x,(a(e) for all €eG,, teT. 


For a root «, such an isomorphism x, is 
uniquely determined up to a scalar multipli- 
cation in G,; hence we put P,=x,(G,). 

If we denote by r the totality of roots (rela- 
tive to T), r satisfies the following axioms, 
where E = X @ Q and E* is the *dual space of 
E with respect to the inner product ( >: (i) For 
each g € r, there corresponds &* € E* such that 
<a*, a> =2 and <a*, BY eZ for all Per. (ii) If we 
define a reflection w, of E by 


W,X=xXx—Ca* xa for xe, 


then w,fer for all Ber. (In particular, w, = 
—ger.) (iil) If a, Per are linearly dependent, 
then ĝ = +«. (iv) If dim E= r, r containsr 
linearly independent elements. 

In general, a finite subset r in a finite- 
dimensional vector space E over Q satisfying 
the axioms (i)-(iv) is called a root system in £E. 
(This root system is sometimes said to be 
reduced, to distinguish it from the root system 
defined in Section Q, which does not satisfy 
axiom (iii).) For a root system r, the elements 
x* of E* corresponding to er are uniquely 
determined by these conditions, and the set 
r* = {a*} is a root system in E* (a er* is 
called a coroot). Also, the group W of linear 
transformations of E (£*) generated by w, 
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(w,«) with ger is finite and is called the Weyl 
group of the root system r, If we identify E* 
with E by means of any W-invariant (positive 
definite) metric on E, then a* =(2/<a, a>)a. 
When r is a root system of a semisimple alge- 
braic group, 


<a* DEZ forall wer, yex, (1) 


so that X is W-invariant, and the Weyl group 
of the root system r can be identified with the 
group N(T)/Z(T) of Section H. (In general, a 
maximal torus in a connected reductive alge- 
braic group coincides with its own centralizer, 
so that the Weyl group W can be identified 
with N( T)/T.) 

When a tlinear ordering (compatible with 
the addition) is given in £, we denote by r, the 
set of all positive roots in r. An element wer, 
is called a simple root if it cannot be written as 
a=a' +o" witha’, a”er,. IfA={a,,...,0,} is 
the totality of (distinct) simple roots in r, , the 
elements «,,, «, are linearly independent, 
and any root yer can be written uniquely in 
the form a = + È! m,%;, with m;e Z, m; > 0. In 
general, a subset A of r having this property is 
called a fundamental system; a fundamental 
system is always obtained in the manner ex- 
plained from a linear ordering on E. For a 
fundamental system A, the cone A, in E*, 
defined as the set of x in £* satisfying the 
inequalities <¢;, x) > 0 (1 < i< r), is called a 
Weyl chamber. If we denote by L, the hyper- 
plane defined by the linear equation <4, x) = 0 
for a root %, then E* =| Jer La = (Ja Aa, and W 
acts tsimply transitively on the set of all Weyl 
chambers {A,'. The Weyl group W is gen- 
erated by r reflections Ws, (l<i<r). 

In a semisimple algebraic group G, Borel 
subgroups B containing a (fixed) maximal 
torus Tare in one-to-one correspondence with 
the fundamental systems A (or r+) relative to T 
by the relation B, = Iyer, P,, where P, = x,(G,). 
(More precisely, every element in B, can be 
written uniquely as a product of the elements 
in P,, where the ordering of the P, is taken 
arbitrarily.) 


K. Bruhat Decomposition 


If we take a representative s, of we W in N(T), 
there is a decomposition G=|)wew Bs, B 
(disjoint union). Furthermore, if for w € W we 
put Ny = Elser, nwe, P,, and in particular N = 
N, = B, and denote by Wo the unique element 
in W such that WA= -A, then the element 
in Bs„B can be written uniquely as a product 
of elements in Nww,» Sw T, N. Hence we have 
G= U Nite Ns 

weW 


which is called a Bruhat decomposition of G. In 
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particular, if we put N’ = sọ, Ns, (which is 
the unipotent part of the Borel subgroup cor- 
responding to -A), then N’TN is a Zariski 
open set in G, and the natural map N’ x T x 
N =G is birational. This implies that the 
function field of G is rational (i.e., ‘purely 
transcendental over Q). 


L. Structure of Semisimple Groups 


A subset r, of a root system r is called a closed 
subsystem if r4 , r= rı, where rı , denotes the 
submodule of X generated by r, A closed 
subsystem satisfies conditions (i), (ii), and (iii) 
of a root system. For a closed subsystem r, of 
a root system r of a semisimple algebraic 
group G, the subgroup G(r,) of G generated by 
the P (a er ,) is a semisimple closed subgroup 
with a maximal torus T, =(G(r,)M T)o, of 
which the root system relative to T, coincides 
with the restriction of r, on T, and the coroot 
system can be identified with r¥ = {a*|«erj,}. 
The subgroup G(r,) is normal if and only if 
r—r,1S also a closed subsystem; when this is 
so, G=G(r,)* G(r —t,), G(r, ) G(r —1,) = finite. 
All connected closed normal subgroups of G 
are obtained in this manner. In order that G 
be simple (sometimes called ahsolutely simple 
or almost simple) as an algebraic group (i.e., 
without proper connected normal subgroups), 
it is necessary and sufftcient that t be irreduc- 
ible (i.e., r cannot be decomposed into a dis- 
joint union of two proper closed subsystems). 
In general, a root system r can be decomposed 
uniquely into the disjoint union r = y, UU r, 
of irreducible closed subsystems r; such that 

r; UU r,(1<i<s) are also closed subsystems; 
correspondingly, G is isogenous to the direct 
product G,x x G, of (absolutely) simple 
algebraic groups G; = G(r,). (G is actually a 
direct product if it is simply connected or an 
adjoint group.) The subgroups G; are deter- 
mined uniquely and only by G. 


M. k-Forms [15] 


Let K be an extension of k and G, an alge- 
braic group defined over K. An algebraic 
group G defmed over k is called a k-form 

of G, if there is a K-isomorphism f of G onto 
G,. Suppose further that K/k is finite sepa- 
rable, and for every Galois automorphism 

c of k/k, put e,=f7of '. Then g, is an iso- 
morphism of G, onto Gj, and the g, satisfy 
the relation p} 0 P, = Ps Conversely, given 

a collection of isomorphisms { ¢,} satisfying 
these conditions, there always exists a k-form 
G (with a K-isomorphism f onto G, such 
that g,=f%0f '), which is unique up to k- 
isomorphism (Weil). In particular, if K/k is a 
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finite Galois extension with Galois group T, 
then { p,} is a (continuous) t1-cocycle of I in 
Aut,(G,) (the group of all K-automorphisms 
of G,), and by the above correspondence the k- 
isomorphism classes of k-forms G are in one- 
to-one correspondence with the (continuous) 
1-tcohomology classes of the cocycle DA (in 
the cohomology set H'(F, Aut,(G,))) (- 172 
Galois Theory J) 

To a given finite separable extension K /k of 
degree d and an algebraic group G, defined 
over K of dimension n, we can associate a 
certain algebraic group Ry ,(G,) defined over k 
of dimension dn, which is obtained from G, by 
restricting the ground field [ 19]. A more pre- 
cise definition is as follows. Let {a,, G3, . , 4} 
(c, = 1) be a set of automorphisms of k/k such 
that o; K( 1 <i<d) are all distinct. Then one 
can find a k-form G of G, = It, Gfi with an 
isomorphism f: GG, such that ¢,=f" of! 
is given by @,((x,)) = (x,«), where i” is defined 
by the relation (6,0) K = ojo K. If we denote 
by p; the canonical projection of G, onto its 
first component G, and put p= P; o f, then the 
pair (G, p) is uniquely characterized (up to k- 
isomorphism) by the following universality 
property: If G’ is any algebraic group defined 
over Kand @ is a K-morphism of G’ into G}, 
then there exists a (uniquely determined) k- 
morphism ¢ of G’ into G such that ọ = po @. 
The group G (together with p) is denoted by 
Rx(G,). For the group of rational points, 

G, = G, g. When the algebraic group G, has 
some additional structure (such as that of 
‘vector space, falgebra, etc.), then Ry,(G,) 
automatically has the same kind of additional 
structure. 


N. Chevalley’s Fundamental Theorems 


Let G, G’ be connected semisimple algebraic 
groups, and let T( T’) be a maximal torus in 
G(G’'), X(X°) its character module, r (r) a root 
system of G (G’) relative to T (T), etc. If we 
have an tisogeny ọ of G onto G’ such that 
<p(T) = T’, then there is a bijection x2’ of r 
onto r’ such that y(x’) = qa%, where y is the 
dual homomorphism of @ T and gq, is a posi- 
tive integer, which equals 1 if the characteristic 
is zero and is a power of p if the characteristic 
is p> 0. Conversely, any injective homomor- 
phism y: X'—X satisfying this condition (with 
respect to a certain bijection rr’ and q,) 
cornes from an isogeny p: G—G' in the manne! 
already stated. In particular, ¢ is an isomor- 
phism if and only if y is an isomorphism such 
that w(r')=r (Le., g, =! for all xer) [7]. The 
isomorphism class of G is thus completely 
determined by the pair (X, r), so that we some- 
times write G = G(X,r). A connected semi- 
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simple algebraic group G defined over k is 
called of Chevalley type over k(or k-split) if 
there exists a k-trivial maximal torus T in G. If, 
in the above theorem, G and G’ are of Cheval- 
ley type over k and T and 7” are k-trivial, then 
the theorem remains true if we replace isogeny 
by k-isogeny. In particular, the k-isomorphism 
class of a connected semisimple algebraic 
group of Chevalley type over kis completely 
determined by (X, r). Chevalley also showed 
that, for any pair (X, r) satisfying condition (1) 
above, there exists a connected semisimple 
algebraic group G(X, r) of Chevalley type 
defined over the prime field. Therefore, since 
the classification of semisimple algebraic 
groups of Chevalley type is reduced essentially 
to that of root systems (X, r), it turns out that, 
over any ground field k, there exist as many 
connected simple algebraic groups of Cheval- 
ley type as connected simple complex Lie 
groups (-+ 249 Lie Groups; Appendix A, Table 
5.1). 

For a given semisimple algebraic group 
G = G(X, r) delined over k, put Xo = 1, (= 
the submodule of X generated by r), X° = 
{xeF|<a*,x>eZ for all xer}. Then we have 
natural isogenies G(X°,r)>G(X,1)>G(Xo,¥) 
(with q, = 1), all of which can be taken to be 
defined over k. The group G(X°, r) (G(Xo, D) is 
called the simply connected group (the adjoint 
group) isogenous to G. When the characteristic 
of Kis zero, these isogenies (which are already 
known in the classical theory of complex Lie 
groups) are essentially the only possible iṣọ- 
genies among the semisimple algebraic groups. 
But when the characteristic is p > 0, there are, 
in addition to these, the Frobenius homo- 
morphism (with q, = p) and the following 
“singular” isogenies (for which q, = 1 or p de- 
pending on «):B,2C,, Fy > Fy, (p=2), G,; >G, 
(p = 3). In particular, when kis a finite field, 
taking the set of fixed points of the singular k- 
isogenies, we obtain the simple finite groups of 
M. Suzuki and R. Ree (- 151 Finite Groups). 


0. Classification Theory 


A connected semisimple algebraic group G 
defined over Kis called k- (almost) simple if 
there is no proper connected closed normal 
subgroup of G defined over k. (When G is k- 
simple and k-split, the factor group D(G,)/ 
center is an abstract simple group except for 
a few special cases [ 17]. For more general 
results — Section Q.) For a k-simple algebraic 
group G, let G, be any one of its absolutely 
simple components, and let k, be the smallest 
field of definition for G, containing K. Then 
k/k is a finite separable extension, and G is k- 
isogenous to R, ,(G,). Hence the problem of 
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classifying all k-simple groups (up to isogeny) 
is equivalent to that of tinding aj] k,-forms of 
simple groups of Chevalley type. This latter 
problem can be reduced, in principle, to the 
classification of compact k,-forms and that of 
certain diagrams (1.e., *Dynkin diagrams along 
with an action of the Galois group) [ 16], J. 
Tits [30] (- Appendix A, Table 5.1). For 
instance, when kis a linite field (or, more gen- 
erally, a field of dimension < 1 [ 15]), there is 
no compact k-simple group; hence, using a 
simple classification theory of the diagrams, 
we can show that the only absolutely simple 
algebraic groups G deiined gyer k are either of 
Chevalley type or of the types introduced by 
R. Steinberg (denoted by 74,, 7D,, °D4, °D4, 
*E,). Connected semisimple algebraic groups 
composed of the groups of these types are 
characterized by the property that they have a 
Borel subgroup defined over k. Such groups 
are said to be of Steinberg type over k(or k- 
quasi-split). Absolutely simple algebraic groups 
over a tp-adic lield have been classified by M. 
Kneser and J. Tits [30,31]. When the charac- 
teristic of kis not equal to 2, the classification 
of simple groups of classical type (except for the 
type D,) is known to be equivalent to that of 
semisimple tassociative algebras with tinvo- 
lution [ 18]. A similar relation also holds be- 
tween some of the exceptional simple groups 
and tCayley algebras or ‘Jordan algebras (H. 
Hijikata, T. A. Springer, J. Tits). 
The following is a }jst of absolutely simple 
algebraic groups of classical type. 
l. k-forms of SL(n) (n > 2). 
1.1. Gy =SL(m, 8)= (ge My(8)|N(g)= 1}, 
where § is a tcentral division algebra 
over k with (R:k)=r?, n=mr, and N 
denotes the treduced norm in M,,(8). 
1.2. G;,=SU(m, 8, f)={geESL(m, &)| 
I (gx, gy) = f(x, y) for x, ye R”}, where 
& is a central division algebra over a 
quadratic extension K of k with an in- 
volution z of the second kind (which 
means that {tek |é' =ë} =k), (K:k')= 
r?°, n= mr, and f is a (nondegenerate) 
tHermitian form of m variables over 
§ with respect to the involution ;, 
II. k-forms of SO(n) (n > 3, n #4), Sp(n) (n 
even, n > 2. 
Gi =SU (n, 8, f), where § is a central 
division algebra over k with an involu- 
tion ; of the first kind (i.e., such that 
{€ek|E'=E} =k), (8:k)=r?, n=mr, 
and f is a nondegenerate c-Hermitian 
form of m variables oyer § with re- 
spect to the involution z. In this case, 
dim{če K |¢'=¢} =r(r + £g)/2 with £o = 
t 1, and G} is a k-form of SO or Sp 
according as ££ = 1 or = 1. (SO(8) may 
have other k-forms coming from the so- 
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called triality.) The case where the 
characteristic of kis 2 can also be dis- 
cussed by a method given by J. Tits 
(Inventiones Math. 5, 1968). 

When k is a local field or an algebraic num- 
ber field, the only central division algebra with 
an involution of the first kind is a *quaternion 
algebra (and, if ; is the “canonical involution,” 
then e= 1). 


P. Algebraic Groups over an Algebraic 
Number Field 


Let G be a connected algebraic group detined 
over an algebraic number field k of finite de- 
gree. Let {v} be the totality of *prime divisors 
(i.e., equivalence classes of valuations) of k 
Taking the ‘restricted direct product of a 
family of locally compact topological groups 
{ G,,}, we obtain a locally compact topological 
group G,, called the adele group of G[19] (- 6 
Adeles and Ideles). In particular, when G = 
Gm, the adele group J =(G,,), is exactly the 
tidele group introduced by Chevalley in class 
field theory. If we identify xe G, with an 

adele whose components are al] equal to x, 

G, becomes a discrete subgroup of G}. 

Concerning the tiniteness property of G,/G,, 
the following results have been obtained 
[21,22]: A character ye X,(G)({ = the module 
of all k-rational characters of G) gives rise 
to a (continuous) homomorphism y4: G,—> 
1=(G,,),. Put G)={geG,|lz,(g)|=1 forall 
XE X,(G)}, where || is the standard norm in 1. 
Then G% is tunimodular, and the quotient 
space G4/G, is of finite volume with respect to 
the (unique) invariant measure on it. G9/G, 
(G,/G,) is compact if and only if the semi- 
simple part G/R (the reductive part G/R,) of G 
is k-compact. From the arithmetic point of 
view, it is important to determine explicitly the 
volume of G°/G, with respect to the invariant 
measure normalized in a certain manner; such 
a volume is cg]led the Tamagawa number of G 
and is usually denoted by 7(G) [19,23]. For 
instance, Siegel’s formulas on the volume of 
the fundamental domain of the unit group of a 
quadratic form f over k are essentially equiva- 
lent to a theorem on the Tamagawa number 
stating that t(SO(f)) = 2. 

Let 9 be the tring of integers in kand L an 
o-lattice in the vector space on which G is 
acting. We can define in a natural manner an 
action of G, on the set of all o-lattices; then the 
orbit G4 L (G,L) of L with respect to G, (G) is 
called the genus (class) of L. The tstability 
subgroup G, ; of Lin G, is open, and the 
double coset space G, ,\G,/G, is finite (fi- 
niteness of the class number). Moreover, let 
{v,,, v,} be the totality of tinfinite prime 
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divisors of K, and put G,, = []}_, Gy. Then G, 
is a Lie group, and the canonical projection 
G, on G, of G4 c G, is a discrete subgroup of 
finite type. (In general, (discrete) subgroups of 
G, which are tcommensurable with G, are 
called arithmetic subgroups.) As in the adele 
case, y€ X,(G) gives rise to a (continuous) 
homomorphism x, : G, >(R * )’, and if G9, = 
{g€G,. |z,,(g)|=1}, then the quotient space 
G°./G,, is of finite volume. Moreover, G°/G, 
(G, /G,) is compact if and only if G9/G, (G,/G,) 
is compact. 

In addition to these, the fapproximation 
theorem and the *Hasse principle are also 
extended to (classical, or general) algebraic 
groups (M. Eichler, M. Kneser, G, Shimura, 
Hijikata, Springer; — [30]). 


Q. Structure of Reductive Groups [24] 


Let G be a connected ‘reductive group defined 
over a field k. Then G has a tmaximal torus 
defined over k and G is ‘split over a finite 
‘separable extension K of K. The structure of 
the group G, can be discussed as in Sections I, 
J, K, and L. Here, we discuss the structure of 
G,, 1.¢., a *k-form of G,. 

Let § be a maximal k-split torus of G, i.e., a 
k-subtorus of G that is k-split and maximal 
with respect to these properties. Any two such 
tori are conjugate over K, 1.e., by an element of 
G,. Their dimension is called the k-rank of G. 
To say that G is of k-rank zero is equivalent to 
saying that G is tk-anisotropic. The centralizer 
Z = Z,(S) of S in G is a reductive k-group, 
and its derived group is k-anisotropic. Let 
N = N,S) be the normalizer of § in G. The 
*Weyl group ,W= N/Z relative to § is called 
the k-Weyl group of G, A k-root of G with 
respect to S is a nontrivial character of S that 
appears when one diagonahzes the represen- 
tation of S in the +Lie algebra g of G, S operat- 
ing via adjoint representation. Denote by ,r 
the set of all k-roots of G with respect to S. 
There is a decomposition of the Lie algebra g 
of G:g=Go + Leer Ga Where g,={X Eg|Ad(s)X 
=2(s)X for all seS). Then go is the Lie alge- 
bra of Z, and there is a unique unipotent k- 
subgroup P, of G normalized by S such that its 
Lie algebra is gą. The set ¿r is a troot system in 
a suitable Euclidean space whose Weyl group 
is isomorphic to ,W; if G is k-split, ¿r is the 
ordinary root system, and the P, are as in 
Section J. In general, ,r need not be reduced 
(i.e., axiom (iii) in Section J need not be satis- 
fied), nor should dim 9, = dim P, always be 1. 

A closed subgroup of G defined over k 
which is minimal among the parabolic sub- 
groups of G is called a minimal paraholic k- 
subgroup of G. (If G is k-split, a minimal para- 
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bolic k-subgroup is a *Borel subgroup of G.) 
Any two minimal parabohc k-subgroups are 
conjugate to each other over k. If P is one such 
subgroup, then there exists a maximal k-split 
torus S such that P is the semidirect product 
of the reductive k-subgroup Z = Z,(S) and the 
tunipotent radical U = R,(P) of P. The ex- 
pression P = ZU is called a Levi-decomposition 
of P, and Z is called a Levi-suhgroup of P. Any 
two Levi-subgroups of P are conjugate by an 
element in U,. There is an ordering of pr such 
that P is generated by Z and P, with & > 0. The 
minimal parabolic k-subgroups containing a 
given maximal k-split torus § correspond to 
the *Weyl chambers of ,r. They are permuted 
simply transitively by the Weyl group ,W. 

Fix an ordering of ,t, and let ,A be the ffun- 
damental system of ,r with respect to the given 
order. For any subset () of ,A, denote by P, the 
subgroup generated by Z = Z,(S) and P,, 
where 4 is a linear combination of the roots of 
,¢ in which all roots not in # occur with a 
coefficient > 0. Then P, contains P and, in 
particular, P, = P. P} is called a standard para- 
bolic k-subgroup of G containing P. Any para- 
bolic k-subgroup of G is conjugate over k to a 
unique P,. If S, is the identity component of 
(\zeolker a), then $ is a k-split torus of G and 
P, = Z(S,)R,,(P,). This shows that any parabohc 
k-subgroup of G has a Levi-decomposition 
and its Levi-subgroups are conjugate to each 
other over k. Let P be a minimal parabolic k- 
subgroup of G containing a maximal k-split 
torus S. Put U = R,(P), Z = Z,(S), and N = 
N,(S). Then, N = N,Z, so G, is the disjoint 
union over ,W of double cosets U,n,,P, (we 
KW), where n, is a representative in N, of 
we,W. More precisely, if we, W, there exist 
two k-subgroups U, and U”, such that U = 
U‘, x U% (product of k-varieties), the map 
U‘, x P> Un,,P sending (x, y) onto xn,,y is an 
isomorphism, and 
(G/P),=G,/P,= (aU) 

we, W 
where 7 is the projection G-+G/P. This is 
called a relative Bruhat decomposition. If G is 
k-split, this gives an ordinary Bruhat decom- 
position {— Section K). If 0 is a subset of A, 
let W, be the subgroup of ,W generated by 
reflections defined by the g’s in 0. If 0, 0 are 
subsets of ,A, then there is a bijection of dou- 
ble cosets 


(Po) \Gu/ (Pods = Wo\k W/ Wy. 


(Note that all these properties follow from the 
fact that (P,, N,) is a BN-pair in G,; — Section 
R.) 

If G is k-isotropic, let G* be the subgroup of 
G, generated by all U,, where U runs over the 
unipotent radicals of the minimal parabolic k- 
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subgroups of G. Kneser and Tits conjectured 
that when G is semisimple and simply con- 
nected and k arbitrary, G, = G*. Platonov 
showed that this conjecture fails for some k- 
forms of SL, (n > 2), but it is true in many 
cases, e.g., when G is tk-split or tk-quasi-split 
(J. Tits, Sém. Bourbaki 29e, no. 505, 1976677). 
In this case, if G is simple as an algebraic 
group and the cardinality of k is > 4, then 
G,/Z(G,) is simple as an abstract group. The 
connected semisimple algebraic k-groups over 
a finite field k are k-split or k-quasi-split (— 
Section 0). J. Tits has given a reduction pro- 
cess for classifying the groups over an arbi- 
trary field k. He defined the index of the k- 
group (to a certain extent a generalization of 
Witt’s theorem characterizing a quadratic 
form by means of its index and anisotropic 
kernel) and gave all possible indices of the k- 
groups and also a complete list in the case of 
local tield (J. Tits [30, 31]). 


R. Buildings and BN-pairs [25] 


The origin of the notations of buildings and 
BN-pairs lies in an attempt to give a systema- 
tic procedure for the geometric interpretation 
of the semisimple Lie groups and, in partic- 
ular, the exceptional groups. The theory has 
various applications to the groups of Lie types. 
To describe this precisely, we must introduce a 
number of detinitions. 

A complex A is a set with an order relation 
c, read “is a face of” or “is contained in” such 
that for a given element A, the ordered subset 
S(A) of al} faces of A is isomorphic to the set of 
all subsets of a set. The subset S(A) of A is 
called a simplex in A. A complex has a smallest 
element, which we denote by 0. For an element 
A, the number rk A of minimal nonzero faces 
of A is called the rank of A. Define rk A = 
sup(tk A| AEA). A morpbism g: A— A’ (where 
A and A’ are complexes) is a mapping of the 
underlying sets such that for every AEA, 

S(A) = S(a({A)). A subcomplex of A is a com- 
plex whose underlying set is a subset of A such 
that the inclusion is a morphism. If A €A, the 
star St(A) of A is the set of all elements of A 
containing A. With the order relation induced 
from A, St(A) is a complex. If BeSt(A), the 
rank of Bin St(A) is called the codimension of 
A in B and is denoted by codim, A. 

A complex A is called a cbamber complex if 
every element is contained in a maximal ele- 
ment, which is called a cbamber, and if given 
two chambers C, C’, there exists a sequence of 
chambers C = Cp, C, ,, Ca = C such that 
c odin. , (Ci-1 1) =codime, (C_, NC) <1 
for all i= 1, 2, , m. A chamber complex is 
called thick (resp. tbin) if every element of 
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codimension | is contained in at least three 
(resp. exactly two) chambers. An endomor- 
phism ¢ of a thin chamber complex A is called 
a folding if o° = ¢ and if any chamber on ¢(A) 
is the image of exactly two chambers in A by 
ọ. For any folding ¢, (A) is called a root. A 
thin chamber complex is called a Coxeter 
complex if for any pair of adjacent chambers 
C, C’, i.e., chambers such that codim,(C N C’) = 
1, there is a root containing C and not C’. Let 
X be a Coxeter complex. For any root (2) 

of X, there is only one root ~'(2), called the 
opposite of (2), such that ø(X)N e(2) does 
not contain any chamber and g(2) U y’(2) = 4; 
there is also an involutive automorphism 
called the reflection associated with ¢ that 
transforms @(Z) onto y'(2). The group W(C) 
generated by all reflections of X is called the 
Weyl group of X, which turns out to be a 
Coxeter group, i.e., there exist sets / and {m;;j, 
(i,j) € 7 x I, where m;; are integers or % and 

M; = 1, such that the group is presented by 

the generators {r;};.; and the fundamental 
relations (r;r)"" = 1, i, jel, m,, #00, A Coxeter 
complex is called irreducible if it is not the 
join of two nonempty Coxeter subcomplexes. 

A building is a thick chamber complex A 
with a system % of Coxeter subcomplexes, 
called the apartments of A such that (i) every 
two simplexes of A belong to an apartment. (11) 
If X, 2” eA, there exists an isomorphism of 2 
onto Y that fixes X N 4” (elementwise). Since 
the apartments of a building are isomorphic to 
each other, we can define the Weyl group, 
rank, and irreducibility of the building to be 
those of its apartment. A building with finitely 
many chambers is said to be of spberical type. 
If a building can be realized as a simplicial 
decomposition of a Euclidean space, then one 
says it is of Euclidean type. 

Examples of buildings are provided by BN- 
pairs. A BN-pair or Tits system in a group G is 
a system (B, N) consisting of two subgroups of 
G such that 
(BNO) B and N generate G; 

(BN) BON=H=AN; and 

(BN2) The group W= N/H has a generating 
set R such that for any r e R and any we W, 
(BN2’) rBw c BwB N BrwB, 

(BN2”) rBr FB. 

The group W is called the Weyl group of the 
BN-pair. For any subset § of R, let W, denote 
the subgroup of W generated by S. Set P; = 
B W,B. Then P, is a subgroup of G and the 
mapping S++P, is a bijection of the lattice of 
all subsets of R onto the lattice of all sub- 
groups of G containing B. A subgroup of G is 
called parabolic if it is conjugate to some Ps. 
Let A be the set of all left cosets of all sub- 
groups P4, S C R, ordered by the opposite of 
the inclusion relation. Let G operate on A on 
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the left. Let X be the subset {nP, neN,§cR} 
of A, and Y[ be the set of G-translates of X, 
Then (A, Y) is a building whose Weyl group is 
W. 

Let G be a reductive algebraic group defined 
over a field k and P be a minimal parabolic k- 
subgroup of G containing a maximal k-split 
torus § and set N = N,(S). Then (P,, N,) iS a 
BN-pair in G,. Therefore there is a building of 
spherical type associated with the group G. 
Conversely, the buildings of rank > 3 and 
irreducible spherical type (roughly speaking) 
all turn out to be associated with simple alge- 
braic or classical groups (J. Tits [25]). This 
result gives a complete and unified description 
of structures that were discovered previously 
in certain cases. For example, the building of 
type A, gives a tprojective space (E. Abe, T. 
Tsuzuku), that of type C, gives a polar space 
(Veldkamp), and that of a k-form of type Eş 
gives a Cayley space (J. Tits). As an applica- 
tion, One can show that a finite building of 
rank 2 3 and irreducible spherical type is 
isomorphic to the building of an ‘absolutely 
simple algebraic group over a finite field. 

When kis local (i.e., endowed with a com- 
plete discrete valuation whose residue field is 
Perfect), the reductive group defined over k 
has another BN-pair such that the associated 
building is Euclidean. This theory was ini- 
tiated by N. Iwahori and H. Matsumoto [26], 
who considered split semisimple groups. Later, 
quasi-split and classical groups were studied 
by H. Hijikata, and the theory for the general 
case was given by F. Bruhat and J. Tits 
[27,28]. To distinguish from the usual BN- 
pair structure, the subgroups conjugate to B in 
this case are called Iwabori subgroups, and 
parabolic subgroups are called paraholic sub- 
groups. The Euclidean buildings are the “ultra- 
metric analogs” of the *Riemannian symmetric 
spaces. In other words, in the study of p-adic 
simple groups, they play a role similar to that 
of the symmetric spaces in the theory of simple 
groups. 
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A. Introduction 


A complex number that satisfies an algebraic 
equation with rational integral coefficients is 
said to be an algebraic number. If the coeffi- 
cient of the term of highest degree of the equa- 
tion is 1, this algebraic number is said to be 

an algebraic integer. The set A of a]] algebraic 
numbers is a field which is the talgebraic clo- 
sure of the rational number tield Q in the 
complex number tield C. The set 1 of all alge- 
braic integers is an tintegral domain which 
contains the integral domain Z of all the ra- 
tional integers. The ‘field of quotients of J is A. 


B. Principal Order 


An extension field k of Q of finite degree 
(which we shall always suppose to be con- 
tained in C) is said to be an algebraic number 
field of finite degree, and kis a subfield of A. 
The intersection o = kN I is an integral domain 
whose field of quotients is k; o is called the 
principal order of k (More generally, a subring 
R of o containing 1 is said to be an order of k if 
the field of quotients of R is k. The set f of all 
elements y of o such that yo C R is an ideal of o; 
in addition f is called the conductor of R.) Let n 
be the degree of kover Q. Then the additive 
group of the principal order o of Kis a tfree 
Abelian group of trank n. A tbasis (@,,. @,) 
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of o as a free Abelian group (or Z-module) is 
said to be a minimal basis of o (or of K. Let oP 
(i = 1, , n) be tconjugate elements of w; over 
Q, and let A = lof] be the determinant whose 
(i,j) entry is œf. Then D, = A? is a rational 
integer that is independent of the choice of a 
minimal basis of o. D, is called the discrimi- 
nant of k If k #Q, then |D,|> 1 (Minkowski’s 
theorem, 1891). For any given rational integer 
m there are only a finite number of algebraic 
number fields whose discriminants are equal 
to m (C. Hermite and H. Minkowski, 1896). 
The proof of these theorems depends on the 
methods of geometry of numbers (- 182 
Geometry of Numbers). 


C. Ideals of tbe Principal Order 


An fideal a of the principal order o is said to 
be an integral ideal of k. In particular, a prime 
ideal ( #0) of o is called simply a prime ideal of 
k. The domain o is not necessarily a tprincipal 
ideal ring but is always a *Dedekind domain. 
That is, every ideal a of o is uniquely ex- 
pressed (up to the order of the factors) as a 
finite product of powers of prime ideals of o. 
This theorem is called the fundamental theo- 
rem of tbe principal order o. 

The quotient ring o/a of o by an ideal a (#0) 
of o is a finite ring. The number of elements 
of o/a is called the absolute norm of a and is 
denoted by N(a). We have N(ab) = N(a)N(b). 
Every prime ideal p (4 0) of o is a fmaximal 
ideal of o, and o/p is a Imite field. Let the 
‘characteristic of o/p be p, where p is a prime 
number. Then o/p is a Imite extension of the 
prime field Z/pZ. Let the degree of v/p over 
Z/pZ be f. Then N(p) = p”, and f is said to be 
the degree of the prime ideal p. 

Let 5 be a complex variable. The (complex- 
valued) function 


Ob) =) NY =[] I =N) >) 


of s € C is called the Dedekind zeta function of 
k (R. Dedekind, 1871). Here the summation 
extends over all ideals a of o, and the product 
extends over all prime ideals p of o. This series 
converges absolutely for Re s > 1, and the 
function ¢,(s) has a single-valued fanalytic 
continuation to a +meromorphic function 

on the whole complex plane (- 450 Zeta 
Functions). 


D. Units 


An algebraic integer ¢ of kis said to be a unit 
of kif `! is also an algebraic integer. Hence € 
is a unit of kif and only if the +Principal ideal 
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(E) is o. The set E, of all units of k forms an 
Abelian group under multiplication, which is 
called the unit group of k. The set of allele- 
ments of E, of finite order coincides with the 
set of all the roots of unity contained in k and 
forms a cyclic group of a finite order w. Let n 
be the degree of k over Q. Then for each ele- 
ment xE k there are n conjugate elements g® 
over Q. Let a (i = 1, , r,) be real for any 
xe k, and let 1+) and g14"2*) (j = 1, . , r,) 
be pairs of complex conjugates for any &E€ k. 
Then we have n = r, + 2r,. The unit group E, 
of k is the direct product of a cyclic group of 
order w and the free Abelian multiplicative 
group of trank r = r,+r,—1. This theorem 
is called Dirichlet’s unit theorem (1846). A basis 
(€i, , €) of this free group is called a system 
of fundamental units of k. 

Let 1! =log|o| (i=1,...,7,), 1a = 
2logla”| (j=r,+1,...,7,+1)) for uek. For r 
elements 4,,, 4, of E,, 


On h [n] 
Resare (Sar A An, 


On, ly, Oy, 


is called the regulator of (y,,, 4,) (Dedekind). 
In order for y,,...,4, to be multiplicatively 
independent, it is necessary and sufficient 
that R [y,,. 4,] #9. The absolute value of 
R [n -> y,] takes the minimum positive 
value R for fundamental units (¢,,.., €). R = 
R [¢,,,E,] is independent of the choice of 
fundamental units (¢,,., €,) of k. R is called 
the regulator of k. In general, R[n;, > n,]|/R 
is equal to the index |E,: H]| of the group 
H generated by the roots of unity in k and 
Mis on, H. W. Leopoldt conjectured that 
units in k, which are multiplicatively indepen- 
dent over Z, remain multiplicatively indepen- 
dent over Z, (the ring of p-adic integers) when 
they are considered as elements of the tensor 
product k ® Q, over Q. This conjecture was 
affirmatively proved in some special cases by 
J. Ax (Illinois J. Math., 9 (1965)) and others. 

If k/Q is a Galois extension, there exists a 
unit ¢ of k such that the conjugates of € over Q 
contain r multiplicatively independent units 
(Minkowski’s theorem). 


E. Ideal Classes 


An o-module contained in k (i.e., oa c a) such 
that ga Co holds for some element a ( #0) of k 
is said to be a fractional ideal of k. For two 
fractional ideals a, b of k the “product” ab 
defined by {'a;f; (finite sum)|a;€4, B:Eb} is 
also a fractional ideal. Thus the set of the 
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fractional ideals of k forms a multiplicative 
commutative semigroup. For a fractional ideal 
a the seta t={aeEk gaco} is also a fractional 
ideal of k, and we have aa~'= o. Thus the set 
of all nonzero fractional ideals of k forms an 
Abelian group 3, under multiplication with 

o as identity. Each fractional ideal a (#0) 

is uniquely expressed as a finite product of 
powers of prime ideals, if we admit negative 
powers. Namely, %, is a free Abelian multi- 
plicative group with the set of all prime ideals 
as basis. Given fractional ideals a and b, we 
say that a is divisible by b if a cb; in this case, 
we call b a divisor of a and a a multiple of b. 
Also, a C b if and only if there exists an integral 
ideal ¢ such that a = bc. Given fractional ideals 
a=T] pyi and b= J] qfi (¢;40, f, #0), we say 
that a and b are relatively prime if {p;} and {q;} 
are disjoint. Usually a fractional ideal of k is 
simply called an ideal of k. 

For an element x ( # 0) of k, (a) = ao is a 
(fractional) ideal of k, and (x) is said to be a 
principal ideal of k. The set P, of all principal 
ideals (a) (aek,a 40) is a subgroup of Jy. 
Since (a) = o is equivalent to ~e E,, we have 
P, = k*/E,, where k* is the multiplicative group 
of all nonzero elements of k. 

Each coset of J, modulo P, is called an 
ideal class of k, and the group €, = %,/P, is 
called the ideal class group of k. Each ideal 
class contains an integral ideal a with N(a) < 
,/|D,| (more precisely, with N(a)<(4/z)'2(n!/n") 

|D,,|). From this it follows that €, is a finite 
Abelian group. The order h of Ç, is called the 
class number of k. For the calculation of the 
class number the fresidue at the pole s = 1 of 
the Dedekind zeta function is used. Namely, 
lim (s- 1)C,(s)=gh, 


sol+ 


g= HaT R Wey |D;l, 


where R, is the regulator of k and w, is the 
number of roots of unity in k (Dedekind, 
1877). This formula is used, in particular, for 
the computation of the class numbers of 
tquadratic fields and tcyclotomic fields (- 347 
Quadratic Fields). The class numbers of cubic 
and quartic (real) tcyclic fields over Q were 
computed by H. Hasse in the case where 

the tconductor of k/Q is less than 100 (Abh. 
Deutsch. Akad. Wiss. Berlin, 2 (1948)). Hasse 
has also given a detailed computation of the 
class number of cyclotomic fields [ 15). In 
general, let the degree n = [k: Q] be fixed and 
let |D,|00. Then 


lim (log(A, R,)/log,/|D,|)= 1. 


(This formula was proved for n = 2 by C. L. 
Siegel, 1935, and for general n by R. Brauer, 
Amer. J. Math., 69 (1947).) 
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F. Valuations 


All the tArchimedean and ‘non-Archimedean 
valuations of an algebraic number field k can 
be obtained as follows (- 439 Valuations): 


Archimedean Valuations. Let n= [k: Q], and 
let n conjugates of we k be a"), a™ such that 
a® (i= 1, , r,) is real, and g"? and g(str2")} 
(j=1,, r) are pairs of complex conjugates. 
W e write |a|,=lo'?| (j=1,7, +m; these 

are Archimedean valuations of k that are 

not mutually equivalent. The equivalence 
classes of these valuations are denoted by 

pi’, ..., p%*"), respectively, and are called the 
tinfinite prime divisors of k. The first r; infinite 
prime divisors are called trea] and the remain- 
ing 7, are called timaginary (or complex). The 
valuations of k defined by 


la|w=lol,, jel... fi 
jo a 
=|al;, j=art+h.., ry tro, 


are called tnormal valuations of k. Here fpo 
(j=r,+1,...,7 +r) are valuations in the 
wider sense. If r, =n we call k a totally real 
field, and if r; = 0 wecallk a totally imaginary 
field. 


Non-Archimedean Valuations. Let p be a prime 
ideal of k and g an element of k. Let (x)= p“b, 
where p and b are relatively prime. Put v,(a) = 
a. Then for any constant p (0 < p <1), 


ap =p 


is a non-Archimedean valuation of k. This 
valuation of k is called the tp-adic valuation of 
k; p-adic valuations for different prime ideals 
are mutually inequivalent. The valuation |æ], 
with p=(N(p)) t} is called a tnormal valuation 
of k. The equivalence class of valuations con- 
taining _|,, is denoted by the same letter p 
and is called a ‘finite prime divisor of k. 

A formal finite product of powers of finite OI 
infinite prime divisors m* = I] pfi is called a 
divisor of k. If all e; > 0, then m* is called an 
integral divisor of k. Given divisors m* = 
II pfi and n*= J] p/i, we write m* n*if e;< 
f= 12). 

Any valuation of k is equivalent to one of 
the valuations defined previously (A. Ost- 
rowski, 1918; E. Artin, 1932). For any element 
æ (#0) of k the tproduct formula [T,|«|, = 1 
holds, where p runs over all finite and infinite 
prime divisors of k and |, are the normal 
valuations of k. Conversely, let k be a field, 
and letV={ — |,} be a set of inequivalent 
valuations of k such that (i) for any a ek (a £0) 
|x|, # 1 holds only for a finite number of p in 
V; (ii) the product formula I], lol, = 1 (aek, 
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x #0) holds; and (iii) there is at least one 
Archimedean valuation in V. Then k is an 
algebraic number field and Vis the set of all 
the prime divisors of k (Artin and G. Whaples, 
Bull. Amer. Math. Soc., 5 1 (1945)). 


G. Ideal Classes in the Narrow Sense 


For a, Bek, the expression a = ß (mod pË) 
means «® 8 > 0 for a real infinite prime divi- 
sor pË and a 6 % 0 for an imaginary infi- 
nite prime divisor pÊ. We call an element 

x € k totally positive if al] real conjugates g” 
(i= 1,. , r,) are positive. In the notation just 
given, this means a = 1 (mod p®) (i = 1, , r4). 
The set of all principal ideals (a) generated by 
totally positive elements a € k is a multipli- 
cative subgroup P% of P,. Each coset of 5, 
modulo P% is called an ideal class of k in the 
narrow sense. Let Ex be the group of all totally 
positive units of k. Then we have (3,: Pk )= 


h2 (Ep: Eg). 


H. Multiplicative Congruence 


Let m be an integral ideal of k, and let k*(m) 
be the multiplicative group of all elements a in 
k such that (a) is relatively prime to m. Any 
element XE k*(m) Can be expressed in the form 
x = B/ysuch that f, y € o and (f), (y) are rela- 
tively prime to m. 

Consider an integral divisor m* = m [J] pÈ 
which is a formal product of m and intinite 
prime divisors pË of k. We call m the finite 
part of m*. Given an element «e k*(m) and 
elements f, y € k*(m)N o such that a = f/y, 
we set a = 1 (mod* nt*) if ß= y (mod nr), and 
a = 1 (mod p®). The set of alļ a in k*(m) such 
that a = 1 (mod” m*) forms a multiplicative 
group. We write a = f (mod” m*) for a, Bek if 
a/B € k*(m) and «/f = 1 (mod* m*). This con- 
gruence is called the multiplicative congruence. 
In the following discussion we shall write 
mod m* for mod” m*. 

We denote by Ņ(m) the group of all ideals 
of k that are relatively prime to an integral 
ideal m, and by §(m*) the group of all prin- 
cipal ideals (a) such that a €k*(m), a = 1 
(mod m*); S(m*) is known as the ray modulo 
m*. Any subgroup H of %,(m) which contains 
S(m*) is called an ideal group modulo m*, and 
the factor group 3,(m)/H is called a group of 
congruence classes of ideals modulo m*. 

If n* m* for integral divisors m* and n* of 
k, then J (m) c 9,(n) and S(m*) c S(n*). If H 
is an ideal group modulo n*, then Q (H) = 
H N Jím) is an ideal group modulo m*, and 
we have %,(1)/H = s,(m)/@(H). For any given 
ideal group H, modulo m* there is a smallest 
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integral divisor {* such that {* m*, and there 
exists an ideal group H modulo j* with @(H) 
= H, (ie., if there is an ideal group H’ modulo 
n* with @(H’) = Ho, then f* n*). We call ĵ* the 
conductor of the ideal group H. The notion of 
multiplicative congruence is used in felass field 
theory and in the theory of tnorm-residue 
symbols. 


I. Ideal Tbeory for Relative Extensions 


If an algebraic number lield K has a subfield k, 
we say that K/k is a relative algebraic number 
field. Let be the principal order of K. For a 
(fractional) ideal a of k, Oa is an ideal of K. 
We write Da = E(a) and cal] E(a) the exten- 
sion of a to K. For ideals a, b of k, wehave 
E(ab)= E(a)E(b) and E(a)Nk=a. 

Let ¥,: K >C be k-isomorphisms (i = 
1,...,n), where n=[K:k]. We write K® = 
(kK) and AY =Y-(A)for 4eK. For an 
ideal A of K, UW ={A" A e UA} is an ideal 
of K®, and Q{ is called the conjugate ideal 
of Yin K®, Let L be the composite field of 
K‘, , K™. Then the ideal generated by 
AH... UA” in L is the extension of an ideal a 
of k We write a = Nx,,(20) and call a the rela- 
tive norm of Y over k. We have Nx (2B) = 
NxjxQONxy(B) and Nx,(E(a)) = a” (for an 
ideal a of K). In particular, for K =Q, Nx9(Q) 
=(N (2). 

Let p be a prime ideal of k Then E(p) = 
Pr PY Bye in O, where P ,,, P, are 
prime ideals of K. Let f; be the degree of the 
finite field O/P, over o/p. Then Nx,(B,) = př; 
fiis called the relative degree of S$, over K, and 
e; is called the (relative) ramification index of 
P; over K We have the relation n = $% e; f; 
between these numbers. If e, = =e,= 1, the 
prime ideal p is said to be unramified for K/k. 
Otherwise, p is said to be ramified for K/k. If 
every prime ideal of kis unramified for K jk, we 
call K/k an unramified extension. (For an in- 
finite prime divisor p , of k we write p,, = 

j BO" if the Archimedean valuation 

ha of kcan be extended to g Archimedean 
valuations hg (i=1,...,g)of K, where e; = 
2 if PË is imaginary and p, is real, e; = 1 
otherwise.) 


J. Relative Differents and Relative 
Discriminants 


Let K/k be a relative algebraic number field 
and 0, 8 be the principal orders of k, K, re- 
spectively. Put M= {A e K Trg,( AD) c o}, 
where Try, is the *trace (- 149 Fields J). Then 
Wt is a (fractional) ideal of K and Y~'= Dy is 
an integral ideal of K; Dx, is called the rela- 
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tive different of K over k. When k =Q, D Ki is 
simply called the different of K.For Ld KK, 
we have the chain theorem: Dip = Dik DK 

Let the conjugates of Ae K over kbe 
AÙ, , A’, and assume that A} = A. Put 
Ox ,(A)=TT2(A -A”) for Ae K. If Ae, 
thenagy,(A)EDgy, Dex, is generated by 
{kal A)| AED}. The integral ideal ÇË gen- 
erated by {A —A“ Ae} in the field L = 
KK?) K® was called an element by D. 
Hilbert. We also have Dy, = EVES, EM. 
The integral ideal Dx, = Nxu(Dx,) of Kis 
called the relative discriminant of K/k. If k= Q 
dko =(Dx). 

For the relative different Dy ik tO be divisible 
by a prime ideal ‘ of K, it is necessary and 
sufficient that E(p) = PP? Pie with e >}, 
where p = $ N k (Dedekind’s discriminant 
theorem, 1882). Hence a prime ideal p of Kis 
ramilied for K/k if and only if p divides the 
relative discriminant dx; there are thus only a 


+ 


linite number of prime ideals of kK which ramify 
for K/k. In particular, K/k is unramilied if and 
only if Dg = 0. 


K. Arithmetic of Galois Extensions 


Let K/k be a relative algebraic number field 
such that K is a *Galois extension of K of 
degree n, and let G be the ‘Galois group of 
K/k. Let o, D be the principal order of k, K, 
respectively. The conjugate ideals of an ideal 
U of K are given by U°={A°| AEA! (cEG). 
If Nq (20) = a, then E(a) = TT ,.¢ Wl’. For a 
prime ideal p of K, E(p)=(B, P- ... P, 
where Nx,(B)=pi(i=l,....g),n=efg, and 
$%,,..., P, are mutually conjugate prime ideals 
of K over k 

Hilbert (1894) developed the decomposition 
theory of a prime ideal p of kK for a Galois 
extension K/k in terms of the Galois group G 
as follows: Let S$ be a prime ideal of ©. Then 


Z=fceG| R =$} 


is a subgroup of the Galois group G of K/k. Z 
is called the decomposition group of $ over k. 
Let G =| j; Zt, be the left coset decomposition 
of G. Then 8; = $": (i = 1, . , g) are all the 
conjugate ideals of $ over k 

The subgroup 


T={cEZ|A°=A(mod), AED} 

of the decomposition group Z is normal, and 
T is called the inertia group of $ over k The 
quotient group Z/T is a cyclic group of order f 


(the relative degree of $). There exists an 
element g of Z such that 


AED, 


and g is uniquely determined mod T; oT gen- 


A°=A*" (mod $), 
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erates the cyclic group Z/T. This ø is called the 
Frobenius substitution (or Frobenius 
phism) of $ over k. For m= 1,2, , 


VO" —lgEZ|A?=A(mod P"*'), AED} 


automor- 


are normal subgroups of Z; the group V’” is 
called the mth ramification group of $ over k 
Let 


V0 — a ye Veith 


=VOID BVO, 


= yas yet) — | 


Let V,— Vo"! (9 =0,1,...,), where vo = —1. 
In particular, Yo =V =T. The integers v|, 
v, „are called the ramification numbers of f. 
The group T/V; is isomorphic to a subgroup of 
the multiplicative group of the finite field O/B. 
Hence T/V; is a cyclic group whose order ey 
is a divisor of N(B)— 1. The groupV,/V,, , 
(m > 1) is isomorphic to a subgroup of the 
additive group of the finite field O/{%, Hence 
Vin/Vm+1 is an Abelian group of type (p, p,, p) 
whose order divides N(‘$). From e = T|= 
(T: V,}{V; it follows that e=eyp", (eg, p)= 1. 
Here G{ denotes the order of a finite group G. 
Hence the decomposition group of %$ is a 
*solvable group. The relation between the 
ramification numbers for K jk and those for an 
intermediate Galois extension F/k was com- 
pletely determined by J. Herbrand (J. Math. 
Pures Appl., 10 (1931)) [Il]. 

Let P be the P-component of the relative 
different Dx, of a Galois extension K/k. Then 


=F pas 0 MG I=D= X (V° 


In particular, d=O0 if T=l,andd=e 1 if 
anes 

Let kz, kr, and kpm) be the intermediate 
fields which correspond to the subgroups Z, T, 
and V™, respectively, in the sense of *Galois 
theory; the fields k}, ky, and k,«m) are called 
the decomposition field, the inertia field, and 
the mth ramification field of R, respectively. 
Let ‘P be a prime ideal of K containing p, and 
let p, and pọ be prime ideals in k, and k, such 
that pz = PN k, and pr = PNky. Then we 
have Elp)=pzp P? for kz/k; Ely,)=Pp 
and Ny jx (Pr)=P pf for k,/k,; and E(p,)= B° 
for K/ky. 

If a prime ideal p of kis unramified for a 
Galois extension K/k then we have E(p) = 
P Pa Pa B= PYG = 1, g), Naa By) = 
p’, and n = fg. The Frobenius automor- 
phism ¢,;:A%= A” (mod $8) (A e D) for the 
prime ideal 8, is uniquely determined, and its 
order is f. Since 3. = Bi, we have o, = 1,10, 7. 
Hence 6,, , 0, belong to the same ‘con- 
jugate class of G. In particular, if G is an 
Abelian group, then o,= ,,= a, and 
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A" = AN"! (mod p), AED. 


We then write 


o={* if G ) 


and cal] this symbol the Artin symbol for p 

for the Abelian extension K jk. For an ideal 
a= [[ p° of k that is relatively prime to the 

relative discriminant of K/k, we define 


(S) nF) «9 


Evidently, we have 


E 


The arithmetic of quadratic fields (— 347 
Quadratic Fields) and the arithmetic of cyclo- 
tomic fields (— Section L) have been developed 
since the 19th Century. 


L. Arithmetic of Cyclotomic Fields 


A complex number č whose mth power is | but 
whose m’th power is not 1 for m’ < m is called 
an mth primitive root of unity. There are (m) 
primitive roots of unity: exp(2mir/m) ((r, m) = 1), 
where ¢ is *Euler’s function. These qm) primi- 
tive roots of unity are the zeros of an irreduc- 
ible polynomial over Q of degree <p(m): 
F(X) =x- 1) 

dim 
where uis the *MObtus function. The coeff- 
cient of the highest term of F,(X) is 1, and the 
other coefficients are all rational integers. 
F,(X) is called a cyclotomic polynomial. An 
example is 


F,2(X)=(X"? 
=X+~- 


D(X? — DAX? — I(X*—1) 


Xg]: 


The algebraic number field K,, = Q(én) 
(e = exp(2zi/m)) obtained by adjoining an 
mth primitive root of unity to Q is a Galois 
extension over Q of degree (m) whose Galois 
group G is isomorphic to the multiplicative 
Abelian group of ‘reduced residue classes of Z 
modulo m:G= {a,(Crr=Ch, (r,m)= 1}. K,, is 
called the mth cyclotomic field. Cyclotomic 
fields are tAbelian extensions of Q. Conversely, 
every Abelian extension of Q is a subfield of a 
cyclotomic field (Kronecker’s theorem, 18.53, 
1877). 

We can choose (1, {,,, Cm -> Cat!) as a 
minimal basis of K,. Let m= If I? |" be 
the decompositions of m in powers of prime 
numbers l, ‚l. Put K= Kp. Then K,, 
is the composite field K,,= KK... K”, 
The different of K,, is given by Dx ig 
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Dy ODO + DO and the discriminant 
of K,, is given by Dx = Dgo Dgo @, = 
o(m)/ọ(l":)). If m= l", the discriminant of K 
is Dy =el* (a=1"'(hl—h— 1)), where ¢= 
— 1 if /"=4 or l=3 (mod 4), and e=1 other- 
wise. Hence the discriminant of K m is Dy = 
Git m/TIi,p? ")%™. Suppose that s 
mÆ2 (mod 4). Then a prime number p ramifies 
for K,,/Q if and only if p divides m. In par- 
ticular, if m= |" (m> 2), then only | ramifies for 
K „/Q and (J) =1%™, NCT) = | (Lis explicitly 
given by [= (1-i,,,)). For /# 2 the ramification 
numbers for } are y= Į} 1 G= 1,2,), and 

the ramitication fields are K, Kp, For [= 2 
the ramification numbers are 1,3,7, and the 


m 


ramification fields are Q, K4, Kg, 

In K,,/Q a prime number p ( ptm) is decom- 
posed as (p)= 1), N(p)=pl(i=1,.---y) 
and fg = qn(m). Here the degree f of p, is de- 
termined as the minimal positive integer f 
such that p” = 1 (mod m). Hence the decom- 
position law of a prime number pin K,,/Q is 
derermined by its residue class modulo m. This 
is a prototype form of class tield theory (- 59 
Class Field Theory). 

The class number of the cyclotomic tield K,, 
can be calculated by Dedekind’s formula (- 
Section E; see also Hilbert [4]). Here we shall 
give the result for m=! (a prime number). 

Let r be a +Primitive root modulo l. For C = 
exp(2zi/l), we put 


7 Lat? = ae 1/2 
sao T tid] i 


Then ¢ is a unit in K,. Define an element g of 
the Galois group of K,/Q by {°= ¢", and put 
e&=87 (i=0,1,...). Then £o, &1,...,€)-; (p= 


(l— 3)/2) are multiplicatively independent units. 


That is, the regulator R [é), €,1 ia] = 

E #0. The units £0,£1,.... &-; are called 
circular units. The class number h of K, is the 
product of two factors, h = h, h,. Here h, is the 
class number of the real subfield K; = Q(¢ + 
(~'), E. E. Kummer called 4, the first factor 
and h, the second factor of the class number h. 
Let X1» X2», 7)-, be the multiplicative char- 
acters of the treduced residue classes of Z 
modulo 1, and let y{i= 1, , p+ 1) be the 
characters among them such that y,( = 1) = =1, 
Then 


=] pti p+1 /I-1 E 
i= (Z ante). pel 


(21)? i=1 \a=1 "ERS 


(Kummer, 1850). Here Rọ is the regulator of 
K;. Since circular units belong to Kj, the class 
number h, of K; is equal to the index of the 
subgroup generated by +1, é,, €,-, in the 
group of units of K;, The class number h of 
K, 1s equal to 1 for {< 19 and it has been con- 
jectured that there exist no more fields K, 
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with h= 1. H. L. Montgomery and K. Uchida 
solved this conjecture by proving that the first 
factor h,>1 for /> 19 (Téhoku Math. J, 23 
(1971)). J. M. Masley and Montgomery proved 
that there are precisely 29 distinct K,, with 
class number | (J, Reine Angew. Math., 
286/287 (1976). 

According to Kummer, | divides h if and 
only if l divides h,. Since h, can be computed 
explicitly, we can readily determine whether 
l| h or not (- 145 Fermat’s Problem; Ap- 
pendix B, Table 4.111). A prime Į is called regu- 
lar if /{h, otherwise it is called irregular. Thus 
an odd prime l> 5 is irregular if and only if 
there exists an even integer j with 2 <j <]—3 
such that | divides the numerator of the jth 
Bernoulli number B,. This criterion of Kum- 
mer can be strengthened as follows. Let § 
denote the /-Sylow subgroup of the ideal 
class group € Kı of the prime cyclotomic tield 
K, = Q(¢,). Because the Galois group G of 
Q(¢,)/Q operates on S, the Abelian group § 
becomes naturally a Z,[G]-module. Choosing 
the canonical character 0:G—Z* defined 
by 67 = C7 for any oe G, we have a direct 
decomposition § = []}=3 §®, where §® = 
{seS| s?=s°, ge G}. Then it is necessary and 
sufficient for || (the numerator of B) that §¢~)) 
#0 (K. A. Ribet, Inventiones Math., 34 (1976)). 
Moreover, because G is naturally isomorphic 
to the multiplicative group of the reduced 
residue classes of Z/IZ, each @' becomes a 
*Dirichlet character. Let 


fet... 
Bi g-i=- > 0 '(a)a; 
wip X a 


then this number, regarded as an element in 
Q,, is equal to a factor of the product (up to 
+ 1 and 2) appearing in the class number 
formula for h,. Let m; be the l-exponent of 

B, a-i for each odd i (2 < i¢]— 3). Then it is 
conjectured that the order of S® is precisely 
equal to |i for each odd i with 2<i</—3, In 
particular, if $’= JI; even S” is trivial, then 
this is known to be true. In general, it is also 
conjectured that lth, holds for any prime | 
(Vandiver’s conjecture). If the group § is 
cyche, then § is of order l"i (A. Wiles, Inven- 
tiones Math., 58 (1980)). 

When a Galois extension K over a finite 
algebraic number field k has the Galois group 
isomorphic to the additive group of the /-adic 
integer ring Z, the extension K/k is called a 
Z,-extension or r-extension. Then for each 
integer n > 0 there exists a unique subfield k, of 
K with degree /" over k. Let k = K,=Q(C) 2), 
and let K be the union of all /"*! th cyclotomic 
fields Kn+1(n>O). Then K/k is a typical ex- 
ample of a Z,-extension with subfields k, = 
K+ 1, Let K/k be a Z,-extension with sub- 
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fields k,,, and let }°" denote the [-component 
of the class number of k,. Then there exist 
integers A> 0, u> 0, v independently of n 
such that e, = An + yl" + v for all sufftciently 
large numbers n (K. Iwasawa, Bull. Amer. 
Math. Soc., 65 (1959)). These numbers A, 4, 
v are called the Iwasawa invariants for the 
Z,-extension K/k. There exists a unique 
Z,-extension K° over the rationals Q. An ex- 
tension K°k over k obtained by taking the 
composite of K° and kis called the basic Zy- 
extension of kor the cyclotomic Z,-extension 
of k. We denote its Iwasawa invariants by 
A(k), ulk), VK). It is known that there are Z,- 
extensions with arbitrarily large jy, But it is 
conjectured that p(k) = 0 holds for the basic 
Z,-extension of any k In particular, in the case 
k= K, = Q(¢,) it has been computed that p(k) = 
0 for |< 125,000 (for |< 4001 by K. Iwasawa 
and C. C. Sims, J. Math. Soc. Japan, 18 (1966); 
for |< 30,000 by W. Johnson, Math. Carp., 29 
(1975); for [< 125,000 by S. Wagstaff, Math. 
Comp., 32 (1978)). By applying the theory of 
uniform distributions to /-adic situations, B. 
Ferrero and L. C. Washington (Ann. Math., 
109 (1978)) proved that p(k) =O when k/Q is 
Abelian. In particular, when K= K, and k, = 
K,"*!, e, > 0 if and only if the class number 
of K, is divisible by / (P. Furtwangler, 1911). 
Since any quadratic field is a subfield of a 
cyclotomic field (by a tGaussian sum formula 
we have Q(./m) cQ(¢,,)), where dis the dis- 
criminant of Q(\/m)), the computation of the 
class number of quadratic fields and the proof 
of the law of reciprocity for the {Legendre 
symbol follow from the arithmetic of cyclo- 
tomic fields. 


M. Aritbmetic of Kummer Extensions 


Assume that an algebraic number field K con- 
tains an nth primitive root of unity. Then a 
*Kummer extension K =k(2/p) (uek) isa 
‘cyclic extension of k Assume that [K: k] =n. 
In order that a prime ideal p of Kramify for 
K/k, it is necessary that p be a divisor appear- 
ing in (n) or (u). If pt(m) and v,(“) #0 (modn), 
then p ramifies for K/k. A prime ideal p which 
is relatively prime to (x) has the decomposition 
E(p) = P, P, with distinct primes $, in K if 
and only if the equation u= ¢" (mod p”) is 
satisfied by some £€ o for any positive integer 
m In particular, if p-+{n) and v,( u) = 0, we have 
E(p) = $; P, if and only if u = č” (mod p) is 
solvable in o. 

If for an element pz of 0 


y=" (mod p) 


is solvable by some €€ o, u is said to be a 
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residue of tbe ntb power modulo p. Assume 
that p)(n) and v(x) = 0. Let f be the minimal 
positive integer such that p/ is a residue of the 
nth power modulo p. Then p is decomposed 
in K as E(p)=8, P, and Nun Bi) =p" 
(i=1,...,g). 


N. Power-Residue Symbol 


Let (, = exp(2zi/n)¢ k, and let p be relatively 
prime to (n) and (x) (x€ k). Then for some r we 
have 


g= = FF (mod p), 


and we write 


This symbol is called the nth power-residue 
symbol (Kummer). Generalizing this definition, 
we can define the nth power-residue symbol 
(a/b), for an ideal b of k which is relatively 
prime to the relative discriminant of k(2/a)/k 
by using the *Artin symbol (( K/k)/b): 


cir) sh off) 


This symbol satisfies 


G5) GG) GG) 
FA 


if all the symbols are well delined. In partic- 
ular, @ is a residue of the nth power modulo p 
if and only if (x/p),, = 1. This symbol coincides 
with the ‘quadratic residue symbol for n =2, 
k=Q, and p#2. 


0. Law of Reciprocity for tbe Power-Residue 
Symbol 


Several formulas concerning the power-residue 
symbol are known which are similar to that 
for the quadratic residue symbol (F. G. M. 
Eisenstein, Kummer, Furtwangler, Takagi, 
Artin, Hasse). These can be proved by means 
of Artin’s tgeneral law of reciprocity in class 
field theory (— 59 Class Field Theory). 

There are many formulas concerning the 
reciprocity of the power-residue symbol. One 
of them is as follows: Letn = Į be a prime 
number. Let «, Be kand assume that (i) g is 
totally positive; (ii) v,(a) = 0 (mod J) if v,(B) #0 
(mod/), and v,(f)=0 (mod!) if v,(a) #0 (mod J), 
for any prime ideal p; and (iii) a= 1 (mod l) and 
B = 1 (mod( 1 —{,)), Then 
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DOs 


(general law of reciprocity, Hasse, 1924). This 
result is a generalization of the formula of 
Eisenstein (1850). If (ek) is totally positive 
and y= 1 (mod l), then 


(complementary 
1924). 


law of reciprocity, Hasse, 


P. Norm Residue 


Let m be an integral divisor of k such that 
m = I; při TI p2 (e; > 0) with finite prime divi- 
sors {p;} and infinite prime divisors {p9}, 
and let f be an element of k that is relatively 
prime to m. For a relative algebraic num- 
ber iield K/k and an element B of K, we set 
B = Ngah B) (mod m) if the following two con- 
ditions are satisfied: (i) B =Nx,(B) (mod pfi) for 
every finite p; and (ii) BY’ >0 for every infinite 
prime p% such that 75) is real and its extension 
to K is imaginary. fe k is then said to be a 
norm residue modulo m for K/k if there exists a 
number B of K such that B= Nx,,(B) (modm). 
Let p be a finite prime divisor of k. If Bisa 
norm residue modulo p° for a sufficiently large 
c, then B is a norm residue modulo pf for any 
e >c. Let c be the smallest such integer (c > 0). 
Then the ideal f, = p° is said to be the p- 
conductor of norm residue for K/k. If p is un- 
ramified for K/k, then c = 0; i.e., every Bek 
which is relatively prime to p is a norm residue 
modulo p° for any e > 0. For a ramified p put 
fp = p‘. For a Galois extension K/k, c is not 
greater than 


5 [V,| Up ye 
, Wpt1 — Up) = . 
p=0 | Vol j Í i=0 V| 


In particular, for an Abelian extension K/k this 
value is an integer and is equal to c (Hasse, J. 
Fac. Sci. Univ. Tokyo, 1934). For example, the 
l-conductor of the cyclotomic field K;4/Q is I”. 
We define the p -conductor for K/k for an 
infinite prime divisor p, of k by foe =p, if Do 
is real and its extension to K is imaginary, and 
oe = | otherwise (- 257 Local Fields F). 
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Q. Norm Residue Symbol 


For an Abelian extension K/k, the positive 
divisor 


f= Tf 


(where p runs over all finite and infinite prime 
divisors of k) is called the conductor of K/k 
(— 59 Class Field Theory). For a e k (a #0) 
take % such that a/a, = 1 (mod f,) and %& = 1 
(mod ffp +), and put (x) = p“b with b relatively 
prime to p. Then b is relatively prime to the 
relative discriminant Dx;,. We define a new 
symbol by 


a, K/k i K/k 
( p 55) i 


where ((K/k)/b) is the Artin symbol. This 

value is independent of the choice of the 
auxiliary element x), The new symbol is called 
the norm-residue symbol (Hasse, J. Reine 
Angew. Math., 162 (1930)). In particular, for 
an infinite real prime divisor p® of k whose 
extension PY’ for K is imaginary, we have 


a, K/k 
po =] or =0 


according to whether the conjugate «”) is 
positive or negative, where g is the automor- 
phism of K/k induced from the complex con- 
jugation of the completion C of K with respect 
to PY. 

The norm-residue symbol has the following 
properties: 


a) (= wr (* A) (= <A, 
p J \ p p / 


(2) if p is unramified for Kk, then 


(c T 
p J \p 


(3) in order that g be a norm residue modulo f, 
for K/k, it is necessary and sufficient that 


( ae 
P *% 


(4) the product formula for the norm-residue 
symbol (Hasse) is 


a, K/k 
=1 
1 p 


where p runs over all finite and infinite prime 
divisors of k; and (5) if the domain of the 
variable g is the whole k (#0), or the set of 
all x such that (œ) is relatively prime to p, or 
the set of alla such that «= 1 (mod p”) (v, + 
I<mv,,,), then the range of ((a, K/k)/p) is 
the decomposition group Z of P, the inertia 
group T of p, or the ramification group V, 
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of p, respectively (Hasse, S. Iyanaga, 1933) 
(- 257 Local Fields F). 


R. Hilbert Norm-Residue Symbol 


The symbol first introduced by Hilbert for 
quadratic fields can be defined in a general 
algebraic number field k containing an nth 
primitive root ¢, of unity. Let «, Be k(«#0, 

p #0), and let p be a prime divisor. Then the 
following nth root of unity ((a, B)/p), is defined 
by using the norm-residue symbol: 


(P) oe. 
a (Ann) 


p 


This symbol ((a, f)/p), is called the Hilbert 
norm-residue symbol. It is also called the 
Hilbert-Hasse norm-residue symbol. For «, x’, 
Bek, wehave 


ax PA foB (a, Bp). 
o( p )-( p ( p ); 


(2) the law of symmetry: 


Gear 
Don SRE. 


and (3) the product formula for the Hilbert 
norm-residue symbol: 


n(~*) = 


(Hilbert, Furtwängler, Takagi, Artin, Hasse). 
For detailed properties concerning the norm- 
residue symbol, power-residue symbol, and 
Hilbert norm-residue symbol and for refer- 
ences for them see Hasse [6]. 

In general, the problem of obtaining various 
laws of reciprocity for the power-residue sym- 


bol is reduced to the one of computing the 
Hilbert norm-residue symbol explicitly. De- 
tailed formulas for these symbols are called 
explicit reciprocity laws; they are treated as 
a topic in the number theory of local fields 
(- 257 Local Fields H). 


S. Density Tbeorem 


Let M be a set of prime ideals of k If 


1 1 

j —_——/ log —— 
ET) l a 
exists, its value is said to be the density of M. 
The density of the set of all prime ideals of k is 
l. Let H be an ideal group modulo an integral 
divisor m. Then the density of the set of all 
prime ideals contained in each coset of 3(m) 
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modulo H is 1/(3(m): H). In particular, let H 
be the ray S(m). Then this result implies that 
each coset of $(m) modulo S(m) contains 
intinitely many prime ideals (a generalization 
to algebraic number fields of the tprime num- 
ber theorem for arithmetic progression). 

Let K/k be a Galois extension, C be a conju- 
gate Class of the Galois group G of K/k, and 
M(C) be the set of all prime ideals p of ksuch 
that the Frobenius automorphism of each 
prime factor ‘B, of p in K belongs to C. Then 
the density of M(C) is {C|/|G| (Cbebotarev’s 
density theorem, Math. Ann. % (1926)). 

Each element g of the Galois group G of K/k 
can be expressed by the permutation z of the 
conjugate fields K"),, K of K over K. Let z 
be expressed as the product of Y tcycles of 
length fi, ‚f Hence n =f, + +f, Let 
Chis St, be the set of all such g in G, and 
let M(f,, ,f,) be the set of all prime ideals 
p of Ksuch that p is decomposed in K/k as 
the product of r prime ideals of K with rela- 
tive degree f,,. ,f,. Then the density of 
M(f,, fis Cf... .Sf)//GI (Artin, Math. 
Amn., 89 (1923)). 


T. Relation to the Arithmetic of Local Fields 


It is quite useful to investigate the relation 
between the arithmetic of algebraic number 
fields and that of local fields. For example, let 
a prime ideal p of an algebraic number field k 
be decomposed as E(p)= Pf. 99 Neal Bi) 
=p/i(i=1,...,g) in an extension K of k. Let 
K and k, be the completion of K and k with 
respect to -adic and p-adic valuations, re- 
spectively. Then we have [Ky k] = e; f,and 
K @x kp = Ky, ++ Kg (direct sum). The 
relative different Dx, is expressed as (the 
p-component of Dx) = []%; Dx, jk» FOr a 
Galois extension K/k the p-conductor i =y" 
for the norm-residue and the conductor p° of 
local extension Ką/k, have the same exponent 
c. For a local field K why, each norm-residue 
modulo f, is a norm of an element of K,. 
Hence precise results concerning the norm- 
residue in local fields can be applied imme- 
diately to a global field K/k (- 257 Local 
Fields). 

We can also apply the method of the idele 
group of an algebraic number field K, and 
therefrom we can prove results concerning the 
ideal class group, unit group, and zeta function 
of k(— 6 Adeles and and Ideles). 


U. History of the Arithmetic of Algebraic 
Number Fields 


C. F. Gauss (1832) was the first to generalize 
the notion of integers to algebraic number 
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fields in considering the elements of ZL,/ —1], 
now called Gaussian integers (Z fi ] is the 


principal order of Q/-1 )). After investiga- 
tions by G. L. Dirichlet and Kummer, the 
notion of ideals was introduced by Dedekind 
(1871) [2]. L. Kronecker gave another founda- 
tion for the arithmetic of algebraic number fields 
(1882) [3]. Dirichlet proved the unit theorem 
and, introducing the analytic method to num- 
ber theory, gave the class number formula of 
quadratic fields (+ 347 Quadratic Fields). H. 
Minkowski first applied the theory of lattice 
points to number theory (- 182 Geometry of 
Numbers), and K. Hensel introduced the p- 
adic method (- 257 Local Fields). Hilbert 
(1897) [4] and Hasse (1926, 1927, 1930) [6] 
summarized the main results on the arithmetic 
of algebraic number fields known at that time. 
In particular, Hilbert’s report centered around 
the arithmetic of Galois extensions, and 
Hasse’s around the class field theory obtained 
by T. Takagi, E. Artin, and H. Hasse (- 59 
Class Field Theory). Since c. 1950, when the 
notions of ideles and adeles were introduced, 
cohomology-theoretic methods have been 
successfully applied to number theory (— 6 
Adeles and Ideles). Recently various local 
methods, for example, the Iwasawa theory of 
Z,-extensions, fformal groups, and fp-adic L- 
functions (T. Kubota and H. W. Leopoldt, J. 
Reine Angew. Math., 214/215 (1964)) have been 
frequently applied to research in algebraic 
number fields. 
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15 (VIII.3) 
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A. Definition 


An algebraic variety of dimension 2 is called 
an algebraic surface. In this article, by a sur- 
face we mean a complete irreducible algebraic 
surface defined over an algebraically closed 
field K. 


B. History 


The history of algebraic surfaces originated 
with the study of algebraic functions of two 
variables. In the case of algebraic functions of 
one variable, the introduction of tRiemann 
surfaces attached to such functions played an 
essential role in the development of the theory. 
The study of algebraic functions of two vari- 
ables led naturally to the consideration of the 
surfaces defined by a suitable polynomial 
equation. H. Poincaré and E. Picard are 
among those who studied the homological 
structure of the surface defined by the equation 
P(x,y, Z) = 0. The theory of tAbelian integrals 
(Picard integrals) is One of the consequences of 
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such topological investigations. S. Lefschetz 
obtained further results in this direction. 

M. Noether and geometers of the Italian 
school, such as F. Enriques, G. Castelnuovo, 
and F. Severi, studied algebrogeometric prop- 
erties of algebraic surfaces. In particular, the 
Italian school geometers recognized the im- 
portance of irregularity and thoroughly inves- 
tigated its geometric meaning. In the early 
20th Century they succeeded in constructing 
the great edifice of the theory of algebraic 
surfaces. Though some of their results lack 
rigorous proof, efforts to build a foundation 
for those results have led to the recent devel- 
opments in algebraic geometry. A signifi- 
cant contribution to the modernization of 
the theory was made by 0. Zariski and K. 
Kodaira. 

The resolution of singularities of an alge- 
bratc surface is one of the most fundamental 
problems in the field. When the tuniversal 
domain is the complex number field, function- 
theoretic methods were used by Italian-school 
geometers and R. J. Walker (Ann. Math., 36 
(1935)). Zariski introduced the tyaluation- 
theoretic method to deal with the problem 
when the characteristic of the universal dọ- 
main is zero. S. Abhyankar (1966) succeeded 
in resolving the case of positive characteristics. 


C. Divisors and Linear Systems 


In what follows, let § denote a nonsingular 
surface. § can be embedded into some projec- 
tive space. Let Z be a linear system of divisors 
on § and fy, ,f, be a basis of the detining 
module for X over K. Associated with X we 
have a rational mapping @, : §P" defined by 
®,(P)=(fo(P),. . .,f,(P)) for general points P 
on S. Pullbacks of hyperplanes by ®,. are 
called variable components of 2. Any member 
of X is a sum of a variable component and a 
fixed component of X. Let §’ denote the clo- 
sure of the image of S by ®,.. If dim S’ = 2, a 
general variable component is irreducible. If 
dim S’ = 1, then a general variable component 
is composed of an algebraic system of dimen- 
sion 1, which is called an algebraic pencil. 
These result from Bertini’s theorems. For any 
divisors D and DP’ on S, the intersection number 
(or the fKronecker index) 7(D: D’) is defmed; 
this number is a symmetric bilinear form such 
that ID: D)= KD; D3) for any divisors D, and 
Dj, linearly equivalent to D and D’, respec- 
tively. If C is a nonsingular curve on S, I(C-D) 
coincides with the degree of the restriction 
C-D of D toC. I(D:D)=(D7) issaidtobethe 
self-intersection number of D. If D is an ample 
divisor on §, then (D?) >Oand/(D- C) > 0 for 
any irreducible curve C on S. These properties 
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characterize an ample divisor (Y. Nakai; = 16 
Algebraic Varieties E). 

Let 2 be an irreducible linear system of 
dimension r( > 1), and let C be a generic com- 
ponent of X. Let C’ be a member of 2 different 
from C. Then the set of C-divisors C- C’ forms 
a linear system of dimension Y- 1 on C. This 
is called the trace of X on C and is denoted 
by Tre. The trace is, in general, not com- 
plete. The integer dim |Tr,2| —dim Trp = 
(2) is called the deficiency of Z. The defi- 
ciency of the complete linear system |D|_is 
denoted by S(D). 

Let x be a point of S, and let @, be the local 
ring of x. Then (5 = | ),¢5 @, is an talgebraic 
coherent sheaf, called the ‘structure sheaf of S. 
Let D be a divisor on §. The sheaf of germs of 
rational functions f such that(f)+D>0 is 
denoted by G', (D). Then H°(S,@s(D)) isa defin- 
ing module for the complete linear system D |. 
D>oO if and only if Os(—D) is a sheaf of C,- 
ideals. The quotient sheaf @,/@,(— D) will be 
denoted by 0,. If D is a tprime divisor, then @, 
is the structure sheaf of the algebraic curve D 
(— 9 Algebraic Curves). Let ¥ be a sheaf on S. 
We set 


(S, F)=E?-o(— 1)! dim HS, F). 


7(S, Os) will be denoted simply by 7(S). We 
call p(S)= y(S) 1 the aritbmetic genus of 
the algebraic surface S. Sometimes y(S) is re- 
ferred to as the arithmetic genus of S. We set 
%s(D) = x(S) x(S, Os(—D)). The integer p, (D) 
= 1 —y;(D) is, by definition, the arithmetic 
genus of the divisor D. If D is a prime diyi- 
sor, then p, (D) coincides with the arithmetic 
genus of the algebraic curve, i.e., p, (D) = 
dim H'(D, Op). 


D. Riemann-Roch Theorem 


Let § be a nonsingular surface and let K de- 
note a canonical divisor on S, i.e., K = (œ) for 
some nonzero rational 2-form œ on S. If C is a 
nonsingular irreducible curve on S$, (K + C): C 
becomes a canonical divisor on C; hence, 
deg((K + C):C)=2p,(C)—2 byacorollaryto 
the Riemann-Roch theorem on C (+9 Alge- 
braic Curves C). Since ]((K + C}: ©) = deg((K + 
C): C), it follows that p,(C)=1I((K + C): C)/2 + 


1. Moreover, the formula 
p,(D)=I((K + D)-D)/2+1 


holds for an arbitrary curve D on S; this is 
called the adjunction formula. For any divisor, 
we definep, (D) tobeI(D (D +K))/2+1.Then 


15, C(D))= x(8)— Zs(— D) = x(S) + pal— D)— | 
= I(D-(D—K))/2+ (8). 
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This formula is called the Riemann-Rocb 
theorem on S (— 366 Riemann-Roch Theo- 
rems C). Applying Serre’s duality theorem to 
D (— 16 Algebraic Varieties E) we have 


dim HS, ((D))= dim H?*(S,O(K — D)) 
for i=0, 1, 2. 


In particular, dim H?(S, O(D))=I(K — D), 

which is called the index of speciality of D; 
dim H'(S, ©(D))=dim H'(S, O(K —D)) is called 
the superabundance of D, denoted by h'(D). 
The inequality 


I(D) +1(K — D) 
> x(S, (D)) =I(D: (D = K))/2+7(S) 


is called the Riemann-Roch inequality, where 
equality holds if and only if k'{(D) = 0. FD isa 
curve with s connected components, then 


(D+ K)=h'(—D)+ p,(D)—s + x(S). 


In addition, if h!(0)= 0, then hk!'( D) = 0 and 
thus {(D+ K)=p,(D)—s+y(S). This is called 
the Riemann-Roch theorem for the adjoint 
system. Note that D + K| is called the adjoint 
system of D. The Noether formula, (K?) + 
c,(S)= 12y(S), is a special case of Hirzebruch’s 
theorem of Riemann-Roch type (- 366 
Riemann-Roch Theorems B). Here, c,(S) 
denotes the second Chern number of $, which 
coincides with the Euler number of § if K is 
the field of complex numbers. 

Let Div(S) denote the group of all divi- 
sors on S; by linearity we can define the bi- 
linear form (D: D’) on Div(S)) = Div(S) ®@ ,Q. 
J = {DE Div(S)) I(D- D') =0 for all D'} is 
a subgroup and X = Div(S$),/J is a finite- 
dimensional vector space over Q, on which the 
nondegenerate bilinear form / is induced. 7 has 
a unique positive eigenvalue; thus the other 
eigenvalues are all negative. This is called the 
index theorem of Hodge; it is derived from the 
Riemann-Roch theorem on S. From this, we 
inferthatif D?=I(D-D)>0, then I(D-D’)?> 
D? -D forany D’, X is said to be the Neron- 
Severi group of § and dim X is said to be the 
Picard number of § (— 16 Algebraic Vari- 
eties P). 

To study [(mD) asa functionof m >> 0, O. 
Zariski writes an arbitrary effective divisor D 
as a sum of D and DO’ eDiv(S)y with non- 
negative rational coefficients such that (1) D? 
is arithmetically effective (or, numerically 
semipositive), i.e., I (D'*)-C) >0 for any curve C 
on S; (2) D‘)=0 or the intersection matrix of 
the support of D™ is negative detinite; and 
(3) 1(D™ D) = 0. Such a decomposition is 
unique and is called the Zariski decomposition 
of D [7]. If mD is a divisor for some m > 0, 
then [(mD)=I(mD“). 
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E. Invariants of Algebraic Surfaces 


There are many invariants besides the arith- 
metic genus discussed earlier. We set ht? = 
dim, H?(S, Q9). Then h”? is equal to the 
number of linearly independent holomorphic 
2-forms; it is called the geometric genus and is 
usually denoted by p,. Since h® ! gives the 
maximum among the deficiencies 6(2) of linear 
systems on S, it is called the maximal defi- 
ciency of S. For a divisor C such that h! (C) = 
0, we have 6(C)=h®:!. The number h°! was 
formerly called the irregularity of §, because 
h°- ! was considered to be a correction term in 
the equality p,(S) = p,—h®'. The study of 
higher-dimensional varieties showed, however, 
that it was unnatural to regard h®:! as a cor- 
rection term. At present, by irregularity we 
mean the dimension of the Picard variety of § 
(— 16 Algebraic Varieties P), and we denote 
this number by q. 

When § is defined over the complex number 
field, we have h?4 = h®P, In particular, q = 
h®: = h!-°, This number is equal to the num- 
ber of linearly independent +tAbelian simple 
integrals of the first kind; it is also equal to 
one-half the first Betti number of S. In cases 
with positive characteristic, these equalities 
do not hold in general. J.-P. Serre gave an 
example of an algebraic surface § such that 
h®!#h!?, and J. Igusa gave an example such 
that g<h®.!=h!-°, Let K be a canonical divi- 
sor of S. The number P, =/(iK) is called the i- 
genus, and P; (i= 2,3, . ) are generally called 
plurigenera. If P, =0and d n, then P, is also 
zero. The numbers p,(S) = h7? —h®™ +, p,(S) = 
hè’, ab’, ht, P, (i=2,3,...)(P,=p,) ae 
tabsolute invariants of $’; i.e., they take the 
same values for any nonsingular surface S’ 
that is birationally equivalent to S. However, 
ht! is not an absolute invariant. For a pro- 
jective plane, all plurigenera P, vanish and 
the irregularity q = 0. Thus if S$ is a rational sur- 
face (- 16 Algebraic Varieties J), i.e., a sur- 
face which is birationally equivalent to P?, 
all P, =q = 0. Conversely, any surface with 
q =P, =0 is a rational surface. This is called 
Castelnuovo’s criterion. A ruled surface is de- 
fined to be a surface that is birationally equiv- 
alent to a product of the projective line and a 
curve. All P, of a ruled surface equal 0 and 
any surface with P, =P, = 0 is a ruled sur- 
face. This is called the criterion of ruled sur- 
faces (Enriques). 


F. Characteristic Linear 
Families 


Systems of Algebraic 


One of the central problems considered by the 
Italian school was to prove that the irregular- 
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ity q is equal to the maximal deficiency h® £, 
For that purpose, Severi introduced the notion 
of characteristic linear systems of algebraic 
families. Let 2 be an irreducible algebraic 
family of positive divisors on S such that a 
generic member C of X is an irreducible non- 
singular curve, and let r be the dimension of X. 
Let 2 be a 1-dimensional subfamily of 2 
containing C as a simple member, and let C 
be a generic member of X L Then the speciali- 
zation of C’: C over the specialization C’-+C is 
a well-defined C-divisor of degree y= I (C: C’). 
The set of C-divisors thus obtained is called 
the characteristic set. The characteristic set 
forms an (Y = 1)-dimensional linear system and 
contains Tr, C|as a subfamily. This linear 
system is called the characteristic linear system 
of X, For any algebraic family of dimension r, 
we have f <dim Cl + q. In particular, there 
exists an algebraic family X that contains oo! 
linear systems and such that for a generic 
curve C we have h'(C) = 0. For such an alge- 
braic family, we have the equality r= dim C| 

+ q; hence the inequality q < h® ! follows. 
Moreover, if the characteristic linear system is 
complete, we have q = h™!, The proof of the 
completeness of characteristic linear systems 
given by Severi is valid only in some special 
cases (€.g., the case p, = 0). For a complex 
algebraic manifold, a rigorous proof was given 
later. When the characteristic is positive, the 
completeness does not hold in general (Igusa); 
however, for the surface with p, = 0, the com- 
pleteness holds (Y. Nakai). The completeness 
holds if and only if the Picard scheme of § is 
reduced [ 14]. 


G. Birational Transformations 
Surfaces 


of Algebraic 


Let S and S’ denote nonsingular surfaces. If 
there exists a birational morphism T: SS’, we 
say that S dominates S’, and we write S > S’, In 
addition, if T is not an isomorphism, we write 
S >S’. In case there does not exist an S’ with 

S > S’, S is said to be relatively minimal. On the 
other hand, if we have §’ > § for any S’ which 
is birationally equivalent to S, § is said to be 
minimal. Any minimal surface is, by definition, 
relatively minimal. If a minimal (resp. rela- 
tively minimal) surface S is birationally equiva- 
lent to S’, we say that § is a minimal (resp. 
relatively minimal) model of S’ or of the field 
K(S). A necessary and sufficient condition for 
S to have a minimal model is that S not be a 
ruled surface (Castelnuovo and Enriques). 

Let S be a nonsingular surface and P be a 
point on S. Replacing P by a projective line 
Pt, we have a nonsingular surface §' and a 
birational morphism T:$’—>S§ such that T-‘(P) 
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=E=P' and S’—E=S—{P} by T. T:§’3§ 
(or T~! in some references) is said to be a 
locally quadratic transformation. Any bira- 
tional morphism between nonsingular sur- 
faces is a composition of locally quadratic 
transformations and an isomorphism. Given 
a birational mapping T: §>S’ and P on S’, 
T`! #P }is called an exceptional curve when- 
ever it is not a point. Moreover, it is called an 
exceptional curve of the first kind if T is regu- 
lar along T~! {P}. Otherwise, it is of the second 
kind. Exceptional curves consist of irreducible 
rational curves. An irreducible curve Ẹ is an 
exceptional curve of the first kind if and only 
if (E?)= = 1 and £ xP!, S is relatively minimal 
if and only if S has no exceptional curves of 
the first kind; S is minimal if and only if S has 
no exceptional curves at all. A relatively mini- 
mal surface that is not minimal is a ruled 
surface. Such a surface is either P? or a P!- 
bundle over a curve. In particular, a relatively 
minimal rational surface is either P? or a P!- 
bundle over P’. Any surface of the latter type 
is occasionally called a Hirzebruch surface and 
its (K?) is 8, where K denotes a canonical 
divisor. However, (K?) of P? is 9. Define the 
linear genus of a rational surface to be 10. If S 
is not a rational surface, taking a relatively 
minimal surface S’, we deline the linear genus 
Pp” of S to be (K)+ 1 of f, 


H. Examples of Algehraic Surfaces 


Let §„ denote a nonsingular surface in P? 
defined by a homogeneous polynomial of 
degree m. Let H denote a divisor on §,, in- 
duced from a (hyper)plane on P3, Then the 
canonical divisor K is linearly equivalent to 
(m—4)H, i.e. K-(m-4)H. Hence, p,=(m—1) 
(m 2) (m — 3)/6. Moreover, q = 0; if K is the 
field of complex numbers, §,, is simply con- 
nected. §, is isomorphic to the product of two 
copies of Pt, i.e, S, = P' x P’; hence, it is a ra- 
tional surface. §, is also rational. There exist 
27 lines on §,, Contracting 6 mutually disjoint 
lines among these 27 lines, we obtain a pro- 
jective plane. Conversely, given 6 points on 
P? in general position, by performing locally 
quadratic transformations with these points 
as centers we get a cubic surface S$, and a 
birational morphism T: S$, +P’. The inverse 
images by T of the 6 given points, the proper 
transforms of 15 lines connecting every pair of 
points chosen from the 6, and 6 conics passing 
every 5 of the 6 points by T~!, exhaust 27 lines 
on S,. If m = 4, then K ~ 0. In general, a non- 
singular surface with K ~ 0 and q = 0 is said to 
be a K3 surface (- 72 Complex Manifolds J). 
K3 surfaces have certain properties similar 

to Abelian surfaces that are defined to be 2- 
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dimensional Abelian varieties (— 3 Abelian 
Varieties). For K = C, the period mapping 
defined by integrating regular 2-forms has 
been studied extensively in connection with 
moduli theory. 

In general, surfaces with the bigenus P, = 
p, = 1 and g =0 are birationally equivalent to 
K3 surfaces; surfaces with P, = p, = 1 and q= 2 
are birationally equivalent to Abelian surfaces. 
Every Abelian surface has the involution i 
defined by i(x) = -a and the quotient surface 
by iis a singular surface with 16 ordinary 
double points. By performing 16 locally qua- 
dratic transformations successively with these 
singular points as centers we obtain a K3 
surface. Such a K3 surface is called a K ummer 
surface. The original singular surface is also 
called a Kummer surface. A quartic surface in 
P? with 16 double points is an example of a 
Kummer surface. 

The theory of birational classification of 
surfaces was developed by Castelnuovo, En- 
riques, and others of the Ttalian school. This 
theory has been extensively enriched and gen- 
eralized in various ways. Kodaira’s theory of 
analytic surfaces includes classification of 
algebraic surfaces (— 72 Complex Manifolds I, 
J), and classification of surfaces in the positive- 
characteristic case has been recently studied in 
detail (4, 5]. When the field K(S) is a subfield 
of a purely transcendental extension K(X,Y), § 
is said to be a unirational surface. If the exten- 
sion K(X, Y)/K(S) is separable, S is a rational 
surface. However, if it is inseparable, § may 
become nonrational, a K3 surface, an elliptic 
surface, or a surface of general type [5, 12]. 
Even for noncomplete surfaces, we have a 
classification theory [ 10] similar to the previ- 
ous ones. The following result is one of the 
applications: whenever S x A’ x A?, § is iso- 
morphic to A?, Here A” means the affine n- 
space (M. Miyanishi, T. Sugie, T. Fujita; 
[7,9,13]). 

Let I be the ring of integers of a real qua- 
dratic field of discriminant d. The Hilbert 
modular group G = SL(2,1)/{ +1} acts on the 
product #7, Æ being the complex upper half- 
plane. The normal complex space #7/G can 
be compactified by adding a finite number of 
points and thus a compact norfhal surface is 
obtained. Resolving these singularities in the 
canonical minimal way, we have a nonsingular 
surface Y(d) over C, which is called the Hilbert 
modular surface with discriminant d. Y(d) is 
simply connected; hence, g( Y(d)) =0. f d =5, 8, 
12, 13, 17, 21, 24, 28, 33, 60, then Y (d) is a 
rational surface. If d = 29, 37, 40, 41, 44, 56, 57, 
69, 105, then Y(d) is birationally equivalent to 
a K3 surface. If d =53, 61, 65, 73, 76, 77, 85, 88, 
92, 93, 120, 140, 165, then Y (qd) is an elliptic 
surface with x( Y (d)) = 1, Otherwise, it is a 
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surface of general type (— [ 19,201 in 72 Com- 
plex Manifolds). 

For O-cycles on a surface — 16 Algebraic 
Varieties J. 
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16 (VIII.4) 
Algebraic Varieties 


A. Affine Algebraic Varieties and Projective 
Algebraic Varieties 


Fix a field k. A subset of the n-dimensional 
taffine space k” over k is called an affine alge- 
braic variety (or simply affine variety) if it 

can be expressed as the set of the common 
zeros of a (finite or intinite) set of polynomials 
F(X,, » X,) with coefficients in k. Similarly, a 
subset of the n-dimensional *projective space 
P’(k) over k that can be expressed as the set of 
the common zeros of a set of homogeneous 
polynomials G,(Y,, . , Y,) is called a projective 
algebraic variety (or simply projective variety). 
In this section, variety means either an affine 
or projective variety. A variety which is a 
subset of another variety is called a subvariety. 
These varieties are the forerunners of the 
modern, more general versions of algebraic 
varieties, which we will discuss later. 

When V is an affine variety in k”, the set of 
the polynomials in k[X] = kLX,, ,X,] that 
vanish at every point of V form an ideal I(V) 
of k[ X]. The residue class ring A, = kLX]/I(V) 
is called the coordinate ring (or affine ring) of 
V. We can regard A, as the ring of k-valued 
functions on V that Can be expressed as poly- 
nomials of the coordinates of k”. When Vis a 
projective variety, the *homogeneous ideal 
generated by the homogeneous polynomials in 
(sea pees a that vanish at every point 
of V is denoted by I(V), and the ring A, = 
k [ Y]/I( V) is called the bomogeneous coordi- 
nate ring of V. 

A variety Vis said to be reducible or irreduc- 
ible according as it is the union of two proper 
subvarieties or not. A maximal irreducible 
subvariety of V is called an irreducible compo- 
nent of V. Any variety can be written uniquely 
as the union of a finite number of irreducible 
components. A variety V is irreducible if and 
only if I( V) is a tprime ideal. When that is the 
case, the field of quotients of A, (when Vis 
affine) or the subfield of the field of quotients 
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of A, consisting of the homogeneous elements 
of degree 0 (when V is projective) is called the 
function field of V and is denoted by k(V). 
Elements of k( V) are called rational functions 
(or simply functions) on the variety V. The field 
k(V) is tfinitely generated over k. The tran- 
scendence degree of k(V) over k is called the 
dimension of V. When V is reducible, the max- 
imum of the dimensions of its irreducible 
components is called its dimension. If W is a 
subvariety of an irreducible variety V, then 
dim V = dim W is called the codimension of W 
on V. A subvariety of pure codimension 1 of 
an affine or projective space can be defined by 
a single equation and is called a hypersurface. 
If the ideal I(V) of a variety V of dimension r 
in a projective space P”(k) is generated by n-r 
homogeneous polynomials, then Vis called a 
complete intersection. Compared with general 
varieties, complete intersections have some 
simpler properties. On the other hand, many 
important varieties are not complete intersec- 
tions, e.g., Abelian varieties of dimension > 2. 

The intersections and finite unions of sub- 
varieties on a variety V are also subvarieties. 
Thus the subvarieties can be taken as the 
*system of closed sets of a topology on V (- 
426 Topology), which is called the Zariski 
topology of the variety V, When k is the field of 
complex numbers, V can be viewed as an 
tanalytic space, and the topology of Vas such 
(the “usual” topology) is much stronger than 
the Zariski topology. For the rest of this arti- 
cle, varieties will be considered as having Zar- 
iski topologies unless stated otherwise. Terms 
such as Zariski open, Zariski closed, and Zar- 
iski dense are used to mean open, closed, or 
dense in a Zariski topology. Suppose a con- 
dition (P) concerning the points of an irreduc- 
ible variety V (concerning the elements of a 
set M parametrized by the points of V) is 
satisfied in a nonempty Zariski open set of V. 
Then we say that the condition (P) holds at 
almost all points of the variety V or at general 
points of V (almost all elements of the set M). 

Let U and V be affine varieties in k" and k”, 
respectively. Then the product set U x Vis an 
affine variety in k"*" and is called the product 
algebraic variety (or simply the product) of U 
and V. Note that the Zariski topology on 
U x Vis stronger than the product of the 
topologies of U and V. When k is talgebrat- 
cally closed, U x Vis irreducible if U and 
V are irreducible. 

Suppose that k is algebraically closed. Let $% 
be a ‘prime ideal of k[X] = k[X,, ,X,], and 
let V be the affine variety in k" detined as the 
zero points of $. Then I(V) = $ (fHilbert zero 
point theorem) (- 369 Rings of Polynomials 
D). Therefore there exists a one-to-one corre- 
spondence between the set of prime ideals of 
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k[X] and the set of irreducible varieties in k”. 
In particular, the tmaximal ideals correspond 
to the points of k”. Similarly, there exists a 
one-to-one correspondence between the set of 
homogeneous prime ideals of k[ Y] other than 

i-o Y;K[ Y] and the set of irreducible sub- 
varieties in P’’(k). 

When we deal with nonlinear algebraic 
equations, we cannot expect asimple, clearcut 
theory without assuming that k is algebraically 
closed. Hence we take an algebraically closed 
field K containing k and regard a variety V in 
k” as a subset of the variety Vy in K” defined 
by the same equations. From now on, we 
suppose that k is algebraically closed. If the 
ideal I(V) of k[X] or k[ Y] determined by a 
variety Vis generated by polynomials with 
coefficients in a subfield k‘ of k, we say that V 
is defined over k’ or that k’ is a field of defi- 
nition for V. Any variety has the smallest field 
of definition, which is finitely generated over 
the prime field. In the theory of A. Weil [92], 
we fix an algebraically closed fjeld K that has 
an infinite transcendence degree over the 
prime field, This K is called the universal 
domain. A point of Vis called a k’-rational 
point of V if all of its coordinates belong to a 
subfield k’ of K. Let K,, K, be two extension 
fields of a field L, and let (x)e K}, (y)EK3. 

We say (y) is a specialization of the point (x) 
over L (notation: (x) p( y)) if all polynomials 
f(X)ELLX,,....X,] satisfying f(x) = 0 also 
satisfy f(y) = 0; in other words, if there exists a 
homomorphism of L-algebras L [x,, ,x,] > 
L[y,,->y..] mapping x; to y;. Let K be the 
universal domain, Van irreducible variety in 
K", and k' (c K) a field of definition for V 
having a finite transcendence degree over the 
prime field. Then there exists a point (x) of V 
such that all points of V are specializations of 
(x) over k’. Such a point (x) (in general not 
uniquely determined) is called a generic point 
of V over k’. The ring k’[x] is isomorphic to 
KEX]J/K V) AK'[X] over k’. (Some authors use 
the term generic point to mean almost all points 
as defined earlier.) 


B. Local Rings 


Let V be an affine variety and let W be an 
irreducible subvariety of V, Let By be the 
subset of A, consisting of the elements that 
vanish identically on W. Then $y is a prime 
ideal of A,. The ring of fquotients of A, with 
respect to Py is denoted by Dy, y or by Dy 
and is called the local ring of W on V (or of 

V at W). Suppose for simplicity that Vis irre- 
ducible. Then Dy is the subring { f/g |f, 8 € Ay, 
g By} of k(V), and the tresidue field of Dy 
modulo the maximal ideal Py Dy can be 
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identilied with k( W). When a function g on 
V (pe k( V)) belongs to Dy, it is said to be 
regular at W. Fora given function ge k( V), 
the set of the points of V where ọ is regular 
is Zariski open. In the case of a projective 
variety, the local ring Dy yw is defined as the 
subring of the ring of quotients of A, with 
respect to $y, consisting of the homogeneous 
elements of degree 0. 

A mapping from an open set U of a variety 
V to k that is regular at every point of U is 
called a regular function on U. The ring of 
regular functions on U is denoted by A,. By 
assigning A, to each open set U, we can define 
a ‘sheaf of rings @ on V, of which the ‘stalk 
Oy, at a point XE V is the local ring Dy. The 
sheaf (, is called the sheaf of germs of regular 
functions on V (or the structure sheaf of V) 
(- 383 Sheaves H). 


C. General Definition 


Consider a pair (V, 0) of a topological space 
V and a sheaf ( of germs of mappings from 

V to k. If V has a finite open covering (U) 
such that each (U,, 0| U) is isomorphic to an af- 
fine variety V, (in the sense that there exists a 
homeomorphism from U; to V, that transforms 
0| U, to the structure sheaf of V), the pair (V, 0) 
is called a prealgebraic variety over k, and (/ 

is called its structure sheaf. Usually (V, @) is 
denoted simply by V. 

A regular mapping between prealgebraic 
varieties is detined as a continuous mapping 
g: VV ' satisfying poge@, , for any xe V 
and pE Oy: gx Furthermore, if g is a homeo- 
morphism and qe is also regular, then g 
is called an isomorphism or a hiregular map- 
ping. The Cartesian product X x Y of pre- 
algebraic varieties X and Y is locally a prod- 
uct of affine varieties. Therefore X x Y has 
the structure of a prealgebraic variety. A pre- 
algebraic variety X is called an algebraic va- 
riety if the image of the diagonal mapping X > 
X x X is closed in the Zariski topology of 
the product variety X x X (“separation con- 
dition”). (This definition is due to J.-P. Serre 
[81].) The separation condition corresponds 
to tHausdorff’s separation axiom. If Wis a 
locally closed subset (i.e., the intersection of 
an open set and a closed set) of an algebraic 
variety V, then it becomes an algebraic variety 
in a natural manner (the germs of regular 
functions at Pe W are taken to be the germs of 
functions induced on W by the functions in 
Oy, p). Locally closed subvarieties of k” or P”(k) 
are called quasi-affine or quasiprojective alge- 
braic varieties, respectively. Definitions of 
irreducibility and local rings for general alge- 
braic varieties are given in the same manner 
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as before. In this article, algebraic varieties will 
often be referred to simply as varieties. 

The notion of an irreducible algebraic 
variety was developed from that of abstract 
algebraic variety (or simply abstract variety) 
defined by Weil. 


D. Scbemes 


The set of prime ideals ( 4 (1)) of a commuta- 
tive ring A with unity element 1 is denoted by 
Spec(A) and is called the spectrum of A. For 
any subset a of A, we denote by V(a) the set of 
the prime ideals containing a. We define a 
topology on Spec(A) in which the closed sets 
are V(a). This, again, is called the Zariski 
topology of Spec(A). For an element f of A, the 
open set D( f)=Spec(A)— V(f) is called an 
elementary open set. The elementary open sets 
form a base of open sets in the Zariski topol- 
ogy of Spec(A). The set of closed points is 
nothing but the set of maximal ideals of A. 
Assigning to each point $ of Spec(A) the tring 
of quotients A,, we obtain a sheaf of rings 4 
on Spec(A). We have the equality T'(D(f), A)= 
Ap, where A, is the tring of quotients by the 
multiplicative system { f” n> 0}. In particular, 
we have I'(Spec(A), A) = A. Regarded as a 
‘local-ringed space with A as the structure 
sheaf, Spec(A) is called an affine scheme. 

A local-ringed space X which is locally 
isomorphic to an affine scheme is called a 
scbeme. A morphism of schemes is, by defi- 
nition, a morphism between them as local- 
ringed spaces. Thus we obtain a ‘category 
whose objects are schemes. We denote it by 
(Sch). Giving a morphism f:X —Spec(A) is 
equivalent to giving a ring homomorphism 
I'(f): A>T(X, 0,). Hence the category of 
affine schemes (which is a tfull subcategory of 
(Sch)) is contravariantly equivalent to the 
category of commutative rings. If there is given 
a morphism of schemes f: XS, X is said to 
be an S-scheme or a scheme over §, and f is 
called the structure morphism and § the base 
scheme. For two S-schemes f: XS, g: YS, 
a morphism of S-schemes is defined to be a 
morphism of schemes h; X > Y with f=g oh. 
Thus we obtain the category of S-schemes 
denoted by (Sch/S). Spec(Z) is the unique ‘final 
object in (Sch), hence (Sch) is nothing but 
(Sch/Spec(Z)). 

The tfiber product always exists in (Sch). In 
fact in the case of affine S-schemes X = Spec(B) 
and Y= Spec(C) with § = Spec(A) we have 
Xx,Y= Spec(B ®, C), and in the general case 
we construct X x , Y by patching together fiber 
products of affine schemes. 

A morphism f: XS is called separated if 
the image of the diagonal morphism Ay;s : 
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X >X x şX is closed. We also say that X is 
separated Over § or X is a separated S-scheme. 
A scheme X is said to be separated if it is 
separated over Spec(Z). All affine schemes are 
separated. 

When K is a field, Spec(K) is a space having 
only one point and equipped with K as the 
stalk of the structure sheaf. For a point x of a 
scheme X, denote by k(x) the residue field of 
Oy... For fe@y, we call the residue class off 
in k(x) the value off at x, denoted by f(x). We 
have a natural morphism i,: Spec(k(x))>X 
whose image is {x}. More generally, we call a 
morphism i of a spectrum Spec(K) of a field K 
to X a point of X with values in K. Such a 
point is determined by a point x in X and an 
embedding of k(x) in K. A point of X with 
values in an algebraically closed field is called 
a geometric point. For a morphism f:X > 
S and a point s in S, the fiber product X x , 
Spec(k(s)) is called the fiber off over s and 
denoted by f ~+ (s). For a geometric point 
Spec(K)S, X x sSpec(K) is called a geo- 
metric liber. 

A scheme X is called reduced if the local ring 
at each point of X has no tnilpotent elements. 
A scheme is said to be irreducible if its underly- 
ing topological space is not a union of two 
proper closed subsets. A scheme is called in- 
tegral if it is reduced and irreducible. Every 
local ring of an integral scheme is an tintegral 
domain. If a scheme X has an affine open 
covering { U; = Spec(A,)} such that every A; 
is a tNoetherian ring, X is said to be locally 
Noetherian. A locally Noetherian scheme is 
called Noetherian if its underlying topological 
space is  +(quasi-)Compact. 

A morphism f{:X— Y = Spec(A) is said to be 
locally of finite type (of finite type) if X has an 
open affine covering (a finite open affme cover- 
ing) {U,= Spec(A,)} such that each A; is a 
finitely generated A-algebra. A general mor- 
phism f:X — Y is said to be locally of finite 
type (of finite type) if there is an open affine 
covering {V;} of Y such that every restriction 
of f:f (VV; is locally of finite type (of 
finite type). If f: X-+ Y is (locally) of finite 
type we say that X is (locally) of finite type 
over Y. 

A scheme of finite type over a field K (i.e. 
over Spec( K)) is called an algebraic scheme 
over K. There is a natural equivalence of 
categories between the category of reduced 
separated algebraic K-schemes (as a full sub- 
category of (Sch/K)) and the category of alge- 
braic varieties over K (defined in Section C) 
equipped with regular mappings as mor- 
phisms. Hence we identify these categories 
from now on. Occasionally, algebraic vari- 
ety means irreducible variety. Nonalgebraic 
schemes are also important as tools for the 
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study of algebraic varieties. For example, for a 
point x in a scheme X there is a canonical 
monomorphism j,: Spec(@y „>X by which 
the unique closed point of Spec(@y x) is map- 
ped to x. If for two algebraic K-schemes X, Y 
and for two points xe X, ye Y there is a K- 
isomorphism Oy „= Oy., then suitable neigh- 
borhoods of x and y are isomorphic over K. 

Many concepts concerning varieties, e.g., 
dimension, generic points, specialization, can 
be naturally extended to the case of schemes 
by virtue of commutative ring theory. 

A morphism of schemes f:X — Y is called 
proper if it satisfies the following two con- 
ditions: (1) f is separated and of finite type, (2) 
for every scheme T and for every morphism T 
-=> Y, the morphism X x yT—> T obtained from 
f by the “change of base” is a closed mapping. 
We also say that X is a proper Y-scheme or X 
is proper over Y. A proper algebraic K-scheme 
is called complete. A projective variety is com- 
plete, while an affine variety over K is com- 
plete only when it is of dimension zero. Every 
algebraic variety can be embedded in a com- 
plete variety (M. Nagata [63]). 

A morphism of schemes f: X > Y is called 
affine if every inverse image by f of an open 
affine subset of Y is again an affine scheme. 

A morphism of schemes f: X > Y is called 
finite if it is of finite type and there is an affine 
open covering { U;= Spec(A,)} of Y such that 
f (U) = Spec(B,), where B; is tintegral over 
A;. For a locally Noetherian scheme Y and a 
morphism of schemes f; X > Y the following 
three conditions are equivalent: (i) f is finite; 
(ii) f is affine and proper; (iii) f is proper and 
every fiber off is a finite set. For a finite sur- 
jective morphism of Noetherian schemes f: 
X— Y, X is an affine scheme if and only if Y 
is an affine scheme. 

A morphism of schemesf: X — Y is said to 
be flat if for each point x € X, Oy , isa tflat 
Oy, x)-module. If, moreover, fis surjective, 
then f is called faithfully flat. Assume that 
g: Y’+ Y is a faithfully flat morphism of finite 
type of locally Noetherian schemes and f: X > 
Y is a morphism of schemes. Then for many 
important properties of morphisms, f has these 
properties if and only if the pull-back fy.: 

Xx y Y’+ Y’ has the same properties (theory of 
descent [29, 30]). 


E. Cohomology Theory 


Let (X, 0) be a ringed space. An -Module 
(i.e., a sheaf of O-modules) F is said to be 
quasicoherent if for each point x of X there 
exist a neighborhood U of x and an fexact 
sequence M —>N —> Fiu >0, where M and N are 
free Oy-Modules. An -Module F is said to 
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be of finite type if F is locally generated by a 
finite number of sections over (; F is of finite 
presentation if, locally, there exists an exact se- 
quence @?+(4- F-—0 where p and q are posi- 
tive integers (they need not be globally con- 
stant); F is coherent if(i) F is of finite type and 
(ii) the kernel of any homomorphism Oy > Fy 
(where n is an arbitrary positive integer, and Ų 
is an open set) is of finite type. Obviously, if F 
is coherent, then F is of finite presentation, 
which implies that F is quasicoherenet and of 
finite type. In the category of U-Modules, the 
full subcategory of coherent sheaves is closed 
under almost all operations of sheaves. If 0 
itself is coherent as an O-Module, ( is said to 
be a coherent sheaf of rings. In this case every 
()-Module of finite presentation is coherent. 

The structure sheaf of a locally Noetherian 
scheme is a coherent sheaf of rings. On a lo- 
cally Noetherian scheme X, every quasicoher- 
ent sub-@,-Module or quotient @,-Module of 
a coherent 0,-Module is coherent. A coherent 
@,-Module on an algebraic variety V is called 
a coherent algebraic sheaf. 

Let X = Spec(A) be an affine scheme. Then 
every quasicoherent (y-Module F on X is 
generated by its global sections. The corre- 
spondence FI (X, F) defines an equivalence 
beeween the category of quasicoherent sheaves 
on X and the category of A-modules; if A is 
Noetherian, then the coherent sheaves and the 
finite A-modules correspond to each other 
under this equivalence. 

Let X be a separated scheme, and W= {U,} 
an affine open covering of X. For each quasi- 
coherent @,-Module F, the cohomology group 
H4(X, F) is canonically isomorphic to the 
tČech cohomology H4(U, F) (- 383 Sheaves 
F). If X is of dimension d, then HX, F) =0 
for every sheaf F of Abelian groups on X and 
q>d. 

For a scheme X we define the cohomological 
dimension cd(X) to be the largest integer q 
such that H4(X, F) #0 for a quasicoherent Oy- 
Module F on X [35]. The cohomological 
dimension cd(X) does not exceed the dimen- 
sion of X. If X is an affine scheme, then cd(X) 
= 0. The converse is true under the assump- 
tion that X is Noetherian (Serre’s criterion 
[29, MIJ). For an algebraic scheme X of dimen- 
sion n, cd(X) = nif and only if X is complete 
(S. L. Kleiman). 

Let f:X — Y be a proper morphism of 
Noetherian schemes. Then for every coherent 
(-Module F and for every q > 0, Rf (F) (> 
200 Homological Algebra) is also coherent. In 
the special case of Y = Spec(k) with a field k 
this means that for an algebraic coherent sheaf 
F on a complete variety X the cohomology 
group H4(X, F) is a finite-dimensional vector 
space over k. 


Let X be a scheme over k and let F be a 
locally free (y-Module of rank n G.e., an Oy- 
Module which is locally isomorphic to 0%). If 
we take an open covering {U,} of X and iso- 
morphisms @;: Fiy, Cx, then p;o ø; ' defines 
a morphism gy:U; U;> GL(n, k), which is 
called the coordinate transformation of F. If we 
construct a tvector bundle B on X by the same 
coordinate transformations fij» then F can be 
regarded as the sheaf B(B) of germs of sections 
of B. By means of the canonical homomor- 
phism GL(n, k)> PGL(n— 1, k) we can con- 
struct a projective bundle P(F) on X (which is 
said to be associated with F). (Note that in 
[29], P(F) is defined to be a projective bundle 
with coordinate transformations ‘gis L.e., as- 
sociated with the dual of F in our sense.) This 
procedure of associating P(F) with a locally 
free (/,-Module F can be generalized for any 
quasicoherent Os-Module of finite type on 
an arbitrary scheme S. 

A closed (locally closed) S-subscheme f:X > 
S of p: P(F)—S is called a projective scheme 
(quasiprojective scheme) over S, or f is said to 
be a projective morphism (quasiprojective mor- 
phism). A projective morphism is proper. A 
reduced projective scheme over a field K is 
nothing but a projective variety over k. We 
can develop the theory of projective schemes 
by means of tgraded rings in a way similar to 
affine schemes. 

A locally free 0y-Module of rank 1 is called 
an invertihle sheaf. Invertible sheaves corre- 
spond to line bundles up to isomorphisms. Let 
P = P*(k) be a projective space, (y,, y,,--. Yy) 
a system of homogeneous coordinates of P, 
and U; the open subset of P defined by y;# 0. 
Denote by O(n) the invertible sheaf on P de- 
fined by the coordinate transformation g; = 
(y,/y)". More generally, let p: P = P(F)> S be 
the projective bundle associated with a locally 
free Us-Module F of rank N + 1 on a scheme 
S. Then there is an invertible sheaf O(n) = 
O(1)®" on P with the properties: (i) for each 
SES its restriction to the fiber p~'(s) = P*(k(s)) 
is O(n) detined above; (ii) p,(O(n)) = S"(F) for 
n > 0 where S”(F) denotes the nth symmetric 
product of F. The invertible sheaf O(I) is called 
the tautological line hundle on P., (Note that 
(( 1) in the sense of [29] is O( -1) in our sense, 
but since the definition of PŒ) is also differ- 
ent, the above property (11) holds without 
modification.) 

For a quasiprojective S-scheme f: X >P(E) 
>S, the restriction of (( 1) to X is denoted by 
Oy( 1) (or simply (( 1)). An invertible sheaf L on 
X is called very ample over § if there exist a 
locally free ,-Module of finite type E on § 
and an S-immersion j: X -> P(F) such that 
Oy( 1) = L; L is called relatively ample over § or 
S-ample if L®” is very ample over § for a cer- 
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tain n > 0 (- Section N). When S is an affine 
scheme, an ample (very ample) sheaf over § is 
simply called ample (very ample). There is the 
following cohomological criterion of ample- 
ness (generalized Serre’s theorem). 

Let Y be a Noetherian scheme, f: X > Y a 
proper morphism, and L an invertible (,- 
Module. Then the following four conditions 
are equivalent: (i) L is f-ample; (ii) for each 
coherent @,-Module F there is an integer 
N such that R%f,(F @ LS”) = 0 for alln > N 
and q > 0; Gii) for each coherent sheaf of 
ideals Y of @y there is an integer N such that 
RIF (4 @ L®")=0 for alln>N. They imply 
the condition (iv): for each coherent Oy- 
Module F there is an integer N such that the 
canonical homomorphisms f*f,(F © L®")> 
F ®L®" are surjective for all n> N. 

Let X be a scheme proper over a field k 
(such an X is called a k-complete scheme), 
and let F be a coherent Q-module. Then all 
H4(X, F) are finite-dimensional k-vector 
spaces (finiteness theorem), and y(F) = 
X( =1)? dim H4(X, P) is a finite number, 
called the Euler-Poincaré characteristic of F 
over X. For every invertible sheaf L, y(F © 
L®") is a polynomial in m, which is said to 
be the Snapper polynomial [48]. Supp F, de- 
fined to be the set {x EX F, £0}, is a closed 
subset of X and is called the support of F. The 
degree of the polynomial y(F ® L®") is at 
most Y = dim supp F, and we have y(F ® L®”) 
= e'm’/r! + lower terms in m for some e € Z. e 
is said to be the intersection number of L” with 
F [48]. When W is a closed subscheme of X 
defined by an (,-ideal J, (L’-W) is defined to 
be the intersection number of L" with 0,/¥, 
where r = dim W. If L is ample, then (L’: W) > 
0 for any W. In particular, when W= X, it 
follows that (L") > 0, where n = dim X. The 
converse of this fact is the Nakai-Moishezon 
criterion, saying that if (L" W) > 0 for any 
closed subvariety W of X where r = dim W, 
then L is ample [56,64]. 

When L is ample, H4(X, F Q L®") = 0 for 
any q > 0 and for sufficiently large m; hence 
y(F @ L®")= dim H°(X, F ® L8") and the 
Snapper polynomial y(F ® LS") is also called 
the Hilbert polynomial or the Hilbert charac- 
teristic function of F.If L is the invertible sheaf 
detined by a hyperplane section of X in PY, i.e., 
L = (@y(1), then y(L®™) = dim H°(X, L®”) = 
dim Rm for sufficiently large m where @ R, 
denotes the homogeneous coordinate ring of 
X in Př. 

In general, for a complete irreducible variety 
V of dimension Y, we put 


V)=1(Oy)= $ (1h 


(h°-4 = dim H4(V, (@,,)) and call it the arithmetic 
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genus of V. Classically, the number p,(V) de- 
fined by pa(V)=(—L)'(x(V)— =h% — ho! 
+...+h°:} was called the arithmetic genus of 
V, instead of y( V). When V is a nonsingular 
irreducible curve, p,(V) is the usual tgenus. If 
V is a projective variety, the constant term of 
the Hilbert polynomial is y(V) and the coeffi- 
cient of its highest term is (deg V)/r'. 

Let V be a tnormal variety, and D a tdivisor 
on V. If, for each point x EV, we denote by 
((D), the set of the functions fe KV) that 
satisfy (f) + D 2 0 on some neighborhood of x, 
we obtain a coherent algebraic sheaf ((D), and 
we have dim H° V, O(D)) = I(D). If moreover, V 
is complete, we put y,(D) =y(V)— x(O(— D)) 
and call it the virtual arithmetic genus of D. 
Classically, the number p,(D) = —1)’"!(y)(D) 
= 1) was called by that name, When D is effec- 
tive and has no multiple components, y,(D) 
coincides with the arithmetic genus of D re- 
garded as a variety. In general, y,(D) stays in- 
variant if D is replaced by a divisor that is 
algebraically equivalent to D (-» Section N). 

If D is a Cartier divisor, O(D) is an inver- 
tible sheaf. For two Cartier divisors D,, D}, 
O(D, +D,)= O(D,) ® O(D,), and D, and D, 
are linearly equivalent if and only if O(D,)= 
O(D,). 

Let V be an irreducible nonsingular variety 
and let Q? denote the sheaf of germs of regular 
differential forms of degree p(a°® =(,). If V is 
complete, then we denote dim H4(V, QP) by 
hP", 

Serre’s duality theorem: Let V be a nonsin- 
gular complete irreducible variety of dimen- 
sion r, B an algebraic vector bundle over V, 
and B* the dual vector bundle of B. Denote by 
B and #* the sheaves of germs of sections of 
B and B*, respectively. Then (i) H’(V, Q”) is 
canonically isomorphic to k, and (ii) H4 ( V, B) 
and H’ #(V, 8* @ Q") are dual to each other as 
linear spaces by means of the cup product of 
the above spaces with H'(V,Q") x k In partic- 
ular, H4(V, QP) is dual to H'~4(V,Q"~"); hence 
we have h? =h Pra, 

This theorem was extensively generalized 
by Grothendieck in the category of schemes 
[1, 331. 

When the field Kis of characteristic 0, we 
furthermore have h”! =h*? by complex con- 
jugation (— 232 Kahler Manifolds C); but in 
characteristic p, there are examples for which 
this symmetry does not hold [61]. In general, 
h®-4 is a frelative invariant but not an tab- 
solute invariant; however, as h? is the dimen- 
sion of the hnear space of the differential forms 
of the first kind with degree p, h”? is an ab- 
solute invariant. Hence h®-? is also an absolute 
invariant in the case of characteristic 0. When 
the characteristic is positive, the absolute 
invariance of °:? has not yet been proved. 
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F. Simple Points and Singular Points 


Let V be a variety over an algebraically closed 
field k We say that a point P of Vis simple 

or that ViSnonsingular or smooth at P if the 
local ring Op is a fregular local ring. Since the 
problem is local, we may assume that Vis an 
affine variety in K”. Then the simplicity of P on 
Vis equivalent to the following condition: P is 
contained in only one irreducible component 
of V, and if that component has dimension r 
there exist n-r polynomials F;(X) in I(V) such 
that rank (OF ,/0X;)x)=p = n—r.A point of V 
that IS not simple is called a singular point or 
a multiple point. The set of singular points on 
V (called the singular locus of V) is a proper 
closed subset of V. A variety with no multiple 
points is called smooth or nonsingular. 

This notion can be made relative. A mor- 
phism f:X — Y of a locally Noetherian scheme 
is called smooth if f is flat and locally of finite 
type and all the geometric fibers off are non- 
singular. In the case of an affine morphism 
S:X =Spec(R[X,,..., Xs fis San) > Y= 
Spec(R) of relative dimension r (by which we 
mean the dimension of the general fiber) with 
a Noetherian ring R, the smoothness off 
amounts to a condition that rank ((@f;/0X,)(x)) 
= § at each point x of X. 

When for a point P of a variety V the local 
ring Opis tnormal, P is called a normal point. 
A simple point is normal. The set of normal 
points is a nonempty open subset of V. An 
irreducible variety whose points are all normal 
is called a normal algebraic variety (or simply 
normal variety). The singular locus of a normal 
variety has codimension > 2. For an irreduc- 
ible variety V, there exists a pair (V’, f) of a 
normal variety V’ and a birational finite mor- 
phism f:V'—> V; V’ is unique up to isomor- 
phisms and is called the derived normal model 
or normalization of V. 

Simplicity and normality for V at a sub- 
variety W are detined in the same way as at 
a point by using the local ring Oy w. 

For a morphism f:X — Y of locally Noe- 
therian schemes, locally of finite type, the fol- 
lowing three conditions are equivalent: (i) f is 
smooth and every fiber off is a discrete set; (ii) 
fis flat and every geometric fiber over Spec(K) 
off is a union of spectra of fields isomorphic 
to K; Gii) f is flat and every fiber off over 
y € Y is a union of spectra of fields that are 
finite tseparable extensions of k(y). These 
conditions are local with respect to X. If a 
morphism f satisfies these equivalent con- 
ditions, we say that f is étale or X is étale over 
Y. A morphism 


f: X= Spec(R[X,, 9 XM . A) 
—> Y= Spec(R) 
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is étale if and only if det((éf,/0.X;)(x)) 9 for all 
xeéX. Hence étale morphisms correspond to 
local isomorphisms in the analytic category. 
For a surjective étale morphism f: X >Y many 
important geometric properties (reduced, 
integral, normal, nonsingular, etc.) hold on 

X if and only if they hold on Y (theory of 
descent). 


G. Dimension Theorems 


Let V be an irreducible variety, and let U and 
W be irreducible subvarieties of V. Then any 
irreducible component of Uf) W that is simple 
on Vhas dimension > dim U + dim W = dim V. 
When the equality holds, the component is 
called a proper component of the intersection 
UN W. If every component of U N W that is 
simple on Vis proper, we say that U and W 
properly intersect on V. Any two subvarieties 
U and W of P”(k) with dim U + dim W2n 
intersect each other. When Vis an irreducible 
r-dimensional variety in P"(k), the number of 
points of intersection VN L of V with an (n= 
r)-dimensional linear variety L is independent 
of the choice of L as long as L is in a “gen- 
eral position.” This number is called the degree 
of Vand is denoted by deg( V). Letting (( 1) be 
the fundamental sheaf of P’(k), we have deg V 
=(O(1)’: V) (- Section E). 


H. Group Varieties 


An algebraic variety G is called an talgebraic 
group if it has a group structure and if the 
mapping G x G->G sending (x, y) to xy ‘is 

a morphism. Every algebraic group is quasi- 
projective (Chow). If G is irreducible, then 

it is also called a group variety; a complete 
group variety is called an *Abelian variety (- 
3 Abelian Varieties B, 13 Algebraic Groups B). 
A scheme G over another scheme § equipped 
with morphisms over $:G x ,<G-+G, GG, and 
S—G, called multiplication, inverse, and unit 
section, respectively, which satisfy the relations 
corresponding to the usual axioms of group, is 
called a group scheme (over S). As a point set, 
G is not a group, while, for any scheme T over 
S the set G(T) = Hom,(T, G) of the morphisms 
from T to G is a group (- 52 Categories and 
Functors M). Consider an algebraic group 
scheme G over § = Spec(k). If the characteristic 
of k is zero, then G is necessarily reduced, so 
an algebraic group scheme over k is essen- 
tially the same as an algebraic group; if k has 
characteristic p, there exist algebraic group 
schemes over k that are not reduced. 
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Į. Rational Mappings 


Let f: V-t V? be a morphism of varieties. If V 
is not complete, the image f(V) is not always 
closed; the closure off(V) (in V’) is called the 
closed image of V. The image f(V) contains an 
open dense subset of the closed image. 

Let V and W be irreducible varieties. A 
closed subset T of Vx W is called an algebraic 
correspondence of V and W. We say that points 
P e Vand Q e W correspond to each other by T 
if (P, Q)e T. If T is irreducible and the closed 
image of the projection T— V coincides with 
V, then the function field k(V) can be identified 
with a subfield of k(T); if we have k(V) = k(T) 
with this identification, then T is called a 
rational mapping from Vto W. Moreover, if 
the same conditions are satisfied for W, then T 
is called a birational mapping (of birational 
correspondence or birational transformation), 
and in this case we have k(V) = k( W). A mor- 
phism can be considered a special kind of 
rational mapping by taking the graph. If T is a 
rational mapping from V to W and W, is the 
closed image of T in W, then k( W,) can be 
regarded as a subfield of k(T) = k( V). If k(V) is 
fseparably generated (tpurely inseparable) over 
k( W,), then T is said to be separable (purely 
inseparable). 

Let T be a rational mapping from V to W, 
and let V’ and W’ be irreducible subvarieties 
of V and W, respectively. If there exists an 
irreducible subvariety T’ of T whose projec- 
tions have the closed images V’ and W’, then 
we say V’ and W’ correspond to each other by 
T. The union of irreducible subvarieties of W 
that correspond to V’ by T is a closed subset of 
W; it is called the proper transform of V’ by T 
and is denoted by T[V’]. Note that V’ >y” 
does not imply TL[V’] >T[ V”]. The set of 
points of W that correspond to the points of 
V’ is called the total transform of V’ by T and 
is denoted by Tf V’}. Identifying k(T) with 
k(V), we have Op r > Dy.y- in general; if the 
equality holds, we say that T is regular (or 
defined) along V’. In that case, W?’ is the unique 
irreducible subvariety of W corresponding to 
Vv’ by T. If V’ is simple and of codimension | 
in V, then T is always regular along V’. The set 
U of points of Vat which T is regular is a 
nonempty open subset, and the restriction of T 
to U defines a morphism from U to W. A 
rational mapping can be defined as the closure 
of the graph of a morphism defined on an 
open subset of V. The study of a rational 
mapping can be reduced to that of the bira- 
tional morphism T> V. Zariski’s main theo- 
rem: Let §: X > Y be a birational mapping, 
and assume that the inverse S~': Y-X is 
regular, and that X is normal along an irre- 
ducible subvariety X’. If there exists an irre- 
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ducible component Y’ of S[ X] with dim X'> 
dim Y’, then § is regular along X’. [t follows 
from the above main theorem that, if T:V—- 
W is a rational mapping and if P is a normal 
point of V such that T[P] contains an iso- 
lated point, then T is regular at P. 

For a birational mapping T: V—> W between 
complete irreducible varieties, a subvariety V 
of Vis said to be fundamental when dim T[ V’] 
> dim V’. When V’ is a point (curve) we say V’ 
is a fundamental point (fundamental curve) with 
respect to T. The most classical example of a 
birational correspondence with fundamental 
points is the quadratic transformation T of a 
projective plane onto itself given by (xo: xı: x2) 
>(X1X2:X2X9:XoX;) with a suitable coordinate 
system. In this case, T? = identity and the 
points P, defined by x; = 0 for j # i correspond 
to the lines x;=0 by T. Let Cr, denote the 
group of birational transformations of P”(k) 
into itself. These transformations are called 
Cremona transformations when n > 2. Cr, is 
generated by linear transformations and qua- 
dratic transformations (M. Noether). Recently, 
Cr, has been studied in detail by M. Dema- 
zure [21] and H. Umemura [91]. Let V be a 
complete nonsingular irreducible variety over 
k It is called relatively minimal if every bira- 
tional morphism from V to a complete non- 
singular variety V’ is an isomorphism. It is 
called minimal if every birational mapping 
from a nonsingular variety V’ to Vis a mor- 
phism. Replacing birational mapping by ra- 
tional mapping, we can define strong (abso- 
lute) minimality. Abelian varieties and com- 
plete nonsingular curves with positive genera 
are strongly minimal. In general, minimality 
implies relative minimality. 

Zariski’s main theorem is closely related to 
the general connectedness theorem due to W. 
Fulton and J. Hansen stated as follows: Let P 
be the product of r copies of P," and A be the 
diagonal subvariety of P. If X is an irreducible 
projective variety and f: X —P is a morphism 
with dim f(X)>(r—1)m, then f'(A) is con- 
nected [22]. The following results are derived 
from this theorem: (1) if X is singular with only 
normal crossings (- Section L), then X can- 
not be imbedded in P?""!, where n = dim X. (2) 
Let X be a nonsingular subvariety of P~“ which 
is not contained in any hyperplane. Suppose 
that by the projection z,:P’>P*~'! with cen- 
ter x, x being a general point of P*, X is iso- 
morphic to 7,(X). Then 3-dim X <2(N = 2) 
(Zak’s theorem). (3) Let X be a nonsingular 
subvariety of P and H be an arbitrary hyper- 
plane section of X. Then dim(Sing H) < 
codim X (J. Roberts). 

By a similar connectedness theorem, it was 
proved that if C is a (reducible) curve with 
normal crossings on P? over C, then the fun- 
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damental group of P? -C is commutative, 
which was conjectured by 0. Zariski [20]. 


J. Rational Varieties 

An irreducible algebraic variety V over k 
whose function field is purely transcendental 
over k is called a rational variety. A complete 
smooth surface § over an algebraically closed 
field is rational if and only if P,(S) = g(S) = 0 
(the Castelnuovo-Zariski criterion, = 15 Alge- 
braic Surfaces E). 

If the function field of V has a finite alge- 
braic extension which is purely transcendental 
over k, then Vis called unirational. A unira- 
tional curve is in fact rational. More generally, 
if the function field k(C) of a curve C over 
k is contained in a field finitely generated 
and purely transcendental over k, then C is 
rational (Liiroth’s theorem). A unirational 
surface over an algebraically closed field of 
characteristic zero is rational by virtue of the 
above criterion, but in the case of positive 
characteristic there are unirational surfaces 
which are not rational (Zariski). There are 
nonrational unirational threefolds even of 
characteristic zero; for example, ąl| smooth 
cubic hypersurfaces in P* (C. H. Clemens and 
P. A. Griffiths [ 16]), and some smooth quartic 
hypersurfaces in P* (V. A. Iskovskii and Yu. 1. 
Manin [45]). See [6]. 


K. Monoidal Transformations 


Let V be an irreducible variety and ¥ be a 
sheaf of ideals of @,. For any affine open set U 
of V,.# |U is determined by an ideal a of the 
coordinate ring A of U, Let a,, a, , a, be a 
system of generators of a, and let U’ be the 
graph of a rational mapping from U to P”(k) 
such that the points PeU and (a)(P):a,(P): 

: d,(P))e P"(k) correspond to each other by 
U’. Then U’ is uniquely determined (up to 
isomorphisms) by U and a only, Suppose 

V has a covering by affine open sets U;. We 
obtain U; over U; as before. By patching them 
together, we get a birational morphism T: 
V'— V, which is unique up to isomorphisms. 
This T with V’ is called the monoidal trans- 
formation or blowing up of V by the ideal 
sheaf .g, Note that in some references, the 
inverse transformation JT”! is said to be the 
monoidal transformation. The inverse image 
ideal T~'(.¥) is an invertible sheaf, that is 
relatively ample over V. If W is the support of 
Cy/ J and W# V, then T-‘(W) has codimen- 
sion | and T gives rise to the isomorphism 
from V’ = T~'(W) onto V- W. Thus one can 
say that V’ is obtained from V by replacing W 
with T-‘(W), which is locally defined by prin- 
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cipal ideals. If every point of W is a nonsin- 
gular point of V and W is itself nonsingular, 

T '(W) is the projective bundle over W whose 
fiber is the projective r-space where r = dim V 
= dim W — 1. In general, if W is a subvariety 
defined by a sheaf of ideals .%, then T is said to 
be the monoidal transformation with center W. 
In particular, if W is a point, T is called the 
(locally) quadratic transformation. 


L. Resolution of Singularities 


Given an arbitrary irreducible variety V, we 
have the problem of finding Out a nonsingular 
projective variety V’ birationally equivalent 
to V. This is called the problem of resolution 
of singularities. In the case of characteristic 
zero, this problem was solved by Zariski 
(1944) for dimension < 3 and by H. Hironaka 
(1964) [37] for any dimension. In the case of 
characteristic p, S. Abhyankar solved the 2- 
dimensional case (1956) and the 3-dimensional 
case (1966). Hironaka’s theorem of resolution 
of singularities is stated as follows; Let V be a 
variety over a field of characteristic zero. Then 
there exists a finite sequence of morphisms of 
varieties: V.+ V, >. V; > = V such that (1) 
V, has no singular points, (2) each V,,,7 V; is a 
monoidal transformation of V; with center D,, 
(3) each D, is a nonsingular subvariety, (4) 
each V, is normally flat along D;. Here, Vis 
said to be normally flat along a subscheme D 
of V defined by the sheaf of ideals f, if the 
quotient modules $?/Y?"" are flat Mp ,- 
modules for all p and all points x of D. 

Let VY be a nonsingular variety and D an 
effective divisor on V. D is said to be a divisor 
with (only) normal crossings at x € V, if Dis 
defined by f,. fy such that (f, ,f,) is a part 
of a tsystem of local coordinates around x. D 
is said to be a divisor with only normal cross- 
ings, if it is SO everywhere. For any subvariety 
W on J, there exist h: V— V that is a composi- 
tion of monoidal transformations with non- 
singular centers, such that h-'(W) has only 
normal crossings. This results from Hironaka’s 
main theorem II [37]. The normal crossing 
divisor is also defined for complex manifolds, 
and similar results hold. 


M. Cycles and Divisors 


Let V be an irreducible variety. We denote by 
B, the set of r-dimensional irreducible sub- 
varieties of V that are simple on V (i.e., are not 
contained in the singular locus of V), and by 
3,(V) the free Abelian group with basis 23,. 
Elements of 3,(V) are called cycles of dimen- 
sion Y (or r-cycles) on V. Let A and B be r- 
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cycles; A =} n; A;, B =£ m; A;(A;EB,, A, # A; 
if i +j). If n; > m; for alli, then we write A > B. 
If A 20, then A is said to be a positive cycle. 
For a O-cycle A =È n;P, the integer deg(A) = 
È n; is called the degree of A. 

A cycle of codimension | is called a divisor. 
A divisor > 0 is usually called effective in- 
stead of positive. If Vis of dimension d and if 
W®@ = 23.-, , the local ring Oy wisa discrete 
valuation ring. The tnormalized valuation 
defined by it is denoted by p,( ), For a func- 
tion f€ kV), we say that Wis a zero of order n 
if vy(f)=n> 0, and that Wis a pole of order 
—nifvy(f)=n<0. Any function fe K V), 
other than the constant 0, has at most a finite 
number of zeros and poles. We denote by 
(f) the sum È vy(f) W extended over all the 
zeros W off, and put (f~")o=(f)~ and (f)g— 
(f),=(. We call (fo, (foo. and (f) the 
zero divisor, the pole divisor, and the divisor 
off; respectively. The divisor (f) is equal to 
È vw(f)W, where the summation is taken over 
We8,_,, and we have (fg)=(f)+(g). When V 
is complete and the singular locus of V has 
codimension > 1, then fis constant if and only 
if(f), =0 (or (f)o=0). Let D, and D, be divi- 
sors; if there exists a function f( #0)e KV) 
such that D, —D, =(f), then D, and D, are 
called linearly equivalent to each other and we 
write D, ~ D,. The linear equivalence class 
containing a divisor D is denoted by cl(D). A 
divisor which is linearly equivalent to 0 on a 
neighborhood of each point of V is called a 
Cartier divisor (some authors call a Cartier 
divisor simply a divisor). If V is smooth, then 
any divisor is a Cartier divisor. If a divisor D 
can be written as D=f) on an open set U, 
then the function fis called a local equation of 
D on U. Let T: V'>V be a rational mapping 
from a normal variety V’ to a complete variety 
V, let D be a Cartier divisor on V, and assume 
that the closed image of T is not contained in 
D. Since T is regular on some open set U such 
that codim( V’ U)> 1, we have a morphism 
o = T y and the pullback @*(D) defined by 
composing the local equations of D with @. 
Taking the closure of this divisor in V’, we 
obtain a divisor on V’, which is denoted by 
TAD). 


N. Divisors and Linear Systems 


Let V be a complete irreducible variety, fo, 
fis... f, be elements of the function field KV) 
of V, and D be a divisor on V satisfying (fi) + 
D > 0 for each i. Then the set X of the divisors 
of the form (È a;f) + D, where the a; are 
elements of k and not all zero, is called a linear 
system. The linear space kfy + kf, +. + kf, is 
called a defming module of 2. The divisors in 
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2 are positive and are linearly equivalent to 
each other; if every positive divisor that is 
linearly equivalent to a member of X belongs 
to 2, then 2 is said to be a complete linear 
system. For any linear system 2, there exists a 
unique complete linear system containing it, 
which is denoted by ||. The maximal positive 
divisor Dp that is contained in all divisors of 2 
is called the fixed component of X, and for 
each DEC we call D = D, the variable compo- 
nent of D (or of X). A point P of V is called a 
base point of a linear system 2 if P is on each 
variable component of 2. A linear system JY 

is called irreducible if its generic member is 
irreducible; otherwise it is called reducible. The 
dimension of a defining module of a linear 
system Ž is denoted by 1(C); we call I(C)- 1 the 
dimension of and denote it by dim J. A 
linear system of dimension 1 is called a linear 
pencil. 

A defining module of a linear system X is 
determined uniquely up to k-isomorphisms. 
Let L be a defming module, and let fo, fis- , f, 
be a linearly independent basis of L over 
k If we associate to each point P of V the 
point O=(fo(P):f,(P):...:,f,(P)) of the n- 
dimensional projective space, then we obtain 
a rational mapping ®, from V to another 
variety V’. Outside the base points of X, the 
rational mapping > is regular; and the base 
points are the fundamental points of @,. We 
say that ®; is the rational mapping defined 
by the linear system 2, When Chas no tixed 
components and ®; is a closed immersion, 

2 is said to be very ample (or ample). For a 
divisor D, the set of positive divisors that are 
linearly equivalent to D is a linear system, 
which is called the complete linear system and 
is denoted by |D|, We usually write [(D) in- 
stead of [(|D]). If |D] is very ample, we say that 
D is very ample. We say that D is ample (or 
nondegenerate) if mD is very ample for some 

m> 0, 


0. Differential Forms 


Let V be an n-dimensional irreducible variety, 
and let A = k( V) be its function field. We de- 
note by D* the set of derivations of over 

k, i.e., the k-linear mappings D: R->§ satisfy- 
ing D( fg)=D(f)g + fD(g). Then D* is an n- 
dimensional linear space over K. Let D denote 
the fdual space of D* over R. For each feg, 
let df be an element of D defined by <df, DY = 
D(f) (De D*), Let x1, X2,- . - , Xp be a separat- 
ing transcendence basis of KĘ over k, in the 
sense that x,, ,x, are algebraically inde- 
pendent over k and § is a tseparable algebraic 
extension over KX i> Xna). (Such a basis exists 
under the weaker hypothesis that Kis tperfect.) 
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Then dx,,.... OX, form a basis of D over R. 
The homogeneous elements of degree r of the 
*Grassmann algebra of D over K are called 
differential forms of degree r on V (or rational 
r-forms). The set of the differential forms of 
degree n is a 1-dimensional linear space over K 
spanned by dx, A dx, A. A dx, 

A set of n functions f,, , f, in K is called a 
system of local coordinates on an open set U of 
V iff; —fi(P), -.f,—-F,(P) is a tregular system 
of parameters of the local ring , for each 
Pe U. In that case, f,,,f, is also a separat- 
ing transcendence basis of K, If P is a simple 
point of V, then there exists a system of local 
coordinates on a suitable neighborhood of P. 
Let œw be a differential form of degree r on V, 
and write w= Yi < m <i, Pw4fi, A A Afi, 
where (f;,.-,f,) is a system of local coordi- 
nates around P. If the coefficients Pj are 
regular at P, then % is said to be regular at P. 

When Vis a complete variety without sin- 
gular points, a differential form that is every- 
where regular on V is called a differential form 
of tbe first kind (or a regular form); the dif- 
ferential forms of the first kind are determined 
by the function field A and are independent of 
the choice of the nonsingular model V. The 
number of linearly independent differential 
forms of the first kind, of degree n, is denoted 
by p and is called the geometric genus of V. 

Let V be a complete variety, W an irreduc- 
ible subvariety of V of codimension 1, and P 
a point of W that is simple on V. Choose a 
system of local coordinates (fi). Given a dif- 
ferential form œ on V, we write it as a “poly- 
nomial” in the df;, and denote by vy(w) the 
minimum of the values of the coefficients for 
the fyaluation vy( ). The number vy(a) is 
determined by w and W, and it is independent 
of the choice of P and of the local coordinates. 
Then w defines a divisor (w) = Dy vy(w) W on 
V, which is called the divisor of a differential 
form œ. The divisor of a differential form of 
degree n (= dim V) is called a canonical divisor 
and is usually denoted by K. The canonical 
divisors form a linear equivalence class of 
divisors. 


P. Albanese Variety, Picard Variety, Néron- 
Severi Group 


Let V be an irreducible variety. Then we can 
construct a couple (A, f) consisting of an 

tA belian variety called the Albanese variety of 
V and a rational mapping f: V+ (called an 
Albanese mapping) such that: (i) the image of 
f generates A, L.e., the sum off with itself n 
times, F: ¥"— A, is generically surjective for 
sufticiently large n; (ii) for every rational map- 
ping g: V>B of V into an Abelian variety B, 
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there exist a homomorphism h; A > B and a 
point be B such that g=h: f +b. The Albanese 
variety is uniquely determined up to isomor- 
phisms and f is determined up to translations. 

In the case of k = C, if Vis a complete non- 
singular variety and if q is the dimension of the 
linear space of differential 1-forms of the first 
kind on V, then the first tBetti number i?, is 
equal to 2q. Let œw, , @ be a basis of the 
linear space and let Y1, , Y2q be a basis of 
the first homology group modulo torsion. Put 
aj; = f, ©; and a, = (a;1,, %4). Then the period 
vectors 4; (1 <j < 29) are linearly independent 
over R in C* If T denotes the discrete sub- 
group of C! generated by the ;, then the quo- 
tient group C*/T’ is the Albanese variety of 
V. The Albanese mapping is given by the map- 
ping Po({§@,,- ,)G@,) (modT), where P 
is a variable point on V and Q is a fixed point 
on V (> 232 Kahler Manifolds C). 

Replacing the term “rational mapping” by 
“morphism” in the definition, we can define the 
strict Albanese variety of V and prove its 
existence. It is a quotient Abelian variety of 
the Albanese variety of V. If V is nonsingular, 
both coincide by virtue of the tstrong mini- 
mality of an Abelian variety (- Section J). 

Let V be a complete normal variety, U the 
set of the simple points of V, and D a divisor 
on V. Then D is said to be algebraically equiva- 
lent to 0 if there exist a nonsingular curve C, 

a divisor I’ on U x C, and two points P and 

Q on C such that D can be written as D = 
ož(T) = oġ(T), where @p and  @ are the mor- 
phisms gp: U >U x P>U x C and pọ:U > 

U x Q-U x C. We denote by G(V), 6,(V), 
and (V) the set of al] divisors on V, the set 
of divisors that are algebraically equivalent to 0, 
and the set of divisors that are linearly equiva- 
lent to 0, respectively. We çan introduce a 
canonical structure of an Abelian variety into 
66V), which is called the Picard variety 
of V. The dimension q of the Picard variety is 
called the number of irregularity of V; if q = 0 
we say that Vis regular. 

The Albanese variety and the Picard variety 
of V are tisogeneous to each other, and each 
one is the Picard variety of the other. If Vis 
a curve, they are isomorphic and called the 
Jacobian variety (— 9 Algebraic Curves E). 

Using Cartier divisors instead of divisors, we 
get an analogous theory to construct another 
kind of Picard variety which turns out to be 
isomorphic to the Picard variety of the strict 
Albanese variety of V. The group of the linear 
equivalence classes of Cartier divisors can be 
identified with H!( V, 0%), where O% is the sheaf 
of multiplicative groups of the invertible ele- 
ments in 0,. From this point of view, we can 
generalize the theory of Picard variety to the 
case of schemes also. The theory thus obtained 
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is called the theory of Picard schemes [60]. 
The quotient group NS(V) = 6(V)/G,(V) is 
finitely generated [66] and is called the Neron- 
Severi group of V. We cal] the rank of NS(V) 
the Picard number of V and denote it by p(V). 
In the case of a nonsingular projective vari- 
ety over k = C we have an inequality p(V) < 
h} (V) (= dim, H! (V, Q})) and the Lefschetz 
number B,( V) = p( V) is a birational invariant 
(where B,(V) is the second Betti number of 
V) [39]. For the positive characteristic case, 
however, the above inequality does not hold 
in general [61]. 

The torsion part of NS(V) is 6,(V)/6,( V 
where &,( V) denotes the group of divisors 
numerically equivalent to zero (- Section Q) 
(T. Matsusaka). The last fact cannot be gen- 
eralized for higher codimensional cycles [25]. 


Q. General Intersection Theory 


Let V be an irreducible variety of dimension n, 
and let A and B be irreducible subvarieties of 
V of dimension r and s, respectively. If C is a 
proper component of A N B, we can define the 
intersection multiplicity i(A B, C; V) of A and 
B along C on V, which has properties consis- 
tent with our geometric intuitions. In partic- 
ular, it is invariant under biregular mappings. 
If A and B intersect properly on V and if 
C,, , Cy are the proper components of A N B, 
we define an (r+s—n)-cycle A: B by A: B= 
>, i(A. B,C,; V) C, and call it the intersection 
product of A and B. If each component X, of 
an r-cycle X =}, n,X, and each component Y, 
of an s-cycle Y = Lig mg Y, intersect properly, 
we define 
X- Y=}, Vp nygX,* Yp. 
Then we have the associative law (X Y): Z = 
X: (Y> Z) for cycles X, Y, and Z whenever 
both sides are defined. Two r-cycles X, and X, 
are said to be numerically equivalent if for 
every (n = r)-cycle Y that intersects them pro- 
perly we have deg(X, Y) = deg(X, Y). 

The theory of intersection is one of the most 
basis theories in algebraic geometry, for the 
other theories can be constructed from it [92]. 


R. Chow Rings 


Let U and T be nonsingular irreducible vari- 
eties. If Z is a cycle on U xT such that Z- 

(U x t) is defined for every pointt of T, we 

put Z: (U x t)=X(t) xt, and we obtain a 
family {X(t)} of cycles on U parametrized by 
the points of T. Such a family is called an 
algebraic family of cycles. Two cycles X, and 
X, whose difference is equal to the difference 
of two cycles in an algebraic family are said to 
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be algebraically equivalent. In particular, if X, 
— X, can be expressed as the difference of two 
cycles of an algebraic family parametrized by 
the points of the affine line, then X, and X, 

are said to be rationally equivalent. The set of 
the cycles that are algebraically (rationally) 
equivalent to 0 is a subgroup 3,( U)(3,4,( U)) of 
J(U). For divisors, rational equivalence coin- 
cides with linear equivalence. 

Let f:V—U be a morphism between non- 
singular irreducible varieties. For an irre- 
ducible subvariety W of V, let W’ denote the 
closure of f( W), and put f;(W) = 0 if dim W> 
dim W’ and f3(W)= m W’ if dim W = dim W’, 
where m= [k(W): k( W’)] is the degree of the 
morphism W— W’. Extending Í. z by linearity, 
we obtain a module homomorphism f}: 3( V) 
— 3(U). If f is proper, then Ís induces a 
module homomorphism f, from 3(V)/3,a(Y) 
=A(V) to A(U). 

Let U and V be nonsingular irreducible 
projective varieties, and f: V+U be a mor- 
phism with graph T. If Ye 3( U) is such that 
I: (Vx Y) is defined, we denote by f#(Y) the 
image of I’: (Vx Y) under the induced isomor- 
phism [`> V. Each class of the rational equiva- 
lence class group A(U) = 3(U)/3,.:( U) contains 
a cycle for which f:¥ is defined. Hence we can 
define f*: A(U)>A(V). Let A: UU x U be 
the diagonal morphism, and define x: y = 
A*(x x y) for x, ye A(U). Then A(U) is a ring 
with respect to this product; moreover, it is a 
tgraded ring with the grading by codimension. 
This graded ring A(U) is called the Chow ring 
of U, and the mappingf*: A(U)+ A( V) is a 
ring homomorphism. If f is proper, we have 
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Zero Cycles. Let V be a nonsingular irreduc- 
ible projective variety over an uncountable 
algebraically closed field of characteristic 
zero (say C). Denote by A,,(V) the group of 
classes of O-cycles of degree 0 on V modulo 
rational equivalence. We say A,(V) is finite- 
dimensional if the mapping V” x V”+A,(V) 
sending (a1, -34n b133 ba) to La;— Èb; is 
surjective for a certain n. 

If A,(V) is finite-dimensional, then the 
canonical mapping A,( V)~»Alb( V) induced 
from V>Alb( V) is bijective (A. A. Rojtman). 
In general, the torsion part of A,(V) is isomor- 
phic to that of Alb( V) (Rojtman [75]). If 
h?:°( V) > 0 for some p> 1, then A,(V) is not 
finite-dimensional (Mumford, Rojtman [74]). 
If Vis a surface which is not of general type 
and with h?-°( V) = 0, then A,(V) is isomorphic 
to Alb( V) (S. Bloch, A. S. Kas, D. Lieberman 
[13]). There exist surfaces of general type with 
h>? =0 such that A,(V) =O. For instance, 
Godeaux surfaces are such surfaces (H. Inose 


[43]). 
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One Cycles. Let V be a nonsingular projective 
irreducible variety and M(V) denote {numer- 
ical equivalence classes of I-cycles on V} &®zR. 
Via the intersection pairing, N(V) is dual to 
the Neron-Severi group tensored by R (- 
Section P); and let NE(V) denote the smallest 
cone in N(V) containing all effective I-cycles. 
S. Mori studied the structure of NE(V) in 
detail. One of his results is stated as follows: If 
the anticanonical divisor is ample, then there 
exist rational irreducible curves /,,, l, such 
that (1) -(I;-K,)<n+ 1 and (2) NE(V)= 
R,[/,]+...+R,[1,], where n=dim V, K, is 
the canonical divisor on V, and [1,] denotes 
the class represented by 1,. Moreover, Mori 
has proved that (1) if K, is not numerically 
effective, e.g., if — Ky is ample, then V con- 
tains a rational curve, and (2) if the tangent 
bundle of V is tample, then Vis isomorphic 
to the projective space [59]. (This is called 

the Hartshorne conjecture.) 


S. Chow Coordinates, Hilbert Schemes 


Consider an irreducible algebraic correspon- 
dence T between irreducible varieties V and 
W. Let V’ and W’ be the closed images of the 
projections of T to V and W, and let a and c 
be their dimensions, respectively. Take generic 
points P, Q of V’, W’, and consider the total 
transforms T{P} of P to W and T/Q} of Q 
to V. Denoting the dimensions of T{ P} and 
T{ Q} by band d, respectively, we have a + 
b=c +d, where both sides are equal to the 
dimension of T. This property is called the 
principle of counting constants. 

This simple principle has wide application. 
For instance, let V be an r-dimensional variety 
in P”(k), and let È ju,,X;=0 (0<i<r) be the 
equations of r + 1 hyperplanes H;. The con- 
dition VN (Ho N N H,) # Ø defines an irreduc- 
ible algebraic correspondence T between V 
and the multiprojective space W = F’“(k) x x 
P”(k) with uj as coordinates: T = {(x,u)|xeV, 
> UijX; = 0}. In this case, we have a =r, b = 
(n D +1), and d =0 in the notation intro- 
duced previously, so that ¢=n(r + 1)— 1. This 
implies that W’ is of codimension | in W; 
hence W’ is defined by a single equation F(u;; 
= 0. This form F is the associated form of V of 
B. L. van der Waerden and W. L. Chow. It is 
a homogeneous form of degree d (d = deg( V)) 
in each (u;o,. Uin) and is symmetric in the 
indices į, More generally, for a positive cycle 
X= n, V, of dimension r and of degree d 
(=È n, deg( V,)) in P"(k), the product JI Fj of 
the associated forms F, of V, is called the as- 
sociated form of X. The coefficients of F, 
arranged in a fixed order and regarded as the 
homogeneous coordinates of a point of a pro- 
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jective space, are called the Chow coordinates 
of X. This is a natural generalization of the 
Plucker coordinates. Given r, d and a projec- 
tive variety U (c P”(k)), the set of Chow coor- 
dinates of the positive cycles that are con- 
tained in U, whose dimension is r and whose 
degree is d, is a projective variety called a 
Chow variety. 

In scheme theory, the Hilbert scheme is 
introduced in the following way [60]. Let PZ 
be the n-dimensional projective space over 
Spec Z. For a locally Noetherian scheme S, 
and for a closed subscheme Z of P} x S, we 
have the composition of the immersion of Z 
and the projection P} x § to S; this is denoted 
by f,:Z-S. For any point 5 of S, fz '(s) is 
a closed subscheme of Pj. and the restric- 
tion of O(1) to fz 1(s) is denoted by L,. Put 
MS) = {Z c P} x S} | fz is flat}. Then M be- 
cornes a contravariant functor to the cate- 
gory of sets, which is representable, j.e,, there 
exists a locally Noetherian scheme M such 
that M is naturally isomorphic to the functor 
Hom( —, M). M is written as Hilb(P7) and 
is called the Hilhert scheme. There exists a 
closed subscheme W of P3 x Hilb(P3) with flat 
fw such that for any Ze M(S), there exists a 
unique ~: S+M = Hilb(P}) in such a way that 
Z = W x „5. In particular, if X is a closed şub- 
scheme of P over a field k, there exist SE 
Hilb(P3) and a field extension k/k(s) such that 
X = fy'(s) ® usk- Thus Hilb(P%) parametrizes 
all closed subschemes of Pj. Fix a polynomial 
P and define .#”(S) to be {Ze M(S)| x(L2”) 
= P(m) for all s}. Æ? is also a contravariant 
functor, represented by a scheme Hilb’(P%) 
which is projective over Spec Z. The direct 
sum of all Hilb?(P3) is just Hilb(P¥). 


T. Algehraic Geometry and Complex Analytic 
Geometry 


When k = C, an algebraic variety is called a 
complex algehraic variety, and it has the struc- 
ture of a fcomplex analytic manifold or (if it 
has singular points) of an analytic space. If we 
denote by Ob „ the ring of holomorphic func- 
tions at a point x of V, then Oy „C O,,, and 
their tcompletions coincide. If x is a simple 
point of V, then OF is the tring of convergent 
power series, and its completion is the tring of 
formal power series, The prime ideals of OF . 
remain prime under completion (M. Nagata). 
Because of this, the analytic behavior of Vin a 
neighborhood of x can be investigated alge- 
braically through the completion of Op x. 

If Vis complete, then the analytic coher- 

ent sheaves on V and the algebraic coherent 
sheaves on V correspond to each other bijec- 
tively; consequently, for propositions that can 
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be stated in terms of coherent sheaves, the 
results in the analytic sense remain valid in the 
algebraic sense also, and vice versa (J.-P. Serre 
[82]) (- 72 Complex Manifolds E). 


U. Topology of Algebraic Varieties 


Every algebraic variety defined over R (or C) 
can be triangulated by real analytic cells [46]. 
Let V be a nonsingular connected algebraic 
variety defined over C. For an algebraic auto- 
morphism g of C we can define V’ by letting 
g operate on the coefficients of the defming 
equations of open affine coverings of V. V and 
V” are not necessarily homeomorphic or even 
of the same homotopy type [83]. Grothen- 
dieck has shown that there is a ‘spectral se- 
quence Ef? = H?( V, O2)=>H?*4(V,C), called a 
Hodge spectral sequence. This spectral ge- 
quence can be defined in a purely algebraic 
way and H?’*4( V, C) can be considered as the 
thypercohomology of the de Rham complex 

{ T(V, Q,), d} of V. If Vis projective, V carries a 
Kahler metric and by the theory of tharmonic 
integrals the Hodge spectral sequence degener- 
ates and E?? is C-isomorphic to the complex 
conjugate of E47? (- 232 Kahler Manifolds B). 
This is also the case if V is complete. 

The topology of a nonsingular projective 
surface was studied by Lefschetz using the 
method of Lefschetz pencils. For a projective 
nonsingular irreducible variety V of dimension 
ndelined over C, a Lefschetz pencil { W,},<p: of 
V is, by definition, a linear pencil consisting of 
hyperplane sections W, of V such that: (i) for 
allt eU = Pè = {t,t , ta}, W, is nonsingular; 
(ii) each W, has only one singular point that is 
an ordinary double point; and (iii) W N W,, 
is nonsingular, where we assume 0, 00 EU. 
Embed V into P” by a high multiple of a 
hyperplane of V and take a generic linear 
pencil {H,} of hyperplanes in P“, Then { W, = 
H, N V} is a Lefschetz pencil of V. By blowing 
up V along W N W, we obtain a smooth 
variety V and a surjective morphism 1:7 >P", 
Let W=n '(0), ny=n|x 1(U). R?rty.Q is a 
local system attached to the monodromy 
representation 9,:7,(U,0)-GL(H?(W,Q)). 9, 
is trivial if p # n = 1. For each point t; there 
corresponds a cocycle 6, of H"! ( W, Q) called a 
vanishing cocycle such that if y; is a loop based 
at 0 going once (counterclockwise) around t; 
we have, for each xe H" (W, Q), 9,(y;)(x)= 
x +(x, 0;>0;, where ( » is the intersection 
pairing of H"~'( W, Q). (yì) is called a Picard- 
Lefschetz transformation. The main results due 
to Lefschetz are restated as follows. (1) (Weak 
Lefschetz theorem). The natural homomor- 
phism H'( V, Q)—> H'( W, Q) is an isomor- 
phism for 0 <i<n-~ 2 and is an injection for 
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i=n—1, or equivalently H(V,W, Q) = O0 for 

0 <i <n = 1. (2) (Strong Lefschetz theorem). Let 
č be the cohomology class of H*( V, Q) corre- 
sponding to the hyperplane section W, and let 
L: H*(V, Q)— H**?(V, Q) be the homomor- 
phism detined by the cup product with ¢. Then 
foreachi<n,L" ':H'(V,Q)>H*""'( V, Q) isan 
isomorphism. The weak Lefschetz theorem is 
true for a cohomology with integral coeffi- 
cients. In fact, V-W has the homotopy type 
of a real n-dimensional finite CW complex, 

and T ( V, W) =0fo y < n. The strong Lefschetz 
theorem is equivalent to the statement that 
H”!( W, Q) is the direct sum of the vector 
space spanned by the vanishing cocycles 6, 
and the vector space spanned by the invar- 
iant cocyles (i.e., @,-,(y)x=x,i= 1, ,d). 
Lefschetz’s original proof of this statement is 
incomplete, and no direct topological proof is 
known. The transcendental proof of (2) is 
given by the theory of harmonic integrals. A 
version of Lefschetz pencils is a proper mor- 
phism f:X >D={z||z|<¢} of a complex mani- 
fold X onto a disk D such that f* =f |f (D*), 
D* =D = {0}, is of maximal rank at every point 
of f~'(D*). Fix a pointse D*. 2,(D*,s) oper- 
ates on HÌ( W, D, W =f }(s), and we have a 
representation @,:7%,(D*,s)— GL(H!(W, Z)). 
For a loop y based at § and going once around 
O the Picard-Lefschetz transformation @,(7) is 
quasiunipotent (i.e., for a certain integer m, 
(yy is unipotent). 

For a nonsingular projective variety detined 
over afield Æ with characteristic p> 0, the 
above Lefschetz theorems hold for an l-adic 
cohomology (l+ p) [19, 30 (SGA 7)] (- Sec- 
tion AA). Using the theory of finite étale cover- 
ings of an algebraic variety detined over a field 
k, we can define the algebraic fundamental 
group and the algebraic homotopy groups, 
which are profinite completions of the topo- 
logical fundamental group and the topological 
homotopy groups, respectively, where k = C 
[5, 30 (SGA 1]. Let (X,x) be a germ of a 
complex space with isolated singular point x. 
(X,x) is always algebraizable, i.e., the com- 
pletion Oy y of the analytic local ring Oy, x is 
isomorphic to the completion of the local ring 
of a closed point of an algebraic variety de- 
fined over C [3]. For topological type of al- 
gebraic surfaces — Moishezon [57]. 


V. Hodge Theory 


Let Hp be a finite-dimensional real vector 
space containing a lattice H}, and let H = 
Hg Spe be its complexification. A Hodge 
structure of weight mon H (or Hp) is, by de- 
finition, a direct sum decomposion H = 
ptam H”, HIS Ht? whereH? 4 isa 
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complex vector subspace and the overbar de- 
notes complex conjugation. If H and H’ carry 
Hodges structures of weight m and m’, respec- 
tively, then H @ H', Hom,(H, H'), APH, and 
H* carry Hodge structures of weight m + mw, 
m'—m, pm, and —m, respectively. For a Hodge 
structure H ofweight m, FPH= @,,,H*™"*, 
p=0,1,., m, induces a decreasing filtration. 
Let H bea Hodge structure of weightm, and 
let Q be a bilinear form on H. If the follow- 
ing three conditions are satisfied, the Hodge 
structure H is said to be polarized by Q. (i) Q 
is detined over Q and is symmetric (skew- 
symmetric) if m is even (odd). (ii) Q (HP, H? 1) 
= 0 unless p = p',q = q'. i) (,/—1)?"4Q(v, 5) > 
0 for nonzero wœ HP4. Let V be a compact 
Kähler manifold. Then H = H"( V,C) carries 
the Hodge structure induced by the type (p, q)- 
decomposition (— 232 Kahler manifolds B). 
This is also the case if V is a compact complex 
manifold which is the image of a holomorphic 
mapping from a compact Kahler manifold of 
the same dimension. Moreover, if Vis projec- 
tive, the Hodge-Riemann bilinear relations 
define a natural polarization on the subspace 
P of H consisting of a]] primitive cohomology 
classes. 

Each algebraic cycle W of codimension 
s determines a cohomology class [W] e 
H?5( V, Q), which belongs toH®*( V). (Such 
a class is called an algebraic cycle.) The con- 
verse of this fact is called the Hodge conjecture, 
which says that H*5( V, Q) H®$(V) is spanned 
by algebraic cycles. The case s = 1 has been 
verified by Hodge., Lefschetz, and Kodaira. 

Let V(W) be a smooth irreducible algebraic 
variety detined Over C (complex manifold), and 
let ọ:V—> W bea projective smooth morphism 
with connected tibers. Then H = RY, C is a 
flat vector bundle on W with the flat connec- 
tion V. V is often called the Gauss-Manin 
connection, and it can be detined algebraically 
if W is also algebraic. 

For each fiber V, = ~'(s), se W, the filtra- 
tion FPH™(V,, C= @ixs,pH*" “( V) induces a 
complex subbundle F” and the connection V 
has the property V(O(F*)) c O(F’ '@T*), 
where T is the tangent bundle of W [26]. 
Moreover, if W is algebraic, V is a differential 
equation with regular singular points on W 
where W 1s a smooth compactification of W, 
such that W ~ W is a divisor with normal 
crossings (— Section L). If we consider the 
subbundle P of H consisting of all primitive 
cohomology classes, the polarization on each 
fiber induces a Hermitian pseudometric on 
P. Curvatures of bundles P N F” have been 
studied by Griffiths [26]. There exists a classi- 
fying space D for polarized Hodge structures 
and there exists a holomorphic mapping of the 
universal covering W of W into D, usually 


called a period mapping. D may not be a 
bounded symmetric domain but has several 
interesting properties [24,27]. In some cases 
D/T with a suitable discrete subgroup T is the 
moduli space of polarized algebraic varieties 
(e.g., curves, Abelian varieties). 

P. Deligne [ 1 8] has generalized Hodge 
theory to arbitrary algebraic varieties (more 
generally schemes of finite type over C). The 
simplest case is the Hodge theory of a smooth 
noncomplete irreducible variety X. By Na- 
gata’s embedding theorem [63] there exists a 
complete algebraic variety X such that Y = 
X-X is a subvariety. By virtue of Hironaka’s 
resolution theorem we can assume that X 
is nonsingular and that Y is a divisor with 
normal crossings. Let Q}(log Y) be a sheaf 
of germs of meromorphic 1-forms with loga- 
rithmic pole along Y, i.e., locally written as 
Èi- G(x) (dx;/x;) + Lhe +1 a(x) dx,, where 
(x;,, X,) is a system of local coordinates with 
center pe Y in X such that x x, = 0 is a local 
equation of Y and a;(x), a,x) are holomor- 
phic at p. Using the complex {Q{ (log Y)= 
A?O\(log Y), d}, with a suitable filtration, 
Deligne has shown that H = H"(X, ©) carries 
a mixed Hodge structure and this structure is 
independent of the choice of X. The mixed 
Hodge structure on H consists of two finite 
filtrations, i.e., Oc.. c W,_, CW, C. CH, the 
weight filtration which is detined over Q, and 0 
c. CF? Flo. CH, the Hodge filtration 
such that F” induces on W,/W,_, a Hodge 
structure of weight n. As a corollary he has 
shown that a meromorphic p-form on X with 
logarithmic pole along Y (i.e., a section of 
QF (log Y) is d-closed on X, and () = 0 if and 
only if œ, is zero in H?(X,C). An important 
application of the theory of this mixed Hodge 
structure on H"(X, C) is the following. Let 
V and W be smooth irreducible varieties, 
and let p: VW be a smooth projective mor- 
phism. If V is a smooth compactification of 
V, the canonical homomorphism H”(V, Q)—> 
H?( W, R™o,,Q) is surjective. Fix a point sE 
W. 7, (W, s) operates on H™( V,,Q). Then this 
action is semisimple [ 18]. 


W. Deformations, Moduli, Algebraic Spaces 


In this section for simplicity the field kis as- 
sumed to be algebraically closed, Let X be an 
algebraic scheme over k A (flat) deformation of 
X over a connected scheme S over kwith base 
point sp consists of the following data: (1) A 
morphism p:X—S that is flat and of finite 
type. If X is complete, pis also proper. (2) A 
closed point sọ € S such that the fiber ¥ x 5 k(sg) 
is isomorphic to X. For any closed point 5 € S, 
the liber X,=X x, K9) is called a flat deforma- 
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tion of X. If X is smooth and complete, we 
assume further that p is smooth. Similarly, we 
can define a deformation of a polarized alge- 
braic manifold, an embedding deformation 
of X in an algebraic scheme Y over k, a de- 
formation of an affine scheme with isolated 
singular points, and a deformation of vector 
bundles on a fixed algebraic scheme over k 
etc. The theory has two aspects: local theory 
and global theory. 

Let (R, m) be a complete Noetherian local 
ring such that R/m = k Set R, = R/m". A for- 
mal deformation Xz of X is a sequence {X,} 
such that (i) X, is a deformation of X over 
Spec(R,,) and (ii) there is a compatible ṣẹ- 
quence of isomorphisms X, ® gr, Ry-;SXy-1 
for any n. Let (FLA/k) be the category of 
finite-dimensional commutative local k- 
algebras. The local theory of deformation is 
the study of the covariant functor F of (FLA /k) 
to (Set), where, for A€(FLA/k), F(A) is the set 
of isomorphism classes of deformations of X 
over Spec(A). The functor is in general neither 
representable nor prorepresentable (i.e., there 
exists a formal deformation X, of X such that 
F(A) = Hom,..),(R, A)). But, under reasonably 
mild conditions on F, F has the hull R [78]. 
That is, there is a forma] deformation XR of X 
and a natural transformation j: G-F, where 
G(A) = Hom, ai,(R, A), such that j is for- 
mally smooth (i.e., for any surjection A’+A in 
(FLA/k), G(A’)—> G(A) x ria F(A’) is surjective) 
and G(k[é])>F (k[e]) is bijective for the ring 
of dual numbers k[¢]. The formal deformation 
Xp is called a yersal deformation of X. The 
hull R is unique up to noncanonical isomor- 
phism. The deformation functor F has the hull 
R if X is (i) a complete algebraic scheme over 
k, (ii) an affine scheme with isolated singular 
points, (iii) a polarized algebraic variety over 
k, or (iv) a vector bundle on a complete alge- 
braic scheme over k, etc. If there exists a de- 
formation 7:¥3§ of X over a scheme S 
with base point sọ over Ksuch that R = O5 s 
X, <3 @R,, the formal deformation {7,, : 
X,,->Spec(R,,)} is called algebraizable. Alge- 
braizability of the yersq] deformation has been 
studied by M. Artin. Since the assumption that 
S is a scheme is rather restrictive, Artin has 
introduced the notion of algebraic spaces and 
has considered algebraizability in the category 
of algebraic spaces [2-4]. For a complete 
algebraic variety, the yersq] deformation is not 
necessarily algebraizable and we need to con- 
sider deformations of polarized algebraic 
varieties. The versal deformation of an affine 
variety with an isolated singularity is alge- 
braizable in the category of algebraic spaces. 
For the global theory of deformations, we 
need the projectivity assumption, and the 
theory is essentially reduced to the theory of 
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Hilbert schemes (or Chow varieties) (— Sec- 
tion S). The problem of moduli is considered 
as the study of the set A4 of all isomorphism 
classes of deformations of X. Usually we con- 
sider the moduli of polarized varieties. 

Let (Sch) be the category of Noetherian 
schemes and .# a contravariant functor from 
(Sch) to (Set) delined by M(X) = {isomor- 
phism classes of families of polarized varieties 
parametrized by X e€Ob(Sch), ¢.g., families 
of polarized Abelian varieties with(out) addi- 
tional conditions}. The functor .@ is called 
a moduli functor. If jt is represented by a 
scheme M, M is called a fine moduli scheme 
[60]. In this case there is the universal family 
m:¥ —+M.In many cases the moduli functor 
is not representable. A coarse moduli scheme 
for a given moduli problem is a scheme A4 
together with a natural transformation ¢: 

M —Hom( —,M)such that (1) ¢(Spec(k)): 
M(Spec(k)) > Hom(Spec(k), M) is bijective for 
any algebrically closed field k, and (2) for 

any scheme N and any natural transformation 
y: M—ĦHom( —, N), there is a unique natural 
transformation 1: Hom( —,M)—>Hom( —, N) 
with w= 4: ¢. A coarse moduli scheme is called 
a moduli space or a moduli scheme. 

In many cases, the moduli space can be 
obtained as the quotient space of a certain 
(locally closed) subset H of a Hilbert scheme 
by the following equivalence relation: s-s’ EH 
if and only if X,3 X, as polarized varieties, 
where 7: ¥ > H, 7 '(s)=X,(T. Matsusaka 
[53]). This equivalence relation is often in- 
duced by an action of a treductive algebraic 
group G. Suppose that a reductive algebraic 
group G operates on an algebraic k-scheme Z. 
A G-invariant morphism f: Z— Y (i.e., for the 
trivial action of G on Y, f is a G-equivariant 
morphism) is called a geometric quotient if (1) 
f is a surjective affine morphism and f,(0,)® 
= 0y, (2) if X is a G-stable closed subset of Z, 
then f(X)is closed in Y, and (3) for x ;, x, €Z, 
f(x,) =f(x,) if and only if the G-orbits of x, 
and x, are the same. Let G be a reductive 
algebraic group, 7: G—Aut{ V) a rational rep- 
resentation on a finite-dimensional vector 
space V over k, and vg #0 a G-invariant point. 
Then there exists a G-invariant homogeneous 
polynomial F of degree > {on V such that 
F(v 9) #0 (W. J. Haboush [32]). This implies 
that if a reductive group G operates on an 
algebraic k-affine scheme Spec(A), then the 
invariant ring A® is a finitely generated k- 
algebra and, moreover, if any G-orbit in 
Spec(A) is closed, the natural morphism 
Spec(A)—Spec(A®) is a geometric quotient. 
For a quasiprojective scheme Z over k with 
an action of a reductive group G we need a 
notion of stable points [60,62]. The subset 
of stable points of Z consists of all geometric 
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points of a G-stable open subscheme Z* of Z, 
and there exists a geometric quotient f: Z*—> Y 
where Y is quasiprojective (Mumford [60], C. 
S. Seshadri [79], Haboush [32]). In this way 
Mumford has shown the existence of coarse 
moduli schemes of nonsingular complete 
irreducible algebraic curves and polarized 
Abelian varieties. But, in general, analysis of 
stable points is very difficult and it is desirable 
to extend the category of schemes so that it 
becomes easier to obtain a quotient. Matsu- 
saka has introduced the notion of a Q-variety 
[52]. M. Artin has introduced the notion of an 
algebraic space (D. Knutson [49]). 

An algebraic space X of finite type consists 
of an affine scheme U and a closed subscheme 
R cU x U such that (1) R is an fequivalence 
relation, and (2) the projections p;: R+U (i = 
1,2) are étale. (These are often written as R3 
UX.) A morphism g: V—>X of an affine 
scheme V to an algebraic space X consists of 
a closed subscheme Wc U x V such that (1) 
the projection W> V is étale and surjective, 
and (2) the two closed subschemes R x y W, 

W x,W of Ux Ux V are equal. Let SAVY 
be an algebraic space. Then Hom( Y, X) is de- 
fined as the kernel of Hom(V, X)3Hom(S, X). 
If Y is an affine scheme, this definition of 
Hom( Y, X) is equivalent to the previous defi- 
nition by virtue of the étaie descent. Thus 
algebraic spaces form a category which con- 
tains the category of schemes. We can define 
the structure sheaf of an algebraic space and 
construct a cohomology theory. Many impor- 
tant notions and theorems for schemes can 

be generalized to those for algebraic spaces. 
Every algebraic space has a dense open subset 
that is an affine scheme. A group algebraic 
space is a group scheme (J. P. Murre). Suppose 
k = C. If an algebraic group G operates on an 
algebraic k-scheme properly with a finite stabi- 
lizer group, the quotient space exists as an 
algebraic space. In this way H. Popp has 
shown the existence of moduli spaces of alge- 
braic surfaces of general type as algebraic 
spaces [70,71] (- 15 Algebraic Surfaces; also 
[54]). Moreover, every separated algebraic 
space X of finite type over C carries a natural 
structure X°” of an analytic space. If there 
exists a proper modification morphism f: X” 
—> Y of a separated algebraic space X onto an 
analytic space Y, then Y carries a structure of 
an algebraic space and f becomes a morphism 
of algebraic spaces (Artin [2]). For any alge- 
braically closed field k, Artin has introduced 
the notion of formal algebraic space and 
formal contraction and has obtained results 
similar to those for algebraic spaces [49]. 

An irreducible compact complex space X 
whose algebraic dimension (— 72 Complex 
Manifolds F) is equal to dim X is called a 
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Moishezon space. As a corollary of the above 
theorem, any Moishezon space X carries a 
structure M of a compact algebraic space such 
that X x M“. 


X. Formal Schemes 


Let A be a ring which we assume to be Noe- 
therian, for simplicity, and | an ideal of A. 
Taking { J"\,.) as a fundamental system of 
neighborhoods of 0, we can introduce a struc- 
ture of a topological ring into A called J-adic 
topology. The fcompletion of A with I-adic to- 
pology is isomorphic to the projective limit 4 
=lim,s)4/” (here A/I" are regarded as dis- 
crete topological rings) and called the comple- 
tion of A along /. If A is Noetherian, then A is 
again Noetherian. There is a canonical con- 
tinuous homomorphism i: AA whose kernel 
comprises the zero divisors a witha = 1 Ẹ 7 
(intersection theorem of Krull; — 284 Noeth- 
erian Rings B). If į is an isomorphism, we say 
A is complete with respect to I. The topology 
of A is the [-adic topology where [= i(I)A and 
A is complete with respect to f, Take a Noeth- 
erian ring A complete with respect to I which 
we consider as an -adic topological ring by 
identifying its completion along / with A. 

On X=V(I)< Spec(A) we can define a sheaf 
of topological rings Oy by T(A(f), 0,) = 
lim,s9 4s/I"4 p for (f)=D(f)NX with 

feA. We call X, (x) the format spectrum of A 
and write Spf(A). T is called a defining ideal of 
Spf(A). A (locally Noetherian) formal scheme is 
by definition a topological local ringed space 
which is locally isomorphic to a formal spec- 
trum (of a Noetherian ring). If we define mor- 
phisms between two formal schemes by those 
in the category of topological local ringed 
spaces, the formal schemes form a category. 

For two formal spectra Spf(A) and Spf(B) 
with deiining ideals 7 and J, respectively, the 
direct product Spf(A) x Spf(B) in the category 
of formal schemes is the formal spectrum of 
the completion of A ® B along the image of 
1®B+A® J. Similarly, we can construct a 
fiber product of formal schemes. A formal 
scheme ¥ is called separated if the image of the 
diagonal morphism Ay: X>Xx X is closed (> 
Section D). 

For a Noetherian ring A with an ideal J, the 
formal spectrum Spf(A) (with a defining ideal 
f) is called the completion of Spec(A) along 
P’ (I). Similarly for a Noetherian scheme X and 
a closed subscheme X’ we can define the com- 
pletion X x of X along X’. Every completion of 
a separated scheme is separated. For a coher- 
ent sheaf F on X one can define its completion 
Fix along X’, which is again coherent (under 
the assumption that X is locally Noetherian). 
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Thus we can develop a theory of formal 
schemes in a way similar to that of schemes, 
which we call “formal geometry” (for the more 
general definitions and further discussions see 
[2; 29, I, IIT; 303). Roughly speaking, a function 
on Xx is a formal Taylor series with respect 
to the direction normal to X’ whose coeffi- 
cients are regular functions on X’. The method 
of formal completion enables us to introduce 
“analytic” or “intinitesimal” methods in alge- 
braic geometry. Among many important 
theorems, we state here the following two 
theorems. (1) The fundamental theorem of 
proper mapping: Let f: X— Y be a proper mor- 
phism of locally Noetherian schemes, Y’ a 
closed subscheme of Y, and X’ = X x y Y’ the 
inverse image of Y’. Denote the respective 
completions of X and Y along X’ and Y’ by 
Å and Y, respectively. We have the induced 
proper morphism of formal schemes f oe 
f. Then we have canonical isomorphisms, 
(R'a lF)) y = B'S, (Fix-)sn > 0, for every coher- 
ent, (y-Module F on X. This theorem can 
be applied to prove Zariski’s connectedness 
theorem: for a proper morphism f: X > Y of 
locally Noetherian schemes with f,(@,) = Oy, 
every fiber f~! (y) off is connected and non- 
empty for ye Y. (2) We use the same nota- 
tion as in (1) and assume, moreover, that Y = 
Spec(A) for a Noetherian ring A, complete 
with respect to an ideal J, and Y’ = V(I). Then 
the correspondence F Fry. gives an equiva- 
lence between the category of coherent y- 
Modules with proper support over Y and the 
category of coherent (-Modules with proper 
support over Y. This theorem plays an impor- 
tant role in the theory of tdeformations of 
algebraic varieties. 


Y. Algebraic Vector Bundles 


In this section a vector bundle is a locally free 
sheaf of finite constant rank (- Section E). A 
quotient sheaf of a vector bundle is called a 
quotient bundle if it is a vector bundle. A sub- 
sheaf F of a vector bundle E is a subbundle 
when both F and E/F are vector bundles. A 
vector bundle is said to be indecomposable 
unless it is a direct sum of proper subbundles. 
Every vector bundle E on P! is a direct sum of 
line bundles, that is, E ~ @ @(a;) (Grothen- 
dieck). This property characterizes P! in the 
category of nonsingular projective varieties. In 
fact, if X is a nonsingular projective variety 
with dim X > 0 and X # P' and if r is an in- 
teger with r > dim X, there are stable (see 
below), a fortiori, indecomposable vector 
bundles on X of rank r (J. Simonis and M. 
Maruyama). Vector bundles are closely related 
to subschemes of the base variety. Let E be a 
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vector bundle of rank 2 on a nonsingular 
quasiprojective variety X and (( 1) an tample 
invertible sheaf on X. For n »( and general 
seH(X, E(n)), E(n)/sO,= L @ I with L a line 
bundle and J an ideal in (, which defines a 
smooth subscheme Y= (s), of codimension 2. 
When X is, for example, P” (n > 3) or affine, 
the converse holds. Let Y be a subscheme of X 
purely of codimension 2 and locally of com- 
plete intersection. If wy x A4 &® Oy for an inver- 
tible sheaf M on X, there is a vector bundle 

E of rank 2 and se H°(X, E) with (s)) = Y. 
Moreover, E is decomposable if and only if Y 
is globally a complete intersection. An Abelian 
surface can be embedded in P*(C), and hence 
we have an indecomposable vector bundle of 
rank 2 on P*(C) [40]. For n > 5, we have no 
examples of indecomposable vector bundles of 
rank 2 on P”(C) [36]. Every vector bundle on 
A” =k" is trivial (D. Quillen [72], A. Suslin). 
From this and the fact stated above several 
results can be deduced; every nonsingular 
curve in A? is set-theoretically a complete 
intersection (L. Szpiro). 

For a vector bundle E on a complete 
scheme, the following are equivalent: (i) for 
every coherent Oy-module F, F © S’(E) is 
generated by its global sections for n>0, (ii) 
for every coherent sheaf F on X, H'(X, F @ 
S’(E)) = 0 for alli > 0 and n > 0, (iii) the ttauto- 
logical line bundle ({ 1) on P(E) is ample. A 
vector bundle having these properties is said 
to be ample. This is a generalization of the 
notion of ampleness of invertible sheaves 
(- Section E). The set of ample vector bundles 
is closed under several operations [34]. No 
good criterion such as Nakai-Moishezon’s 
for the ampleness of invertible sheaves (— 
Section E) is yet known. The tangent bundle 
of a nonsingular complete variety X is ample 
if and only if X =P” (— Section R; S. Mori 
[59]). 

Let Y be a nonsingular projective variety 
over an algebraically closed field and Oy( 1) an 
ample invertible sheaf on Y. A coherent sheaf 
E on Y is said to be stable (or, semistable) with 
respect to (),( 1) if E is torsion-free and if for 
every coherent subsheaf F of E withhO#F # E, 
X(F(m))/r(F)< (or <)x(E(m))/r(E) for all m> 0, 
where r(*) denotes the rank. If E is stable (or 
semistable) and locally free, it is called a stable 
(resp. semistable) vector bundle. Let f; X—S be 
a tsmooth, projective, geometrically integral 
morphism and (),{ 1) an f-ample invertible 
sheaf on X. For a numerical polynomial H 
and an S-scheme T, set L¥is(T)={E Eis a 
coherent sheaf on X, with the properties (a) 
and (b)}/ ~; (a) E is T-flat, (b) for every geo- 
metric fiber X, E, = E x, İs stable with re- 
spect to Oy( 1) x, and y(E,(m)) = H (m), and if 
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E, = E, ® f7#(L) with L an invertible sheaf on 
T, E, ~E}. X45 is a tcontravariant functor of 
the category of tlocally Noetherian S-schemes 
to the category of sets. If § is of tfinite type 
over a *universally Japanese ring A, 2Y,s has 
a coarse moduli scheme My)s(H) (— Section 
W) and it is tlocally of finite type over S (D. 
Mumford, Seshadri, D. Gieseker, Maruyama 
[23, 51]). The set of the classes of semistable 
sheaves under a suitable equivalence relation 
(S-equivalence) on geometric libers of X over § 
also has a coarse moduli scheme My/(H) and 
M,,,(H) is its open subscheme. Moreover, it is 
known that M,,,(H) is projective over § in 
some cases, for example, when A is a field of 
characteristic zero. When dim X/S= 1, X = P? 
or P?, the structures of M,,,(H) have been 
extensively studied [ 11,69]. Theories of vector 
bundles are used in theoretical physics [9]. 


Z. Torus Embeddings 


Over an algebraically closed field K, a torus 
embedding, or a toric variety, is a normal 
scheme X locally of finite type over k on which 
an algebraic torus T acts with a dense open 
orbit isomorphic to T. Such X’s, as well as 
many of their algebrogeometric properties, can 
be described very simply in terms of cones in 
real affine spaces, as Demazure [21] first saw 
in the nonsingular case in connection with 
algebraic subgroups of the Cremona trans- 
formation group, and then as D. Mumford et 
al. [47] as well as K. Miyake and T. Oda [67] 
saw in the general case immediately after H. 
Sumihiro [SS] proved a basic theorem on 
linear algebraic group actions. 

The group N, written additively, of one- 
parameter subgroups of an r-dimensional 
algebraic torus T is a free Abelian group of 
rank r. A convex rational polyhedral cone g in 
Nr = Rz N with 0 as the vertex is the set of 
nonnegative linear combinations of a finite 
number of elements of N such that eN (—o) = 
{O}. A subset z of g is called a face, and is 
denoted by 7 < øg, if there exists a linear func- 
tional m on Np having nonnegative values on 
o and t={yeo|m(y)=0}. A fan A in Np isa 
collection of such o’s satisfying the conditions 
(i) A30,0>t1=>Astand (ii) A35, 0° >0> 
alo <a’. 

Each oéA gives rise to an affine torus em- 
bedding U, as follows: Let M be the Z-module 
dual to N; hence A4 is the group of tcharacters 
of T, For geA, the set MNa={meM|m(y)>0 
for all yeo} is seen to be a linitely generated 
additive subsemigroup of M containing 0 
and generating M as a group. U, is then the 
spectrum of the semigroup algebra k[M N a]. 
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Thanks to the condition (ii) above, the U,’s 
can be naturally pasted together to produce 
a torus embedding X = tle iUs 

Every torus embedding is obtained in this 
way. Equivariant dominant morphisms be- 
tween torus embeddings can be described in 
terms of Z-linear maps between N’s which 
send a fan to another. 

Many algebrogeometric properties of X can 
be described in terms of A, e.g., X is nonsin- 
gular if and only if every g € A can be spanned 
by a part of a Z-basis of N. X is complete 
(proper over k) if and only if the union of 9’s in 
A coincides with N,. X has at worst cyclic 
quotient singularities if and only if every ge A 
is simplicial, i.e., if it is spanned by R-linearly 
independent elements of N. An equivariant 
resolution of singularities of X exists and is 
obtained by a suitable subdivision of A (Mum- 
ford et al. [47]). X has only rational singular- 
ities, hence is Cohen-Macaulay (M. Hochster 
[38] and Mumford et al. [47]). The set of T- 
orbits in X is in one-to-one correspondence 
with geA. A reduced T-invariant subscheme Y 
of X is the union of T-orbits, and hence corre- 
sponds to a subset X of A. M.-N. Ishida [44] 
determined when Y is Cohen-Macaulay or 
Gorenstein in terms of the combinatorics of 2. 

A T-invariant Cartier divisor D on X cor- 
responds to a support function h, which is a 
continuous R-valued function on the union 
| Jesa a which is (i) positively homogeneous, 
ie., h(Ay)=Ah(y) for A>0 and ye|_),ea9, (ii) 
Z-valued on N f ( y sea 0) and Gii) linear 
when restricted to each gc A, Various prop- 
erties of the invertible sheaf (,(D) (— Section 
E) can be described in terms of h, For in- 
stance, when X is complete, (,(D) is tample if 
and only if h is upper convex, i.e., h(y) +h(y’) < 
h(y +y), and moreover, A is the coarest fan 
with the property (iii) above. The cohomology 
of O,(D) can be calculated by means of h 
(Demazure [21] and V. Danilov [ 17]). 

A support function h, on the other hand, 
gives rise to a convex polyhedron in Mp with 
vertices in M. In this way, certain aspects of 
the geometry of convex sets can be thought of 
as a part of the theory of projective varieties 
(R. Stanley [87], B. Teissier [89]). 

Mumford et al. [47] introduced a more gen- 
eral concept of toroidal embedding: A normal 
algebraic variety Y and a nonsingular Zariski 
open subset U such that Y >U is formally 
isomorphic at each point to a torus embedding 
X D T. This concept has been used very effec- 
tively to prove important theorems: (1) Sys- 
tematic nice compactitications of arithmetic 
quotients of bounded symmetric domains (D. 
Mumford et al. [7], 1. Satake [77]). Y. Nami- 
kawa [65] worked out the details in the case 
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of the Siegel upper planes, improving their 
earlier Satake compactifications studied by I. 
Satake, W. Baily, J.-I. Igusa, and A. Borel [12]. 
(2) Semistable reduction theorem (Mumford et 
al. [47]). Let f: V-C be a flat morphism from 
a nonsingular variety V to a nonsingular 
curve C in characteristic 0. After a finite base 
extension C’— C and a modification V’3 
Vx-C', we cangetf ':V’>C’ whose singular 
fibers are reduced with only nonsingular com- 
ponents crossing normally with each other 

(- Section L). Without the reducedness 
requirement, the existence is derived from a 
result of Hironaka [37]. 


AA. Etale Topology 


Let ¥ be a category. We say that a Gro- 
thendieck topology on Y is given if, for each 
SE Ob(S), families of morphisms (covering 
families of S) are given and satisfy the follow- 
ing conditions: (1) If g: T>S is an isomor- 
phism, fo :T> S} is a covering family, (2) if 
{9;: R; > S}j-, is a covering family, for any 
morphism p: S'S, the fiber product R; = 
Rix, S exists and the induced family { Q; : 
R;—>S'} is a covering family of S’, (3) if {¢,: 
R;>S};er 18 a covering family and if for each 
i€ I, {Si a: Si, a> Ri}aca; iS a covering family, 
then {¢?s,,4:S; 7S} icraca, IS a Covering 
family. In general, it is more convenient to 
use a notion of sieves to define a Grothen- 
dieck topology [30 (SGA 4)].) A category with 
Grothendieck topology is called a site. Let X 
be a scheme and Et/X the category whose 
objects consists of schemes étale over X. If we 
choose a family of morphisms {¢;: Y;—> Y hier 
with (Jier ¢;( Y) = Y, as a covering family of 

Y e Ob(Et/X), this defines the étale topology 
on X and the étale site X,,. Similarly, one can 
define the Zariski site X,,, (resp. the flat site 
X ,;), using Open immersions (resp. flat mor- 
phisms locally of finite type). A presheaf on a 
site Y is a contravariant functor from Y to 
(Set). A presheaf F is a sheaf if, for any cover- 
ing family {9;:R;>R}ier F(R) Iie F(R) 
Tj, jer (Ri x pR;) is exact. If Y has a final ob- 
ject X, the functor F> F(X) is left exact on 
the category of Abelian sheaves on Z and 
the cohomology groups H: (X, F) are de- 
fined as the right derived functors. Using 
covering families, one can define the Cech 
cohomology H +(X, F) as usual. In the follow- 
ing, a sheaf means an Abelian sheaf. For a 
*geometric point j;x-X of a scheme, an étale 
neighborhood U of x consists of an étale mor- 
phism f: UX and a morphism j: x U such 
that f oj = i. For a sheaf F on X, the stalk 
F, at a geometric point x is defined by F, = 
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lim T( U, F), U being the étale neighborhoods 
of x. On X a the sheaf G,, is defined by G,(S)= 
T(S, @s)* for each SE Ob(Et/X). The kernel of 
the nth power homomorphism G,,-3G,, is 
denoted by ,. The sequence 1>yu,>G,,> 
G,,— | is exact in X „if pis prime to resid- 

ual characteristics of X, but not necessarily 
exact in X,,,. There are canonical isomor- 
phisms Pic(X)= H! (Xz; OF) %H'(Xq, Ga) 3 
H'(X,,,G,). For X = Spec(k), k a field, the 
étale cohomology theory of sheaves on X, is 
equivalent to the tGalois cohomology theory 
over k; hence H'(X,,, ) 3 H'(G, Fg), where 

G = Gal(k/k), k the tseparable closure of k and 
F; is the stalk of F at the geometric point ®), 
of X. For a morphism f: X > Y of schemes 

and a sheaf F on X,,, the direct image sheaf 
Ía E of F is defined by fẹ F(S) = F(X x ,S), 
SeOb(Et/Y), and higher direct-image sheaves 
R'f, F are detined by the right-derived functor. 
Let X be a separated scheme of linite type 
over a field k. By a theorem of Nagata [63], 
there exists a scheme X proper over K and 

an open immersion j: X—X. For a torsion 
sheaf F on Xa, let jF be a sheaf on X, ex- 
tended by 0 outside X. The cohomology 
with compact support H4(X,,, F) is defined 


et? 
by H(X a, F)= H*(Xa jF). This is inde- 
pendent of compactifications. Similarly, for a 
separated morphism f: X —>§ of linite type of 
schemes and a torsion sheaf F on X,,, one can 
define higher direct image sheaf with compact 
support R4f,F. For the étale topology, torsion 
sheaves are important. All torsion sheaves on 
Xa on a Noetherian scheme X are inductive 
limits of constructible sheaves. A sheaf F on 
Xa is called locally constructible (constant) if F 
is represented by an étale covering of X. A 
sheaf F on X ,, is called constructible if there 


exists a linite surjective family of subscheme X, 


of X such that the restriction of F to X; is 
locally constructible (constant). This is equiva- 
lent to saying that every irreducible closed 
subscheme Z of X contains a nonempty open 
subscheme U such that the restriction F to U 
is locally constant and has finite stalks, i.e., 
there is a covering family (¢;: U;>U} such that 
F U,is constant and the stalk F, is a finite 
Abelian group for all geometric points x of U. 
The cohomology of a torsion sheaf or a con- 
structible sheaf has properties similar to those 
of the classical cohomology. Let f: XS be a 
proper morphism and F a torsion sheaf on 
Xa Then the stalk Rf, F at a geometric 
point s of § is isomorphic to H4(X, «,F), 
where X,=X x 5 Spec k(s). If f is a separated 
morphism of linite type of Noetherian schemes, 
a similar fact holds for the cohomology with 
compact support. Moreover, if F is a con- 
structible sheaf, R4f,F is also constructible 
(finiteness theorem). For an affine scheme X of 
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finite type over a separably closed field and F 
a torsion sheaf on X,,, one has H4(X,,, F) =0 
for q > dim X. Let f: X >S be a separated 
morphism of schemes of linite type on C, Fa 
torsion sheaf on X,,. One has a canonical 
isomorphism (R@f,F)" 5 Rf" F™. In partic- 
ular, if X is proper over C, one has H4(X,,, 
Z/(n))SH(X™, Z/(n)). On the other hand, 

for a nonsingular complete curve C over an 
algebraically closed field, one always has 
H'(C.,, Z) = 0. For a geometric point 5 of § 
the strict localization of S at s is the ring 

Os s= lim T(U, Oy), U being étale neighbor- 
hoods of s. A geometric point f of § is called a 
generalization of s if t is detined by an alge- 
braic closure of the residue field of a point of 
Spec (, ,. In this case, s is called a special- 
ization of t. If f: X +§ is a proper smooth 
morphism the sheaf R4f, Z/(n) is locally con- 
structible (constant), and ifs is a specializa- 
tion of t one can define a cospecialization of 
H(X, é Z/@)) to H(X, é Z/@)) which is 
bijective. Let X be a scheme of finite type over 
a field k. If | #ch(k) is prime, one can define the 
l-adic cohomology H4(X, Q,) = lim, H(X, Z/ 
(/")) @z,Q,. The |-adic cohomology has as 
many good properties as the classical co- 
homology (— Section U). 
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17 (X1.11) 
Algebroidal Functions 


A. General Remarks 


If an analytic function f satisfies an tirreduc- 
ible algebraic equation 


Ag(z)f*+A,(2) fe '+...+A,(z)=0 (1) 


with single-valued meromorphic functions 
A,{z) in a domain G in the complex z-plane, 
then f is called a k-valued algebroidal function 
in G. With no loss of generality, we can as- 
sume that there is no common zero among the 
A(z) and that all the A,(z) are tholomorphic in 
G. When k = 1, the solution of (1) is a single- 
valued meromorphic function in G. If all the 
A,(z) are polynomials, then f is an talgebraic 
function. Thus algebroidal functions can be 
regarded as extensions of single-valued to 
multiple-valued functions and also as exten- 
sions of algebraic to transcendental functions. 
Since (1) is irreducible, its discriminant D(z) 
does not vanish identically. For all the points 
a satisfying D(a)#0, Ap(a) 40, ae G, equation 
(1) determines k holomorphic function ele- 
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ments f,(z), f,{z) in a suitable neighborhood 
of a that determine the analytic function f. 
They can be *prolonged analytically in G in 
the wider sense. At any point satisfying A,(z) 
= 0, at least one element has a pole; and at any 
point satisfying D(z) = 0, there may appear 
ramified elements. Therefore an algebroidal 
function can be defined as a finitely multiple- 
valued analytic function in G with the excep- 
tion of poles and talgebraic branch points. 
Every algebroidal function f(z) determines a 
+Riemann surface, which may be considered a 
tcovering surface 3 of G. This surface 3 is a k- 
sheeted *covering surface over G with no sin- 
gular point except for algebraic branch points. 
Also, f reduces to a single-valued meromor- 
phic function on 33, and all the function ele- 
ments over a point z are different. A k-valued 
algebroidal function can also be characterized 
by this property. 

These two (equivalent) delinitions of alge- 
broidal functions give rise to two distinct 
methods of studying these functions. In adopt- 
ing the first definition, we can make use of 
results in the single-valued case, as did G. 
Rémoundos and G. Valiron. When the lat- 
ter definition is adopted, we can use several 
methods that are also applicable in the single- 
valued case, as did H. Selberg and E. Ulrich. 

Research on algebroidal functions has been 
carried out mainly for the case where G is the 
finite plane |z|< 00 or the unit disk |z|< 1. 
Almost all the known results for algebroidal 
functions are extensions of those on single- 
valued functions, but several results particu- 
larly relevant to algebroidal functions have 
been discovered. The existence of branch 
points makes it difficult to investigate alge- 
broidal functions in some cases. 


B. Absolute Value 


Among several results on the absolute values 
of algebroidal functions, the tmaximum prin- 
ciple, one of the basic principles, holds on the 
Riemann surface 3. The following relation 
holds among the A,(z) and | f,(z)|. Assume that 
(1) has the form 


SE +A (2) fe 1+... 4+4,(z)=0. (1) 


Then log( 1 + A(z))/log( 1 + F(z)) is bounded, 
where A(z)=max|A,(z)| and F(z)=max| f,(z)|. 
An algebroidal function that has no pole 
in z|<o0is called an entire algebroidal func- 

tion. The successive derivatives of an entire 
algebroidal function may have poles at every 
branch point, which is a departure from the 
case of single-valued integral functions. An 
algebroidal function defined by (1) or (1’) with 
entire functions A(z) and zero-free ,4,,(z) is 
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entire. Dividing both sides of (1) by A,(z), we 
obtain equation (1’), and all its coefficients are 
entire. In equation (1’), the forder, type, and 
class of an entire algebroidal function coincide 
with those of the largest A(z) (- 429 Tran- 
scendental Entire Functions). 

For an entire algebroidal function of order 
less than 1/2, F(z) tends to infinity uniformly 
along a sequence of concentric circles |z| =r, 
(r, > 00). However, it can be shown that not 
all the branches of f(z) necessarily tend to in- 
finity on the Riemann surface (3; in this sense, 
*Wiman’s theorem does not remain true. But, 
using minj,,-,max,|f,(z)|, one can obtain some 
extensions of the generalized Wiman theorem. 


C. Picard’s Tbeorem and Its Extension 


Rémoundos first extended tPicard’s theorem 
and *Borel’s theorem to an algebroidal func- 
tion. Every k-valued transcendental alge- 
broidal function in the finite plane takes on 
every value inlinitely often with at most 2k 
exceptional values. There are examples where 
2k values are actually omitted. Hence the 
theorem is the best one possible in this sense. 
T. Varopoulos (Bull. Soc Math. France 53 
(1925), 23-34) introduced the degeneracy index 


A=dim{ (CoxCi pass 
+¢,A,=0} 


(0< 4<k= 1) into equation (1) and showed 
that the number of Picard’s exceptional values 
off is at most k+ A+ 1. There is no single- 
valued meromorphic function with A > 0, so 
this result is relevant only for algebroidal 
functions. Extending this idea further, J. Du- 
fresnoy and others obtained more precise 
results. The situation is the same for the Borel 


CEC cg Ag tc, A t+... 


exceptional values. That is, the +Convergence 
exponent of f(z) = w = 0 coincides with the 
torder off except for at mest 2k values, for 
which the convergence exponents off are 
less than the order off (E. Borel). There are 
at most 2k polynomials P(z) for which f(z) = 
P(z) = 0 has at most a finite number of roots 
(Borel), and furthermore, using the degeneracy 
index, we Can give more precise results. 

Selberg was the first to extend the tNevan- 
linna theory of meromorphic functions to 
algebroidal functions (— 272 Meromorphic 
Functions). Almost simultaneously, Valiron 
obtained the same results starting from the 
coefficients of (1); then Ullrich improved the 
results by considering the effect of branch 
points of 3. 

Let 3, be the part of 3 over |z| < r, let n(r, w 
be the number of roots of f(z) = w = 0 in 3,, 
and let n(r, 3) be the number of branch points 
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of 3,. With these notations we write 
17” t n(n(0, w) 
Ne,w)=> (newn, wh SO Mog, 
kjo t k 
m(r, w) 
! | l ——— do, W#H 
a — (9) - 4 ‘ 
Ikr Ja E E °° 


T(r, w) =m(r, w) +N (r, w). 


Let T(r, f) be the flogarithmic integral of the 
spherical area of the image of 3, under w = 


f, 


S es A li 
T(r, f) D i a Tce do, 


and let N(r, 3) be the logarithmic integral of 
n(r, 3). Then we have T(r, w) =T(r, f) + OU) 
and the ramification theorem: 


N(r, 3) <(2k = 2) T(r, f) + O(1). 

Also, T(r, f )= O(logr) holds if and only if f is 
algebraic. Let .4(z) be the maximum of |A,{z)|, 
and let 


2n 


1 
= — 1@ 
u(r) Fkn |. log A(re'*) dg. 


Then T(r, f) = p(r) + OC 1). As the second funda- 
mental theorem we have 


È New )>Q-ITN-NEO3) 
+Ë Ni (r, w) +O(logrT) 
v=] 


>(q=2k) T(r, f) 


+ 3 N,(r,w,)+ O(logrT), 
v=1 

where N, (r, w) is the logarithmic integral of 
n,(r, w) which is the sum of the multiplicity 
minus one of all the roots of f(x) =w = 0 in 3,. 
Furthermore, the deficiency, ramification index 
of f(z), and ramification index of the surface 3 
are defined by 


6(w)= 1 —limsup N(r, w)/T(r, f) 
=liminfm(r, w)/T(r, f), 
D(w) = liminf N, (r, w)/T(r, f), 
€=hminf N(r, 3)/T(r, f). 
With these notations, we have 
X d(w) +) Dow) < 24+ E< 2k. 


These results contain the Picard theorem 
and the Borel theorem. Furthermore, by con- 
sidering the effect of branch points, the Ahlfors 
theory of covering surfaces can be extended to 
algebroidal functions (Y. Tumura). By using 
this result, ‘Bloch’s theorem can be obtained 
very simply. 


But these results do not contain the Varo- 
poulos result. Taking into consideration the 
degeneracy index J, H. Cartan proved the 
following inequality when } = 0: 

q 
(q=k=1)T <N (r, w,) + S(r), 
and for general 2 he conjectured that the 
number qk = 1 may be changed to the num- 
ber q—k— A= 1. This conjecture is still un- 
solved except for some special cases where for 
example, q = k+ į + 2 (N. Toda, Nagoya Math. 
J.,91 (1983), 37-47). 

Interesting results follow upon a study of 
this degeneracy index. For example, when f is 
an entire algebroidal function, if 2?*;' ô(w,) 
> 2k 2(w, #00), there are at least km 1 
*Picard’s exceptional values in {w,} (Niino, 
Ozawa, and Toda). 

With respect to the relation between the 
number of exceptional values and the order of 
fin |z| < œ, there are some results similar to 
those for single-valued functions. For example, 
if f has k+ 1 Picards exceptional values, the 
order off must be a positive integer or oo. 


D. Asymptotic Values and Other Results 


In the single-valued case, Valiron, L. Ahlfors, 
W. H. J. Fuchs, A. Edrei, W. K. Hayman, and 
others studied the tBorel direction, the number 
of tasymptotic values, the relation between the 
deficiency values and the asymptotic values, 
etc. However, almost none of the correspond- 
ing results holds for the algebroidal case as 
shown by several counterexamples. 

There is no relationship between the order 
of an entire algebroidal function and the num- 
ber of its finite tasymptotic values, which is 
quite different from the single-valued case. 
Furthermore, it is possible to have an infinite 
number of asymptotic values even if the order 
is equal to zero. If an algebroidal function f 
satisfies lim inf T(r, f)/(log r)? < +00, then it 
has at most k asymptotic values (Valiron- 
Tumura). The Ahlfors theorem, which is con- 
cerned with the number of tdirect transcen- 
dental singular points of the inverse function 
and the order of a meromorphic function in 
the single-valued case, was extended to the 
algebroidal function by Lü Yinian (Scientia 
Sinica, 23 (1980)). 

The tJulia direction or the Borel direction 
for an algebroidal function is defined not on 3 
but on |z| < cc because of the appearance of 
branch points. With respect to the Julia direc- 
tion, there are results similar to those for the 
single-valued case, but it is unknown in gen- 
eral whether the Bore! direction exists. A. 
Rauch proved that if f” Teg rar = Q, 
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then there is a me with an angle of at least 
n/p in which L(g)=|* log* F(re"®)/r°*! dr 
diverges. Apart Aas, the theory of distribution 
of values, Selberg obtained some conditions 
under which the inverse functions of fAbelian 
integrals of a special kind reduce to alge- 
broidal functions. 
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A. History 


The theory of almost periodic functions was 
originated by H. Bohr in 1924 as a result of his 
study of ‘Dirichlet series. The theory provides 
a method of studying a wide class of trig- 
onometric series (= 159 Fourier Series) of 
general type. Further generalizations were 
made by N. Wiener, V. V. Stepanov, A. S. 
Besikovich, S. Bochner, and others. H. Weyl, 
J. von Neumann, and others clarified the rela- 
tions between this theory and trepresentation 
of groups, specifically, the relations between 
almost periodic functions in a ttopological 
group and representation theory of a ‘compact 
group. 


B. Almost Periodic Functions in the Sense of 
Bohr 


Let f(x) be a complex-valued continuous 
function defined for all real values of x. A 
number 7 is called a translation numher of f(x) 
belonging to ¢ > 0 if 

sup |fix+1)—fOl<e. 


TOXO 


If for any e> 0 there exists a number l{e) > 0 
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such that any interval of length }(e) contains a 
translation number off belonging to x, then 
f(x) is called almost periodic in the sense of 
Bohr. We denote by B the set of all almost 
periodic functions in the sense of Bohr. 

If f(x) is tperiodic with a period p, then 
fe B, because each number /> p plays the role 
of [(g) for any¢>0. Anyf €B is bounded and 
uniformly continuous. A necessary and suffi- 
cient condition for a bounded continuous 
function on (—00, 00) to belong to B is that for 
any given sequence {h,} of real numbers, there 
exist a subsequence {h, } such that the se- 
quence of functions { f(x + hy )} is uniformly 
convergent in (—oo, 0); i.e., the set { f(x+ 
h) he( —o0, œ)} is totally bounded with re- 
spect to the uniform norm | f ||=sup f(x)| in 
the space of bounded continuous functions in 
(—00, 00). 

If f(x) e B, then f( —x), f(x), af (x) (where 
a is a complex number), and f(x + h) (where 
h is a real number)eB. If f(x), g(x)eB, then 
f(x)+g(x) and f(x)g(x)eB. If fi(x)eB and 
{ f,(x)} converges uniformly to f(x), then f(x) 
eB. For any real number A, expisx (where 
i is the fimaginary unit) is continuous and 
periodic. Hence the polynomial function 
Èr- % expid,x € B. Moreover, if the latter 
function converges uniformly to È a, exp iA,,x 
as m tends to œœ, then the limit function 
is also an element of B. The polynomial 
Li 1% EXPiA,x and the series Yi , Xn EXP iå, X 
are called a generalized trigonometric poly- 
nomial and a generalized trigonometric series, 
respectively. 

For any f eB, its mean exists: 


4 [OT 
M[f]= lim | f(x) dx. 
a 


The convergence of the right-hand formula 
is uniform in aé{ —00, oo), and the limit is 
independent of the choice of a. Thus M[f] 

is a tlinear functional defined on B. Since 
M[expidx]= 1 for 2=0 and =0 for 140, the 
family {exp idx — <4 <o}is an fortho- 
normal system with respect to the tinner prod- 
uct (f,g)= M[f(x)g(x)] defined on B. Let 
a(A)= M[f (x)exp( iAx)] for any f € B; then 
there exist countably many values of à for 
which «(A) differs from zero. Denote these 
values of 2 by 4,, 22, - and write a(4,) = 

X, We call the numbers «,, «,, «,, the 
Fourier coefficients of f(x). The formal series 
Prei Xn exp ipx is called the Fourier series 

of f(x). Moreover, the Parseval equality 
MU|f()/?]=X2, |e, |? is valid for any feB. 
For every periodic function, these definitions 
coincide with the ordinary Fourier coefficients 
and the Fourier series (= 159 Fourier Series). 
Any almost periodic function in the sense of 
Bohr is uniquely determined by its Fourier 
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coefficients; i.e., if two almost periodic func- 
tions have the same Fourier Series, then they 
are identical. For any f € B, its Fourier series 
does not always converge uniformly, but 

f(x) e B can be approximated uniformly by a 
sequence of trigonometric polynomials. Hence 
the almost periodic functions in the sense of 
Bohr are also called uniformly almost periodic 
functions. 


C. Generalizations of Almost Periodic 
Functions 


Let C( —%, œ) be the space (- 168 Func- 
tion Spaces) of all bounded continuous func- 
tions on (—00, œ) with distance p( f, g) = 
SUP_ »<x<w {f(x)—g(x)|. Then a uniformly 
almost periodic function is the limit of a ṣẹ- 
quence of trigonometric polynomials with 
respect to this distance. Generally, let p be a 
‘distance function introduced in a function 
space (whose elements are not necessarily 
continuous in (—00, 0)). Then the limit of a 
sequence of generalized trigonometric poly- 
nomials with respect to the distance p is called 
an almost periodic function with respect to p. 
For example, for p > 1, we set 


atl \/p 
Pool SI= sup l | fagas} 
Dyl f, SI = 


1 feH 1/p 
lim sup f | If i) acuta} 
lo -a<a<am (lJa 

These are distance functions. The properties of 
the corresponding almost periodic functions 
and their relations to other classes of almost 
periodic functions have been studied by Besi- 
kovich [ 1]. 


D. Analytic Almost Periodic Functions 


Let D be a strip domain, a < Re z < b, defined 
in a complex plane. For any tholomorphic 
function f(z) in D and ¢ > 0, a real number 7 is 
called a translation number off belonging to 
e>O if sup_.p|f(z+it)—f(z)| <e. If for any e> 
O there exists a number (e) > O such that any 
interval of length /[(e) contains a translation 
number off belonging to €, then f(x) is called 
an analytic almost periodic function in D., We 
denote the set of all analytic almost periodic 
functions in D = {a < Rez < b}by A(a, b). If we 
fix an x in a <x < b, then g(y) = f (x + iy) for 
any f(z)eA(a, b) belongs to B. 

For any feA(a, b) there corresponds a 
Dirichlet series È; &„ exp 2,z such that two 
analytic almost periodic functions are identi- 
cal if the corresponding Dirichlet series are 
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identical. Here the coefficients 


ty = My[ f(x +iy)exp( = ii, y)] 


are determined independently of x (a <x <b), 
and Parseval’s identity 


M,[|f(x+iy)|7] = > |x, |? exp 2A,,x 


holds (Bohr [3]). If the series 
Y An exp A,x exp id, y 


at x = a and x = b represent the Fourier serjes 
of f,(y) and f,(y)eB, respectively, then there 
exists f€ A(a, b) such that f(z) is continuous on 
D and 


fatiyJ=fly),  flb+iy)=f,(y). 


The behavior of f¢A(a, b) at the boundary or 
exterior points of the domain D = {a < Re z < 
b} has also been investigated by Besikovich 


[1]. 


E. Almost Periodic Functions on Groups 


Von Neumann defined almost periodic func- 
tions on any group, generalizing the charac- 
terization of uniformly almost periodic func- 
tions on (—0, 00), Let B(G) be the set of all 
complex-valued bounded functions on a group 
G. Then B(G) is a metric a with the tdis- 
tance p(f,g)=supyeg| f(x) —g(x)|. If for any 
feB(G) the set A,= { f, ,(x x)= fax) )|a, beG} 
is totally bounded in the metric space B(G), we 
call f an almost periodic function on the group 
G. This condition is equivalent to the total 
boundedness of B; = { f,(x) = f(xa) ae G} or 
Cr = {af (x)= f(ax)|aeG}. We denote the set 
of almost periodic functions on G by ¥(G). 
For f(x), g(x)é€.o/(G), the linear combina- 
tions a: f(x)+ b+ g(x) (a, beC) and the prod- 
uct f(x)g(x) are both contained in .¢(G). If 
fa € (G) and { fa} converges to f uniformly on 
G, then fe.A(G). If fe. A(G), then fay, faraf E 
d(G) also. Hence ./(G) is a closed subalgebra 
invariant under two-sided translation in the 
*Banach algebra i?(G). For any f€./(G) there 
exists only one number M[ f] in the closure 
(with respect to the distance p in B(G)) of 


etd bea | 


A= f$ cif (a,xb)|c¢,>0, ¥ c= 1, a; b; ec} 
(= the least closed tconvex set including A,). 
We call M[ f] the mean off on G. The map- 
ping f>M[f] is a linear functional on (G), 
and we have M[ f]>0 if f>0. 


F. Relation to Bounded Representation 


Suppose that we are given a finite-dimensional 
matrix representation D(x) = (d,(x)) of a group 
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G. Then the following three conditions are 
equivalent: (i) All the d,(x) are bounded on G. 
(ii) All the d,(x) are almost periodic in G. (iii) 
The representation J) is fequivalent toa repre- 
sentation by unitary matrices. The inner prod- 
uct ( f, 8) = MLf(x)g(x)] provides the algebra 
A(G) with the structure of a tpre-Hilbert 
space. Let H(G) be the Hilbert space that is the 
completion of s% (G). If we select D”) = (d#(x)) 
from each L, where L is an equivalence class of 
bounded irreducible representations ol G, and 
if n; is the order of D$, then {( (1/./n,)d2( x)[1< 

i,j <n}, A€ L} is a complete E ine sys- 
tem in the Hilbert space H(G). Any f(x) € 
(G) can be approximated uniformly in G 
by a finite linear combination of the d(x). 


G. Almost Periodic Functions on Topological 
Groups 


When G is a tseparated topological group, we 
denote the set of all continuous functions on G 
contained in ./(G) by £ (G). The statements 
of the theorems in the previous section con- 
cerning (G) and the representation D remain 
valid if we replace »f(G) by d,(G) and replace 
D by a continuous representation of G. In 
particular, if G is the additive group of real 
numbers R, then d,(R) is exactly B. 


H. Relation to Compact Groups 


Every continuous function on a compact 
group G is almost periodic; i.e., 2% ,(G) = C(G). 
The mean value M[f ] of fe.W,(G) is identical 
to fe f(x)dx, where the tHaar measure dx is 
normalized so that fe dx= 1. In this case, the 
theory of bounded representations discussed 
above is the Peter-Weyl theory (- 69 Com- 
pact Groups). 

In general, let G be a separated topological 
group. There exists a continuous homomor- 
phism ọ of G onto a compact group K = K(G) 
with the following two properties: (i) For any 
compact group K’ and a continuous homo- 
morphism og’: G— K’, there exists a continuous 
homomorphism y : K > K’ such that g'= yo 
g. (ii) Such a pair K =(K, ọ} is unique up to 
isomorphism. K is called the Bohr compactifi- 
cation of G, and ¢ is called the canonical map- 
ping. In particular, suppose that G is a locally 
compact Abelian group and G* is its tcharac- 
ter group. We denote by G’ the group G* with 
discrete topology. Let K be the character 
group of G°, and let ~* be the identity map- 
ping G'>G* and @ be its conjugate mapping 
G-—K, which is a continuous homomorphism. 
Then K is the Bohr compactification of G 
with the canonical mapping p. A necessary 
and sufficient condition for f on G to be con- 
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tinuous almost periodic is that a continuous 
function f'on K such thatf = f'oọ must 
exist. If this condition is satisfied, then the 
mean M [f] is identical to fk f'(x) dx. For 
any finite-dimensional continuous unitary 
representation D' of K, D = D’o is a finite- 
dimensional continuous unitary represen- 
tation of G, and vice versa. Hence there exists 
a canonical isomorphism (determined by D = 
D’o ọ) between the equivalence classes of 
finite-dimensional unitary representations 

of a separated topological group G and the 
equivalence classes of finite-dimensional uni- 
tary representations of its Bohr compactifica- 
tion K. The fkernel of the canonical mapping 
0: GK is identical to the intersection of all 
the kernels of finite-dimensional continuous 
unitary representations of G. 


1. Maximally Almost Periodic Croups 


Let G be a topological group. If for each pair 
a, b of distinct elements of G there exists a 
continuous almost periodic function f on G 
such that f(a) #f(b), then G is called a maxi- 
mally almost periodic group. This is the case 
if and only if G has sufliciently many finite- 
dimensional unitary representations. For a 
connected locally compact group G, the fol- 
lowing six conditions are equivalent: (1) G is a 
maximally almost periodic group. (2) There is 
a one-to-one continuous homomorphism from 
G into a compact group. (3) G is the direct 
product of a compact group and a vector 
group R”. (4) G is the tprojective limit of +Lie 
groups that are locally isomorphic to compact 
groups. (5) The quotient group G/Z is com- 
pact, where Z is the center of G. (6) The system 
of all neighborhoods that are invariant under 
the tinner automorphisms constitutes a basis 
for the neighborhood system of the unit [7]. 

Moreover, any discrete free group is maxi- 
mally almost periodic. If there is no continu- 
OUS almost periodic function except constant 
functions, the topological group is called 
minimally almost periodic. Any noncompact 
connected tsimple Lie group is minimally 
almost periodic. 
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A. History 


The term analog computation isa generic term 
describing various techniques of computa- 
tion employing diagrams, or physical systems 
whose equations are similar to the mathemat- 
ical problems in question. The history of ana- 
log computation is probably as old as that of 
digital computation; for example, the ancient 
Greeks tried to solve cubic equations using 
diagrams, and the astrolabe widely used by 
astronomers through the medieval period is 
also a kind of analog computer. Soon after 
the discovery of logarithms, the slide rule was 
invented. In the 18th Century, the planimeter, 
used to measure plane areas, was introduced, 
and in the 19th Century, the nomogram ap- 
peared (- Section D). In the first half of 20th 
Century, a large electronic analog computer 
was developed, thus predating the lirst prac- 
tical digital computer. 

However, analog computation has an essen- 
tial defect, namely, the limitation of accuracy. 
Today it is useful for simple calculation, but is 
rapidly becoming less important as the devel- 
opment of digital computers, including pocket 
calculators, advances. 


B. Graphical Calculation 

Graphical calculation is a method of compu- 
tation by means of geometric constructions 
using common drawing tools. Some typical 
examples of practical graphical calculation are 
the following: evaluation of linear functions of 
several variables (J. Massau, 1887), of systems 
of linear equations (F. J. van den Berg, 1888), 
of polynomials (J. A. Segner, 1761), of alge- 
braic equations (Lill, 1867), graphical integra- 
tion, graphical differentiation, and the solu- 
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tion of ordinary differential equations of the 
first order (J. Massau, 1878), of linear differ- 
ential equations (Czuber), and of ordinary 
differential equations of the second order 
(Lord Kelvin, 1892). 

Using bisquare-root graph paper, sold 
under the names of binomial probability 
paper or stochastic paper; we can handle tF- 
distributions or other probability distributions 
reducible to F-distributions, such as binomial 
or normal distributions, to a fairly good de- 
gree of approximation. This grapbical method 
of statistical inference is, even now, a powerful 
method of statistical quality control. 


C. Graphical Mechanics 


Graphical mechanics is the graphical treatment 
of mechanical problems, especially problems of 
equilibrium. In this method, the fundamental 
constructions are the composition and reso- 
lution of forces by means of force polygons 
(Fig. 1). This method is also applicable to 
problems in dynamics when they can be re- 
duced to problems of equilibrium by means of 
d’Alembert’s principle. 


Fig. 1 

An example of the graphical method of constructing 
the composite and the line of action of three given 
forces F,, F,, and F,. In the force polygon in (b), the 
vector 03 represents the composite R. On the link 
polygon in (a), the point d is a point on the line of 
action of the force. If the fourth force F, has the 
same magnitude as R with opposite direction, both 
polygons are closed, and the four forces F,, F,, F3, 
and F, are in equilibrium. 


Graphical mechanics is most convenient 
when applied to problems reducible to 2- 
dimensional structural mechanics. But we can 
also use jt for 3-dimensional problems if we 
work on projections, e.g., on the plan and the 
elevation, of the original body. In recent years, 
however, the amount of work involved in 
geometric construction has been deemed non- 
negligible, and the technique of computer 


19 D 


Analog Computation 


graphics is now considered to be much more 
convenient. Here the computation is done by a 
digital computer, and the result is displayed in 
a convenient graphical form. 


D. Nomograms 


Nomograms are charts in which we can easily 
read off the corresponding value u, from given 
values Uis> Upi when there is a relation 
F(u,,...,U,) = 0 among n real variables u; , 
....Uu,. The construction of nomograms has 
been thoroughly investigated by M. d’Ocagne 
[5]. 

A function of two variables F(u, ,u2)= 0 can 
be represented by a two-sided scale or by func- 
tional paper. The most useful nomograms are 
those for functions of three variables, F(u}, 
Uy, u3) = 0. Changing this into f(x,y, u; =0 
(i = 1,2, 3) and drawing the curves f.(x, y, u;) = 0 
for fixed values of u;, we have an intersection 
chart, where three curves for the correspond- 
ing values u4, uz, u3 Meet at a point. In prac- 
tice, however, it is much more convenient to 
use the alignment chart, which is a dual of the 
intersection chart. It is especially useful when 
the relation F(u,, u,,u3)=0 canbe decom- 
posed into an equation of the form 


filu) gilu) hilu) 
falu) 922) haluz) =0. (1) 
faluz) g3(us) h3(us) 


Putting x; = f;/h;, yi = 9,/h; (i = 1,2, 3) and 
drawing the curves (x;, y;) with the parameter 
u; scaled on it (u,-sealed), the corresponding 
values 4,, u2, and u, satisfying relation (1) lie 
on a straight line (Fig. 2). Using this property 
we can easily read off the corresponding values 
(by laying down a ruler). In the strict sense, the 
term “nomogram” usually refers specifically to 
an alignment chart. 


uj uo uy 


Fig. 2 


For four or more variables, we can apply 
similar techniques if the function is separable 
into functions of three variables. If it is not 
separable, we can sometimes apply such tech- 
niques such as two-functional scales with 
functional networks, cocircular charts, co- 
planar charts, or moving charts. Using func- 
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tional approximations, an approximate nom- 
ogram has recently been constructed with 
errors considered to be admissible in practical 
use. 

Historically, the origin of Hilbert’s 13th 
problem, which asks if it is possible to rep- 
resent a function of many variables as the 
composite of functions of two variables, 
cornes from a study of nomograms with many 
variables. 


E. Analog Computers 


The analog computer is a special purpose 
machine designed for a specified analog com- 
putation. In a wider sense it includes special 
devices that perform tanalog simulation. The 
following mechanical devices are well known: 
the pantograph for copying plans, the harmonic 
analyzer for obtaining the Fourier expansion 
of a periodic function, and V. Bush’s differen- 
tial analyzer. Since about 1940 large electronic 
analog computers for solving differential equa- 
tions have been extensively developed. 

Analog computers have up to recently had 
the advantages of ease of construction, sim- 
plicity, and inexpensive operation, and they 
were also considered to be fast enough for use 
in real-time computation. To compensate for 
the limitations of analog computers in com- 
parison to digital ones, several hyhrid com- 
puters have also been used. However, with 
the rapid progress of digital computers and 
digital-analog converters, the analog com- 
puters are now considered less important than 
in the past. 


F. Curve Fitting 


Curve fitting is a method of finding a sim- 

ple curve y = f(x) supplying the best possible 
approximation to the values y,, y,, for 
discrete values X;, x,, of the independent 
variable. A polynomial passing through all the 
given points is constructed by means of tinter- 
polation. For experimental data we usually 
construct a curve by using methods, such as 
the tmethod of least squares, that take the 
errors due to observation into account. 

A function constructed to best fit the ob- 
served values of the function y = f(x) express- 
ing some physical law is called an empirical 
formula, in contrast to the theoretical formula. 
Usually, we first assume that the function 
contains several empirical constants, which are 
then determined to fit the experimental data. 

The most common case is a linear approx- 
imation (using a linear function) with a gsuit- 
able change of variables. Semilogarithmic 
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paper, logarithmic paper, or probability paper 
are used to facilitate transformations in terms 
of logarithms or normal distributions. 

We also frequently use polynomials or trig- 
onometric polynomials of lower degree. To 
determine the empirical constants the follow- 
ing methods are used: the graphing method, 
the selection of certain points, the mean value 
method, or the method of least squares. 


G. Orthogonal Polynomials 


Let there be given the values yọ, Y1; Y,—; for 
discrete values x = 0, 1, . , n— 1 with equal 
differences. A polynomial f(x) of degree k (<n) 
that minimizes the sum of the squares G,(n) = 


X(¥m—S(m))? is given by 


v=0 
n-1 
a,= 2, Vn y(n, m)/S,(n), 
n-1 
S (n)= 2 m))’, 


and we have G,(n)= Et yz — Lk ae S,(n). 
Here the function q,(n, x), called a Chebyshev 
q-function, is defined by 


amor SIDE) 


In practical applications, it is better to replace 
q,(n, x) by the function 
gi (n, x)=q,(n, x)/2~’v!M,(n), 


where M,(n) is the greatest common divisor of 


v+tm\/fn—-l—m 
( )( ) (n=, 1,- 9) 
m y- m 


The values q*(n,m)(m=0, 1,...,n—1) are mu- 
tually coprime integers. The function q*(n, m) 
is called the simplest Chebyshev q-function or 
simplest orthogonal polynomial, and is some- 
times denoted by X,,,(x), Cu) or Pn, m(X). 
When two functions satisfy the condition 


(fae= $ flomg(m) =0, 


they are called orthogonal for a finite sum. If 


from 1,x,x7,..., x” we construct a system 


orthogonal with respect to this definition, then 
we have the polynomial 


reas È AAOC 


which has the following connection with q,: 


q,(n, x) 
=((—1)"(n— 1)!/2”(n—v— DP, 1). 
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The polynomial P, „(x) is called a Chebyshev 
orthogonal polynomial or sometimes simply an 
orthogonal polynomial. The polynomials P, ,(x) 
are orthogonal for the finite sum. If n- œ and 
0 <x< 1, the function P, ,(x) tends to P,(1— 
2x), where P, is the ‘Legendre polynomial. 
These functions may be conveniently used in 
least-squares curve fitting. 
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The origin of analysis can be traced back to 
the time when Eudoxus (4th century B.c.) and 
Archimedes (3rd century B.c.) devised the so- 
called method of exhaustion for calculating 
the area of a plane figure and the volume of a 
solid. Their objects of investigation were re- 
stricted, however, to particular types of figures 
or solids. In the 16th and 17th centuries, F. 
Viète, J. Kepler, and B. Cavalieri again took 
up this problem. In the 17th Century, the prob- 
lem of drawing a tangent to a given curve 

was studied by R. Descartes, P. de Fermat, 

B. Pascal, and J. Wallis. Fermat, in particular, 
applied the result to find the maxima and 
minima of certain functions. It is worth noting 
that a certain type of mathematics powerful 
enough to produce similar results was inde- 
pendently developed in Japan around that 
time. In 1684, G. Leibniz in Germany intro- 
duced the symbols dx and dy in treating the 
same problem. He proved that dy/dx repre- 
sents the slope of the tangent to the curve at 

a given point and discovered a new operation 
to calculate it. In 1686, he established the “‘in- 
verse tangent method,” which is what we now 
call integral calculus. He also introduced the 
notation f. On the other hand, I. Newton in 
England developed his “method of fluxions,” 
corresponding to our differential and integral 
calculus, from the viewpoint of mechanics. But 
neither Leibniz nor Newton formulated the 
fundamental concepts rigorously, and there- 
fore they were criticized severely by many 
contemporary scholars. The new calculus 
gained ground in Great Britain rather slowly. 
B. Taylor in England and C. MacLaurin in 
Scotland demonstrated its usefulness in 1715 
and in 1745, respectively. On the continent, 
however, Leibniz’s symbolic calculus was 
taken up by mathematicians of the Bernoulli 
family, G. F. A. de l'Hôpital, G. Fagnano, and 
many others; with it, they solved many scien- 
tific problems which until then had remained 
intractable. This motivated subsequent re- 
searchers to pose new problems in the form of 
differential equations. 

One of these problems, treated by J. le 
Rond ď’ Alembert in relation to the vibration 
of a chord, concerns the tpartial differential 
equation 


0? y/ðt =a? 8? y/ðx? (1) 


for y = y(t, x) with the boundary conditions 
y = 0 for x = 0 and x = Į, He obtained the solu- 
tion y= f(at+x)—f(at—x), where f is an 
arbitrary function of period 21. In 1753, D. 
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Bernoulli showed that solutions of equation 
(1) are given by functions of the form 


a X knx . krx 
yoy X (acos tasin T") 
KZI 


These two kinds of solutions gave rise to the 
question of whether an arbitrary function can 
be expressed by a ttrigonometric series. This 
problem was studied by A. C. Clairaut, J. L. 
Lagrange, and L. Euler. In 1807, J. Fourier in 
France, in treating a problem on the conduc- 
tion of heat, claimed that an arbitrary function 
of period 2r can be expressed as 


a4 <= : 
v=. + ` (a,coskx +b, sinkx), (2) 
k=l 
where the coefficients a, and þ, are given by 


1 T 
a= f f(x)cos kx dx, 
RJ-1 


TA | f(x)sin kx dx. (3) 
m =R 

This series is now called the Fourier series, 

but Fourier never verilied the fact that the 

series (2) with coefficients (3) converges and 

represents f(x). It was only as late as 1820 that 

A. L. Cauchy in France first noted that to 

treat a series properly, one must examine its 

convergence. 

In the 19th Century, the concept of ffunc- 
tions, which had been taken in the sense of 
“analytic expressions,” came to be detined by 
the correspondence relation. Cauchy clarified 
the ideas of ‘limit and fcontinuity, tdifferentia- 
bility and ‘integrability. He showed that a 
function that is continuous in a bounded 
closed interval is integrable in that interval. 
But his proof was not rigorous, as he lacked 
the notion of tuniform continuity. In his 1854 
paper on the trigonometric series, B. Riemann 
in Germany considered the integrability of 
functions that might be discontinuous and 
introduced the concept of what we now call 
the *Riemann integral. 

The theory of tsets, initiated by G. Cantor in 
Germany in his paper of 1874, revolutionized 
analysis. R. Baire, E. Borel, and H. Lebesgue 
in France contributed to the establishment of 
analysis on the basis of set theory. Baire made 
a classification of discontinuous functions. 
Generalizing his results, Lebesgue gave a 
definition of analytic expressions, thus clarify- 
ing the term that had been used vaguely since 
the time of Euler. Lebesgue also tried to define 
the concepts of the integral of a function, the 
length of a curve, and the area of a surface 
from the most general viewpoint. In generaliz- 
ing the notion of *tmeasure introduced by 
Borel, he established in 1902 the theory of 
*Lebesgue measure with which he laid the 
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foundations of the theory of Lebesgue inte- 
grals. The introduction of this theory gave 

to the theory of Fourier series a new turn in 
the direction of functional analysis. Measure 
theory was also employed by A. N. Kolmo- 
gorov of the Soviet Union to lay a solid foun- 
dation for tprobability theory in 1933. 

The study of functions of a complex variable 
was originated by Cauchy in the first half of 
the 19th Century. He began his research by 
introducing the notion of “monogenic func- 
tions”; a function which is monogenic at every 
point of a domain is what we now call a tholo- 
morphic function. He established tCauchy’s 
integral theorem and integral formulas for 
these functions, and deduced from these theo- 
rems the tresidue theorem for functions with 
poles. Making use of the integral formula, 
Cauchy proved that a function that is 
holomorphic at a point a can be expanded in 
a power series of the form Xo &(z =a)” in a 
neighborhood of this point. 

Riemann considered a complex variable w 
as a function of another complex variable z 
when dw/dz is independent of the value of 
the differential dz. This amounts to the same 
thing as a “monogenic function” of Cauchy. 
Riemann’s mapping theorem became a model 
for subsequent developments. Riemann intro- 
duced the concept of tRiemann surfaces in 
order to tuniformize multivalued functions. 
This important idea was basic to the progress 
of analysis and ttopology in the 20th Century. 

K. Weierstrass, who was a contemporary of 
Riemann, developed the theory of functions 
of a complex variable from a purely analytic 
viewpoint. He delined an element of a function 
to be a power series }i/2.9 a,(z a) of z -a, 
representing a holomorphic function in the 
interior of its fcircle of convergence, and de- 
fined an analytic function to be an aggregate 
of such elements that are derived from one of 
them by means of tanalytic continuations, 
along all curves having the point a as the 
initial point. 

Riemann and Weierstrass constructed their 
theory to complete the theory of telliptic func- 
tions and tAbelian functions that was initiated 
by N. H. Abel and C. G. J. Jacobi. Their re- 
sults were a high point of 19th-century class- 
ical mathematics. H. Poincaré in France built 
upon their work. Another high point in the 
theory of functions of a complex variable was 
reached when J. Hadamard and C. de La Vallée 
Poussin in France made use of it to prove the 
tprime number theorem in 1896. 

Weierstrass also initiated the study of func- 
tions of several complex variables and was 
succeeded by Poincaré, P. Cousin, and E. 
Picard in France. They tried to extend the 
theory of functions of one complex variable to 
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that of many variables. F. Hartogs, however, 
discovered a phenomenon quite different from 
the case of one variable. Cousin posed in 1895 
the problem of constructing a function of 
several complex variables that has assigned 
poles, This problem was pursued by A. Weil in 
France and solved by K. Oka in Japan in 
1936. Also, the problem of characterizing 
tdomains of holomorphy was investigated by 
E. E. Levi in Italy, H. Cartan in France, and 
P. Thullen and K. Stein in Germany and was 
finally solved by Oka in 1953. H. Cartan and 
J.-P. Serre reformulated these results in terms 
of tcohomology with coefficients in sheaves. 
This considerably influenced the formulations 
of mathematics thereafter. These results were 
further generalized to tanalytic space by H. 
Grauert, R. Remmert, and Stein in Germany. 
The theory of tcomplex manifolds, which are 
generalizations of Riemann surfaces to several 
variables, was initiated by W. V. D. Hodge in 
England and K. Kodaira in Japan and con- 
tinued by F. Hirzebruch in Germany, M. F. 
Atiyah in England, and I. M. Singer in the 
United States. 

Differential calculus gives a general method 
of finding extreme values of a given function. 
Likewise, in order to find a function that pro- 
duces an extremal of the given ffunctional, the 
calculus of variations was created in the 18th 
Century. For example, Euler considered the 
problem of finding a particular function y(x) 
that renders the functional {? F(x, y, y’) dx 
an extreme value among all those functions 
y(X) for which the plane curve y = y(x) passes 
through two given points (a, A) and (b, B) of the 
plane; he showed that this y(x) must satisfy 
the differential equation dF, /dx = F, = 0 (1744). 
Lagrange, W. R. Hamilton, and others devel- 
oped this result into a general tvariational 
principle that governs not only classical tme- 
chanics but also tquantum mechanics. 

From research on the continuity or differen- 
tiability of the functional with respect to y 
emerged the idea of considering a function as a 
“point” in a ffunction space. This gave rise to 
‘functional analysis, a branch of analysis that 
treats functions as elements of certain spaces 
and utilizes the methods of algebra and topol- 
ogy. The first result in this regard was the 
theory of fintegral equations of V. Volterra in 
Italy and E. I. Fredholm in Sweden in the 
beginning of the 20th Century. Fredholm’s 
work was motivated by his desire to solve the 
tDirichlet problem, the solution of which had 
been used by Riemann in the proof of his 
mapping theorem, etc. However, Riemann’s 
own proof of existence of the solution, called 
the tDirichlet principle, was not rigorous, and 
attempts to save the proof provided one of the 
central problems of analysis for some time. 
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Fredholm’s solutions were different from Rie- 
mann% but D. Hilbert in Germany was able 
to justify Riemann’s original proof. Hilbert’s 
proof was later simplitied and generalized by 
R. Courant, H. Weyl, and others. Hilbert also 
introduced the ‘function spaces /, and L, to 
study the teigenvalue problem of integral 
equations with tsymmetric kernels. Later J. 
von Neumann in Germany established tspec- 
tral theory in abstract tHilbert spaces and 
applied it to the mathematical foundations 
of quantum mechanics (1929). S. Banach in 
Poland created the theory of flinear operators 
in Banach spaces (1932). This theory was 
further generalized to that of linear topolog- 
ical spaces and was applied to the theory of 
distributions. 

In their study of partial differential equa- 
tions and Fourier analysis, Hadamard, J. 
Leray, S. L. Sobolev, T. Carleman, and many 
others had to extend the notion of functions; 
they also enlarged the notion of derivatives. L. 
Schwartz in France introduced tdistributions 
and defined derivatives in the sense of distribu- 
tions to unify these generalizations (1945). M. 
Sato in Japan defined more general generalized 
functions, called thyperfunctions (1958). It has 
become evident that both distributions and 
hyperfunctions have provided the most power- 
ful tools in recent research in the general 
theory of tpartial differential equations, to 
which L. Hormander in Sweden has made 
outstanding contributions. 
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Complex Variables 


A. Holomorphic Functions 


As in the case of tholomorphic functions of 
One complex variable, the definition of holo- 
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morphic functions can be given in two ways: 
the first definition utihzes differentiability, 
following the approach of B. Riemann; and 
the second method utilizes the notion of power 
series expansion as developed by K. Weier- 
strass. In this article, N = {0, 1,2,.}. 


B. Power Series 


Let z be an n-tuple of complex variables 

Ly 520s Zqs and C=(C),.++5C,) a point of C”. 

An infinite sum P of monomials a(z c} = 
k, akglZ1 C1)" += (Zp — Cn)” (= (Ky, Kn 
N”), where a, € C, is called a power series 

with center c and coefficients a,. If, for a bi- 
jection g of N onto N”, the simple series 

È pen lapp hZ = c)”?)\s convergent at z = 2°, 
we say that P is absolutely convergent at 2°. 
Its sum at z° denoted by © a,(z° — c}, is de- 
fined as the sum È app (Z° = c)®’, which is 
independent of the choice of ¢. If P is uni- 
formly bounded at z°, then P is absolutely 
convergent at every point of the open polydisk 
S={z||z;—¢|<|z? —cj|, j=1,...,n}. Further- 
more, in this case P converges absolutely and 
uniformly on every compact set in § (N. H. 
Abel). 

The convergence domain of a power series P 
is the set D of points z° such that P is ab- 
solutely convergent at every point in a neigh- 
borhood of z°, The interior of the set B of 
points at which the infinite sum P is uniformly 
bounded is equal to D. A (complete) Reinhardt 
domain with center c is a domain D in C” such 
that whenever D contains Zo, the domain D 
also contains the torus {z |z;—¢]=|z? cl j= 
1, ,n} (the closed polydisk {z| |z;—c;| <|z?— 
c|,j=1,., n}). If the convergence domain D 
of the power series P is not empty, it is a com- 
plete Reinhardt domain and is also logarithmi- 
cally convex; that is, the set D— U{zlz=¢} is 
mapped onto a convex domain in R” by the 
mapping 2;—>log|z;—c;| j= 1, . ..,n). The set D 
of points at which P is absolutely convergent 
is, in general, greater than D, and it is possible 
that Ď contains exterior points of D. A thorn of 
D is the set of exterior points of D contained in 
D that are located on the planes {z z= c} g= 
1,. , n). An n-tuple r e R", is called a set of 
associated convergence radii if P is absolutely 
convergent at every point of {z| |Z;— c] < 
r,j=1,...,n}but not of {z| |z;-¢|>7,j= 
1,, n}. An n-tuple of associated conver- 
gence radii may not be uniquely determined, 
but it satisties 


lim sup (|a,|r*)!/*! = 1, 
|k|> +00 


(E. Lemaire). 
Let f be a complex-valued function defined 
in a neighborhood of z° € C”. If there exists a 


convergent power series P with center z° such 
that at every point of a neighborhood of z° the 
value off’ and the sum of P coincide, then f is 
called analytic at z° in the sense of Weierstrass, 
and P is the Taylor expansion of f at z°. 


C. Differentiability 


Let f be a complex-valued function detïned in 
a neighborhood of z° e C”. If in a neighbor- 
hood of z? we have 


f(2—-f(2)= 44 lz z)... Halaz) +8, 


with &;,. ..,&„ EC and 
lim e/(\z, — zil +... +|Zn— Za |)=0, (1) 
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then we say that f is (totally) differentiable at 
z°. The function f is then continuous at z°, 
and the partial derivatives éf/éz; (j = 1, , n) 
exist. Furthermore, the Cauchy-Riemann 
differential equations of/6z;=0 (j= 1, , n) 
hold, where ¢f/0z;=(1/2)(0f/0x;—iéf/éy,) and 
f/0z,=(1/2)(f/dx, + idf/ay,) with z,= x, + iy, 
We say that f is holomorphic at z° in the sense 
of Riemann if f is differentiable at every point 
in a neighborhood of z°, Analyticity in the 
sense of Weierstrass is equivalent to holomor- 
phy in the sense of Riemann. Furthermore, if 
the partial derivatives Of/éz, j =1,..., n) exist 
at every point in a neighborhood of 2° then f 
is, without assuming continuity, proved to be 
holomorphic. Thus the holomorphy off in 
each variable z, implies the holomorphy off in 
z=(z,,--)z,) (Hartogs’s theorem of holomor- 
phy, 1906). 

A complex-valued function in a domain G c 
C” is called holomorphic or analytic in G if it 
is holomorphic at every point of G. Let H(G) 
be the ring of holomorphic functions in G. For 
f =u +ive N(G), u and p satisfy in G the dif- 
ferential equations 67 u(z, 2)/02;0%, = 0; that is, 


07 u ĝu 
a =0, 2 
Ox;0X,  OV;OY, (2) 
Oru Aru 


=0, j, k=1,...,n. 
Ox;0y, Ox,0V; ae 
A tdistribution TED’(G) is called plurihar- 
monic in G if it satisfies (2) in G. Then T is 
harmonic and hence a real analytic function 
by *Weyl’s lemma. 

Let G; be a domain in the z-plane with 
piecewise smooth boundary C;. If fe H(G) (G 
= ITj-, G;) is continuous on G, then 


j= 
2s i 10) 
Bni Je, x.x G-a) (29) 


f(2),z€G, 


Kanad] 0.246 (3) 
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(Cauchy’s integral representation). Thus if n > 2, 
then f is determined by its values only on the 
proper subset C = Ci x x C, of @G, which is 
called the skeleton (or determining set) of G. 
For (pluriharmonic) functions of several com- 
plex variables, the boundary value problem, 
not necessarily solvable in its classical form, is 
not so effective as the Dirichlet problem in one 
complex variable. 

As in the case of one variable, the *Laurent 
expansion is valid for every holomorphic 
function in a domain of the form G = [Jj-,G,, 
where the G, are circular annuli œc C. Suppose 
that we are given f€ H(G,) and f, e H(G,), 
where G,, G, are domains in C” such that G N 
G, is nonempty and connected. If f, = f, on 
{z|}z,-z?|<r,y=y,j=1,...,n}, where z°= 
x? + iy? € Gi NG, then there exists a unique 
fe H(G,U G,) such that f|G, =f, and f|G,= f 
(theorem of identity). Thus tanalytic continu- 
ation proceeds as in the case of one variable. 
Similarly, some fundamental theorems in one 
variable, such as *Liouville’s theorem on ten- 
tire functions and the ‘maximum principle, 
hold also for several variables. However, there 
are some properties that reveal the differences 
between the cases of one and several variables. 
For instance, the set of zeros of a holomorphic 
function (— 23 Analytic Spaces B) has no 
isolated point for N > 2. The investigation of 
these remarkable differences is one of the 
purposes of the theory of analytic functions of 
several complex variables. 


D. Shilov Boundaries 


While the maximum principle holds for a 
holomorphic function in a domain G, the set of 
points where the maximum is attained may be 
a proper subset § of 6G. For instance, if G is 
the product of annuli as before, then the skele- 
ton of G can be taken as S. In connection with 
the theory of tnormed rings, G, E. Shilov 
proved that there exists a unique smallest 
member §, (called the Shilov boundary of G) 

in the family of closed subsets S such that 
sup{|f(2)||z€8} =sup{|f(2)||z€G} for every 
fe H(G) continuous on G. The structure of 

Sp is investigated in detail together with 

the pseudoconvexity of G connected with it. 
Applying tPerron’s method for the Dirichlet 
problem to *plurisubharmonic functions and 
the Shilov boundary, H. J. Bremermann solved 
one type of boundary problem. 


E. Local Theory 


Let f and g be two functions defined in neigh- 
borhoods of a set Sc C”, If f= g ina neigh- 
borhood of S, then f and g are called equiva- 
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lent with respect to S. The germ off on S, 
denoted by fs, is the equivalence class off: A 
germ of a bolomorpbic function on S is the 
germ on S of a holomorphic function defined 
in a neighborhood of S, and H(S) denotes the 
ring of germs of holomorphic functions on S. 
Given a point 0 in C”, H(O) = H( {0}) is iso- 
morphic to the ring H, of convergent power 
series at 0, i.¢., the power series that are ab- 
solutely convergent in some neighborhoods of 
0. For every nonzero function fe H(O), there 
exists a system of coordinates {z}, , Z,,) cen- 
tered at O such that f(0, , 0, z,) # 0 for every 
Z, # 0 in a neighborhood of z, = 0. In a neigh- 
borhood of 0, f is then equal to the product of 
an invertible element of H, and a distinguished 
pseudopolynomial 


P(z,)=22 +.4,(Z;, Laem! Z,-1)22 | + 


’ Z,-1)¢ A,,- [z,], 


with a,(0,...,0)=...=a,(0, ...,0)=0, and P(z,) 
is uniquely determined by f and the coordi- 
nates z,,, z, (Weierstrass’s preparation 
theorem). It follows from this that H, is an n- 
dimensional tregular local ring. Considering 
H(0) as the tinductive limit of flocally convex 
rings H(U), where U ranges over a base for a 
neighborhood system of 0, H. Cartan proved 
the preparation theorem in a more precise 
form in which the association f a, is con- 
tinuous with respect to the supremum norm. 
Based on a deep consideration of this situ- 
ation, K. Oka proved a theorem of funda- 
mental importance: The fsheaf (cn defined by 
(en, = H(Z) (ze C”) is tcoherent. 


a AZ a se 


F. Domains of Holomorphy 


Given a domain G c G” for n> 2 it may be 
that there exists a domain G’ strictly greater 
than G such that all the functions that are 
holomorphic in G extend to holomorphic 
functions in G’. For instance, let S = S’ x a, 
where S’ and g are open polydisks in (z,,. , 
Z,-1)-Space and z,-space, respectively, and 

let T c C” be an open set. If there exists an 
open set U (#@)S' such that (U x a) U(S’ x 
6a) CT and if S N T is connected, then all the 
functions that are holomorphic in T extend 
uniquely to holomorphic functions in SU T 
(Hartogs’s continuation theorem). In particular, 
if A is an tanalytic set in a domain G c C” with 
dim A <n 2, then all the functions that are 
holomorphic in G-A extend uniquely to 
holomorphic functions in G. Furthermore, if A 
is an analytic set in G with A 4G, then every 
fe H(G—A) that is locally bounded at the 
points of A extends uniquely to a holomorphic 
function in G (Riemann% continuation theorem 
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for n> 2). The domain G; of holomorphy for f 
is defined to be the maximal domain to which 
f may be continued analytically. A domain G 
is called a domain of holomorphy if G = G, for 
some fe H(G). However, G, is, in general, not 
a subdomain of C". G, is, generally, a mani- 
fold spread over C”; i.e., G, is a connected n- 
dimensional *complex analytic manifold with a 
holomorphic mapping ¢: G,>C" of maximum 
Jacobian rank (g is then an open mapping). 
The same is true for the common existence 
domain of functions in a subfamily of H(G). 
The common existence domain G of all the 
functions in H(G) is called the envelope of 
holomorphy. A  holomorphically complete 
domain is a domain G such that G = G, These 
notions carry over to the case where G is a 
manifold spread over C”. The (general) Levi 
problem of determining the conditions for a 
given domain to be holomorphically complete 
is fundamental to the theory of analytic func- 
tions of several complex variables (— Section 
I). In connection with this problem, various 
notions of pseudoconvexity of holomorphi- 
cally complete domains are defined. 


G. Pseudoconvexity 


An upper semicontinuous real-valued function 
u(—o <u< +00) in a domain GCC" is said 
to be plurisubharmonic if for every z° e G and 
every a ç Ç” the function u(z° + ta) of t is tsub- 
harmonic (including the constant —oo) in all 
the connected components of {t z29+taeG}. A 
domain G is said to be pseudoconvex (or d- 
pseudoconvex) if u = = logd, is plurisubhar- 
monic in G, where d,(Z) is the distance from 

Zz €G to 0G with respect to any norm in C”. 
Every connected component of the interior of 
the intersection of a family of pseudoconvex 
domains is pseudoconvex, and the union of an 
increasing sequence of pseudoconvex domains 
is pseudoconvex. Suppose that we are given a 
domain G and a function y of class C? in a 
neighborhood of G such that G = {z|u(z) < 0} 
and, for some ¢>0, X ,(67u/dz,62,)a,a, >| al? 
for every ae C”. Then the domain G is said to 
be strongly pseudoconvex. Strong pseudo- 
convexity implies pseudoconvexity. Every 
pseudoconvex domain is exhausted by an 
increasing sequence of strongly pseudoconvex 
domains. An open set Pe C” is called an ana- 
lytic polyhedron if P= {z| |y,(z)| < l,a= 

1,, N}, x,€ A(P) (a= 1, ... N). Then every 
connected component of P is pseudoconvex. A 
Weil domain is a connected and bounded 
analytic polyhedron P defined by y,(a = 

1, , N) with N È n, such that for every k 

(1< k<n) the intersection of the hypersurfaces 
|%a,(2)) = 1 (1 <i<k) is of dimension <2n—k. 
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H. Holomorphic Convexity 


A domainG c C" is called holomorphically 
convex if for every compact set K cG, K = 
Nyemert? IE) <supyex |f(o)]} (the holomor- 
phic hull of K) is a compact set contained in 
G. (For a domain G contained in an tanalytic 
set we Can similarly define holomorphic con- 
vexity of G.) Every connected component of 
the intersection of a family of holomorphically 
convex domains is holomorphically convex, 
and a holomorphically convex domain is 
exhausted by an increasing sequence of Weil 
domains. Holomorphic completeness implies 
holomorphic convexity. The converse is true 
for domains in C”, If G is holomorphically 
convex, then for every point { of 0G there 
exists an fe H(G) such that f is not locally 


bounded at ¢ (H. Cartan and P. Thullen, 1932). 


Hence a holomorphically convex domain is a 
domain of holomorphy. Thus a domain in C 
is holomorphically convex if and only if it is a 
domain of holomorphy. (The same is true for 
unramified covering domains over C" (Oka, 
1953).) The union of an increasing sequence of 
domains of holomorphy is a domain of holo- 
morphy (H. Behnke and K. Stein [2], 1939). 
Suppose that we are given a domain G and 
domains S,, T, (a = 1,2,) such that §, U 7” c 
G and supr, |f]=sups.u7,[f| for every fe H(G). 
Suppose also that $} = lim §, is bounded. We 
Say that the continuity principle holds in G if 
Thc G (Tp = lim T,) implies SoG G. The con- 
tinuity principle holds in a domain of holo- 
morphy (Hartogs’s theorem of continuity). 
This implies that if G is a bounded domain 

C C” (n > 2) with connected boundary 0G, 
then every function holomorphic in a neigh- 
borhood of @G extends to a holomorphic 
function in G (Hartogs-Osgood theorem). In 
particular, for n > 2, the set of singular points 
of a holomorphic function has no isolated 


point. A domain is pseudoconvex if the con- 
tinuity principle holds there. Hence a domain 
of holomorphy is pseudoconvex. 


I, The Levi Problem 


Let G be a domain in C” and z°€ dG. If there 
exists an open neighborhood U of z? such that 
every connected component of G N Ų isa 
domain of holomorphy, then G is called Car- 
tan pseudoconvex at z°. On the other hand, if 
every 1-dimensional analytic set that has z? as 
an ordinary point contains points not belong- 
ing to G U {z°} in the neighborhoods of z°, 
then G is called Levi pseudoconvex at 7° Fyr- 
thermore, G is called locally Cartan (Levi) 
pseudoconvex if G is Cartan (Levi) pseudo- 
convex at every point of 0G. Every domain of 
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holomorphy is locally Cartan pseudoconvex. If 
G is pseudoconvex and there exists a neighbor- 
hood U of z° such that GN U ={z| p(z) <0}, 
where ọ € C!( U), then G is Levi pseudoconvex 
ate”, 

The (proper) Levi problem of whether every 
pseudoconvex domain is a domain of holo- 
morphy was proposed by E. E. Levi (1911). 
After unsuccessful efforts by various mathe- 
maticians to solve the problem, it was affirma- 
tively solved by Oka (1942 for n = 2 and 1953 
for manifolds spread over C” for n > 2), H. 
Bremermann, and F. Norguet. The problem 
was solved also by H. Grauert [ 14] (1958) in a 
more general form (- Section L) using results 
on linear topological spaces and by L. Hor- 
mander [ 1 8] (1965) using methods of the 
theory of partial differential equations. A 
fundamental step in Oka’s solution is his 
gluing theorem: Let G be a bounded domain 
c C”, If every connected component of G, = 
{z|x,>a}NG and G,={z|x, <b}NG (a<b) 
is a domain of holomorphy, then G is a do- 
main of holomorphy. Indeed, by virtue of 
the Behnke-Stein theorem and the fact that 
every pseudoconvex domain is the union of an 
increasing sequence of bounded locally Cartan 
pseudoconvex domains, it suffices to solve the 
Levi problem in the case of a bounded locally 
Cartan pseudoconvex domain. The Levi prob- 
lem in this case is solved by the gluing theo- 
rem. Various integral representations of holo- 
morphic functions are known besides the 
Cauchy representation. The Bergmann-Weil 
integral representation in a Weil domain was 
used as an important means of solving the 
Levi problem. 


J. Holomorphic Mappings 


Holomorphic functions with values in a tquasi- 
complete tlocally convex complex vector space 
E have also been investigated. The classical 
theory described above has been generalized, 
to some extent, to this case. In this way, many 
applications of the theory have been dis- 
covered. An E-valued function in a domain G 
c C” is holomorphic if and only if the map- 
ping uof: GC is holomorphic for every çon- 
tinuous linear form yon E. By this theorem, 
most problems concerning E-valued holo- 
morphic functions can be reduced to those of 
ordinary holomorphic functions. Note that the 
vector space H(G) of ordinary holomorphic 
functions in G is a *Fréchet space. The spaces 
C” and complex *Banach spaces belong to the 
above category of E. A. C?-valued holomor- 
phic function in a domain G c C” is called a 
holomorphic (or analytic) mapping of G into 
CP’. An isomorphism in the category of do- 
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mains G c C” and holomorphic mappings is 
called an analytic isomorphism (or biholomor- 
phic mapping). An automorphism in the cate- 
gory is called an analytic (or holomorphic) 
automorphism. With every domain G c C 

is associated the sheaf Oç of germs of holo- 
morphic functions over G. Thus we have the 
notion of a tringed space (G, 0,). A complex 
analytic manifold can be defined as a (Haus- 
dorff) ringed space that is locally isomorphic 
to some (G, 0,). 

A meromorphic function in G is a function 
that is locally the quotient of two holomor- 
phic functions with denominator #0. It may 
be defined more rigorously as a meromor- 
phic mapping of G into P,(C) (- 23 Analytic 
Spaces D). 


K. The Cousin Problems 


The Cousin problems are those of constructing 
meromorphic functions with given zeros or 
poles. In terms of sheaves the problems are 
stated as follows: Let Xg be the sheaf of germs 
of meromorphic functions over a domain G c 
C”. The first Cousin problem asks whether 

the mapping T(G, Xg)>T (G, A) induced 

by the exact sequence 030, 4,9 A 0 
(Po = Hg/Og)is surjective, where T(G, F) is 
the module of tsections of ¥ over G (- 383 
Sheaves C). Let #,* be the sheaf of multiplica- 
tive groups of germs of meromorphic functions 
not identically 0 and (,* be the subsheaf of 
A* formed by germs of nonzero holomorphic 
functions. The second Cousin problem asks 
whether the mapping T(G, %#*)—> P(G, Dg) 
(Dg = HG*/Og*)is surjective. P. Cousin (1895) 
solved the first problem for G = C" or []j_, G; 
and the second problem for G =Ç”, Oka (1935) 
proved that the first problem is solvable in 
every domain of holomorphy. In solving the 
second problem in a domain of holomorphy, 
Oka established the notion of ‘fiber bundles 
and proved that the problem for any domain 
is reduced to holomorphic triviality of a 
holomorphic principal fiber bundle over the 
domain and that holomorphic triviality is 
equivalent to topological triviality when the 
domain is of holomorphy (Oka’s principle). 
Using the solutions of the Cousin problems, 
Oka proved his gluing theorem, described in 
Section 1. 


L. Stein Manifolds 


Abstracting certain important properties of a 
domain of holomorphy, Stein [33] introduced 
the following category of complex analytic 
manifolds (X, Oy): (1) X is paracompact (i.e., 
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each connected component of X has a count- 
able open base). (2) Functions in F(X, Oy) 
separate the points of X. (3) For every point 

x € X there exists a system of local coordi- 
nates around x that is formed by functions in 
1(X, Cy). (4) X is holomorphically convex. 
(X, 0,) is then called a Stein manifold. It was 
later discovered by Grauert [ 12] that con- 
ditions (2) and (4) imply (1) and (3). 

Applying the theory of teohomology with 
coefficients in sheaves, H. Cartan and J.-P. 
Serre obtained for an tanalytic coherent sheaf 
F on a Stein manifold X, the following funda- 
mental theorems of Stein manifolds. Theorem 
A: H°(X, F) generates the stalk Fy (as an Oy- 
module) at every point x of X. Theorem B: 
HX, F) =0 for all q> 0 [7,30]. Conversely, 
for a complex analytic manifold X, if for every 
analytic coherent sheaf .g of ideals defined by 
a 0-dimensional analytic set in X (i.e., a dis- 
crete subset of X), H! (X, J) = 0, then X is a 
Stein manifold. Furthermore, if T'(X, 0y) = 
I (Y, 0y) for a Stein manifold Y (as in the case 
where X g c”), then the fundamental theorem 
A for every coherent sheaf of ideals implies 
that X is a Stein manifold (I. Wakabayashi). 

Due to the fundamental theorems, most 
results on domains of holomorphy hold un- 
changed for Stein manifolds. For instance, the 
first Cousin problem is always solvable. The 
second Cousin problem is solvable if and only 
if H?(X, Z)=O. An n-dimensional Stein mani- 
fold can be realized as a (ramitied) covering 
domain of holomorphy over C”. Furthermore, 
some theorems on differentiable manifolds 
have analogs on Stein manifolds. For instance, 
the cohomology groups of the complex of 
holomorphic differential forms over a Stein 
manifold X are isomorphic to the cohomol- 
ogy groups H*(X, C) (analog of tde Rham’s 
theorem). Every n-dimensional Stein manifold 
X is realized as a closed complex analytic 
submanifold in C2"*?; that is, there exists an 
injective tproper holomorphic mapping f: 
X>C*"*! with df #0. Consider all the holo- 
morphic tprinciple tiber bundles over a Stein 
manifold X whose fibers are isomorphic to a 
complex Lie group G. The analytic isomor- 
phism classes of the bundles and the elements 
in H'(X, G’) (where G° is the sheaf of germs 
of holomorphic mappings of X into G) are in 
one-to-one correspondence. The same is true 
for the topological isomorphism classes of the 
bundles and the elements in H'(X, G°) (where 
G“ is the sheaf of germs of continuous map- 
pings of X into G). The mapping H! (xX, G%)> 
H'(X, G') induced by the canonical injection 
G° > G is bijective (Grauert [ 13]). Every rela- 
tively compact domain in a complex analytic 
manifold is holomorphically convex if it is 
strongly pseudoconvex (Grauert [14]). Hence 
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such a domain is a Stein manifold. It follows 
from this that every real analytic manifold 
with countable base for open sets is realized as 
a closed real analytic submanifold of some R”. 

The notion of Stein manifolds is generalized 
to that of weakly 1-complete manifolds: a 
complex manifold X is called a weakly 1- 
complete manifold if there exists a plurisubhar- 
monic function y of class C® on X, such that 
for any ceR, X,= {xe X |u(x)<c} is relatively 
compact. The family of such manifolds in- 
cludes compact complex manifolds too. For a 
weakly 1-complete manifold also, vanishing 
theorems of cohomology have been estab- 
lished by H. Hironaka, S. Nakano [26], and H. 
Kazama [2] (- 232 Kahler Manifolds D). 

The theory of entire functions of two vari- 
ables has been developed from a new view- 
point established by T. Nishino. An entire 
function f of two variables defines a family of 
Riemann surfaces { f=c c eC} on C?, and 
investigations of the structure of sych a family 
play an important role. For instance the fol- 
lowing is proved in this way: If every irreduc- 
ible component off= c (CE C) is biholomor- 
phic to C!, then there exists an entire function 
g such that { f, y): C? +C? is a biholomorphic 
mapping (Nishino [28]). In this theorem, the 
analyticity of g is obtained from the fact that 
C? is a Stein manifold. 


M. Continuation of Analytic Sets 


The application of the theory of cohomology 
with coefficients in sheaves is not restricted 
to problems concerning Stein manifolds. 
Given Gp = {z||z;|<1,1<j<n} (n>3), Gi = 
{z||z1|<e|z|<1,2<j<n}, and G™=G,U 
(Go —{2|2.=...=Z,_,=0}) (3<m<n), we have 
H°(G™, Ogm) = 0 (1 <p <m=— 2) (Scheja’s 
theorem [32]). Let F be a coherent analytic 
sheaf over a domain G c Ç". If, for every point 
z of an analytic set ASG, ¥,= {0} or p< 

n —dim, A =2 hd, F (where hd, ¥ is the 
thomological dimension of the O, ,-module 
F), then it follows from Scheja’s theorem that 
the mapping H?(G, F)>H?(G = A, F) induced 
by the canonical injection G—A-> G is bijec- 
tive. This generalizes Hartogs’s continuation 
theorem for holomorphic functions, which 
corresponds to the case p= 0. 

Besides the continuation of holomorphic 
functions, we çan consider the continuation of 
analytic sets. Let A be an analytic set in a 
domain G c C" and § an analytic setin G A. 
A point ze G is said to be regular (essentially 
singular) with respect to § if the closure § of S 
in G is (is not) analytic at z. If dim A < dim, S 
for every point z€ S, then § is analytic in G. If 
S is purely d-dimensional and S is analytic at 
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a point of a d-dimensional irreducible com- 
ponent A’ of A, then S is analytic at every 
point of A’ that is not located in any other ir- 
reducible component of A. Furthermore, if 
dim A g dim S and S is purely d-dimensional, 
then the following hold: (1) The set E of essen- 
tial singularities of S is, if not empty, a purely 
d-dimensional analytic set in G, formed by 
irreducible components of A. (2) If every 
irreducible component of A contains points 
of E not located in any other irreducible com- 
ponent of A, then A CE, and A is, if not 
empty, purely d-dimensional. (3) If every d- 
dimensional irreducible component of A con- 
tains points that are regular with respect to S, 
then S is a purely d-dimensional analytic set 
in G (Thullen, Remmert, and Stein). By these 
results it is possible to give a proof for tChow’s 
theorem that every analytic set in P,(C) is 
algebraic. 

The continuation of holomorphic functions 
is related to the continuation of their graphs. 
W. Rothstein investigated the continuation of 
analytic sets to obtain the following analog 
to Hartogs’s theorem of continuity: If G = 
G, UG, G, ={2||2,|<1/2, Xj-2lz;jl < 1}, G= 
{2| \z,|<1, 1/2< 5-12; < 1}, and G= 
{z||z,|<1, Z_,|2z,/? <1} (the envelope of 
holomorphy of G) with n> 3, then every purely 
(n 1)-dimensional analytic set A in G extends 
to an analytic set in G; that is, there exists a 
purely (n = 1)-dimensional analytic set Ain 
G such that A = ÄN G. K. Kasahara and H. 
Fujimoto generalized this theorem to the case 
of analytic spaces. 


N. Nevanlinna Theory for Several Complex 
Variables 


The Nevanlinna theory investigates holomor- 
phic mappings between complex manifolds. 
In function theory of one variable, for a holo- 
morphic mapping f :C>P!(C) from C into 
the 1-dimensional tcomplex projective space, 
the famous +Picard theorem states that if f 
omits three values, then f must be constant. 
R. Nevanlinna developed the quantitative 
theory of value distributions off: L. Ahlfors 
[1], introduced the geometric approach, and 
enunciated the principle that the negative 
curvature of the image manifold restricts a 
holomorphic mapping. In higher-dimensional 
situations, this principle has been realized in 
many cases. The Nevanlinna theory for a 
holomorphic mapping f: C—P”(C) with re- 
spect to thyperplanes was established by Ahl- 
fors and H. Weyl and J. Weyl [34]. Holomor- 
phic mappings into projective spaces have 
been studied in detail by W. Stoll, H. Fujimoto, 
and M. Green. Following the work of S. S. 
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Chern [9] of around 1970, the equidimen- 
sional case was investigated (P. A. Grifliths, K. 
Kodaira, S. Kobayashi and T. Ochiai). J. A. 
Carlson and Grifliths [6] succeeded in obtain- 
ing a tdefect relation for equidimensional 
holomorphic mappings and thypersurfaces of 
the image manifold. This yields the following 
Picard-type theorem (after the formulation by 
F. Sakai, Inventiones Math., 16 (1974)): Let X 
be a tprojective algebraic manifold of dimen- 
sion n and f: C’+X a holomorphic mapping. 
Let D be a hypersurface of X. We take a 
tdesingularization t: X*+X so that the in- 
verse image D* = 77! (D) has at most normal 
crossings as singularities. We denote by K* a 
tcanonical divisor of X*. We assume that 

dim H°(X*, O(m(K* +D*))) 30 


n 


Ro m 
If f omits D, then the Jacobian off vanishes 
everywhere. As a corollary, it follows that the 
funtversal covering manifold of X -D cannot 
be C". When X = P”(C) and P is a nonsingular 
hypersurface of degree d, the above assump- 
tion is satisfied if d>n + 2. In the nonequidi- 
mensional case, Ochiai [29] (with supple- 
mentary works by M. Green, Y. Kawamata, 
and P. Wong) verified the following asser- 
tion, which was first stated by A. Bloch with 
rough arguments: Let X be a projective alge- 
braic manifold of dimension n, and suppose 
dim H'(X, 0y) >n; then the image of a holo- 
morphic mapping f:C—X is contained in a 
proper subvariety of X (— 124 Distributions 
of Values of Functions of a Complex Variable; 
272 Meromorphic Functions). 


0. Hyperbolic Manifolds 


Every complex analytic space X has two jn- 
variant tpseudodistances: the Carathéodory 
pseudodistance cy and the Kobayashi pseudo- 
distance dy, both of which generalize the Poin- 
caré distance p of the unit disk D = {|z| <1}. 
These pseudodistances can be defined by the 
property that dy is the largest pseudodistance 
among all pseudodistances §, on X for which 
all holomorphic mappings (X, 6,)—>(D, p) are 
distance-decreasing, while cy is the smallest 
among all pseudodistances 6, for which all 
holomorphic mappings (D, p)>(X, 6,) are 
distance-decreasing. Then cy < d,. If Y is an- 
other complex analytic space, every holomor- 
phic mapping f:X > Y is distance-decreasing 
with respect to either its Carathéodory or 
Kobayashi pseudodistance. This may be con- 
sidered to be a generalization of the Schwarz- 
Pick lemma. (The ‘Schwarz lemma is often 
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referred to by this name, since G. Pick also 
investigated the distance-decreasing property.) 
The Kobayashi pseudodistance dy can also 
be obtained by integrating an infinitesimal 
differential pseudometric called the Koba- 
yashi pseudometric in the same manner as the 
Riemannian distance is obtained from the 
Riemannian metric (H. L. Royden). 

If dy is a (complete) distance on X, then X is 
said to be (complete) hyperbolic. A Riemann 
surface is hyperbolic in this sense if and only 
if its universal covering is biholomorphic to 
the unit disk. If X is open in Y and if, for 
every pair of sequences of points {p,} and 
{qn} in X converging to distinct points of X, 
lim dy(P„, dn) is positive, then X is said to be 
hyperbolically embedded in Y. The Riemann 
sphere CU { œ} minus three points is not 
only complete hyperbolic but also hyperboli- 
cally embedded in the Riemann sphere. Every 
holomorphic mapping of C into a hyperbolic 
complex analytic space is constant, while every 
holomorphic mapping of the punctured disk 
D* = {0<|z|< 1} into a hyperbolically embed- 
ded space X c Y extends to a holomorphic 
mapping of D into Y. Thus the classical little 
Picard theorem reduces to the statement that 
C = {0, 1} is hyperbolic, while the great Picard 
theorem reduces to showing that C = {0, 1} is 
hyperbolically embedded in the Riemann 
sphere. These classical theorems can be gen- 
eralized in two ways. If M is a tsymmetric 
bounded domain and [ is a discrete arithmetic 
group acting freely on M, then M/T is not 
only complete hyperbolic but also hyperboli- 
cally embedded in its Satake compactification 
(Kobayashi and Ochiai). If X is the comple- 
ment of 2n + 1 hyperplanes in general position 
in the complex projective space P”(C), then X 
is complete hyperbolic and hyperbolically 
embedded in P”(C) (a restatement of a re- 
sult going back to E. Borel, A. Bloch, and H. 
Cartan). 

Although there are some noncompact non- 
hyperbolic complex manifolds X for which 
every holomorphic mapping f:C—X is con- 
stant, a compact complex manifold X is hyper- 
bolic if and only if every holomorphic map- 
ping f:C—X is constant (R. Brody). 

For the *Teichmtller space X = T, of com- 
pact Riemann surfaces of genus g, the Koba- 
yashi distance dą agrees with the Teichmüller 
distance which had been introduced before 
the complex structure of T, was defined (H. L. 
Royden). 

In the study of pseudoconvex domains, the 
*Bergman metric, the Carathéodory distance, 
and the Kobayashi distance serve as useful 
tools. Their behavior at the boundary has been 
studied extensively. 
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Given an n-dimensional complex mani- 
fold X, an invariant measure Yy can be de- 
fined as the largest measure such that every 
holomorphic mapping f from the polydisk 
D” with invariant measure into X is measure- 
decreasing. For an algebraic manifold of gen- 
eral type, this measure is everywhere positive. 


P. Bounded Domainsin C” 


For any bounded domain in C” there is natu- 
rally assigned a tKahler metric called the 
Bergman metric. Using the invariance of this 
metric, E. Cartan proved that all the Her- 
mitian symmetric spaces of noncompact type 
are realized as bounded domains. In view of 
the fact that some important period matrix 
domains (e.g., the 19-dimensional bounded 
symmetric domain of type IV, the Siegel upper 
space, etc.) are of this type, it is obviously of 
great significance in algebraic geometry to 
study discontinuous subgroups of the auto- 
morphism groups of such domains (Pyatetskii- 
Shapiro). 

On the other hand, it is almost impossible 
for general bounded domains to determine the 
explicit form of their Bergman metrics. But 
Cartan also investigated strongly pseudo- 
convex real hypersurfaces in C?, and he solved 
completely the (local) equivalence problem for 
them, introducing a definite type of Cartan 
connections over them in a functorial way. 
Thereafter N. Tanaka (and for hypersurfaces 
also Chern and J. Moser) generalized this 
result to all pseudoconvex real submanifolds 
in higher-dimensional complex manifolds. 
One can further apply this result to the equi- 
valence problem of bounded domains with 
smooth strongly pseudoconvex boundaries, 
for C. Fefferman [ 10] proved by analyzing the 
boundary behavior of the Bergman metric that 
every biholomorphic mapping between two 
such domains is extended smoothly up to their 
boundaries (I. Naruki [27] gave an alternative 


proof). 


Q. History 


In connection with fAbelian functions, ana- 
lytic functions of several complex variables 
have been studied sporadically since the time 
of Riemann and Weierstrass (H. Poincaré, 
Cousin). A series of investigations by Hartogs 
([ 17] (1906), etc.) that revealed the distinctive 
properties of several complex variables ini- 
tiated a new epoch in complex analysis. Levi 
(1910- 1911) generalized Hartogs’s results to 
the case of meromorphic functions, introduced 
the notion of pseudoconvexity, and proposed 
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the so-called Levi problem. After a lapse of 
time, many contributions to this new area of 
complex analysis have been made since 1920. 
The study by K. Reinhardt (1921) of analytic 
automorphisms was further developed by C. 
Carathéodory and Behnke. The tkernel func- 
tion introduced by S. Bochner and S. Berg- 
mann (1922) produced many remarkable 
results. In contrast with tPicard’s theorem 

in one variable, P. Fatou found a holomor- 
phic mapping f: C* —C? with nonvanishing 
Jacobian such that the image f(C?) has an 
exterior point. 

The theory of analytic functions of several 
complex variables has flourished since 1926. 
Behnke and Thullen in Münster, together with 
G. Julia and H. Cartan in Paris, were the most 
active investigators. The results on tnormal 
families of analytic functions of several com- 
plex variables (Julia [20], 1926), the uniqueness 
theorem of holomorphic mappings (H. Cartan, 
1930), and a characterization of a domain of 
holomorphy by holomorphic convexity (Car- 
tan and Thullen, 1932) are their most remark- 
able achievements. Behnke and Thullen [3] 
systematized the results obtained since the 
discovery of the theory by providing a com- 
plete bibliography of articles up to 1934. 

The three major unsolved problems at that 
time-those of Cousin, Levi, and the approx- 
imation of holomorphic functions-were 
intensively studied by Oka from 1936, who has 
given complete solutions [30]. The investiga- 
tion of ideals of holomorphic functions by H. 
Cartan (1944), together with that of ideals with 
undetermined domains by Oka, has developed 
into the theory of coherent analytic sheaves. 
The notion of analytic spaces, first introduced 
by Behnke and Stein (1951), extended the field 
of investigation in the theory of analytic fync- 
tions of several complex variables. The theory 
of cohomology with coefficients in sheaves has 
been effectively applied by H. Cartan and 
Serre (1951-1952). The introduction of the 
notion of Stein manifolds (195 1) came at the 
same time. Grauert’s deep investigations since 
1955, together with those of Stein and Rem- 
mert, have contributed greatly to the develop- 
ment of the theory of analytic spaces. In the 
1960s, active investigations took place also in 
the United States [16]. The theory of auto- 
morphic functions of several complex variables 
has been developed by C. L. Siegel, I, Satake, 
and others in connection with the theory of 
numbers. Entire functions of two variables 
have been investigated from a new point of 
view by Nishino and others since 1968 (— last 
paragraph of Section N). The notion of a 
hyperbolic manifold introduced by Kobayashi 
(1967) enables us to obtain many results on 
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complex manifolds by means of the methods of 
differential geometry. The theory of automor- 
phisms of bounded pseudoconvex domains 
was developed extensively in the 1970s by 
Fefferman, Naruki, and others. The theory of 
an envelope of holomorphy has also been 
successfully applied to the theory of elemen- 
tary particles in physics. 
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22 (1.12) 
Analytic Sets 


A. General Remarks 


The notion of analytic sets was first defined by 
N. N. Luzin and M. Ya. Suslin in 1916, and 

it was extended to that of projective sets by 
operations such as complementation and pro- 
jection (Luzin, 1924). Most mathematicians, 
including Luzin and W. Sierpinski, who 
worked in this field, were in agreement with 
*French empiricism (or tsemi-intuitionism), 
which defended the standpoint of R. Baire, 

E. Borel, H. Lebesgue, and others. An object is 
said to be effectively given if itcan be uniquely, 
individually, and unambiguously determined 
in finite terms so that anyone can reach the 
same object by following the defining pro- 
cedure. Semi-intuitionists claim that only 
effectively given objects have mathematical 
existence, and they do not recognize as a 
mathematical object something that needs the 
axiom of choice for its definition. From this 
point of view, *Borel sets were “well-defined” 
sets to which classical analysis had to be re- 
stricted. Thus the question was raised whether 
it is possible to extend the class of Borel sets to 
a wider class of sets with the same certainty. 
Lebesgue (J. Math. Pures Appl., 1 (1905)) de- 
fined a function not belonging to any class 

of Baire functions by using the totality of 
‘ordinals of the second class. (Later, this 
method was systematically developed as the 
theory of sieves by Luzin.) However, it did 
not satisfy Borel as being effective. Can we, 
then, extend the Borel sets without any use of 
ordinals of the second class? The discovery of 
analytic sets gave an affirmative answer. 

In this article (except in Section 1), we treat a 
space (denoted by X, Y, . ) that is thomeo- 
morphic to a ‘complete tseparable tmetric 
space and its subspace. Denote by 9 the space 
of irrational numbers (a metric space consts- 
ting of the irrational numbers € R with the 
metric |x — y| of x and y). The following prop- 
erties of a subset § of a space X are equivalent: 
(i) § is a continuous image of N; (ii) § is a con- 
tinuous image of a Borel set in X; (iii) § is the 
projection of a closed set in a product space 
X xN; (iv) § is the projection of a Borel set in 
X x Y. We calla set satisfying one of these 
properties an analytic set, an A set, or a J} set 
(in X). The complement of an analytic set is 
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called a complementary analytic set (or simply 
coanalytic set), a CA set, or a TF} set. 


B. The Operation A and Sieves 


When to each (n,, , ng) of finite sequences of 
natural numbers there corresponds a unique 
element E(n,, , n,) of a family F of sets, this 
correspondence {E(n,, . . , n,)} is called a 
schema of Suslin (or system of Suslin) consist- 
ing of sets in F. Denoting an infinite sequence 
of natural numbers by {n;}, the set given by 
Uin [k E(N, m) is called the kernel of a 
system of Suslin, and the operation of taking 
the kernel is called the operation A (analytic 
operation). 

Let Q be the set of al] rational numbers 
between 0 and 1 and F be a family of sets. 
Take a family {C,},<g of sets belonging to F 
with the index set Q (or more geometrically, a 
subset C= |] «gC, x (r) of X x Q when F is a 
family of subsets of a space X), and call it a 
sieve consisting of sets in F. Denoting by fr) a 
(strictly) monotone decreasing sequence of 
elements of Q, we call the set |]; (\kC,, 
(namely, the set of all x such that C™ = 
{r (x, ) €C} is not well-ordered by the 
order < of rational numbers) the set obtained 
by a sieve C or the sieved set obtained by C. If 
the family F is closed with respect to countable 
intersection, then the family of al] sets ob- 
tained by sieves consisting of sets in F is ident- 
ical to the family of all sets obtained by apply- 
ing the operation A to F. When F consists of 
all the closed sets in a given space, this is the 
family of all analytic sets. In particular, it is 
sufficient to take the family of closed intervals 
as F when X is the space of real numbers. 
Note that we can define sieves and sieved sets 
more generally by using the space of real num- 
bers R instead of the set Q of rationals. 


C. Properties of Analytic Sets 


It is evident from the definition that every 
‘Borel set is analytic. If a Borel set is uncount- 
able, then it is the union of a countable set 
and a one-to-one continuous image of 9. The 
analyticity of sets is invariant under countable 
unions, intersections, and Cartesian products 
and the operation A and tBorel-measurable 
transformations. An uncountable analytic set 
contains a tperfect subset (Suslin). Therefore 
the possible cardinality of an analytic set is at 
most countable or is that of the continuum. 
Every analytic set enjoys the *Baire property, 
and in Euclidean space every analytic set is 
tLebesgue measurable (Luzin, Sierpinski). If a 
set E in the Euclidean plane is analytic (coana- 
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lytic), then I(E) is also analytic (coanalytic), 
where I(E) is the set of all x such that the 
section E™ of E that is parallel to the y-axis 
has a positive measure (M. Kondé6 and T. 
Tugué). Concerning the Baire property, the 
similar result for a set E of X x Y is obtained 
by replacing “to have a positive measure” by 
“to be the tsecond category (nonmeager) set” 
(A. S. Kechris). The Lebesgue measure of an 
analytic set is effectively calculable (in the 
sense that the measure of a X} (or 2 }(a)) set is 
a X! (resp. X}(«)) real number (- Section G 
and 356 Recursive Functions) (H. Tanaka). 
Every analytic (coanalytic) set E can be de- 
composed into N, Borel sets. This decompo- 
sition is called a decomposition of E into 
constituents. An analytic (coanalytic) set is a 
Borel set if and only if it is decomposable into 
a countable number of constituents (Luzin, 
Sierpinski). In a space with the cardinality of 
the continuum, there exist analytic sets that 
are not Borelian. For example, in the space 

C( [0, 1]) of continuous functions on the inter- 
val [0, 1] (— 168 Function Spaces) the set of 
all differentiable functions is coanalytic but not 
Borelian (S. Mazurkiewicz). 

The following theorems are especially im- 
portant in analytic set theory. Luzin’s first 
principle (the first separation theorem): For 
every pair of disjoint analytic sets A,, A,, there 
exists a Borel set B such that A, c B and 
BILA, = Ø. An immediate corollary of 
Luzin’s first principle is Suslin’s theorem: If 
both A and X = A are analytic,’ then A is a 
Borel set. Luzin’s second principle (the second 
separation theorem): For every pair of analytic 
sets A and B, there exist complementary analy- 
tic sets C and D such that A—BcCC, B-Ac 
D, and CND = @. A one-to-one continu- 
ous image of a Borel set is Borelian (Suslin). 
More generally, for a given B-measurable 
function f defined on a Bore1 set B, the set A 
(<f(B)) of all points y whose inverse images 
f> (y) are singletons is a complementary 
analytic set (Luzin’s unicity theorem). In this 
theorem, we can replace “a singleton” by “a 
tg-compact set” (V. Ya. Arsenin, K. Kunugui). 
Therefore, if a set is the image of a Bore] set by 
a continuous function such that the inverse 
image of each point is a o-compact set, then it 
is a Borel set. 


D. Generalization to Projective Sets 


A projective set of class n is inductively defined 
as follows: (i) the Borel sets are the projective 
sets of class 0; (ii) the projective sets of class 
2n + | are the continuous images of the sets of 
class 2n; (iii) the projective sets of class 2n are 
the complements of the sets of class 2n = 1. 


110 


The projective sets of class 1 are exactly the 
analytic sets, and those of class 2 are the com- 
plementary analytic sets. The following are 
fundamental properties of projective sets. 
Denote by L, the family of the projective sets 
of class n. Then (1) Lac Lanex and Lapi C 
Lan+24+ (K= 1,2, . ..). (2) the property of 
being a set of class n is invariant under count- 
able unions, intersections, and Cartesian 
products and homeomorphisms; (3) a contin- 
uous image of a projective set of class 2n + 1 
is of the same class; (4) the projection on X of 
a set of class 2n + | in X x Y is a set of class 
2n + 1 in X; (5) the family of the projective sets 
of class 2n + 1 in a space X is the family of the 
projections of all sets of class 2n in X x X (or 
X x N); (6) the kernel of a system of Suslin 
consisting of sets of class nis a projective set 
of the same class, where n 40, 2. 

We frequently call a projective set of class 
2n lapP setora Z} set, and that of class 2n 
a C, set or a TI} set. A B, set or a Aj set is a set 
that is both P, and C,„. The respective families 
of these sets are also denoted by P, (22), 

C, (I}), and B, (A}), respectively. In general, for 
a family § of sets in a space X, we denote by 
C¥% the family of the complements X-E of all 
sets E in &. We write Sep,(%) and Sepy(%) for 
the propositions obtained by substituting “set 
in &,” “set in & and in CY,” and “set of CR” 
for “analytic set,” “Borel set,” and “coanalytic 
set,” respectively, in Luzin’s first principle and 
Luzin’s second principle, respectively. Fur- 
thermore, we say that the reduction principle 
holds for %¥, and denote it by Red(%), when for 
any sets A, BE & there exist A,, B, € & such 
that A, c A, B, cB, A, UB, = AUB, and 

A, N B, = @. C. Kuratowski introduced the 
latter principle Red(%) which implies Sep,(C%) 
and Sepy(C%), and proved Red(/7}). Classi- 
cally, reduction or separation principles for the 
projective sets of higher classes were not 
settled except for Sep,(/7}), Sep, (773) @P. S. 
Novikov), and Red(2}) (Kuratowski). Now- 
adays, it is known that these principles for 

{X) JT}\,53 are undecidable from the axiom- 
atic set theory ZFC. If we assume tthe axiom 
of constructibility V = L, then Red(}) holds 
for n> 3 (J. W. Addison). On the other hand, 
under the assumption of tprojective deter- 
minacy PD (— Section H), it follows that 
Red(/7}) (Red(Z})) holds when p is odd (even) 
(A. Martin, Addison and Y. N. Moschovakis). 


E. Universal Sets 


A set U in §t{x X is called the universal set 
for the projective sets of classn in X if for any 
projective set P of class n in X, there exists 
ZoEN such that P={x|(z9,x)eU}. Concern- 
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ing universal sets, we have the following result: 
for every n> 0, there exists a universal set for 
the projective sets of class N in X that is of the 
same class in N x X. Hence in a space with 

the cardinality of the continuum, there exists a 
projective set of class n + 1 which is not of 
class N. In general, if a class % of sets is closed 
under taking a universal set and continuous 
preimages, then % and C% cannot both have 
the reduction property. Therefore Red(/#}) or 
Red(}) fails for each n> 1. 

Any two universal sets of analytic sets are 
*Borel isomorphic. On the other hand, an 
analytic set E is not Borel isomorphic to any 
universal analytic set if the complement of E is 
an uncountable set without a perfect subset (A. 
Maitra and C. Ryll-Nardzewski). Hence, there 
are at least two equivalence classes of uncount- 
able analytic sets with respect to the Borel 
isomorphism = under the assumption of V= 
L, since the latter assumption implies the 
existence of such a set E (- Section H). 


F. Uniformization Principle 


The uniformization problem arose during 
investigations of implicit functions, For a set E 
in a space X x Y, uniformization of E is the 
finding of a subset V of E such that 


Wx(ay((x, yje E) J!y((x, ye V)), 


where J!y is the tquantifier which means 
“there exists exactly one y.” A Borel set can be 
uniformized by choosing a suitable coanalytic 
set (Luzin). Kond6’s uniformization theorem 
(Japan. J. Math., 15( 1938)) is the most impor- 
tant result in descriptive set theory: every 
coanalytic set is uniformizable by a coanalytic 
set. As a corollary to this, any x} set is uni- 
formizable by a Z} set, and every ©} set is 
obtained as a One-to-one continuous image 
of a coanalytic set. 

Kondô’s original proof was very difficult to 
understand. Since teffective descriptive set 
theory was introduced by Kleene and Ad- 
dison, the proof has been simplitied by Ad- 
dison, and a more elegant one has been given 
by J. R. Shoenfield [7, p. 188]. Nowadays, the 
theorem is also called the Novikov-Kond6- 
Addison theorem, and is one of the most 
powerful and fundamental theorems not only 
in descriptive set theory, but also in the tfoun- 
dations of mathematics. 

The uniformization of an analytic set is, in 
general, not to be found among analytic or 
coanalytic sets. There was a conjecture that 
any analytic set is uniformizable by specifying 
an A, set (difference of two analytic sets). 
Recently, this conjecture was negatively settled 
(J. Steel, Martin). Assuming that V= L, the 
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uniformization of a £} (n > 3) set is determined 
by specifying a X} set, and that of a H; (n > 2) 
set by specifying a it; set. On the other hand, 
if an axiom system of set theory (¢.g., ZF; — 33 
Axiomatic Set Theory) is consistent, then it is 
still consistent even if we add to it the follow- 
ing proposition: There exists a JJ} set whose 
uniformization is impossible by any choice of a 
definable set in the system (P. J. Cohen, A. 
Lévy). However, if we assume the existence of 
a tmeasurable cardinal (MC), it is known that 
every IT} set is uniformizable by a M4 set 
(Martin and R. M. Solovay). On the other 
hand, PD implies “every M! (2?) set is uni- 
formizable by a M! (2}) set for each odd 

(resp. even) n” (Moschovakis). 

There are sufficient conditions on sections 
E™ of a Borel set E in the Cartesian product 
X x Y for E to be uniformizable by the Borel 
set as follows: all sections E™ are (i) countable 
(Luzin, Novikov), (ii) a-compact (Arsenin, 
Kunugui); or, as “large section property,” (ili) 
in the second category (S. K. Thomason, P. G. 
Hinman), or (iv) of positive measure (Tanaka, 
G. E. Sacks). For applications of descriptive 
set theory to analysis, an important uniformi- 
zation result is von Neumann’s selection 
theorem: for any A set E c X x Y, there exists 
a tBaire measurable and ‘absolutely measur- 
able function f: X> Y (- 270 Measure 
Theory L (vi)) such that 


Vx(Ay((x, yje E) <> (x, f(x))€ E). 


Concerning implicit functions, any Borel set 
E cX xY such that all sections FE) are at 
most countable is expressed by a union of 
intersections of E with graphs of some Baire 
functions f,:X > Y,neN (Luzin). Recently, it 
was shown known as a generalization of this 
that any Borel set Ec X x Y such that all 
sections are a-compact is a union of countably 
many Borel sets E, for each of which all sec- 
tions E® are compact (J. Saint-Raymond). 


G. Kleene’s Hierarchy and Effectiveness 


First, projective set theory in any space is 
reducible to the theory in the space of irra- 
tional numbers. Second, if we introduce a 
tweak topology in the set NX = @q of tnumber- 
theoretic functions x with one argument, the 
resulting topological space N^ is homeomor- 
phic to the tBaire zero-dimensional space 

of irrational numbers. Third, any subset B of 
N™ is open and closed in this topology if and 
only if there exist a function €(EN%) and a 
predicate A*(x) that is tgeneral recursive in č 
such that «e B<> A*(x), Fourth, logical opera- 
tions such as “1, v , A, 3x (where x is a varti- 
able ranging over the natural numbers), and Ja 
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exactly correspond to the operations (on sets) 
complementation, union, intersection, count- 
able union, and projection, respectively. On 
the basis of these facts, projective set theory is 
regarded as the theory of the N™ tanalytic hier- 
archy of Kleene. Here, the following example 
is remarkable: We can construct a £} set which 
is universal for the analytic sets (namely, a 

X! [N’] set) in the space of irrational num- 
bers (— 356 Recursive Functions H). 

The connection between projective set 
theory and logic has been discussed by C. 
Kuratowski and A. Tarski. From their point of 
view, semi-intuitionists such as Borel regard 
the set of natural numbers to be precisely clear 
in itself and also the continuum to be immedi- 
ately recognizable by our geometric intuition. 
In their argument rational numbers do not 
play such an important role. They take, a 
priori, the set of irrational numbers as the 
fundamental domain, and intervals with ra- 
tional extremities as the simplest sets of points 
among the subsets of the fundamental domain 
as the starting point of their argument. Here, 
the fundamental domain or each interval is 
not conceived as a totality of its elements, but 
recognized as a “uniform extent.” In contrast 
to this, singletons and individual irrational 
numbers are not so simple. For this reason 
Borel introduced the notion of calculable 
numbers to study definable real numbers. 
Following Luzin, we say that a calculable 
number is a constructible real number in the 
sense that we can give it by an arithmetical 
approximation as precisely as we want. Now, 
this notion is nearly identical to the notion of 
an effectively calculable real number given by 
A. Church or A. M. Turing. 

In the mathematics of the semi-intuitionists, 
the word “effective” has played an especially 
important role, Although these mathema- 
ticians have always agreed not to accept the 
taxiom of choice, the exact meaning of “effec- 
tive” has differed slightly among different 
members of their group or in different stages 
of the development of the theory. Such dif- 
ferences mainly arose in connection with the 
question: How can we tell whether given en- 
tities are finitary or individual? One way to 
guess the original intention held by Bore] and 
others when they used the term “effective” is to 
replace the term by “recursive.” Nowadays, the 
concept of “effectiveness” is used in this sense 
(— 356 Recursive Functions C), and all clas- 
sical results in descriptive set theory essentially 
have effective versions (or refinements via 
relativization). For example, the Novikov- 
Kond6é-Addison theorem is described as fol- 
lows: Any M} (IT} (a) for an 7€°q) set is uni- 
formizable by a J} (resp. /7{{«)) set. Similarly, 
Suslin’s theorem is that every analytic and 
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coanalytic (i.e., 41 (&)) set A is a Borel set of 
class v(x) which is effective relative to the 
“definability x” of given A, where v(a) is the 
+Constructive ordinal relative to (Kleene, 
Tugué and Tanaka, A. Louveau, etc.). 


H. Further Results in Axiomatic Set Theory 
and Strong Axioms 


The recent development of axiomatic set 
theory has yielded the following propositions. 
Under the assumption that V= L (K. Gödel, 
Novikov, Addison): (1) there exists an uncount- 
able 77} set that does not contain any perfect 
subset; (2) there exists a nonmeasurable 4} 
set; (3) there exists a 4} set that does not have 
the Baire property. On the other hand, if the 
axioms of set theory ZFC plus “there exists an 
*inaccessible cardinal number” are consistent, 
then so is “every projective set is Lebesgue 
measurable, has the Baire property, and has a 
perfect subset when it is uncountable” with 
them (Solovay). Concerning these properties, 
*Martin’s axiom MA and “25% > N,” (MA + 

< CH is consistent with ZFC; = 33 Axiom- 
atic Set Theory E) implies that every J} is 
measurable, and has the Baire property 
(Martin and Solovay). The possible cardinality 
of a X; set is at most &, or that of the continu- 
um, which is implied from “every Z} set is a 
union of N; Borel sets” (Sierpinski). It is 
known that one cannot prove or refute from 
ZFC the converse of the latter statement. If 
one assumes MC (the existence of a measur- 
able cardinal), then every £4 set is a union of 
N, Borel sets. Therefore MC implies that the 
possible cardinality of a £} set is at most &, 
or that of the continuum. 

With each set A c œ we associate the 
following infinite game of perfect information 
G(A), played by two players I and II. First 
player I chooses no € , then player II chooses 
n, E w, then I chooses n, € œ, and so on. The 
game ends after œ steps. Let r(k) = ną; if XE A, 
then J wins G(A); otherwise II wins. A strategy 
(for 1 or II) is a rule that tells the player what 
move to make, depending on the previous 
moves of both players. A winning strategy is 
one such that the player who follows it always 
wins. The game G/(A) is determined if one of 
the players has a winning strategy. The axiom 
of projective determinacy PD is the assertion 
that for every projective set A( c w) the game 
G(A) is determined. PD is widely used to solve 
problems on projective sets that are not decid- 
able from ZFC, and is a plausible hypothesis 
that does not seem to contradict the axiom of 
choice at present; and it has pleasing conse- 
quences in descriptive set theory. For n 3 3, 
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reductions and uniformization results of 2} 
(17) sets mentioned earlier are such examples. 
Furthermore, PD implies that every projective 
set is Lebesgue measurable, has the Baire 
property, and contains a perfect subset if it is 
uncountable. 

Now, write Determinacy ( X!) (Determinacy 
(A})), or simply, Det(}) (Det(A})), when G(A) 
is determined for every Z% (Aj) set A. Using 
the fact that the open game is determined, 
namely Det(2°) (D. Gale and F. M. Stewart), 
D. Blackwell has given a short proof of 
Sep,(Z/). The extremely difficult problem was 
to show Det(A}) in ZF (indeed, one cannot 
prove Det(Aj) only from the Zermelo axioms 
Z; = 33 Axiomatic Set Theory B). In 1975, 
Martin solved this problem. Concerning 2), 
Det (x 1) cannot be proved from only the ZF 
axioms, but MC implies Det(2]) (Martin). 
However, it is known that Det(£}) cannot be 
proved even if we assume MC. 

By a pre-well-ordering, we mean a relation 
< having all the properties of a well-ordering 
except for tantisymmetry. Denote by 6) the 
least ordinal that cannot be the length of a 
preewell-ordering on œ belonging to AŻ. It 
is a classical result that 6} =w, (= K,). The 
following results are mainly due to Martin: 

ô} <&,, MC = ô! < N}, and also, PD > 

54 <N,. Let x be an infinite cardinal. A set 

A cX is called x-Suslin when there exists a 
closed set CCX x °K °K={ fl f:w—k}) such 
that 


xeAdf[feew a(x, fec]. 


A is X,-Suslin iff Ae Z] If A is a Z3 set, then 
A is &,-Suslin (Shoenfield). Under the assump- 
tions of MC, if A is a Z} set, then it is X,- 
Suslin (Martin). Furthermore, Det(A},,) implies 
the following facts (Kechris, Moschovakis): 
every 23,47 Set is x-Suslin, where x is the 
cardinal of 83,41; every Z1, set in k-Suslin 
for some «k <83,., 

The axiom of determinacy AD is the asser- 
tion that the game G(A) is determined for 
every set A. AD implies that every set is 
measurable (J. Mycielski and S. Swierczkow- 
ski), has the Baire property, and has a perfect 
subset if it is uncountable (Morton Davis). 
Though AD contradicts the axiom of choice, 
some consequences of AD in the area of pro- 
jective sets are more desirable than the conse- 
quences of the axiom of choice. The following 
are examples of consequences of AD. (1) Aisa 
Z! set iff A is a union of N} Borel sets (Mos- 
chovakis). (2) For each n, 6} is a cardinal 
(Moschovakis). (3) Generalization of Suslin’s 
theorem holds as follows: 4} = Bg: (= the 
smallest Boolean algebra containing the open 
sets and closed under complementation and 
unions of length <6}) (Martin, Moschovakis). 
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I, Polish Spaces, Luzin Spaces, and Suslin 
Spaces 


A topological space homeomorphic to a com- 
plete separable metric space is called a Polish 
space. A subspace E of a Polish space X is 
Polish if and only if it is a G,-subset of X, Le. a 
countable intersection of open subsets of X 
(Aleksandrov and Uryson). A Hausdorff top- 
ological space X is called a Luzin space (resp. 
Suslin space) if we can find a Polish space S 
and a continuous bijective (resp. surjective) 
mapping f:S— X. Every Polish space is a 
Luzin space and every Luzin space is a Suslin 
space. 

Let X be a Hausdorff topological space. A 
subset E of X is called a standard set (resp. 
analytic set) if the set E with the relative top- 
ology is a Luzin (resp. Suslin) space. The 
analytic subsets are closed under analytic 
operations. Every analytic set is obtained from 
closed sets by applying the analytic operation. 
Every analytic set is funiversally measurable. 
Every standard set is a Borel set. Every count- 
able union or intersection of standard sets is 
standard. 

A subset E of a Suslin space is a Borel set if 
and only if both E and E° are analytic. A 
subset F of a Luzin space is a Borel set if and 
only if E is standard. 

Let f be a Borel measurable mapping from 
a Suslin space X into another Suslin space Y. 
Then the image f(X) is an analytic subset of Y. 
Furthermore, if f is injective, then f gives a 
*Borel] isomorphism between X and fl X). If 
both X and Y are Luzin spaces and if f is 
injective, then f(X) is a Borel subset of Y. 
Every Suslin space is tBorel isomorphic to an 
analytic subset of R, and every Luzin space is 
Borel isomorphic to one of the following 
spaces: (1) N,={1,2,.--,n}, (2) N={1,2,...}, 
B R. 

The selection theorem due to von Neumann 
(- Section F) holds when X and Y are Suslin 
spaces. This fact and its ramifications [ 15,161 
are useful in nonlinear functional analysis and 
control theory. 

Practically all useful spaces appearing in 
functional analysis and probability theory are 
Polish or Luzin spaces. Examples: (i) Every 
locally compact Hausdorff space with a count- 
able open base is Polish. (ii) Every separable 
Banach space is Polish. (iii) The set C of all 
continuous functions on [0, 1] with the ftop- 
ology of uniform convergence is a Polish 
space. (iv) The set D of all right continuous 
functions on [0, 1] with left limits is a Polish 
space when it is endowed with the Skorokhod 
topology (- 250 Limit Theorems in Proba- 
bility Theory E). (v) The space % of distri- 
butions and the space ”’ of tempered distri- 
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butions (- 125 Distributions and Hyperfunc- 
tions) are Luzin spaces. 
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Analytic Spaces 


A. General Remarks 


An tanalytic function of a complex variable 
has as its natural domain of detinition a +Rie- 
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mann surface, i.¢., a 1 -dimensional complex 
tanalytic manifold. In the case of several com- 
plex variables, the set of zeros of an analytic 
function, the quotient space of a domain by 

a tproperly discontinuous group of analytic 
automorphisms, the existence domain of an 
talgebroidal function, etc., are, strictly speak- 
ing, not necessarily complex analytic mani- 
folds. It is necessary to consider a more gen- 
eral category of complex analytic manifolds 
with singularities, and the notion of analytic 
spaces is drawn from these examples. Many of 
the properties of complex analytic manifolds 
are extended to analytic spaces; on the other 
hand, theories specific to analytic spaces have 
also been developed. 


B. Analytic Sets 


We say that a subset A of a complex analytic 
manifold G is an analytic set in G if itis a 
closed subset and each point of A has a neigh- 
borhood U such that Uf A is the set of com- 
mon zeros of a finite number of holomorphic 
functions in U. Specifically, if A is locally the 
set of zeros of a single holomorphic function 
that does not vanish identically, then A is 
called principal. Two subsets §, and $, of G 
are called equivalent at z? e G if there exists 
a neighborhood U of z° such that S$, NU = 
S, N U. By this equivalence relation, every 
subsets § of G defines its germ S,o at 2° A 
germ of an analytic set at z? is the germ at z? 
of an analytic set in a neighborhood of 7°, 
Each germ A, of an analytic set at z? = Qe G is 
associated with an ideal [(A9) = {f| fe H(0), 
f|A, =0} in the ring H(O) of tgerms of holo- 
morphic functions at 0. We call A, reducible 
if A, is the union of two germs of analytic sets 
A,’ and A,” with A,'#A,, Ay” # A,; other- 
wise, A, is called irreducible. An analytic set A 
is called irreducible at 0 if the germ at 0 of A 
is irreducible. Properties of A, and [(A,) cor- 
respond to each other. Thus A, is irreducible 
if and only if [( A,) is prime. 

As the ring H(Q) is tNoetherian, in a neigh- 
borhood of every point z? an analytic set A 
is represented as a union of a finite number 
of analytic sets A; that are irreducible at z°. 
These A, are essentially unique. If an analytic 
set A is irreducible at z°, then there exists a 
system of local coordinates (z,,., z,) centered 
at z? and a pair of natural numbers d <n and k 
such that, in a neighborhood of z°, A is a k- 
sheeted ramified covering space with covering 
mapping @:(Z,,.... 2,)(Z,,,24))1.¢., for an 
analytic set R in a neighborhood of 0 € C4, 9: 
A—@'(R)>C!— R is, in a neighborhood of 
z°, a k-sheeted covering mapping, where A 
ọ ~1(R) is a connected d-dimensional com- 
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plex analytic manifold in the neighborhood. 
The coordinates of the k points of A —@g*{R) 
over each point (z,,, z,) are holomorphic in 
these variables. The number d is the (local) 
dimension of A at z° and is denoted by dim,» A. 
From this local representation, we obtain 
Riickert’s zero-point theorem: Every prime 
ideal $ in H(Q) is equal to F(A), with A, an 
irreducible germ of an analytic set at 0. In this 
case the dimension at 0 of an analytic set A 
that defines A, is equal to the tKrull dimen- 
sion of the tlocal ring H(0)/§8. The theory of 
local rings is very important in the study of 
germs of analytic sets. The dimension of a 
general analytic set A at z° is detined by 
dim,» A =sup;dim,oA;, where A=| J; A; in a 
neighborhood of z°, with the A, irreducible 
at 2° If dim,» A; is equal to d for all į, then A 
is called purely d-dimensional at z°. The 
(global) dimension of A is defined by dim A = 
sup,., dim, A. A purely d-dimensional ana- 
lytic set is defined to be an analytic set that is 
purely d-dimensional at every one of its points. 

A point z? of an analytic set A is called 
ordinary (regular or simple) if A has the struc- 
ture of a complex analytic submanifold in a 
neighborhood of z°, The set A’ of ordinary 
points of A is dense and open in A. The set A* 
= A -A’ of singular (not ordinary) points is an 
analytic set in G. If A is purely d-dimensional, 
then A’ is a d-dimensional complex analytic 
manifold and A* is an analytic set of dimen- 
sion <q —1. 

Let A be an analytic set of dimension d in G, 
and B a purely d’-dimensional analytic set in G 
-A with q' > d. Then the closure B of B in G is 
a purely d’-dimenstonal analytic set (Remmert- 
Stein continuation theorem [17]). 

For every analytic set A in G, the tanalytic 
sheaf .%(A) of germs of holomorphic functions 
over G that vanish on A is tcoherent (H. Car- 
tan). We call (A) the sheaf of ideals defined 
by an analytic set A. Let Oç be the sheaf of 
germs of holomorphic functions over G. Then 
Gi =(0¢/.4(A))|A is a coherent sheaf of rings 
over A. 0, is called the sheaf of germs of holo- 
morphic functions on an analytic set A. 


C. Analytic Spaces 


A tringed space (X, 0,) with Hausdorff base 
space X is called an analytic space if for every 
point xeéX, there exists an Open neighborhood 
U of x such that the ringed space (U, @,| U) is 
isomorphic to a ringed space (A, ©), where A 
is an analytic set in an open set G of some C”. 
The structure sheaf Qy is then called a sheaf of 
germs of holomorphic functions. The notion of 
*holomorphic mapping from one open set in 
C” into another is generalized to the case of 
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mappings from one analytic set into another. 
An analytic set Y in an analytic space X and 
the sheaf (y of germs of holomorphic functions 
on Y are detined as in the case where X is a 
complex manifold. The ringed space (Y, 0,) is 
an analytic space and is called an analytic 
suhspace of X. For an analytic space X, the 
notions of dim, X, dim X, irreducibility, and 
pure dimensionality are detined as for an 
analytic set A c G c C”. Every analytic space X 
is the union of a locally finite family of irreduc- 
ible analytic subspaces X, called the irreduc- 
ihle components of X. 

Let ọ : X -» Y be a holomorphic mapping 
of an analytic space X into another, Y. Its 
rank at xe X is detined by r,(x) = dim, X = 
dimy @ '(@(x)). The number r,=sup,exlo(X) 
is called the rank of @, The set of degeneracy 
E, of ọ is the set of points xe X such that 
rgjx(X)<T)x: for an irreducible component 
X’ of X through x. The mapping ¢ is non- 
degenerate if E, = @. For any k €N, {xe X 
rx) < k} is an analytic set (R. Remmert[ 16]). 
In particular, E, is analytic. For a holomor- 
phic mapping ọ :X > Y, the inverse image of 
an analytic set in Y is an analytic set in X. 
However, the image of an analytic set is not 
necessarily analytic. If gis *proper, then the 
image ~(X’) of an analytic set X’ in X is an 
analytic set in Y of dimension f,)y, and is 
irreducible if X’ is irreducible (Remmert’s 
theorem [ 16]). 


D. Modifications and Resolution of 
Singularities 


Let M be a subset of an analytic space X. If, 
for every point x € X, there exists an open 
neighborhood U of x and and an analytic set 
M* in U, containing UM M such that U-M* 
is dense in U, then M is called analytically 
thin. Let o: X > Y be a holomorphic map- 
ping. Suppose that there exist two analyti- 
cally thin sets A4 c X, N c Y such that g 
induces an isomorphism between X — M and 
Y =N. Then X is called a holomorphic modifi- 
cation of Y. If furthermore ¢ is proper, then X 
is called a proper modification of Y. A mon- 
oidal transformation of an analytic space X 
with respect to a coherent sheaf of ideals .¥ is 
detined as in the case of a complex manifold or 
an algebraic variety (— 16 Algebraic Varieties 
L; 72 Complex Manifolds H). It is a proper 
modification f: X* +X such that the inverse 
image ideal sheaf f~' ¥ - Oy. is invertible, and 
is universal among all proper holomorphic 
mappings h: Z—X with the property that 
h~' -Oz is invertible, where f ~! g is the 
tinverse image of ‚f. If J is the sheaf of ideals 
detined by an analytic set Y in X, the mon- 
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oidal transformation of X with respect to f js 
often called the blowing-up of X with center Y. 

H. Hironaka [ 14] proved that, if X is an 
analytic space which is countable at infinity 
(i.e., a countable union of compact sets), then 
there is a proper modification z: X’>X with 
X’ smooth (i.e., free from singular points). Such 
a modification is called a desingularization (or 
resolution of singularities) of X. Moreover, 
over any relatively compact open set U of X, 7 
is the product of a finite sequence of blowing- 
ups 1: X;> Xj_,(Xg =X), with smooth centers 
Y; along which X‘_, is tnormally flat. This 
deep result enables one to derive properties of 
analytic spaces from those of complex 
manifolds. 

Let X and Y be two analytic spaces and G 
an analytic set in X x Y. If the canonical pro- 
jection z:G—X is a holomorphic (or proper) 
modification, then we say that a meromorphic 
mapping (a proper meromorphic mapping) jt 
of X into Y is defined. The set G is then called 
the graph of u. A holomorphic mapping g: X 
> Y can be viewed as a proper meromorphic 
mapping. Let ji: X > Y be a proper meromor- 
phic mapping. Then u(x) (the projection of 
z~! (x) into Y) is a nonempty analytic set in Y 
for every point x € X. Moreover, there exists 
an analytic set N, with X-N dense in X, such 
that u maps X-N into Y holomorphically. 
The smallest set N with this property is called 
the set of points of indeterminacy or the sin- 
gularity set of x. A meromorphic mapping f:X 
>P! (C) is called a meromorphic function on X 
if none of the irreducible components of X is 
mapped to {co} by f. The set f ~! (0) = x((X x 
{0}) NG) is called the set of zero points of X, 
and the set f~'(00) is called the set of poles. 
These are analytic sets in X. Let fiex fy be 
meromorphic functions on X. Then, by a 
suitable proper modification of X, one can 
eliminate the points of indeterminacy of the 
meromorphic mapping f: X>(P'(C))* defined 
by x>(f;(x), - (9), ie., one can modify f to 
be holomorphic. The ring of meromorphic 
functions on X is invariant under proper 
modifications of X. If X is irreducible and 
compact, then the field of meromorphic func- 
tions on X is a simple algebraic extension of 
the ‘field of rational functions of k (<dim X) 
variables. 

Let (X, 0,) be an analytic space. A point 
x € X is called normal for X if Oy „is a ‘normal 
local ring. The set of nonnormal points for X 
is an analytically thin analytic set in X (K. 
Oka). Every ordinary point of X is normal. 
We call X normal if every one of its points is 
normal. Every nondegenerate holomorphic 
mapping of an irreducible X into an irreduc- 
ible and normal Y is an Open mapping if its 
rank is equal to the dimension of Y (Remmert). 
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For every analytic space X, there exists a 
proper modification y:¥ +X with X normal 
such that v is nondegenerate and v X—y~'!(S) 
is an isomorphism, where § is the set of non- 
normal points. Such a proper modification of 
X is unique up to isomorphisms. We call X 
a normalization of X with normalizing map- 
ping v. 

Let ọ : X ~ Y be a holomorphic modifi- 
cation. Suppose that Y is normal at (x?) 
(x° eX). If the set ¢ '(p(x°)) contains an iso- 
lated point, then p~'(p(x°))=x°, and is an 
isomorphism in a neighborhood of x° (an 
analog of *Zariski’s main theorem). In partic- 
ular, if ọ : X —> Y is a holomorphic modifica- 
tion and Y is normal, then maps X —E, 
isomorphically onto the dense open set (X 
E,). Furthermore, if a holomorphic mapping 
(: X > Y is injective and X and Y are jrreduc- 
ible, normal, and n-dimensional, then @(X) 
is an open set in Y and 9 !: @(X)>X is 
holomorphic. 


E. Analytic Spaces in the Sense of Behnke and 
Stein 


Let 9: GG be a proper continuous mapping 
of a connected ‘locally compact space G onto 
a domain G c C". The triple 6 = (G,¢, G) is an 
analytic covering space over G if the following 
conditions are satisfied: (i) ~! (z?) is a finite set 
for every point z° ẹ G. (ii) There exists an ana- 
lytic set A c G of dimension <n = 1 such that 
o G=@'(A)is a local homeomorphism and 
every point of g ‘(A) has a fundamental sys- 
tem of neighborhoods U such that both U 
and U — o~! (A) # Ø are connected. As © is 
unramitied over G-A, the number of points in 
ọ ~! (z?) is constant for z? € G-A and is called 
the number of sheets of ©, A point Ze G is 
called a ramification point of © if the restric- 
tion of g to any neighborhood of Z is not a 
homeomorphism. Denote by B the set of 
ramification points of ©. Then g(B) c A is an 
analytic set of dimension n= 1. Let f be a 
continuous complex-valued function in an 
open set D in G, We call f holomorphic in D if 
for every point 7?ED B and for every open 
neighborhood V of z? = (2°) over which ¢ is 
a homeomorphism, fo g' is holomorphic in 
V. Denote by Og the sheaf of germs of holo- 
morphic functions over G, Then (G, Og) is a 
ringed space. An analytic space in the sense of 
Behnke and Stein is a Hausdorff ringed space 
(X, 0,) that is locally isomorphic to a ringed 
space of the form (G, @g) [3]. Riemann’s 
theorem on removable singularities holds for 
such spaces. Every normal analytic space is an 
analytic space in the sense of Behnke and 
Stein. An analytic covering space © =(G, ¢, G) 
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is a C-covering space (covering space in the 
sense of Cartan) if for every point z? € G there 
exist an open neighborhood V of z” and a 
holomorphic function g in U = @ ~'(V) which 
can be defined by an irreducible polynomial 
of degree k such that its coefficients are holo- 
morphic functions on V and the coefficient of 
its highest term is |, where k is the number of 
sheets of ©, A C-analytic space is an analytic 
space in the sense of Behnke and Stein that is 
locally isomorphic to a C-covering space. The 
category of C-analytic spaces coincides with 
that of normal analytic spaces. According to 
H. Grauert and Remmert [ 10], every analytic 
covering space is a C-analytic covering space. 
Therefore every analytic space in the sense of 
Behnke and Stein is a normal analytic space. 
Let R be an equivalence relation in an ana- 
lytic space X. Given a subset A of X, denote 
by R[A] the set of points of X which are R- 
equivalent to points of A. We call R proper if 
R[K] is compact for every compact set K in 
X. Let g be a proper holomorphic mapping of 
an analytic space X into another Y. For x, 
X’E X, let x =x’(R) be defined by (x) = ex’). 
The equivalence relation R is then proper. We 
consider the quotient space X/R and the 
canonical projection p: ¥ +X/R. With each 
Open set U in the quotient space X [R we can 
associate the ring of holomorphic functions in 
p '(U) that are constant on p ' (X) for every x 
€ U. This leads to a ringed space (X/R, ©,/R), 
which is proved to be an analytic space by 
Grauert’s theorem [9]: All the ‘direct images of 
a coherent analytic sheaf over X by a proper 
holomorphic mapping ~: X— Y are coherent. 
For every proper equivalence relation R in X, 
the ringed space (X/R, 0,/R) is an analytic 
space if and only if for every point¥e X/R 
there exists an open neighborhood V of ¥ such 
that functions in I'(V, @,/R) separate the 
points of V (H. Cartan). 


F. Stein Spaces 


For an analytic space (X, My) let us consider 
the following conditions: (i) Functions in 

I(X, Oy) separate the points of X. (ii) X is K- 
complete; i.e., for every point xe X there exist a 
finite number of f€ (X, Oy) G= 1, , k) such 
that the holomorphic mapping f= (f):X >C* 
is nondegenerate at x. (iii) Every compact 
analytic set in X is a finite set. Condition (1) 
implies (ii), and (ii) implies (iii). If an irreducible 
analytic space X is K-complete, then X is a 
countable union of compact sets (Grauert 

[S]). In fact, if n= dim X, there exist functions 
f,eT(X, Oy) (i= 1, ...,n)such that the holo- 
morphic mapping f =(f;): X >C” is nondegen- 
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erate. The notion of holomorphic convexity 
(— 21 Analytic Functions of Several Com- 
plex Variables H) is carried over to analytic 
spaces. For a holomorphically convex analytic 
space, conditions (i), (ii), and (iii) are equivalent 
(Grauer [8]). A Stein space (or holomorphi- 
cally complete space) is a holomorphically 
convex analytic space that satisfies one of the 
conditions (i), (ii), or (iii). In a holomorphically 
convex analytic space (X,(,), let R be the 
equivalence relation defined by I(X, 6,); i.e., 
for x, x’ € X, X=x’(R) if and only if f(x) = f(x’) 
for every fe T(X, Cy). Then R is proper. The 
analytic space (X/R, @,/R)is a Stein space. A 
Stein space is a generalization of the notion of 
a Stein manifold. Fundamental theorems A 
and B on Stein manifolds hold Verbatim for 
Stein spaces (= 21 Analytic Functions of 
Several Complex Variables L). Therefore the 
main properties of Stein manifolds are in- 
herited by Stein spaces. Let p:X + Y bea 
holomorphic mapping of an analytic space X 
into another, Y. If for every xe X all the con- 
nected components of the fibers @ '(@(x)) are 
compact, then the equivalence relation R’ 
defined by those components (i.e. for x and x’ 
in X, x = x{R') if and only if x and x’ belong to 
the same component of @ '(@(x))) is proper, 
and the ringed space (X/R’, @,/R’) is an ana- 
lytic space. In particular, if X is a holomor- 
phically convex irreducible analytic space 

and R is the equivalence relation defined by 
F(X, Cy), then all the fibers of the canonical 
projection p: X > X/R are connected. 


G. Further Topics 


The notion of analytic space can be gen- 
eralized as follows (Grauert [9]). A ringed 
space (X, Øy) is a general analytic space if it is 
locally isomorphic to a ringed space (A, #4), 
where A is an analytic set in a domain G € C”, 
and Ha = (O/A la for some coherent analytic 
subsheaf # of .¥(A) such that Supp(Cg/ J) 
(={z€G (¢/f), #0})= A. An analytic sub- 
space of (X, (',) is a ringed space (Y, Oy) where 
Y = Supp(@,/.¥) and Oy = ©,/.# for some co- 
herent sheaf of ideals of ©y. (Y, (,) is also a 
general analytic space. A. Douady [6] showed 
that for any general analytic space (X, (y) 
there exists a natural structure of a general 
analytic space on the totality of all the com- 
pact analytic subspaces of (X, @,). The result- 
ing analytic space is called the Douady space 
of (X, Oy). For the proof, the notion of a 
Banach analytic space is used, which is ob- 
tained by first defining analytic subspaces in 
an open subset of a complex ‘Banach space 
and then patching them. A Douady space can 
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be used, for example, to show that the identity 
component of the analytic automorphism 
group of a compact Kahler manifold is a 
‘complex Lie group which is naturally an 
extension of a complex torus by a tlinear 
algebraic group (A. Fujiki, D. Lieberman; = 
232 Kahler Manifolds C). 
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24 (Xx1.1) 
Ancient Mathematics 


A. General Remarks 


To determine the beginning of the history of 
mathematics, one must detine the term “math- 
ematics.” Only speculation based on the ob- 
servation of primitive peoples today can be 
made regarding the development of the number 
concept among prehistoric peoples. The pre- 
historic period ended in Egypt and Meso- 
potamia c. 3000 B.c., and a little later in the 
valleys of the large rivers in India and China. 

Since the basis of the civilizations in the 
river valleys of the ancient world was agricul- 
ture, the administrators first had to control 
watering systems through irrigation, drainage, 
pumping, and canalization; second, they had 
to measure land and harvests for tax collec- 
tion; and third, they had to establish a calen- 
dar by observation of the heavenly bodies. 

All these tasks demanded some knowledge 
of mathematics. 

Additional knowledge of mathematics was 
certainly needed for construction of the pal- 
aces and tombs. We know something of the 
development of mathematical knowledge 
during these ages from some recovered arti- 
facts, but there remains the possibility of new 
finds that will bring about a basic change in 
our knowledge of the history of mathematics 
during this period. 


B. Mathematics in Egypt 


The main sources for our understanding of the 
history of mathematics in Egypt are the Mos- 
cow papyrus and the more important Rhind 
papyrus, both discovered in the 19th Century. 
The Greeks place the origin of their mathe- 
matics in Egypt, but it seems that Egyptian 
mathematics was limited to practical mathe- 
matics. The Egyptians had a decimal numera- 
tion system, but the place value was not clear; 
they used fractions, which they always decom- 
posed into the sums of unit fractions (i.e., frac- 
tions with 1 as numerator); they solved the 
problems of everyday arithmetic that were 
reducible to linear equations; they computed 
approximate areas and volumes of some fig- 
ures for the purpose of measurement of farm- 
land or granaries and for construction work; 
they had exact formulas for the computation 
of areas of triangles and of trapezoids; and 
they used (16/9)? =3.1605 as the value of 
T, but no trace has been found to prove the 
existence of demonstrative mathematics in 
ancient Egypt. 
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C. Mathematics in Mesopotamia 


Sources abound for the study of Mesopota- 
mian mathematics, and these sources may 
very well increase in the future. The Meso- 
potamians kept exact records of astronomical 
observations for long periods of time. Their 
more advanced mathematics was not limited 
to practical use, as was that of the Egyptians. 
They used a sexagesimal system of numeration 
with place value, and also used sexagesimal 
fractions; however, they lacked a cipher to 
denote zero until the 4th Century B.C., and they 
did not have a symbol corresponding to our 
decimal point, so that exact place value had to 
be determined from the context of each ex- 
pression. They had a multiplication table and 
tables of inverses, squares, and cubes of num- 
bers, and they used these tables to solve equa- 
tions, even some simple equations of the third 
degree, as well as simultaneous equations of 
the second degree for two unknowns. They 
had accurate solutions for quadratic equations 
(expressed in words); they discarded negative 
roots, but they admitted both positive roots 
when two existed. They studied integral solu- 
tions of a? +b? = c? (the largest of their solu- 
tions were 12,709, 13,500, and 18,541) and 
approximate computation of quadratic roots. 
which suggests some relation to Greek mathe- 
matics. We have evidence that some of the 
geometric algebra in Euclid’s Elements can be 
traced to Mesopotamian algebra. Some his- 
torians also affirm that the concept of dem- 
onstration in Greek mathematics originated 
with the Mesopotamians, but this theory lacks 
suffcient proof. 

By the 7th Century, the Mayas in Central 
America also possessed a numeration system, 
with the base 20. As far as we know, the Meso- 
potamians and the Mayas were the earliest 
people to possess numeration systems with 
place value (- 57 Chinese Mathematics, 187 
Greek Mathematics, 209 Indian Mathematics). 
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25 (XX.35) 
Approximation Methods in 
Physics 


A. Introduction 


It is often possible to solve a differential equa- 
tion of mathematical physics analytically by 
expanding the solution in a series with respect 
to a small (or large) parameter involved in the 
equation. In general, such an approach is 
called a perturbation method (- Section B). 
For an equation which is difficult to treat 
analytically, it is possible to get a numerical 
solution either by replacing the derivatives by 
difference quotients (*difference method), or by 
expanding the solution in terms of suitable 
functions and determining the expansion coef- 
ficients numerically from the equation re- 
written in a weak form (‘variational method, 
*method of weighted residuals, *Galerkin’s 
method, ‘finite-element method, etc.). For such 
numerical methods — 46 Calculus of Vari- 
ations, 301 Numerical Solution of Algebraic 
Equations, 302 Numerical Solution of Linear 
Equations, 303 Numerical Solution of Ordi- 
nary Differential Equations, 304 Numerical 
Solution of Partial Differential Equations, 441 
Variational Principles. 

In Section C we describe a method for deriv- 
ing an asymptotic expression for a function 
written as an integral in a complex plane. 


B. Perturbation Method 


(1) Regular Perturbation. (a) Initial-value prob- 
lem. Consider an initial-value problem of an 
ordinary differential equation involving a 
small parameter €; 


du 
Te (1) 


u=b(e) at t=0. (2) 


We assume that f is sufficiently smooth as a 
function of t and is regular as a function of u 
and £, and that b is a regular function of €. 

Substituting the power series in e for f, u, 
and b, 


f(t, we) = ft u) +f Ct, u) +, 
u(t; s)=uOXt) + eu (1) + . ..) 
ble) = BO + b+, 
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into equations (1) and (2) and equating terms 
of equal power, we get 


(0) 
du’ FNU, KA= 
dt i ei > 
du” 0 0 1 1 0 
i =f! (t, u! ul 4f! Xt, u! , u0) =b, 
(2) 
du =f uu?) 44 fO yOu? 
dt seem Dero 


+ fOr, uu + Ve, uv), 
u'?(0) = b”, 


the solutions of which determine the perturba- 

tion series for y, The equation for y‘® (the 

unperturbed equation) is nonlinear in general, 

but those for u®, u, are inhomogeneous 

linear equations with identical principal parts. 
The system of differential equations 


du; : 
q Fills May eoume) (i=1,2,...,0), 
u; = b;(e) at t=0, 


or a differential equation of higher order, 


d"u d'u 
t= i(1u Gate) 


d 
U= DOME) ye, et = Be) at r=0, 


can be treated in a similar way. 

(b) Boundary-value problem. Given a linear 
differential operator H = H) +eV involving 
a small parameter ¢ linearly (Hp is a linear 
differential operator and Vis a function of 


(X,,X2,+++5X,)), consider the boundary-value 
problem 

(Hy +eV)u=f(x1,---.X_) (Xis XEN, (3) 
Bu=0 (x), --., X EQ, (4) 


where B is a linear and homogeneous 
operator. 
Substitution of the Taylor expansion 


U=Upteu, + 

into equations (3) and (4) gives 
Ajup=f, Bu, =9, 
Hyu,t+Vug=0, Bu, =0, 


of which the solutions are 
Uy=Kf, u,=—KVKf, 
u, =KVKVKf,.... 


Here K = H l is an integral operator such that, 
for any function p= v(x; , X2, , X,), we have 
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G(x sere Xal Gista 
Ea [Čist Sn) 


X v(i 15 Sade, dén 


where G(x4, 5 Xal Či 3 Ča) is the Green’s 
function for the unperturbed problem. The 
perturbation series 


u=Kf—eKVKf +e’ KVKVKf—... 

converges if VK is bounded and |e] < 1/| VKI]. 
(c) Eigenvalue problem. The solution of an 

eigenvalue problem 

(Hy +eV)u=du (5) 


with the boundary condition Bu = 0 for the 
operators considered in (b) can be obtained in 
power series in ¢: 


L= åo tea e724 _.., 
u=uo teu +e uP + ..., 

where uy and A, are the eigenfunction and 
nondegenerate eigenvalue, respectively, of the 
unperturbed operator Hy, which is assumed to 


be self-adjoint. Substitution of the series into 
(5) gives 

(Hy — Agu = — Vig +2 up, (6) 
(Hy — Aou P= Vu) AP ug FAD YD 

(HS iou = m Vu + AP ug + APh 


$Y), 


If we normalize up and ų by the conditions 
(Uo, Uo) = 1 and (u, uy) = 1, we get from equation 
(6) 


AY) = (Vug, Ug) =o. 


Let § be the inverse of Hy — åp 1 in the sub- 
space perpendicular to to (we put Su, = 0), 
then each term of the perturbation series for u 
and 2 is obtained in the following way from 
the system of the equations given above: 


u= —$Vio, 

A?) = = (VS vu,, Up), 

u?) = S(V—a)SVuo, 

A =(VS(V —a)SVug, Uo), 

u?) = — §(V—a)S(V—a)SVug 
+(VSVuo, Up) S?Vup, 


(2) Singular Perturbation. Formal application 
of the procedure described in (1) often fails 
when the term including the highest-order 
derivative is multiplied by a small parameter 
(or a lower-order term is multiplied by a large 
parameter), i.e., when we deal with singular 


perturbation problems. Typical methods for 
treating such cases are described below. 

(a) WKB method. The method for getting an 
asymptotic solution of the second-order dif- 
ferential equation 
dzu 
Ta" k? P(x)u=0 (7) 
for large values of k was developed for prob- 
lems arising in classical wave motion (Jeffreys, 
1924 [1]) and for problems of quantum 
mechanics (Wentzel, Kramers, Brillouin, 

1926), and is called the Jeffreys method, WKB 
method, or WKBJ method. 

If the function P in (7) does not vary rapidly, 
the solution is expected to have the form 


uocexp{ike(x)}, 


with @(x) not very different from a linear func- 
tion of x, since the exact solution for P = P, = 
const is exp( + ik / Py x). 

Substitution of this expression for 4 in equa- 
tion (7) gives 


ikp” — k? o? +k? P =0. 
Neglecting the first term, we get 


X 
o? = P, 1.€., p(x) = | P 1/2 dx. 


This result suggests a transformation of the 
form 


x 
w= Pity, =| P? dx, 


which gives 


d? w 
az tik’ —Q)w=0, (8) 
where 
d? 2 
Q= Pts (P= p E p=) 


Equation (8) can be transformed further into a 
Volterra’s integral equation of the second 


kind: 
w(z; k)= Ae + Be ™ 


+f sin{k(z—{)} Q(0) yd. 


Therefore, if we expand w as 


1 
w(z; K)=WolZ) 7 wil) s (9) 
we have 
Wo(z) = Ae™ + Be”, 


nae | *sin{kle—Q}O(C)wy-s al 


Zo 
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If P is positive for the interval zọ SZ <z; in 
question and has continuous derivatives up to 
second order, the foregoing series converges to 
the solution of (8). 

On the other hand, when k takes complex 
values, the series (9) does not converge in 
general. The series expansion appropriate for 
this case is 


w(z;k)=exp{4(z;k)}, p= bike 
and we get the asymptotic expansion for w: 


w-exp + ikz+— x | odt Q(z) 


7 2 
sp 20- | Q ac 


{Q"(2)—2027} + | 


1 
Tlk 
Now, in many cases, P takes both positive 
and negative values within the intervalin 
question, €.g., in the problem of transonic flow 
of a gas, in the problem of determining the 
reflectivity and transmissivity of a wave propa- 
gating in a nonuniform medium, or in the 
quantum-mechanical problem of a material 
wave passing through a potential wall. The 
point at which P vanishes is called a turning 
point. In this case there arises the problem of 
analytic continuation of the solution obtained 
in one side into the other through the turning 
point. For the special case when Pac x" (the 
turning point is chosen as x = 0), it can be 
shown that the exact solution of equation (7) is 
explicitly given by 


woc,/z Z,(kz), 


where v = 1/(n + 2), and Z, is a cylindrical 

function of order v. Based on this fact, the 
continuation formula for the case when P 
is approximated by x” near x = 0 has been 
obtained. 

(b) Lighthill’s method. With nonlinear equa- 
tions, formal application of a perturbation 
procedure sometimes leads to the difficulty 
that the solution of the unperturbed equation 
has a singularity not exhibited by the exact 
solution of the original equation and whose 
order increases in higher approximations, so 
that the perturbation series ceases to have any 
meaning near this point. Such a difficulty can 
often be overcome by so-called coordinate 
straining. This method has been applied to a 
number of fluid-dynamical problems, such 
as those arising in theories of aircraft wings, 
boundary layers, and shock waves. 

Before describing the method, it is conve- 
nient to sketch Poincaré% method of getting 
the perturbation series for the period of a 
nonlinear oscillation. In order to solve the 
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nonlinear equation 


du 
—= f(u; To, £), 


dt 

where Tj is the period of oscillation when the 
small parameter ¢ is taken to be zero, Poincaré 
wrote series expansions for both y and the 
period T: 


u=uo(t)t+eu,(t)+.... T=Tot+eT, +..., 


and determined T,, T}, . . . in such a manner as 
to avoid resonance at each step in the deter- 
mination of u,(t), u,(t), . . 

Based on this idea, Lighthill(1949) [2] 
developed coordinate straining in order to 
overcome the difficulty stated above. We illus- 
trate the method here by means of the follow- 
ing simple example. 

Consider a boundary-value problem of a 
nonlinear differential equation 


(teu) +u=0 (O<x< 1): (10) 


u(l)= 1. (11) 


Application of the usual perturbation method 
would give the series solution 


Dat 1 1 an | 1 y 
Saat Ne Se 
x Ox otf 2x x’ 


As is seen, however, the order of an apparent 
singularity occurring at x = 0 increases as we 
proceed to higher approximations so that it is 
not possible to see the true behavior of the 
solution near x = 0. 

In Lighthill’s method, we assume that series 
expansion is valid not only for ų but also for 
the independent variable x, 1.€., 


u=uo()+ eu (E) +... 


Substitution of these series into equations (10) 
and (11) gives 


x=¢+ex (č)+.... 


uo +tuo=0, ug(1)=1, 


cu +u, = — (uo Xi + UX, + uguo), 


u,(1)=0, x,(1)=0, 


From the first equation we have ug = 1/€, 
Before solving the second equation for u} , we 
determine x; (Č) so as to avoid the increase 
of the order of singularity in u,{é) at č =O. 
This requirement is satislied by taking x, = 
(1/2) (€ 1/8), and this gives u, = 0; thus, at 
this stage of approximation, we have 


1 
u=uo(s) + ous ()=s, 


smë O= ttie) 


122 


The guiding principle is the same for higher 
approximations. (The foregoing expression 
happens to be the exact solution, but this a 
fortuitous result of our choice of this particular 
example.) 

The convergence of the series obtained by 
Lighthill’s method has been proved by Wasow 
(1955) [3] for the more general problem 


(x+ a +q(x)u=r(x), 


where e, a, and þ are positive constants and 
q(x) and r(x) are functions regular in |x| <a. 

Lighthill’s method can also be applied to 
partial differential equations. Sometimes it is 
called the PLK method (Poincaré-Lighthill-Kuo 
method), after its successful application to the 
problem of the boundary layer of a thin flat 
plate by Kuo. 

(c) Method of matched asymptotic expan- 
sions. In some cases it is possible to get a uni- 
formly valid series solution if we divide the 
domain into two or more subdomains, solve 
the equation by use of a suitable independent 
variable for each subdomain, and then deter- 
mine the coefficients in each solution by the 
process of matching neighboring solutions 
on their common boundary. Such a perturba- 
tion technique was developed in treating the 
boundary-layer equation in fluid dynamics 
and has been systematized into the method 
of matched asymptotic expansions. 

The idea of the method can be shown by 
the following simple example. Consider the 
boundary-value problem 


(0<x<I), 


u0) = 0, u(1)=1, (12) 


where ¢ is a small positive parameter and a is a 
constant such that 0 <a < 1. In order to solve 
this problem, we first take x itself to be an 
independent variable (the outer variable) in the 
domain x >> ¢ and expand the solution as 


u= —Ug(x) +eU,(x)+ 


Because the highest derivative is multiplied by 
a small parameter g in equation (12), the per- 
turbation equations to be solved are all of first 
order, of which the first one is 


dUy _ 


s4 UplI)=L. 


Solving this, we get an approximation for the 
outer solution: 
U,(x)= 1 —a(1—x). 


On the other hand, we introduce a new 
independent variable ¢ (the inner variable) by 
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putting x = e¢ for x < e. This transforms equa- 
tion (12) into 
d?v dv 
— += = 64, 
dč? dg 
If we assume the expansion 


v= WE) + EV (E+... 


we have the equation for Vp: 
d° V, dV, 
an tap =O, 
dee de 
This gives an approximation for the inner 
solution: 


Vo(Q)=c(1—e*). 

Finally, in order to match the inner and the 
outer solutions, we equate their values at some 
point within the interval ¿<< x << 1 (E00 for 
the inner solution, x0 for the outer solution): 


Vo(20)= Up(0)= 1 -a, 


V,(0)=0. 


from which the coefficient in Vg is determined 
to be c = 1 -a. Therefore we have an approxi- 
mate solution valid in the whole interval 
O<x<l: 


u= U(x) + Vo(d)—(l -a) 
=ax+(1—a)(l-e *"). 


C. Method of Steepest Descent 


If an analytic function f(z) of a complex vari- 
able z is expressed in terms of an analytic 
function g(t) by an integral 


ra= exp{zg(t)} dt, (13) 


where C is a curve on the complex t-plane, 
then an asymptotic expression for f(z) for 
large values of |z| can be derived by the 

method of steepest descent (the saddle-point 
method). The idea of the method may be 
traced back to Riemann, and various asymp- 
totic expressions for cylindrical functions were 
obtained by Debye (1909) [5]. 

The point at which the first derivative of g(t) 
in the integral (13) vanishes is called a saddle 
point. The function g(t) is expanded in a power 
series near this point as 


1 
HE)=G(to) + 59'(to) (t—to)? +.. 


We have an inequality 
zg"(to)(t— fo)? <0 


along the line 


T 
L: arg(t—to)=~—~arg{zg"(to)}. (14) 
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Therefore the absolute value of the integrand 
exp{zg(t)} reaches a maximum at tọ on the 
line L and decreases along it more rapidly 
than along any other direction. Hence if we 
deform the integration path in such a way 
that it passes through the saddle point tọ in 
the direction of steepest descent, the value of 
the integrand is practically zero on the new 
path except very near to tọ when |z] is large, 
whereas the value of the integral remains the 
same. Therefore we can get an asymptotic 
expression of f(z) for large |z| by truncating 
the Taylor series for g(t) up to the second term 
and taking the line L as the integration path: 


1 
fleyexp{calt)} | exp {za talt—ta)} a 


20 
= E E exp{zg(to)}- 


As an example, Stirling’s formula 
nl=./2an n'e" 


for a large positive integer is derived if we start 
with the formula 


n!= | e`t"dt= | e~ "(ns)" d(ns) 


10) 
ve) 
=p"! | exp{n(logs—s)} ds 
0 


and make use of the method of steepest descent. 
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26 (Xx1.4) 
Arab Mathematics 


The role of the Arabs in cultural history has 
been partly that of cultural transmitter. Between 
the 7th and 13th centuries, they established a 
religious empire that extended from India to 
Spain; later it was divided into the eastern and 
western empires. The caliphs of these empires 
encouraged research in the sciences, so the 
capitals Baghdad and Cordova became centers 
of culture where scholars from different coun- 
tries gathered. 

Arabic scholarship is sometimes called the 
stepfather of European culture. During the 
13th Century, Alphonso X (1252— 1284) invited 
Islamic and Hebrew scholars to the Spanish 
court to translate their writings on algebra, 
medicine, and astronomy into Spanish. This 
accomplishment earned him the title of Al- 
phonso the Wise. 

The first contact between Greek and Indian 
mathematics took place in Baghdad under 
Caliph Al-Mansur (754-775); Euclid’s Ele- 
ments was introduced by way of the Byzantine 
Empire, while Brahmagupta’s Brahmasphuta- 
siddhanta came directly from India. Many 
mathematical texts found in the Eastern Ro- 
man Empire and Syria, including some Greek 
works, were translated into Arabic. Though it 
is difficult to discern essential scientific ad- 
vances in Arabian works, the diffusion of 
these translations was instrumental in the 
development of European mathematics. 

The Arabs did not use written numerals 
until Mohammed’s time (570-632). Signs 
representing numbers had been introduced 
into Arabia when its influence encompassed 
Egypt and Greece. Indian numerals were 
imported with Brahmagupta’s book and 
became our present Arabic numerals after a 
series of modifications. 

Among all the branches of Arab mathe- 
matics, algebra was the most advanced. It 
started with Alkwarizmi’s (820) Al Gebr W’al 
Muquabala, the origin of the word “algebra.” 
It was the first mathematical book written in 
Arabic. Its content was essentially a variety of 
methods of solving algebraic equations. Al 
gebr means “transposition of negative terms 
on one side of the equation to the other side 
and changing their signs.” and al muqua- 
bala means “simplification of the equation by 
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gathering similar terms.” For example, qua- 
dratic equations can be brought by these 
methods into one of the following three types: 
x? = px+q; x? +q=px; x? + px =q, where p, q 
are positive numbers. The Arabs expressed the 
rule of solving these equations verbally. They 
apparently knew that quadratic equations 
have two roots, but they adopted only posi- 
tive roots; when the equation had two posi- 
tive roots they adopted the smaller root. The 
proof was given geometrically. It is possible 
that they learned geometric proofs from the 
Greeks. 

For the Arabs, geometry was secondary to 
algebra. They did not appreciate proof as seen 
in Euclid’s Elements. The book on conic sec- 
tions by Apollonius was also translated into 
Arabic, but no essential progress was made in 
this area, The only remarkable contribution 
was that of Omar Khayyam, author of The 
Rubdiyat, who applied conic sections to the 
solution of the cubic equation x? + bx = a. 

In trigonometry, Al Battani (c. 858-929) left 
a notable contribution. He studied The Almag- 
est, the Arabic translation of Ptolemy’s astro- 
nomical work. He added nothing outstand- 
ing to plane trigonometry, but obtained such 
formulas as cos a= cos b cosc + sin b sinc œs a 
for spherical triangles, which were not men- 
tioned in The Almagest. 
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27 (V.18) 
Arithmetic of Associative 
Algebras 


A. General Theory 


Let g be a Dedekind domain G.e., an integral 
domain in which every ideal is uniquely de- 
composed into a product of prime ideals), let F 
be the field of quotients of g, and let A be a 
‘separable algebra of finite degree over F. A g- 
lattice a of A is a g-submodule of A that is 
linitely generated over g and satisfies A = Fa. If 
a subring o of A is a g-lattice containing g, 

then o is called an order. A maximal order is 

an order that is not contained in any other 
order. A maximal order always exists although 
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it may not be unique; in particular, if A is 
commutative, it has only one maximal order. 
Ifweput o,={xeA|xaca} (0,={xEAlaxc 
a}) for a g-lattice a of A, then 0, (0,) is an 

order of A and is called the left (right) order of 
a. If the left order of a is maximal, then so is 

the right order; the converse is also true. If 0, 
and p, are maximal, we cal] a a normai g- 
lattice. To describe the same situation we say 
that a is a left o -ideal or a right 0,-ideal. If 

0; = 9, = 0, then we say that a is a two-sided p- 
ideal. A lattice a with a Co, (or equivalently 
with a C ọ,) is called an integral g-lattice or 
integral left (right) ideal. The product ab of 
two normal g-lattices a, b is called a proper 
product if the right order of a coincides with 
the left order of b. The proper product defines 
the structure of a tgroupoid on the set of all 
normal g-lattices of A. In particular, the in- 
verse a 7! of a normal g-lattice a satisfying 
aa~'=0,,a 'a=0, is given bya `` ={xeA]|xa 
co,}={xeEA axcor}. Fora fixed maximal 
order o, a maximal integral two-sided o-ideal p 
different from o is called a prime ideal of o. If p 
is prime, then 0/p is the matrix algebra of 
degree « over a division algebra, and xk is 
called the capacity of the prime ideal p. The set 
of al] two-sided o-ideals forms a multiplicative 
group, of which prime ideals are independent 
generators. 


B. Maximal Orders of a Simple Ring 


In the rest of this article, A is a tsimple ring, F 
is the tcenter of A, and o is a maximal order of 
A. The prime ideals q of o and the prime ideals 
p of g are in one-to-one correspondence by 
the relation qN g= P; 0/q is a simple algebra 
over g/p, and q°= po for some natural num- 
ber e. The different d of o is defined by d`! = 
{x€A| Tr(xo)<g}. (Tr is the treduced trace 
from A to F; Ð is an integral two-sided o-ideal, 
and is divisible by q¢~! if qf = po.) For q to 
divide Ð, it is necessary and sufficient that 
either e > 1 or o/q not be separable over q/p. 
In particular, if A is a ttotal matrix algebra 
over F, then d= 0, q = po. The ideal of g gener- 
ated by the treduced norms (to F) of the ele- 
ments in a normal g-lattice a of A is denoted by 
N,,,(a). If ab is a proper product, then we have 
Ngjp(ab) = Na4jp(a)N4p(b), where N,/-(D) does 
not depend on the choice of 0; this is called 

the discriminant of A. If [A : F] = n? and q°= 
po, then Narla) = p”, ef =n. 


C. Simple Rings over a Local Field 


Let F be a field that is complete with respect 
to a ‘discrete valuation whose field of residue 
classes is finite, Let g be the tvaluation ring of 
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F, p be the maximal ideal of g, and A be a 
simple algebra with F as its center. If A is a 
‘division algebra and /A : F] = n’, then A has 
only one maximal order o, and o has only one 
prime ideal q such that q"= po; o/q is an exten- 
sion of degree n of g/p. If A is not necessarily a 
division algebra, the relation €9€~! = o° with 
an element ë of A holds between two maximal 
orders o and o’ of A. Furthermore, for any left 
(right) o-ideal a, there exists an element & such 
that a = og (a = a0). By using the notation of 
teyclic algebra, we Can express A in the form 
(K, 9 n”). Here K, is the unramified extension 
of degree n over F, g is the tFrobenius substi- 
tution of K,/F, 7 is a prime element of F, and 
0 <y <n. The element of Q/Z determined by 
r/n (mod Z) is denoted by {A} and is called the 
tHasse invariant of the algebra class contain- 
ing A. The mapping A -{A} gives an isomor- 
phism of the *Brauer group of F onto the 
additive group Q/Z. If M is an extension of F 
of finite degree, then {A”*} = [M: F] {A} holds 
for the algebra A™ obtained from A by scalar 
extension (- 29 Associative Algebras). 


D. Simple Rings over an Algebraic Number 
Field 


Let F be an algebraic number field of finite 
degree, and let A be a simple algebra with 
center F. Then A is a cyclic algebra and is 
isomorphic to a total matrix algebra Over a 
division algebra D, The order n of the algebra 
class of A over F is determined by n? =[D: F] 
(H. Hasse, R. Brauer, and E. Noether). 

Denote by F, the completion of F with 
respect to a tprime divisor p of F, and let A, be 
the algebra obtained from A by the scalar 
extension F, over F. For a finite prime divisor 
p, the meaning of {A,,} is as before; for an 
infinite prime divisor p, put {A,} = 0 or 1/2 
(mod Z) according as A, is a total matrix alge- 
bra over F, or not. Furthermore, define the 
subgroup J, of Q/Z by 


J, =Q/Z, p a finite prime 


divisor, 


p areal infinite 
prime divisor, 


= {0, 1/2 (mod Z)} 


= {0}, 


p a complex infinite 
prime divisor. 


Now let J be the subgroup of the direct prod- 
uct II, J, consisting of all elements of the 

form (a,) (#,€J,) such that æ, = 0 except for a 
finite number of prime divisors and $, &, = 0. 
Then A+/( f A,}) gives rise to an isomorphism 
of the Brauer group over F onto J (Hasse). 
Each {A,/ is called the tp-invariant of A. In 
particular, A is a total matrix algebra over F if 
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and only if A, is a total matrix algebra over 

F, for all p (- 29 Associative Algebras). These 
theorems are closely related to fclass field 
theory. 

If o is a maximal order of A and a, b are left 
o-ideals, then aé = b with an element € of A 
defines an equivalence relation between a and 
b. The number of equivalence classes of left 
ideals with respect to this equivalence is called 
the class number of A; it is independent of the 
choice of o and is equal to the class number 
defined by using right ideals. The product of 
all real infinite prime divisors p with {A,} = 1/2 
is denoted by P,,. If P, is the product of all 
inlinite prime divisors and [A : F] = 4, then A is 
called a totally definite quaternion algebra. 

If o is a maximal order of A and A is not a 
totally detinite quaternion algebra, then a> 
N,,,(a) gives a one-to-one correspondence 
between the classes of left o-ideals and the 
congruence classes of ideals of F modulo P,, 
(- 14 Algebraic Number Fields H) (Eichler’s 
theorem). 

In particular, if A is a total matrix algebra 
over F, then the class number of A is equal to 
the class number of F. The class number of a 
totally detinite quaternion algebra was deter- 
mined by M. Eichler by using the zeta function 
(- Section F) of A [4]. 

Let o be a maximal order of A, a be an 
integral two-sided o-ideal, b be an integer of F, 
and č be an element of o. Furthermore, assume 
b = 1 (mod P,,), Na;r(&)= b (mod a N F) (tmulti- 
plicative congruence). Then there exists an 
element f of o such that Nyjp(f) = b, B=E 
(mod a)(N4;r is the reduced norm), provided 
that A is not a totally definite quaternion 
algebra [5]. This theorem, which is called 
Eichler’s approximation theorem, is widely 
applicable; e.g., it yields the previous theorem 
on the class number and can be generalized to 
the case of semisimple talgebraic groups (— 13 
Algebraic Groups). 


E. Algebras over a Function Field 


The Hasse-Brauer-Noether and Hasse theo- 
rems also hold for normal simple algebras 
over a ‘field of algebraic functions of one vari- 
able over a finite field. On the other hand, a 
normal simple algebra over a tield K of alge- 
braic functions of one variable over an alge- 
braically closed field is a total matrix algebra 
over K (Tsen’s theorem). 


F. Other Notions 


Adeles and ideles for a simple algebra A over 
an algebraic number field of finite degree can 
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be introduced as in the case of number fields 
(— 6 Adeles and Ideles). If N(a) stands for the 
number of elements in o/a, where o is a maxi- 
mal order of A and a is an integral left o-ideal, 
then the zeta function of the simple algebra A, 
called the Hey zeta function, is detined by ¢ ,(s) 
=}, N(a) * (the sum over all integral left o- 
ideals). This function has properties similar to 
those of Dedekind zeta functions (- 450 Zeta 
Functions L). Let p be an infinite prime divisor 
of the center F of A, and let G, be the group of 
elements in A, with the reduced norm 1. Put 
G = J], G, (the product over all infinite prime 
divisors of F). Then the group I of units with 
the reduced norm 1 in o is naturally regarded 
as a subgroup of G. To be more precise, [is a 
discrete subgroup of G, the volume of G/T is 
finite with respect to an invariant measure, 
and G/T is compact if and only if A is a divi- 
sion algebra (— 122 Discontinuous Groups). 
This result can be viewed as a special case of 
more general facts about semisimple algebraic 
groups (- 13 Algebraic Groups). If K is a 
tmaximal compact subgroup of G, then T gives 
rise to a ‘discontinuous group operating on 
the homogeneous space G/K, and we obtain 
tautomorphic forms with respect to I, The 
case where A is a tquaternion algebra has 
been studied extensively (- 32 Automorphic 
Functions). 
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28 (XXI.1 3) 
Artin, Emil 


Emil Artin (March 3, 1898—December 20, 
1962) was born in Vienna. After he studied at 
the Universities of Vienna, Leipzig, and Göt- 
tingen, he taught at the University of Ham- 
burg from 1923 to 1937. In 1937 he left Ger- 
many under the Nazi regime for America, 
where he taught at Notre Dame, Indiana, and 
Princeton universities. He returned to Ger- 
many in 1948 and taught again at the Univer- 
sity of Hamburg until his fatal heart attack. 
In his thesis (1923), he proved affirmatively 
the Riemann hypothesis in function fields 
over finite constant fields in the hyperelliptic 
case, and conjectured that this should be valid 
in general. This was eventually verified by A. 
Weil in 1941. Also in 1923, he introduced the 
*L-function for Galois extension and was led 
to the tgeneral law of reciprocity, which he 
stated as a conjecture and proved four years 
later, thus bringing tclass field theory to com- 
pletion. Around the same period, he estab- 
lished in collaboration with 0. Schreier the 
theory of tformally real fields and solved the 
17th tproblem of Hilbert. He also initiated the 
theory of tbraids, which he later developed 
with F. Bohnenblust in the 1940s. In view of 
these and other ingenious works as well as 
his inspiring teaching in all areas of mathe- 
matics, Artin is considered to have been one of 
the most influential mathematicians of this 
Century. 
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A. Fundamental Concepts 


Let K be a commutative ring with unity ele- 
ment 1 (- 368 Rings A), and let A be a ring 
which is a tunitary K-module (- 277 Mod- 
ules). Such a ring A is called an associative 
algebra over K (or simply algebra over K) if 

it satisfies the condition j(ab) = (ja)b = a(Ab) 
(2e K; a, be A), An (associative) algebra A over 
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K is often written A/K, and K is called the 
coefficient ring (or ground ring) of the algebra 
A= AJK. In particular, if K is a tfield, then it is 
called the coefficient field (or ground field) of A. 
Algebras over fields have been studied in de- 
tail. Notions such as zero algebra, unitary 
algebra, commutative algebra, (semi) simple 
algebra, and division algebra are replicas of the 
respective ones for rings (- 368 Rings). Con- 
sidering both structures as rings and as K- 
modules, homomorphisms and isomorphisms 
are defined in a natural manner, and are called 
algebra homomorphism and algebra jsomor- 
phism, respectively. In this connection, sub- 
algebra, quotient algebra (or residue class 
algebra), and direct product of algebras are 
also defined as in the case of rings. 

An tideal of an algebra A is defined as an 
ideal of the ring A, which is at the same time a 
submodule of the module A over the coeffi- 
cient ring. The fradical of an algebra A con- 
sidered as a ring is then an ideal of A in this 
sense. In fact, the existence of a unity in an 
algebra A implies that any ideal of A consid- 
ered as a ring is necessarily an ideal of the 
algebra A. 

In the rest of this article, we assume that all 
rings have a unity element and that all homo- 
morphisms are unitary. Hence, when we con- 
sider subalgebras of an algebra A, we require 
that they share the unity element with A. If e is 
the unity element of an algebra A over K, then 
the mapping A> Je = j' (AE K) is a homomor- 
phism K - A whose image Ke is contained in 
the tcenter of A, and the scalar multiplication 
Aa is equal to the ring multiplication 4'a (Ae€ K, 
AE A). Conversely, given a homomorphism of 
K into aring A whose image is contained in 
the center of A, we can regard A as an algebra 
over K in an obvious way. Hence we are given 
an algebra A over K if and only if there exists 
a pair (A, p) of a ring A and a homomorphism 
p : K 4A whose image is contained in the 
center of A. There exists a uniquely determined 
(unitary) homomorphism of the ring Z of 
rational integers into any ring; hence any ring 
can be regarded as an algebra Over Z. If the 
coefficient ring K of a nonzero algebra A is a 
field, then K can be regarded as a subfield 
contained in the center of A, and the unity 
element | of K coincides with the unity ele- 
ment of A. 

Let A and B be algebras over K. Then the 
ftensor product A ®, B of K-modules is an 
algebra over K under the multiplication 
(a @b)(a'@b’) =aa' @ bb’ (a,a' €A, b, b’e B). 
This algebra is called the tensor product of 
algebras A and B. Moreover, the mapping 
aa Q 1 (resp. b> 1 @ b) (ae A, be B) gives an 
algebra homomorphism A-/ A @ x B (resp. B> 
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A @ x B, which is called the canonical homo- 
morphism. In particular, if A is a commutative 
algebra, then A ®, B can be regarded as an 
algebra over A by this canonical homomor- 
phism; and in this case A &g B is called the 
algebra obtained by extension of the coeffi- 
cient ring of B to A (or a scalar extension of B 
by A), and is often denoted by B4. The algebra 
K ®x B is canonically isomorphic to B. Fur- 
thermore, (A ®, B) x C and A Q(B Qg C) 
are canonically isomorphic and are written 

A ®B@®C. Let A and B be commutative 
algebras over K. Then, for any commutative 
algebra C and homomorphisms a : A+ C, ß: B 
>C, there exists one and only one homomor- 
phism y: A @, BC such that a(a)=y(a Q 1), 
B(b) = yC1 ® b) (ae A, b e B). This property char- 
acterizes the tensor product A @, B of com- 
mutative algebras A and B. In this sense, 

A @x B is sometimes called the coproduct of A 
and B (- 52 Categories and Functors E). 


B. Examples of Associative Algebras 


As we mentioned, any ring can be regarded as 
an algebra over the ring Z of rational integers. 
But it is often useful to deal with algebras over 
“large? or more “efficient” coefficient rings. 
For instance, we have many rings which are 
algebras Over a commutative ring K, such as 
the fring of polynomials, the tring of formal 
power series in n variables with coefficients in 
K, the tendomorphism ring of a K-module, 
and the tfull matrix ring of degree n over K 

(- 368 Rings C). There are other important 
classes of algebras, such as (semi) group alge- 
bras, Hecke algebras, and crossed-product 
algebras, which wil] be explained later., These 
algebras are defined by a canonical basis con- 
nected directly with a (semi) group structure. 
On the other hand, the ttensor algebra and 
the texterior algebra of linear spaces and the 
Clifford algebra associated with a given 
quadratic form are also important (- 61 
Clifford Algebras, 256 Linear Spaces). 

The most frequently used example of a 
division algebra is the quaternion field H (often 
called Hamilton’s quaternion algebra, W. R. 
Hamilton, 1858). This is a 4-dimensional 
linear space over the real number field R with 
basis { 1, i,j,k}, with the following laws of 
multiplication: 1 is a unity element, į? =j? = k? 
= —1,ij=—ji=k, jk= —kj=i, and ki= —ik =j. 
An element of H is called a quaternion. The 
only finite-dimensional division algebras oyer 
the real number field R are the real number 
field R, the complex number field C, and the 
quaternion field H. 
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C. Group Algebras and Hecke Algebras 


Let K be a commutative ring and K“) be the 
direct sum >',.g K, of modules K,, where each 
K, is isomorphic to K with a group G as the 
index set (- 277 Modules F). The elements of 
K) are the families (A,)seg Of elements of K 
whose components are all zero except for a 
finite number of them. Let {u,},.4 be the 
canonical basis of K‘), namely, u, is an ele- 
ment of K‘@) of which the sth component is 1 
and the others are 0. The module K ©) has the 
structure of an algebra over K, where the law 
of multiplication is determined by u,u, = 

u,,(S, te C). This algebra is called the group 
algebra of G over K. The product A * y of 
elements å = (4) and u= (u) of K is then 
given by 


Gsu= F Aln seG. (1) 


s=rl 


Each basis element u, is often identified with 
the group element s, and in this manner, the 
group G is regarded as a basis of K‘, which is 
usually written KG or K[G]. 

In this definition, the group G can be re- 
placed by a semigroup G, and then the algebra 
K is called a semigroup algebra. As an 
example, let N be the additive semigroup of all 
nonnegative rational integers and N” be the 
direct product of n copies of N. If we denote 
the elements (i,, , i,) of N” by Xi: Xin and 
use multiplication instead of addition, then the 
semigroup algebra of N” over K is exactly the 
tring of polynomials K [X,,-, X,,]. On the 
other hand, if G is a semigroup, then even 
when it is infinite, it may occur that for any 
SE G, there exists only a finite number of pairs 
(y, 1) of elements of G such that s = rl. In this 
case, formula (1) also defines the law of multi- 
plication on the *Cartesian product Kĉ. This 
algebra is called a large semigroup algebra 
and contains K) as a subalgebra. In par- 
ticular, the large semigroup algebra of N” 
is exactly the tring of formal power series 
KUEX pasts Xe li], 

Let H be a subgroup of a group G, and 
assume that the index of H N sHs"'in H is 
fmite for any s€ G. This assumption is equiva- 
lent to the condition that any double coset of 
G by H is a union of a finite number of left 
as well as right cosets. Let H\G, G/H, and 
H\G/H be the set of all tright cosets, ‘left 
cosets, and tdouble cosets, respectively. Then 
each element of the direct sum KO, KOM, 
or KDC can be regarded as a function 
defined on G taking a constant value on each 
right, left, or double coset, respectively. Con- 
versely, any function defined on G can be 


regarded as an element of KAO, KO or 
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kK C/H) if it takes a constant value on each 
left, right, or double coset, respectively, and if 
it vanishes everywhere except on a finite num- 
ber of cosets of respective type. Under such 
identification, let /,, 4,, å, and 4, denote the 
values of a function 4:G—K on seG, le H\G, 
r eG/H, and te H \G/H, respectively. For any 
A, pe K'"°" we define a function J » u by 
(àu), = 2 Ath, SEG, (2) 
where the right-hand side is the sum taken 
over all pairs (r, 1) such that serl, re G/H, 

le H\ G. It can be shown that this sum is a 
tinite sum and that 4 x pe KS Hence the 
module K“#\°) has the structure of an alge- 
bra over K, which is called the Hecke alge- 
bra of (G, H) over K and often denoted by 
H;,(G, H). If H = } the #,(G, H) is exactly 
the group algebra of G. In general, #,(G, H) 
can be regarded as an algebra obtained by 
extending the coefficient ring of HAG, H) to 
K. Furthermore, KDO (K (G ay can be re- 
garded as a right (left) module over the group 
algebra K‘®’, and the endomorphism ring 
Egio( KC) (resp. Ego (K'™)) is canonically 
isomorphic (anti-isomorphic) to the Hecke 
algebra K‘!\6'#) 


D. General Crossed Product 


Let G be a group that operates on a commuta- 
tive ring L, and denote the operation by (s, 4)—> 
s(a) (se G, że L); thus for any seG the map- 
ping 4-»s(A) (że L) is an automorphism of £, 
satisfying s(t(A)) = st() (s, te G). For any J, 

pe L) we define the product ) x we L' by 


(4 * u); =), AME) L(t), se G, (3) 


where { f(r,!)},1¢¢ is a given family of elements 
of L. If this family satisfies the equations 


f(s, r) f (sr, l) = s( f(r, D) f(s, rl), 


then L'® forms a ring. In terms of the canon- 
ical basis {u,},-g, this ring structure is defined 
by the formulas u,u,= f(r, uy, ua =s(A)u, 
(2e L). If K is the subring of L consisting of al] 
że L such that s(2)= A(se G), then the ring L 
is an algebra over K, called the crossed product 
of L and G with respect to the given operation 
and the given factor set f of G. In a narrower 
sense of the term, we consider only the case 
when L is a field, G is a finite group, f(r, |) #9, 
and G operates on L faithfully. In this case, G 
can be identified with the tGalois group of a 
tinite ‘Galois extension L/K, and so the crossed 
product is written (L/K, f). This is a fcentral 
simple algebra over K (- Sections E, F). 

If the operation of G in the above- 
mentioned general crossed product is trivial, 


s.r, leG, 
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namely, L = K, the crossed product of K and G 
is called an algebra extension of G over K with 
respect to f. Usually, we assume that K is a 
field and f(r, I) #0. If f(r, D= 1, the algebra 
extension is the group algebra. If G is a finite 
group whose order is prime to the fcharacter- 
istic of K, then any algebra extension of G over 
K (in particular, the group algebra) is always 
semisimple and ‘separable. If G is a finite 
Abelian group and fis a factor set of G, then a 
bihomomorphism g:G x G>K* (a mapping 
which is a homomorphism in each variable) 

is detined by ¢(s,t)= f(s, t): f(t, s)"'. An alge- 
bra extension of G over K with respect to f 

is a central simple algebra if and only if @ is 
nondegenerate. In particular, if G is a direct 
product of n copies of a group of order 2, 

then by choosing n elements 5;,52,.,5, Of G 
suitably, any element of G can be uniquely ex- 
pressed as 5S, s,.5,, Where {a, b,., z} is a 
subset of { 1,. , n} in the natural order. Hence, 
if we take Uça, 2) = Ug, U, as a basis of an 
algebra extension, then any factor set f is 
determined by fisas) = hijs where Aig= 1 (i<j), 
A= + 1 (i>j), and å; are arbitrary. When å;; 
= -1 (i>j), the corresponding algebra exten- 
sion is a Clifford algebra. Furthermore, if Ài 
= 0, then it is a tGrassmann algebra. If 4;,40 
and nis even, then it is a central simple alge- 
bra whenever the characteristic of K is not 2 
(- 61 Clifford Algebras). 

Now let K be the real number field R and 
A, be the Clifford algebra with 4;,= 1. Then 
A, is the quaternion field. Elements of A, 
are called sedenions, and are important in 
‘spinor theory and *Dirac’s equation. In gen- 
eral, let K be an arbitrary field whose charac- 
teristic is not 2 and putn = 2, hv 1,4,,= 
—1,4,,;=A40, and å>, = u #0, in the pre- 
vious notation. Then the corresponding cen- 
tral simple algebra Q is called a (general- 
ized) quaternion algebra. Thus Q has a basis 
{1, u,v, w} satisfying the following laws: 1 is 
the unity element, w = uv = —vu, u? = 4, and 
v= u(å ueK). Any central simple algebra of 
dimension 4 is isomorphic to a certain quater- 
nion algebra. (In particular, if K = R and į = p 
= -1, then Q coincides with the quaternion 
field H.) For any element x = g + fu + yv + dw 
of Q, the element ¥ =a— fu = yv = ôw is said to 
be conjugate to x, and N(x) = xxe K is called 
the norm of x. An element x of Q is invertible 
in Q if and only if N(x) 40. 


E. Finite-Dimensional Associative Algebras 
over a Field 


For the rest of this article, we assume that the 
algebras considered are unitary and finite 
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dimensional over a field K. Then by general 
properties of left (right) tArtinian rings, any 
associative algebra A has the following struc- 
ture: the tradical N of A is the greatest tnilpo- 
tent ideal, and the quotient algebra A/N = A is 
tsemisimple and decomposed into a direct sum 


of ideals which are simple algebras: 
A=A,+...+Ay,. 


Each simple component A i is a full matrix ring 
of degree r; OVer a certain division algebra D;, 
and A, is decomposed into a direct sum of r; 
minimal left ideals which are mutually A . 
isomorphic: 


Bers. Att a(r: » 
A;=AZ +... +A, 1<i<n, 


where gD, _,, ef? are orthogonal tidempotent 
elements of A; whose sum is equal to the unity 
element of 4 i On the other hand, 


A;=@VA +... +8094 


gives a decomposition of A, into the direct 
sum of minimal right ideals that are mutually 
A-isomorphic. Moreover, we can choose an 
idempotent element e! of A from each residue 
class €% such that fef} forms a system of 
orthogonal idempotent elements, whose sum is 
equal to the unity element 1 of A, and 


ri n ri 
A=) } AeP=F Y ePA 
i=] s=1 


i=1 s=1 


p= 


gives a decomposition of A into a direct sum 
of tdirect indecomposable left (right) ideals. 
Here, Ae{® and Aef (eA and eA) are A- 
isomorphic if and only if i =j. Conversely, 
every decomposition of A into indecom- 
posable ideals is obtained as above. The A- 
submodule Ne® of Ae® is the unique maxi- 
mal proper submodule, and Ae{®/Ne® and 
Ae®/Ne® are A-isomorphic if and only if i = j. 
Any simple A-module is A-isomorphic to a 
certain Ae®/Ne® (- Sections H, J), 

Any simple algebra A over K is isomorphic 
to a full matrix ring M,(D) over a certain divi- 
sion algebra D. This is called Wedderburn’s 
tbeorem. Here n is determined uniquely by A, 
and D is also determined uniquely by A up to 
isomorphism. Moreover, the center of A is 
isomorphic to the center of D. If the center of 
A coincides with K, then A is called a central 
simple algebra (or normal simple algebra) over 
K. In this case, any isomorphism of two simple 
subalgebras of A can be extended to an tinner 
automorphism of A. Let V(B) denote the *com- 
mutor of a simple subalgebra B of A. Then 
V(B) is also a simple subalgebra and V(I’(B)) 
= B,dim A = dim B: dim V(B). In particular, if 
B is central over K, then there is a canonical 
isomorphism Az B ® , V(B). If D is a central 
division algebra, then any maximal commuta- 
tive subalgebra L of D is a field and satisties 


130 


(dim LY = dim D. Moreover, there are tsepa- 
rable extensions over K among such L. In 
general, the dimension of a central simple 
algebra A is a square number r?, where r is 
called the degree of A. 

Two central simple algebras are said to be 
similar if they are isomorphic to full matrix 
rings over the same division algebra. This is an 
equivalence relation, and each equivalence 
class is called an algebra class. On the other 
hand, if A is a central simple algebra and B is a 
simple algebra, then A ®x B is a simple alge- 
bra. Moreover, if B is central, then A Qr B is 
also a central simple algebra. If A and B are 
similar to A’ and B’, respectively, then A @, B 
is similar to A’ @, B’. Hence, the tensor prod- 
uct ® defines multiplication in the set (K) 
of the algebra classes over K. If A is a central 
simple algebra and A° is an algebra anti- 
isomorphic to A, then 4° is also central simple 
and A @, 4° is isomorphic to a full matrix 
ring over K. This shows that (K) forms a 
group, which is called the Brauer group, after 
R. Brauer, who introduced this concept, or the 
algebra class group over K. For a central 
simple algebra A, the degree of a division 
algebra similar to it is called the Schur index of 
A (or of the algebra class of A), and the order 
of the algebra class of A in the Brauer group is 
called the exponent of A. The Schur index is 
divisible by the exponent; and conversely, the 
exponent is divisible by every prime divisor of 
the Schur index. 


F. Extensions of Coefficient Fields 


Let L be an ‘extension tield over a field K. 
Then for any algebra A over K, L @, A can be 
regarded as an algebra over L. This algebra is 
denoted by A® and is called an algebra gb- 
tained by extending the coefficient field to L 
Let us denote the radical of a ring A by R(A). 
An algebra A over K is called a separable 
algebra if it satisfies R(A”) = {0} for any exten- 
sion field L over K. In the special case when A 
is an talgebraic extension field over K, A is a 
separable algebra if and only if every element 
of A (or every element of a subset which gener- 
ates A) is tseparable over K (- 149 Fields). A 
(finite-dimensional) algebra A over K is sepa- 
rable if and only if A is semisimple and the 
center of every simple component of A is a 
separable extension over K. If the quotient 
algebra A/§R(A) of an algebra A is separable, 
then there exists a subalgebra § such that 
A=S+R(A) and SN R(A)= {0}, and S is 
uniquely determined up to inner automor- 
phisms (Wedderburn-Mal’tsev theorem). 

In order that an algebra A over K be central 
simple, it is necessary and sufficient that A} be 


simple for any extension tield L over K. This 
latter statement holds if and only if A* is iso- 
morphic to a full matrix algebra over L for a 
certain extension tield L over K. Such an exten- 
sion field is a tsplitting field of A (- 362 Re- 
presentations F). For a central simple algebra 
A, a (finite) extension field L of degree r over K 
is a splitting field of A if and only if there exists 
a central simple algebra B of degree r which is 
similar to A having a subfield which is K- 
isomorphic to L. In this case, r is divisible by 
the Schur index of A. Furthermore, A has a 
separable splitting field whose extension de- 
gree is equal to the Schur index of A. This 
shows that A has a splitting tield which is a 
finite Galois extension tield over K. 

Let L be a finite *Galois extension gyer a 
tield K and G be its tGalois group. If f is a 
factor set with respect to the operation of G on 
L*, then the crossed product (L/K, f) is a cen- 
tral simple algebra over K (- Section D). The 
product fg of two factor sets f and gis also a 
factor set, and the set of all factor sets forms an 
tAbelian group. Thus (L/K, fg) is similar to 
(L/K, f)@x(L/K,g). On the other hand, factor 
sets f and g are said to be associated with each 
other if there exists a family {A,},.¢ of elements 
of L such that 


f(r, D=glr, Dr(a An r, leG. 


Hence, f and g are associated with each other 
if and only if (L/K, f) and (L/K, g) are similar. 
Therefore, the mapping f>(L/K, f) gives a 
monomorphism of the group H?(G, L*) of all 
associated classes of factor sets (which can be 
identilied with the 2-dimensional tcohomology 
group of G with coefficients in L*) into the 
Brauer group 4(K) over K. Its image coin- 
cides with the subgroup of all algebra classes 
which have L as a splitting field. In particular, 
any algebra class is similar to the crossed 
product of a certain finite Galois extension L 
and its Galois group G (R. Brauer, E. Noether, 
A. Albert, K. Shoda, and others). 


G. Cyclic Algebras 


Let Z be a ‘cyclic extension field of degree n 
over a field K. Then the crossed product of 
Z and its Galois group G is called a cyclic 
algebra over K. For a fixed generator 5 of 

G and any element g 4 0 of K, a factor set 
físi, s (O <i, j <n) can be defined by f(s‘, s) 
=] (itj<n)and f(s',s‘)=o (i+j2n), Let 

(Z, s, &) denote the corresponding crossed 
product. Then (Z, s, a) and (Z, s, ß) are similar if 
and only if «/f is a norm of a certain element 
of Z into K. On the other hand, any crossed 
product of Z and G is similar to a certain 

(Z, $, cc), and the correspondence x—>(Z, s, a) 
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gives an isomorphism of K*/Nz;g(Z*) (the 
norm class group of Z/K) to the group con- 
sisting of the algebra classes over K which 
have Z as a splitting field. If K is a tp-adic field 
or a finite falgebraic number tield, then any 
central simple algebra over K is isomorphic to 
a certain cyclic algebra. In this section, we 
describe this situation in detail. 

Let K be a p-adic field and g the number of 
residue classes modulo p. If A is a central 
simple algebra over K, then its Schur index 
coincides with the exponent, which is called 
simply the index. A finite extension tield L over 
K is a splitting field for A if and only if the 
degree of L is a multiple of the index of A. If n 
is the degree of A, then the field W= K(w) ob- 
tained by adjoining to Ka primitive (q” Ist 
root of unity @ is a cyclic (and tunramified) 
extension of degree n over K, and its Galois 
group is generated by the automorphism 
g of Wdetermined by œ" = œw, g K = iden- 
tity. Moreover, we have A = (W, a, q) fora 
certain yE K*, Let v be the texponential p- 
adic valuation v(a) of x. Then v/n (mod Z) is 
uniquely determined by the algebra class of A, 
which is called the Hasse invariant of A (or of 
the algebra class of A). By assigning to each of 
the algebra classes its Hasse invariant, we get 
an isomorphism of the Brauer group B(K) 
over K to the group Q/Z, the additive group 
of the rational numbers mod Z (H. Hasse, 
1931). 

Let K be a finite algebraic number tield and 
A be a central simple algebra over K. Let p be 
a (finite or intinite) prime divisor of K and K, 
the tp-adic extension field over K. The algebra 
A, which is obtained from A by extending the 
coefficient tield to K p is a central simple alge- 
bra over K,. Except for a finite number of p, 
A, is isomorphic to a full matrix ring over K,, 
and A itself is isomorphic to a ful] matrix ring 
over K if and only if A, is isomorphic to a full 
matrix ring over K, for all p. The index m, of 
A, is called the p-index of A, and the Hasse 
invariant of A, is called the p-invariant of A, 
which is denoted by (A/p). If p is an infinite 
prime divisor, then m, is equal to 1 or 2, and in 
each case, we define the p-invariant by setting 
(A/p) = 0 or 1/2 (mod Z) correspondingly. The 
Schur index of A is the L.C.M. of the p-indices 
M, for all p and coincides with the exponent of 
A. This is called simply the index of A. On the 
other hand, the p-invariants satisfy (A/p) = 0 
(mod Z) except for a finite number of p, and 


5 (4/p)=0 (mod Z). 


p 


Conversely, given a rational number p, for 
each p such that (i) Py = 0 (mod Z) except for a 
finite number of p; (ii)p, = 0 (mod Z) if p is 
infinite and imaginary, p, = 0 or 1/2 (mod Z) if 


29H 


Associative Algebras 


p is infinite and real; (iii) Lp, =0 (mod Z), 
then there is a uniquely determined algebra 
class of central simple algebras A over K such 
that (A/p)= Pp (mod Z) for each p. In this way 
the structure of the Brauer group over a finite 
algebraic number Iield is completely deter- 
mined (Hasse, 1933). 


H. Frobenius Algebras 


Let A be an algebra over a field K. A is called 
a Frobenius algebra if its tregular representa- 
tion and fcoregular representation (— 362 
Representations E) are similar. Thus A is a 
Frobenius algebra if the left A-module A and 
the dual module A* of the right A-module A 
are isomorphic as left A-modules. Let 


ri 


A=F F AP=F Y eA 
2 aps 


be direct decompositions of A into indecom- 
posable left (right) ideals (— Section E). We 
denote e{” by e;. Then A is a Frobenius alge- 
bra if and only if there exists a permutation 7 
on 1,. , nsuch that (i) Ae, S (@n¢) A)*; (ii) r; = 
Fui When there exists a permutation satisfy- 
ing only condition (i), A is called a quasi- 
Frobenius algebra. 

For a subset § of A, the ideals I(S) = 
{ae A|aS=0} and r(S)= {ae A|Sa=0} are 
called the left annihilator and right annihilator 
of S, respectively. Then A is a quasi-Frobenius 
algebra if and only if (ii) l(r(D) =1 and r(l(r))=r 
for any left ideal | and any right ideal r. In 
general, if we are given a left (right) A-module 
M, we denote the right (left) A-module 
Hom ,(M, A) by fi. Then if A is a quasi- 
Frobenius algebra, there is a canonical iso- 
morphism M=M , and the annihilator relation 
gives a One-to-one and dual correspondence 
between the set of the submodules of M and 
the set of submodules of M (M. Hall). If a 
quasi-Frobenius algebra A satisfies (iv) dim r + 
dim I(r) = dim} + dim r(1) = dim A for any left 
ideal | and any right ideal r, then A is a Fro- 
benius algebra; the converse is also true. 

A criterion for an algebra A to be a Frobe- 
nius algebra is that there is a linear form x—> 
A(x) on A such that if A(xa)=0 for all XE A, 
then a = 0. Moreover, if } satishes A(xy)= 
4( yx) (x, y€ A), then A is called a symmetric 
algebra. For example, semisimple algebras 
and group algebras are symmetric algebras. 
If A is a symmetric algebra, then for any left 
(right) A-module M the right (left) A-modules 
M* = Hom,(M, K) and M = Hom, (M, A) are 
canonically A-isomorphic. 

If A is a Frobenius algebra, the radical N of 
A satisfies (N) = r(N), and the annihilator of 
N is a principal left and principal right ideal; 
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the converse is also true. For a two-sided ideal 
z of a Frobenius algebra A, the quotient alge- 
bra A/z is a Frobenius algebra if and only if 
l(z) and r(z) are a principal left and a principal 
right ideal, respectively. If A is a symmetric 
algebra, then any two-sided ideal z satisties 
I(z) = r(z), and A/z is also a symmetric algebra 
if and only if 1(z) = r(z) is a principal ideal 
generated by an element in the center. 

Furthermore, for any extension L of the 
coefficient Tield K, AŻ} is a quasi-Frobenius 
algebra (resp. Frobenius algebra, symmetric 
algebra) if and only if A is (T. Nakayama, 
1939, 1941). The concept of Frobenius algebras 
has been extended to algebras B over a ring A 
(F. Kasch, 1954). 


I. Uniserial Algebras 


In the notation of the preceding section, if each 
indecomposable left ideal Ae; (right ideal e; A) 
of an algebra A over K has a unique compo- 
sition series, then A is called a generalized 
uniserial algebra. If an algebra A is decom- 
posed into a direct sum of ideals which are 
primary rings, then it is called a uniserial alge- 
hra. Any left module over a generalized uni- 
serial algebra A is decomposed into a direct 
sum of submodules which are A-homomorphic 
images of Ae;: An algebra whose radical N is 

a principal left and principal right ideal is a 
generalized uniserial algebra. For an algebra A 
to be uniserial, it is necessary and sufficient 
that every two-sided ideal of A be a principal 
left and principal right ideal. Hence A is uni- 
serial if and only if every quotient algebra of A 
is a Frobenius algebra. If A* is uniserial for an 
extension field L of K, then A itself is uniserial. 
The converse, however, is not always true. If 
AE is uniserial for any extension lield L of K, 
then A is called an absolutely uniserial algehra. 
For A to be absolutely uniserial it is necessary 
and sufficient that its radical N be a principal 
ideal generated by an element in the center Z 
and Z be decomposed into a direct sum of 
simple extensions of K (1.¢., ideals of the form 
K [a]) (K. Asano, G. Köthe, Nakayama, G. 
Azumaya). 


J. Algebraic Algebras 


Here, we consider general (not necessarily 
finite-dimensional) algebras A over a field K. 
We say that A is an algehraic algehra if every 
element of A is algebraic over K, i.e., every 
element of A is a root of a certain polynomial 
with coefficients in K. We say that A satisties a 
polynomial identity p(X,,..., X„) = 0 or that A 
is a PI-algebra with an identity p(X,,, X) = 
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0 if there exists a nonzero (noncommutative) 
polynomial p(X,, ,X,) in X,,, X, with 
coefficients in K such that pla, ,a,) = 0 for 
all a, € A. A Pl-algebra satisfies an identity 
which is homogeneous linear in each variable, 
and also an identity of the form ae Te ie 
O(where[ Jis the sum È+ x; ...x;, taken 
over all permutations of 1,2,, n antl + is the 
sign of the permutation). An algebra is said to 
be locally finite if any finite number of ele- 
ments of A generate a finite-dimensional 
subalgebra. For Pl-algebras, an affirmative 
answer was found for Kurosh’s problem, 
which asks whether an algebraic algebra is 
locally finite if the degree of any element y of 
A (i.e.,dim K [a]) is bounded. 


K. Brauer Group of a Ring 


Let R be a commutative ring. An R-algebra is 
called a separable algebra if A is projective as 
a two-sided A-module (- 200 Homological 
Algebra F). When the base ring is a field, this 
agrees with the classical notion of separability; 
and A is separable over R if and only if 4/mA 
is separable over the residue field R/m for 
every maximal ideal m of R. A central separa- 
ble algebra is also called an Azumaya algebra. 
If P is a finitely generated faithful projective 
R-module (briefly: R-progenerator), the endo- 
morphism ring End,(P) is an Azumaya R- 
algebra. Azumaya algebras A, and A, are 
said to be in the same class (similar) if there 
exist R-progenerators P, and P, such that 

A, ® End,p(P,) = A, © End,(P,). The set of 
similarity classes forms an Abelian group 
with respect to ®. This is called the Brauer 
group B(R) of R (Auslander and Goldman 
[10]). Every element of B(R) is of finite order 
[12,13]. 

B(R) is a covariant functor from commuta- 
tive rings to Abelian groups. If R is a field, 
B(R) coincides with the classically defined one 
(- Section E). If R is a fHenselian local ring 
with the residue field k, the mapping B(R)-+ 
B(k) is an isomorphism [9]. If Risa tregular 
ring with the quotient field K, B(R)-> B(K) is 
ae If further dim R < 2, we have B(R) = 
(\, B(R,), P running over all primes of height 
1 of 3 Where R, is the localization of R at p 
and B(R) and BIR,) are considered as em- 
bedded in B(K) [10]. As an example, put- 
ting these facts together with the structure of 
B(K) of the algebraic number field K (— Sec- 
tion G), we have the structure of B(R) of the 
ring of integers of K: B(R) = 0 if K is totally 
imaginary, and © (Z/2Zy + if K has r( > 0) real 
infinite places. 

A commutative R-algebra § is called a split- 
ting ring of A if the S-algebra § @ A is isomor- 
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phic to End,(P) for some S-progenerator P. 
We denote by B(S/R) the subgroup of B(R) 
consisting of al] algebra classes split by S. 
Since an Azumaya algebra over a ring need 
not have a Galois extension as splitting ring, 
the description of the Brauer groups by means 
of Galois cohomology ceases to have full 
generality. Instead we have the following exact 
sequence of tAmitsur cohomology, assuming 
S is an R-progenerator (Chase and Rosenberg 
[11]): 0H (S/R, U)—> Pic(R)—> H S/R, Pic) 
~ H*(S/R, U)> B(S/R)> H '(S/R, Pic) > 
H *(S/R, U)—> , where we denote by U( 7’) 
and Pic(T) of a commutative ring T the unit 
group and the +Picard group of rank 1 projec- 
tive modules of 7} respectively. The full Brauer 
group B(R) is mapped monomorphically into 
H? (R, U)=lim H? (S/R, U), the limit over the 
faithfully flat R-algebras S. 

Grothendieck and others studied the Brauer 
groups in a more general geometrical context 


[12]. 
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A. Asymptotic Series and Asymptotic 
Expansions 


Let ¢,(x), n=0, 1, 2, be functions delined in 
R,. The sequence of functions {@,(x)}?2o is 
called an asymptotic sequence for x—> oo if 


Pn+1 (x) =0(¢,(x)) as X> 0 (1) 


for each n. When {ọ,(x)} Zo is an asymptotic 
sequence, the formal series È, a,@,(x) is called 
an asymptotic series. In most cases, the func- 
tions @,(x) have the form Ņ(x)ọ(x})". When 
P,(x)=x" an asymptotic series is called an 
m power series. A function f(x) de- 
fined on R+ is said to have the asymptotic 
expansion (or be asymptotically developable in 
the form) 


F(X) ~ ay Po(X) +a pi (x) +... +a)... (2) 
as X 00 if f(x) satisfies 

F(X) aopo(x)—a, P(X) — --. — ap Pal) 

= O(0,41(%)) (3) 


for any integer n>0 as x— œ. The coefficients 
a, (n = 0, 1, ) appearing in (2) are uniquely 
determined. This fact immediately follows 
from the formulas 


ay = lim f(3)/9o(X), -+ 


a, = lim (f(x) — 49 Po(x)— 


x>w 


—ûn-1 Pn-1 (x))/@,(x). 


For example, the Hankel function of the lirst 
kind H(x) has an asymptotic expansion 


Hy (x) 


_ (4v?—1)(4v? — 3°). . . [4v —(2m—1)°7] 

= 2m! 

If v +4 is not a positive integer, Deco (VM) 

(— 2ix) "is not convergent, and (4) means that 
for each n, 


(v, m) 


sala 
To 274) |2" 


=O(x~" ¥?) 


HEG) — 


for x90. 

Assume that f(x) and g(x) admit asymptotic 
expansions in power series for x-t oo. Then 
all functions delined by f(x)+g(x), f(x)— 


g(x), f(x)g(x), f)/g(x) (lim,... g(x) 40) admit 
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asymptotic expansions in power series for x3 
oO, Furthermore, if f(x) has an asymptotic 
expansion for x00, its expansion is obtained 
by termwise differentiation of the f(x)’s. 

For any asymptotic power series 2 o nX 
there always exists a smooth function f(x) 
defined in R, such that 


=g 


f(x)~agta,x !+a,Xx ? +... 


for x œ, and all the derivatives of f(x) are 
asymptotically developable in power series. 
Indeed, a function delined by 


=F a,b O(x/t,,) x 


satisfies the required properties if O(t) is a 
smooth function satisfying 


0, txl, 
w=) se 


and the sequence {t,}/° 9 satislies 1 < ty St; S 


.. and 


|n(n+1)...(2n—1)2na,t,'|<2™ 


for all n. 

Until now only asymptotic series for x 
œ have been considered, but the notion of 
asymptotic series has been adapted in yari- 
ous forms to the spaces of functions under 
consideration. 

(a) Asymptotic series in a complex domain. 
Let g be a boundary point of an open con- 
nected domain D in the complex z-plane. When 
the asymptotic behavior for z>¢ of holomor- 
phic functions in D is considered, we adopt as 
an asymptotic sequence {¢,(Z)}f29 of holo- 
morphic functions in D satisfying for each N 


Pn+i(Z)= 0(@, (2) 


as z tends to g through D. Let D be an angular 
domain with vertex at g, For any asymptotic 
power series Èo a,(z = q)" there always exists 
a function f(z) that is holomorphic in D and 
that admits an asymptotic expansion 


f(z)~ag+a,(z—a)+...+a,(2-—a)" +... (5) 


for za. Concerning the uniform convergence 
of asymptotic expansions, Carleman’s theorem 
is well known [2]: If the asymptotic expansion 
(5) is valid as z tends to g through an arbi- 
trary angular domain in the Riemann sur- 
face of log(z =a) having g as its vertex, then 

the asymptotic expansion (5) is uniformly 
convergent. 

(b) Asymptotic expansions by a large para- 
meter. In the theory of differential equations, 
functions with a large parameter are often 
considered. Let Q be a domain in R” and ka 
parameter > 1, and let g(x, K), n=0, 1, 2,. , 
be functions defined for (x, K) € Q x [ 1, 00). If 
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they satisfy for each x€Q and n 


P11 (X, k)=0(@,(x, k)) (6) 


for k— 00, the sequence { g(x, k)} io is called 
an asymptotic sequence, and the formal series 
paar a,l x)oO,{ x, K) is called an asymptotic series. 
A function f(x, K) is said to be asymptotically 
developable in the form 


F(X, k) ~ dg(x) @o(x, k) +a (xp (x, k) +... 
F Ay(X) Pa(X, k) +i 


for k> œ% if for every xe Q and every n 


f(x, k) = Ag(X)Po(x, k) cosas, 7 Ay—-1(X) Py) (x, k) 

= O(@alX, k)) (1) 
holds as k— œ. Especially, if the order rela- 
tions (6) and (7) hold for each n uniformly in 
xe€Q, we say that f(x, K is uniformly asymp- 


totically developable. In most cases, the 
@,,(x, K are of the form w(x, k)k~". 


B. Methods of Asymptotic Expansion 


There are several general methods of obtaining 
asymptotic expansions of functions represented 
by integrals with a large parameter. These 
methods are useful in applications because 
most fspecial functions are represented by 
integrals (- 39 Bessel Functions, 389 Special 
Functions). 

(a) Laplace’s method. Let h(s) be a smooth 
real-valued function in [a— ô, a + 6] {ô > 0) 
satisfying 


h(a)<h(s) for all sefa—6,a+6]—{a} 
and 
h’(a) > 0, 
and let g{s)e C*([a—6,a+6]). Then 
até 
I(x) -| e **S) g(s)ds 
O-S 
has an asymptotic expansion 
af 2m 
I(x) we e7 xhta) oA 
x 


1 
fsa +ex +... 
h(a) 


texte 


for x— 00, where the c, are determined by 
derivatives of h(s) ats = aof order < 2n+ 2 
and those of g(s) at § =a of order < 2n. 

As an application of this method, con- 
sider the T -function T (x)= {fe ‘t* ' dt. By a 
change of variable t = xs, we have I(x) = 
>f e *(S~ logs) 5 -I1 ds, Since for any 6 > 0 


1-6 0 
eg xls logs) oI di * e~ XS logs) g7] ds 
0 1+6 


<Ce**9 — (y>0), 
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the asymptotic expansion by the Laplace 
method 


1+ô Ir 
E, O * —(1+ce,x7'+...) 
1-3 x 


implies *Stirling’s formula 


(x)= /2rx* e1 + O(x)). 


(b) Stationary phase method. Let h) be a 
real-valued smooth function detined in [a 


6, a + ô] such that 
h’(u) = 0, h’(u) 4 0 and h’(s) £0 for s #a, 


and let g(s) be a function in C9 (a ~ ô, a + 6). 
Then 


atà i 
I(x) = | e*"5) a(s) ds 


0-S 
has an asymptotic expansion 
, , 1 a 
Hovener S- eig >. 


Jx L/h") 
+e, Xx "+ 


where g= h”(a)/|h"(u)) and the c, are deter- 
mined by derivatives of h(s) at s = a of order 
< 2n + 2 and those of g(s) ats = a of order 

< 2n. 

Further, tintegration by parts and the 
tmethod of steepest decent are often used to 
derive asymptotic expansions of integrals with 
a large parameter. 


C. Application tọ Differential Equations 


For a system of linear ordinary differential 
equations with an tirregular singular point at 
z =Q, even when there exists a formal power 
series solution, the power series is generally 
divergent. Taking the hint given by Stirling’s 
formula for the [-function, Poincaré intro- 
duced the notion of asymptotic expansions 
and succeeded in giving an analytic meaning 
to formal solutions of divergent type. Actual 
solutions for *difference equations, tdifference- 
differential equations, and ‘ordinary differen- 
tial equations of canonical form (including 
second-order linear ‘differential equations 

of confluent type) always have asymptotic 
expansions. 

In the study of certain ordinary differential 
equations containing a large parameter k, an 
asymptotic solution is often obtained in the 
form 


u(x, K se"? (ag(x)+ay(x)k 1 + 
+a,(x)k-"+...) 
(*WKB method). Also, in the study of linear 
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partial differential equations, an asymptotic 
solution of the above form plays an important 
role. For example, consider the Schrodinger 
equation 

lou 1 


Lilu] =- Au— v(x)u=0. 
l= ðt (iA) 0) 
An asymptotic series 


1 
u(x,t; 2) mw eiAo(x1) (aat t)+a; (x, jt ee 
l 


+a,{x,t) (z) ' + ) (8) 


satisfies L,[u] -0 forAsooif 


v(x)ọ=0 (eikonal equation), 


1—2Vo'Va,—Aga,=Aa,;, n=0, 1, 2, 


(transport equation), 


where we set a _, =0. For each asymptotic 
solution of the form (8) there exists an actual 
solution of Lj[f{x, t; 4)] = 0 which is asymp- 
totically developable in the form 


f(x, tA)~ ¥ era (x, ia” 
n=0 


Then asymptotic solutions of the form (8) can 
describe the fundamental properties of the 
phenomenon governed by the equation L,[u] 
=o. 

Geometric optics can also be illustrated by 
asymptotic solutions of the form (8) (— 325 
Partial Differential Equations of Hyperbolic 
Type L). The ideas of asymptotic series, asymp- 
totic expansions, and asymptotic solutions 
are essential to the theory of tpseudodifferen- 
tial operators and ‘Fourier integral operators 
(— 345 Pseudodifferential Operators). 
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31 (XVI.1) 
Automata 


A. General Remarks 


Automaton is a generic term for the math- 
ematical models for automatic machines hay- 
ing memory devices and capable of perform- 
ing definite acts within each unit of time. 

There are several types of automata, namely, 
Turing machines for the theory of computa- 
bility [6], linear bounded automata, push- 
down automata, and finite automata for math- 
ematical language theory [10-1 3]. These 
models are also used in the theory of tcomplex- 
ity of computation of algorithms [ 14]. 


B. Turing Machines 


Since the tarithmetization of metamathematics 
was used in the proof of tG6del’s incomplete- 
ness theorem, mathematical formulations of 
the notion of teffectively computable functions 
have been attempted by many mathematicians, 
including S. C. Kleene, A. Church, and others, 
as well as by Gödel himself (— 356 Recursive 
Functions). With this end in mind, A. M. Tur- 
ing and E. L. Post independently introduced a 
sort of ideal computer [ 1,2], called a Turing 
machine. 

Such a machine is capable of being in one 
of a finite numberof internal states (possible 
contents of a memory device) dos, 4134n 
fixed for the machine; and the machine is 
supplied with a tape divided into squares. The 
tape is infinite in both directions. Each square 
can be blank or can have printed on it any one 
of the previously specified symbols Sọ, si, . , 
Sm» Where So stands for the blank. In any situa- 
tion the number of nonblank squares is finite. 
Let Q= {40,415 +--s4n}s M = {S0 S15 -+3 Sm}, and 
consider the mapping t : Qx M-Q x M x 
{R, L, 0}. A Turing machine T is specified by 
(Q, M, t, Yo, F), where F is a subset of Q called 
a set of final states and q is the specified ini- 
tial state. If t(q, s) = (q', s’, x), then the 5-tuple 
(q, 8, q’, s’, X) is called an instruction for T. The 
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machine acts as follows: (1) At the start, the 
machine is in the state qo- Some sequence of 
symbols in M are written on the tape as an 
input, and the machine is on the square next 
to the leftmost nonblank symbol. (2) With 
respect to the present state q and the scanned 
symbol, the machine selects an instruction 
(q, 5,q', S, X); then the next state becomes q’, s 
is replaced by s’, and the machine moves one 
square right or left or stands still according 
as x is R, L, or 0. (3) The machine stops when 
the state becomes an element of F, and then 
the sequence of symbols on the tape is the 
result of the computation and considered to 
be the output. A Turing machine is called 
deterministic if there is always at most one 
instruction associated with a given pair (q, S); 
otherwise it is called nondeterministic. 

We represent a natural number y on the 
tape by y + | squares, each of which has the 
symbol s, printed on it. A pair (y, , Y2) is repre- 
sented on the tape by the following sequence 
of symbols of length y, + y2 + 5: one sọ, the 
representation of y,; one sọ, the representation 
of y,; and one sọ. Similarly, in general we can 
represent a k-tuple (y,, . , Yy) of natural num- 
bers on the tape. Let Ọ be a partial function 
of k variables, k> 0. We say that a Turing ma- 
chine T computes ® if T starts with the tape 
in which the representation of (xio Xa) is 
printed and stops with the tape having the 
representation of (xX,,, x,,x) when ®(x,, 

. > X) is defined and equal to x. A partial 
function is computable (in the sense of Turing) 
if there exists a Turing machine that com- 
putes the function. It is known that a partial 
function is computable if and only if it is par- 
tial recursive. The equivalence of comput- 
ability (in the sense of Turing) and recursive- 
ness for number-theoretic functions is strong 
evidence for the validity of Church’s thesis 

(- 356 Recursive Functions). 

We can apply a Turing machine not only to 
the computation of number-theoretic functions 
but also to the transformation of a finite se- 
quence of letters in any language with a finite 
list of letters. A finite nonempty list of distinct 
symbols is called an alphabet, and a finite se- 
quence of symbols in the alphabet is called 
a word in that alphabet. All the words in an 
alphabet constitute a ffree monoid if we add 
the empty sequence and think of the concate- 
nation of two words as its multiplication. We 
call a finite nonempty list (A ,, B,), , (A., B,) 
of pairs of words a set of production rules, By 
an application of a rule (A;, B;) to a word W, 
we mean that an occurrence of A, exists in W, 
and by the application, A; is replaced by B;. 
We say that two words R and § are equivalent 
if they are transformable into each other by a 
finite number of applications of the production 
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rules. The following problem is called Tbue’s 
(general) problem: [s there an algorithm for 
deciding, for any given alphabet and a set of 
production rules, whether or not any given 
two words are equivalent? E. L. Post and A. A. 
Markov proved by applying Turing machines 
that this decision problem is unsolvable [4, 3] 
(a negative solution for the so-called word 
problem for semigroups). 


C. Universal Turing Machine and 
Generalizations 


Turing showed that it is possible to construct 
a machine U with the following property [ 1]: 
If we supply U with a tape on which a suitably 
coded list of instructions of any given machine 
T is printed then U successively prints on it 
the behavior of T which results from the exe- 
cution of the instructions of T. In other words, 
U simulates the operations performed by any 
machine. Such a machine U is called a univer- 
sal Turing machine. We can construct a univer- 
sal Turing machine with only two internal 
states and also one having only two symbols, 
including the blank sọ (C. E. Shannon and J. 
McCarthy [7]). 

Let y,.-.. Yy be given functions that are all 
defined everywhere on the domain considered. 
Then we can relativize (— 356 Recursive Func- 
tions) the notion of computability of functions 
(in the sense of Turing) to y acon Wes Indeed, 
we Can consider a machine acting as follows: 
When the machine is in a state qi; (j= k 
and we assume that these states are specified 
correspondingly to w,, ., Y, and are not in 
F), then the machine prints the representation 
of w(y,,, Y.) to the right of (yi, , y.) 
and scans the resulting (n;+ 1)-tuple (Yasui 
Yn; WY» , y,,)). In this process, the symbol 
a that is written to the right of (y,, . . , y,,) is 
moved one step to the right so that a is pre- 
served on the right of the resulting (n + 1)- 
tuple. Extending the preceding, Kleene defined 
a machine that computes a functional of finite 
types [S]. A functional of variables of any 
finite types is computable by such a general- 
ized Turing machine if and only if it is partial 
recursive. 


D. Language and Automata 


Let M and N be mutually disjoint alphabets 
and V= MUN. Then G =(N, M, P, S) is called 
a Chomsky grammar if P is a set of production 
rules consisting of finite number of pairs of 
elements of V* and § is specified element of N, 
where V* means the free monoid generated by 
V. For a word ye V*, if the result of a finite 
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number of applications of rules in P is v, then 
we write u—>v, and yis said to be derived from 
u by G. The language generated by G is L(G) 
= {u S>uand ue M*}. N. Chomsky classi- 
fied the grammars into three families with 
respect to the form of production rules: (1) 
Context-sensitive grammar: For each (u, v)jeP, 
the length of ų is less than or equal to the 
length of v. (2) Context-free grammar: For 
each (u, v)e P, u is an element of N and v is 

not a empty word. (3) Regular grammar: For 
each (u, v) € P, we N and v is either an element 
of M or of the form ax, where qe M and xe N. 
Let F,(M), F,(M), and F,(M) be the families 
of languages generated by the grammars in 
the corresponding families classified as above. 
Then Fi(M)2 F,(M)2 F3(M) holds and this 

is called a Chomsky hierarchy [ 10]. Formal 
language theory studies various structures of 
these families of languages. 

Corresponding to the three families of lan- 
guages, we have the following families of auto- 
mata: (1) A deterministic (nondeterministic) 
linear bounded automaton is a Turing machine 
with input tape, namely, T= (Q, MUN, t, qo, F), 
where Q is the set of states, M an input alpha- 
bet, N a tape alphabet, { is a mapping from 
Q x(MU{e}) x(NU {e}) into Q xN x {L,R} 
(the power set of Q x N x {L, R}), qo E Q is the 
initial state, and F is a subset of Q. A word 
we M* is placed on the input tape, and T 
starts its computation from the initial state 
with blank tape and by reading one symbol 
from the input tape, and with respect to the 
mapping t, we decide the configuration of the 
next step. If at a certain stage, T reaches a 
state in F when T reads up w and if the length 
of the tape used is a constant multiple of the 
length of the input tape, then w is said to be 
accepted by T. (The special symbol ge makes 
it possible to change the configuration without 
moving the input tape.) (2) A push-down auto- 
maton is a Turing machine with an input 
tape. The working tape is of infinite length 
and moves in only one direction, say to the 
right. The computation with input word we 
M* starts with the position at the left end of 
the work tape. It uses the tape according to a 
first-in—first-out principle; namely, the ma- 
chine may write a symbol to the blank square, 
but if it is on the rightmost nonblank square, 
it may not go left without erasing the non- 
blank symbol. If it reaches a state in F, then 
w is said to be accepted by T. (3) A finite auto- 
maton is a machine without its infinite tape; 

a word w is accepted if T reaches a state in F 
(without using the symbol e). 

Let L(T) be the set of words over M ac- 
cepted by T; then L(T) is called the language 
accepted by T. Corresponding to the three 
families of automata, DF.(M) and NF,(M), i= 
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1, 2, 3, denote the family of languages ac- 
cepted by deterministic or nondeterministic 
automata. It is known that DF(M) = NF,(M) = 
F(M) for i=2 and 3, and NF,(M)=F,(M); 
however, whether DF, (M) = NF,(M) holds or 
not is still open, and this problem is called the 
LBA problem. These automata are used to 
study the closure properties under various 
operations within the families F,(M). In par- 
ticular, the LBA problem is equivalent to the 
problem of whether F,(M) is closed under 
complementation. 


E. Applications of Automata Theory 


Finite automata are formulated as models for 
digital circuits with memory. Therefore there 
exist structural studies of finite automata; for 
example, methods to decompose a given finite 
automaton to smaller automata correspond- 
ing to a circuit and its subcircuits are studied 
using tsemigroup theory [14,17]. 

Context-free grammars are useful in describ- 
ing the syntax of programming languages. For 
a context-free grammar G, a derivation tree of 
x € L(G) is a tree whose terminal nodes are 
labeled by each symbol of x, its root is labeled 
by S, and other nodes are labeled by some 
elements of N in such a way that if for a node 
with Áe N the nodes connected by an out- 
going edge from the former have labels x,, 
vie, Xp then (A, X;,, x) is a production rule 
of G. Parsing is an algorithm to construct 
a derivation tree of each XE L(G). Various 
parsing algorithms have been studied within 
subfamilies of context-free grammars [ 16] and 
they are used in the construction of compilers. 
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32 (X1.22) 
Automorphic Functions 


A. General Remarks [1] 


Let X be a (complex) tanalytic manifold and F 
a tdiscontinuous group of (complex) analytic 
automorphisms of X. A set of tholomorphic 
functions (without zero) on X, {j,(z)) (ye T), is 
called a factor of automorphy if it satisfies the 
condition j,,.(z) =j,(y(z))j,,(2) for ally, y'e T, 
zéX.A tmeromorphic function f on X is 
called a (multiplicative) automorphic func- 
tion with respect to the factor of automorphy 
{j,(z)} if it satisfies the condition f(y(z))= 
fj (z) for all yel, ze X. When all j,(z) 

are identically equal to 1, i.e., when f(z) is T- 
invariant, f is simply called an automorphic 
function with respect to I, When all j, are con- 
stant (and hence y-j, is a “quasicharacter” 
of I’), f is called a multiplicative function. If 
we denote by J,(z) the functional determinant 
of the transformation y, then for an integer m, 
an automorphic function f with respect to 
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a factor of automorphy of the form j,,(z) = 
J,(z))” is called an automorphic form of weight 
m. (In this case, it is customary to assume 

that fis holomorphic. Also, in this and most 
other cases, we add a suitable condition on 

the behavior off at the “points at infinity,” 
when [\X is not compact (- 234 Kleinian 
Groups).) 


B. The Case of One Variable [2-6] 
Except for the cases where I’ is a tinite group 
or its extension by a free Abelian group of 
rank <2(— 134 Elliptic Functions E), it is 
essentially enough to consider the case where 
X is the upper half-plane § = {zeC Im z>9}. 
In this case, [ may be obtained as a discrete 
subgroup of the *special linear group G = 
SL(2, R). In the following, we restrict our- 
selves to the case where T is a tFuchsian group 
of the first kind (i.e., the case where the tHaar 
measure (I \$) of the quotient space I'\ is 
finite; — 122 Discontinuous Groups). An 
automorphic function (or Fuchsian function) f 
with respect to [ is, by definition, a meromor- 
phic function f on § satisfying the following 
conditions: (i) f is T-invariant, i.e., f(yz)=f(z) 
for all ye T; Gi) f is also meromorphic around 
any tcusp x, of T, i.e., if @ is a real linear 
fractional transformation mapping Xx, to % 
(€.g., Paz) = —1/(z — xo)) and if the trans- 
formation zz + h (h >0) is a generator of 
Pol x, Po ' (where I, is the *stabilizer of xo), 
then f(g '(z)) can be expanded into a tLau- 
rent series of q, = exp((2zi/h)z) (which has only 
finitely many terms with negative exponent) in 
a neighborhood Im z > y, of œ (da is called the 
local parameter around the cusp xq). If we 
denote by R, the compact fRiemann surface 
obtained from the quotient space I'\ by 
adjoining a certain (tinite) number of “points 
at infinity” corresponding to the equivalence 
classes of cusps of J’, then conditions (i) and 
(ii) amount to saying that f gives rise in a 
natural manner to a meromorphic fynction on 
Rr. Thus the field 8, of all automorphic func- 
tions with respect to I’ can be identified with 
the ‘algebraic function field belonging to the 
Riemann surface Rp (— 11 Algebraic Func- 
tions A). (While any (nonconstant) automor- 
phic function with respect to a Fuchsian group 
of the first kind I has the real axis as its tnat- 
ural boundary, an automorphic function with 
respect to a Fuchsian group of the second 
kind can always be analytically extended to 
the lower half-plane through a neighborhood 
of any “ordinary point” on the real axis (= 
234 Kleinian Groups).) 

Let kbe an even integer. A (holomorphic) 
automorphic form (or Fuchsian form) f of 
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weight k or of dimension -k with respect to I 
is, by definition, a holomorphic function f on 
§ satisfying the following conditions: (i) For 


\ 
cveryo=(" i f(oz)=f(z)-(czt+d). (In 


other words, J,(z)** = (da(z)/dz) *? = (cz + d} 
is the factor of automorphy.) (ii) f is also 
holomorphic around any cusp Xp, i.e., in the 
above notation, we have an integral “Fou- 
rier expansion” f(@  '(z))(d@o '(z)/dz)*? = 
D9 4,q;- Note that this definition of weight 
is different from that given in Section A (- 
Sections F, G). If, moreover, the constant term 
a, in this Fourier expansion vanishes at all 
cusps of [, f is called a cusp form. In partic- 
ular, fis a cusp form of weight 2 if and only 

if f(z)dz gives rise in a natural manner to a 
‘differential form of the first kind on the 
Riemann surface Rp. (By a slight modifica- 
tion of condition (ii), we can also define an 
automorphic form of odd weight k. When 


-1 0 , 
( 0 her , condition (i) (for k odd) 1m- 
plies that f is identically equal to zero. So as- 


—1 0 
sume that ( 0 IE Then cusps Xo are 


classified into two categories, according as 
G E E Poly, Po 1 or not, and in the first 
case we should replace the power series in q, in 
condition (ii) by q” x (power series in q,).) 
(Note that the above definition of weight is 
slightly different from the general One given in 
Section A. See also the Siegel and Hilbert 
modular cases (- Sections F, G).) 

We denote by M,(T) (resp. S,(T)) the linear 
space of all automorphic forms (cusp forms) of 
weight k with respect to I’, Since clearly the 
relations Mi, Mi, My, Wh, Sy C Sy 4, hold, 
the direct sum M(T) = %, Vt,{P) is a (commuta- 
tive) tgraded algebra, of which S(T) = ¥, S, (I) 
is an ideal. It is known that W,(T) is of finite 
dimension, and dy = dim W (T), d? = dim S(r) 
can be determined (from the tRiemann-Roch 
theorem) as follows: 


d,= for k<0; 
| for =0 
nn af & pS 
y ° a for a 
g for t = 0, 
d,= d? =g; 
2 es fore “HOO. O27 


N| 


d?=d,—t for k even, >4, 


where s is the number of the equivalence 
classes of telliptic fixed points of [,e;(1 <i<s) 
is the order of the stabilizers of these elliptic 
points, t is the number of the equivalence 
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classes of cusps of T, and g is the tgenus of the 
—1 0 
Riemann surface Rp. (When ( 0 ier 


we can obtain a similar formula for k odd 
except for d,, d?, which, in general, cannot be 
determined from the Riemann-Roch theorem.) 

One method of constructing automorphic 
forms is provided by Poincaré series as fol- 
lows: Let dọ be a ‘linear fractional transfor- 
mation that maps the unit disk to the upper 
half-plane (e.g., the tCayley transformation), 
and put I” = og 'Tao; let p be a function 
holomorphic on the unit disk including its 
boundary (e.g., a polynomial). Then for k >4, 
the series 


P,(z)= >, o(o'(z))(do'(z)/dz}° 
ae 


is tuniformly convergent in the wide sense (Le., 
uniformly convergent on every compact set) in 
the unit disk and expresses an automorphic 
cusp form of weight k with respect to I’, .e., 
we have P,(06 '(z)) (dog '(z)/dz)** € S(T). 
Conversely, every element in ©,(I) (k >4) can 
be expressed in this form. A series of this type 
is called a theta-Fuchsian series of Poincaré (or 
simply Poincaré series). 

For k > 3, we can define a (positive definite 
Hermitian) inner product on ©, (I) as follows: 


(f,9) -Í f(z)g(z) y* * dxdy, 
F 


where z = x + jy and F is a fundamental region 
of F in $. This is called the P etersson metric. 
Since this integral converges also for f, ge 
WM, (T) if one of them belongs to S,(T), we 
can define the orthogonal complement €,(I) 
of S, (1) in MT). As we shall see, €,(1) is 
generated by tEisenstein series. 

Any automorphic function f with respect to 
T can be expressed as a quotient of two auto- 
morphic forms f, , f2 € Mt,(I) for a sufficiently 
large k. If we put w = f(z), the inverse function 
z =f '(w) can be expressed as the quotient of 
two linearly independent solutions of a ‘linear 
differential equation of the form d*z/dw? = 
~(w)z, where g(w) is an talgebraic function 
belonging to the Riemann surface Rp. 


C. Modular Functions and Modular Forms 

[4,7,8] 

The (elliptic) modular group 

r=! (1) 
=SL(2, Z) 
fa b 
ikc d 


is a Fuchsian group of the first kind acting on 


abe deZ,ad-be=1} 


the upper half-plane $. The quotient space 
I'\ can be compactified to get a compact 
Riemann surface Ry, = '\ U {00} of genus 
zero by joining one point at infinity corre- 
sponding to the cusp œ, around which we 
take q = e?*? as a local parameter. The dimen- 
sion d, = dim $i, (I) (k even, > 2) is given as 
follows: 


i _ S[k/12] for k=2 (mod 12), 
'TK/12]+ 1 for k#2 (mod 12), 


where [ ] is the fGauss symbol. 

More generally, an automorphic function 
(automorphic form) with respect to the prin- 
cipal congruence subgroup 


b 0 
rimia}(? "J= Jman} 


is called a modular function (modular form) of 
level N. (For the number of cusps of T ( N), 
denoted by t(N), and the genus of Ryn) > 122 
Discontinuous Groups.) At any cusp of T(N), 
the local parameter q, in (ii) is given by qy = 
exp(2ziz/N). For N, kè 3, the dimensions d,, 
dẹ are given by the general formula of the 
preceding paragraph (including odd k). For 

k > 3, we defme the (extended) Eisenstein series 


by 
| 


y 
Le 
m;=c,{modN)(M, + MZ) 


G,(Z;C4,€), N)= 


where ¢,, €z are integers such that (c,, c2, N) = 
1 and the symbol $; denotes the summa- 

tion excepting the pair (m,, m,) = (0,0). Then 
G,{z; c1» €2, N) depends only on ¢;,cz (mod N), 
and if we take a set {(c,,c2)} suchthat {(c,,c), 
(—c,, —¢2)} forms a complete set of repre- 
sentatives of the (primitive) residue classes 
(mod N), then the corresponding set of Eisen- 
stein series forms a basis of the space &,(I'(N)) 
(k > 3), whence we obtain dim ©, = d, = 4? = 
t(N) (kè 3) (for details, including the case k= 
1, 2, se E. Hecke, Ath Math San Univ. Ham- 
burg, 5 (1927)). 

The Fourier coefficient a, of the Eisenstein 
series can be calculated easily, and we get an 
estimate a, = O(n*~1**) for k> 2, where ¢ is an 
arbitrary positive number. On the other hand, 
for cusp forms, Hecke (Abh. Math. Sem. Univ. 
Hamburg, 5 (1927)) gave a, = O(n*?). In an 
attempt to improve this estimate, H. D. Kloos- 
terman was led to consider the sum 


u, v,4)= xp{ —-| ux+— 
a AN 


(u,v,4€Z), 


called the Kloosterman sum, which is also 
related to the arithmetic of quadratic forms. 
Using A. Weil’s estimate |K (u, v, p)| S 2./p 
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(where p is an odd prime, (u, p) = (v, p) =1), 
based on the analog of the Riemann hypoth- 
esis, we obtain a = O(n*?~"***) The estimate 
a,= O(nk/?~¥2**) is a weak form of what is 
known as the Ramanujan-Petersson conjec- 
ture (— Section D). In the case N = 1, we ob- 
tain the classical Eisenstein series G,(z) = 

G,(z; 0,0, 1) (Keven, > 4), from which we 

define the modular forms g,(z)= 60G,(z), 

g3(z) = 140 G,(z), and A(z) = gł = 27g3 of weight 
4, 6, and 12, respectively. If we denote by (9 = 
go{u; 1, z) the Weierstrass *¢-function with 

the fundamental period (1, z) we have the 
relation f’? = 4g? —g, ~—g,, and A(z) is the 
‘discriminant of the cubic polynomial appear- 
ing in this relation. It is known that every 
modular form can be expressed uniquely as a 
polynomial in gz and g3, or in other words, we 
have M(T)=Clg,, g3]. The polynomial in 
J, 93 expressing a modular form is, moreover, 
isobaric, i.e., consists of terms of the form 
cgy"g3", CEC, where 24+ 3v is a constant 
called the weight of the isobaric polynomial. 
Also, A(z) is a cusp form of the smallest weight, 
and the ideal of all cusp forms, S(T ( 1)), is a 
principal ideal in M(T( 1)) generated by A(z). 
The Fourier expansion of the Eisenstein series 
G,(z) is given as follows: 


1 Sue WA 
— (97) {B 4(—1¥2 k-1 
G,(z) = (27) al wt(-) as ts) 


where q =e*"7 and B, is the *Bernoulli num- 
ber. The discriminant A(z) is expressed as an 
infinite product as follows: 


A(z)=(2n)!2q í a-g}. 


If we put J(z)=g3/A, the function J(z)is a 
modular function and gives an tanalytic iso- 
morphism of the Riemann surface Ry) = 
T( 1)\§ U {co} onto the Riemann sphere 
CU {oo} (which maps C}, 4, «0 to 0, 1, 0, 
respectively). Hence the field of modular func- 
tions Kra) is a rational function field C(J) 
generated by the function J. The analytic 
isomorphism class of tcomplex tori of dimen- 
sion 1 (= felliptic curves) Ew, o) = C/(Za,+ 
Zw) with the fundamental period (@,, œ) 
is uniquely determined by the T( 1)-equivalence 
class of the modulus 7 = qw, /a, (€ $), and hence 
by the value J (t) of the function J. This is the 
historical origin of the name of tmodular 
function [4]. 

As an example of a modular function of 
level 2, we have the /-function: 

p +2)/2)— p(2/2) 


= T pled 


which gives an analytic isomorphism of the 
Riemann surface Ry) = I (2)\ $9 U (three 
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points) to the Riemann sphere CU { œ} (map- 
ping 0, 1, œ to 1, 00, 0, respectively; this prop- 
erty is used in a proof of the fPicard theo- 
rem). Hence we again have Ry.) = C(A). The 
following relation holds between J and A; 


4(1-A+2?)3 


M2= 5G yp 


In general, the field Ry,y) of modular func- 
tions of level N is generated by (g>/g3): e((a, 
+a,z)/N; 1,z) (a,,a,€Z). 


D. The Hecke Ring and Its Representation 
[S-1 13 


In general, let [ be a group, ĮI a subgroup of 
I, and suppose that, for every gef, oTa! 
and Į’ are tcommensurable, which amounts to 
saying that every double coset Fal is a union 
of a finite number of left or right cosets of 

I’. Then in the free module generated by all 
double cosets ToT (oe fF), we can define a 
bilinear associative product as follows: 


Poll = =Z mlo, f; p pEr, 


where, writing IoT =|); ro, rtr = |); Ti, we 
denote by m(a, 7; p) the number of pairs (i,j) 
such that To;t; = Ip. The associative ring thus 
obtained is called the Hecke ring (or Hecke 
algehra) and is denoted by # (Ñ, T). dif I is a 
‘topological group and T is an open compact 
subgroup, the Hecke ring #(I,T) can also be 
interpreted as the ring of all Z-valued continu- 
ous functions on Î with compact support 
provided with a tconvolution product; — 29 
Associative Algebras C.) 

As an example, setting [ = GL*(2, Q), T = 
SL(2, Z), we obtain a Hecke ring #(f,T). In 
virtue of the theory of telementary divisors, a 
complete set of representatives of 1\I'/T is 
given by 


da 


a 
We write T(a,,a,)=T n 


a,,4,€Q,4,,a,>0,a, la. 


I. From the 
a2 
relation T(a,@,,4,4,)T(a;',a;')=T(a,,a,) 
follows the commutativity of # Č, T). Fur- 
thermore, if we put, for a positive integer n, 
T,=T(n) J, T(a,,42), 


a, a EZ 
aja, =n 


then the following multiplication formula 
holds: 


T(n): Tim) =F dT(d, d) . 


din, m 


T(nm/d?). 


We obtain a representation of # rr ) in 
M,(T) as follows. For T=TeT = U To;€ 
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X (Č, T) and fem, (I) we put 
(I T)@=P fal2))(ciz +d) “det(a,), 


where 6; = 6 *) and lis a fixed integer. This 
i i 
representation leaves ©,{I) and €,(T) invar- 
iant. In particular, the representation of T, 
(with [=k 1), called the Hecke operator 
(Hecke, Math. Ann., 114 (1937)), is fHermitian 
with respect to the Petersson metric. 
Following Hecke, we associate with every 
modular form f(z) = $ o4 Ag’ (qQ=er™) a 
*Dirichlet series p(s) = LD, a,n °. Since a, = 
O(n* '**) for feM, (k > 2), o(s) is absolutely 
convergent in the half-plane Res > K The 
conditions for f to be a modular form of 
weight k are equivalent to the following con- 
ditions for @: (s-k) @ (s) can be extended to an 
Tentire function of fmite tgenus (actually, of 
genus 1) and, if we put R(s) = (2z) *T'(s)e(s), 
R(s) satisfies a functional equation of the 
form R(k= s) = (—1)*?R(s). A correspondence 
between f and ¢ that satislies these con- 
ditions is one-to-one; in fact we have a, = 
( —1)"?Res,_, R(s), and the function g(x) = f(ix) 
— (x > 0) and R(s) are related by the tMellin 
transform as follows: 


0 
Risi=| g(x)x*" dx, 
0 
1 
g(x) =— R(s)x-“ ds. 
2ni Res=09 


This correspondence between Dirichlet series 
and automorphic forms can be generalized to 
the case of a certain discrete subgroup of 
SL(2, R), which is not necessarily of the first 
kind (Hecke, Math, Ann., 112 (1936); [ 10]). It is 
also known that if the functional equations for 

©, x(n)a,n * hold for sufficiently many char- 
acters y, then f=)", a,qg" is a cusp form with 
a character for some congruence subgroup 
(Weil, Math. Ann, 168 (1967); [12]). 

Now, suppose that a linear subspace W of 
M,(L) is invariant under all T, (Nn =1,2,... ), 
and let(f,,...,f,)(«=dim Mè) be a basis of Mi. 
If we denote by A(n) the Kx K matrix represent- 
ing T, in this basis, it can be shown that there 
exist x x K (complex) matrices B® (1 <i <x) 
and 4(0) such that we have 


K 


2) =) a" n) =>), fil2)B®. 


Similarly we associate with F(z) a matrix- 
valued Dirichlet series 
= $ n-*2(n) 
n=1 
Then from the multiplicative property of the 
A(n), we obtain the following tEuler product 
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expression of ®(s): 


O(s)=[P]U.-A(p)p s+ ps), 
P 


where 7, denotes the unit matrix of degree x, 
In particular, when dim 9M = 1, i.e., when W 
= Cf, where f is an teigenfunction of all Hecke 
operators 7), the corresponding Dirichlet 
series (p(s) has an Euler product of the follow- 
ing form: 
p(s)=[](1-A,p +p. 

p 
For instance, the Dirichlet series correspond- 
ing to the Eisenstein series G, is (2(2xi)* (k — 
1))C(s)C(s— K+ 1) (where ¢ is the Riemann 
zeta function), which has a well-known Euler 
product. Since S,, = CA, the Dirichlet series 
corresponding to A(z) also has an Euler prod- 
uct. (This is part of the Ramanujan conjecture. 
Ramanujan also,conjectured that the quadra- 
tic polynomial 1 —/,X + p! 1X? appearing in 
this Euler product has imaginary roots, Le., 
|A,|< 2p’ “.) Generalizing the Ramanujan 
conjecture, Petersson conjectured that all 
eigenvalues of the Hecke operator T, in Š, 
have absolute value < 2p" !/? (Ramanujan- 
Petersson conjecture). P. Deligne proved this 
conjecture for K> 2, and his method extends to 
the case when T is a congruence subgroup 
(Publ. Math. Inst. HES, 53 (1974)). For k= 1, 
see Deligne and J.-P. Serre (Ann. Sci, Eade 
Nam Sup.,7 (1974)). It may be noted that this 
conjecture was proved earlier for K= 2 (and for 
almost all p) by M. Eichler (1954) and by G. 
Shimura (1958) (— 450 Zeta Functions M). 


E. The Case of Many Variables [1, 15,161 
Let X be a bounded domain in C™ and F a 
tdiscontinuous group of analytic automor- 
phisms of X. The Poincaré series of weight (= 
Section A) mè 2, defined similarly to the case 
of the unit disk, converges normally in X and 
expresses an automorphic form of weight m If 
T\X is compact, let (f;, , fẹ) be a basis of the 
space of Poincaré series of weight m Then for 
a sufficiently large m (which is a multiple of 
the order of T, for allx € X), the map X 3 z> 
(f,(z), ,f,(z)) defines in a natural manner a 
projective embedding of the quotient space 
T\X into P*~! (C), which actually gives an 
analytic isomorphism of [\X onto a tnormal 
projective algebraic variety. It follows that the 
field of automorphic functions with respect to 
T is an talgebraic function field of dimension 
N, and that every automorphic function can 
be written as the quotient of two Poincaré 
series of the same weight. In the case where 
T\X is not compact, we first discuss some 
examples. 
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F. Siegel Modular Functions [15,17,18] 

The Siegel upper half-space $, of degree # (or 
Siegel space of degree n) is, by definition, the 
space of all nx n complex symmetric matrices 
Z = X + iY with the imaginary part Y > 0 (posi- 
tive definite). (, is analytically equivalent to a 
tsymmetric bounded domain.) The group of all 
(complex) analytic automorphisms of §, is 
given by the real (projective) tsymplectic group 
Sp(n, R)/{ + Lan}, which acts transitively on 

H, by Z>0(Z)=(AZ + B)(CZ + D) for o = 


A B 
f i Sp(n, R). The subgroup F, = Sp(n, Z) 


consisting of integral matrices, or the corre- 
sponding group of linear fractional transfor- 
mations, is called the Siegel modular group 

of degree n, T, is a discontinuous group of the 
first kind acting on §,. 

A Siegel modular form f of weight k or of 
dimension -k is, by definition, a holomorphic 
function on §, satisfying the following con- 

B 

p] Ta {(a(Z))= 
f(Z)det(CZ + D)*; Gi) f has an integral Fourier 
expansion of the form f(Z) = Ero are" T2, 
where T runs over all half-integral tpositive 
semidefinite symmetric matrices of degree n. 
(Note that the Jacobian determinant J,(Z) of 


A B 
g= (¢ i is given by J (Z) = det(CZ + 


D} *-1.} For n22, condition (ii) is superfluous 
(M. Koecher, Math. Z., 59 (1954)). We denote 
by MP =M,(T,) the space of all Siegel modu- 
lar forms of degree n and weight k. When 


. Z ; 
we viteg,3z= (2 3) with Z;€§,-1, 3E 
C"! zeS,, and 


ditions: (i) f A 
1010nS: (1) Tor g = 
eo 


Ms f(Z)= F a,(Z,, 3), 
n=0 


it turns out that ap{Z,, 3) depends only on Z; 
and, writing it as f,(Z,), we have f, eM". 
The mapping @: f— f, thus defined is a linear 
mapping from Wt” into Mi", which is suf- 
jective if Kis even and > 2n (H. Maass, Math. 
Ann., 123 (1951)). We denote the kernel of ® 
by S(” and call feS” a cusp form (viewing 
H,- as a cusp of §,). For fe IM, the fol- 
lowing three conditions are equivalent: (a) 
fe SP; ) fZ) =E rare Te) 

| f(Z)det( Y)*” is bounded. We have Ij” = 
{0} if K<Oor if both kand Nn are odd, and 
MW = C. In general, dim WMP is finite and 

= OKT?) (k-t oo). The graded algebra 
MT) = EL My” is finitely generated, and 
its Krull dimension is n(n + 1)/2 + 1. In par- 
ticular, for n= 2 the structure of the graded 
algebra I(T) is determined explicitly. (The 
even part of this algebra is isomorphic to the 
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polynomial ring of four variables.) Also, 

dim Mi? and the singularities of the quotient 
variety [’,\), are known explicitly (J. 1. Igusa, 
Ame. J. Mah, & (1962), 86 (1964); U. 
Christian). 

As an example of Siegel modular forms, we 
have the Kisenstein-Poincaré series, delined as 
follows: 

Es (Z) = e2nitr(Sa(z)) det(CZ + D) pA 
aer,(S)\In 

where S is an nx # rational symmetric matrix 

> 0, and we put 


U T'U” 
n9-l(; — Jer 'USU =S, 


e?" tr(ST) = det( uF 


This series is convergent for k> n + rank § + 1 
if rank S <n and for k>2n if S>0, and the 
totality of Eg , (those series with S > 0) spans 
M (SH) (Maass, Math. Ann., 123 (1951)). 
‘Theta series delined by integral quadratic 
forms are examples of Siegel modular forms 
(with a certain level) and are of great signifi- 
cance in the arithmetic of quadratic forms 
(C. L. Siegel, Ann. Math, (2) 36 (1935). 

The quotient space I),\, can be compacti- 
fied as follows (I. Satake and W. Baily [15]): 
There exists a positive integer kọ such that, 
for any multiple k of ko, any basis of Wty” 
defines in a natural manner a one-to-one 
biholomorphic projective embedding of TAS, 
of which the image is a Zariski open set of a 
normal projective algebraic variety. Since the 
structure of this projective variety is indepen- 
dent of k, we denote it simply by I,\§, . In 
other words, D\9, is the projective variety 
Proj Mt(I,) associated with the graded algebra 
MT). Then we have TAH, = Ur- TAS,- 
When y 2 2 for r<n,T,\§, is of codimension 
> 2 inI,\§,,, So that (by virtue of fHartogs’s 
continuation theorem) the conditions at the 
points at infinity in the delinitions of modular 
functions and forms become superfluous. In 
fact, for n > 2, if we define a Siegel modular 
function of degree n simply as a Į -invariant 
meromorphic function on §,, then it can auto- 
matically be extended to a meromorphic func- 
tion on the compactilication T,\,,, and hence 
be expressed as the quotient of two modular 
forms of the same weight. It also follows that 
the field of all Siegel modular functions of 
degree n is an algebraic function lield of di- 
mension n(n + 1)/2. (These results can also be 
obtained from the tpseudoconcavity of T \%,, 


without using compactilication (A. Andreotti 
and H. Grauert, Nachr. Akud. Wiss. Göttin- 
gan).) For n = 2, the field of Siegel modular 
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functions is a rational function lield. On the 
other hand, for FI = 1 mod 8, n> 9, the field 
of Siegel modular functions is not rational 
(E. Freitag, Abh. Math. Sem Univ. Hamburg 
(1978)). 

Let I’ be a subgroup of I, of finite index. 
Then [\§, can be compactified in the same 
way as above [23]. However the singularities 
of I\, are extraordinarily complicated (Igusa 
[24]). When T is the principal congruence 
subgroup J) (N) of level N > 2, the blowing-up 
of T,(N)\§, along the cusps is nonsingular for 
n = 2, 3 (Igusa, Mah Ann, 168 (1967)). Fol- 
lowing Igusa’s idea, it is possible to construct 
an explicit nonsingular compactification of 
I\D, in which D is a bounded symmetric 
domain and T is a suitable arithmetic sub- 
group of Aut(D) ([19]; see also Satake, Bull. 
Amer. Math. Soc, 79 (1973)). For D= $x $ 
and T the Hilbert modular group (— Section 
G), F. Hirzebruch has given explicit nonsin- 
gular compactilications of F\D (Ensagnenet 
Math, 19 (1973)), 

The Hecke theory (— Section D) can also be 
extended to the Siegel modular case. For any 
integer M> 0, let §,, = S{”) be the set of 2n x 2n 
integral matrices M such that 'MJM = mJ, 

0 =i x í 7 
where i=(_ and J, is the identity 
matrix of degree n. We define the Hecke 
operator T(m) on MP by 


T(m) f =m"* "+12 X 
(4 8)-Mevim) 


det(CZ 


+ D)*f(MZ), 


where fe M”, Sm = y MeV(m) IM, and V(m) iS 
a complete system of representatives of I\S,,. 
Then for any prime number p, the formal 
operator series D,(X)= DY", T(p")X" is form- 
ally equal to a rational function in X: 
— PAX) 

Q(X)’ 
in which P,(X) and Q,(X) are polynomials in 
X of degree 2” = 2 and 2”, respectively, with 
coefficients in the ring of Hecke operators 
(Satake, Publ. Math. Inst. HES, 18 (1964)). For 


n = 2 we have (Shimura, Proc. Nat. Acud. Sci. 
US, 49 (1963)) 


D,(X)=(1—p** *X*) {1 — T(p)X +(T(p) 
— T(p°) =p Ax? aps T(p)X? 
+ pt EXA E 


Let f(Z)eM be a Siegel modular form of 
degree 2 such that T(m)f =A(m)f for all mè 1. 
We define L ,(s) by 


D(X) 


2 4 
L,{s)=C(2s—2k +4) X ~~ 
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in which ((s) is the Riemann zeta function. It is 
known that L s(5) can be meromorphically 
continued to the entire plane and satisfies the 
functional equation 


P (2k = 2—s) =¥,(5), 


where Y,(s) = (2) 7*P'(s)I'(s— K+2)L,(s) and 
the function (s = k)(s + K+ 2)'¥,(s) is entire (A. 
Andrianov, Trudy Math. Inst. Stddoo., 112 
(1971)). 


G. Hilbert Modular Functions 


Following a suggestion of Hilbert, 0. Blu- 
menthal studied a generalization of modular 
functions of the following type: Let K be a 
totally real talgebraic number field of Imite 
degree, and let K',, K™ be the conjugates 
of K, where n =[K : Q]. (“Totally real” means 
that all the K are real.) For a € K, we denote 
by ge K® the ith conjugate of a. Let D be the 
+Principal order of K, and consider the group 


Tp =SL(2,D) 


6 5 


We define the action of Fp on the n-fold prod- 
uct of the upper half-plane "= {z = (z; >.» z,) 
z;€H} by 


(1) (1) a”) (n) 
ao zı+ß z,+B ) 


2 BindeD20—py= I} 


yDz, 450 yz 4 5m 


x ; F 
for g= (e : ) then T°, becomes a discontinu- 


ous group of the first kind (Tẹ can also be 
considered as an tirreducible discrete subgroup 
of $L(2, R)“). The group I, is called the Hil- 
bert modular group of K (in the strict sense). 
(The defmition is sometimes modilied by re- 
placing the condition a — fy =1 by, say, xô 
By =(totally positive tunit).) If the tclass num- 
ber of K is h, the quotient space I’y\$)" can 

be compactified by adjoining h points at in- 
finity. Therefore, if n > 2, a Hilbert modular 
function can be defined as a meromorphic 
function on $" invariant under [’); and simi- 
larly a Hilbert modular form f of weight k 

or of dimension -k as a holomorphic function 
f on §"such that (f| 0)(z) = flo2)) TT, (yz; 
+6) =*= f(z) for all celp. (In the latter case, 
fis holomorphic at all cusps, i.e., for every 
o€SL(2, K), f|o has an integral Fourier ex- 
pansion.) For Hilbert modular functions and 
forms, results quite similar to those for the 
case N = 1 or the case of Siegel modular groups 
have been obtained (Kloosterman, Maass, K. 
B. Gundlach, H. Klingen). 
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H. Furtber Generalizations 


As further generalizations of the notion of 
modular function, we have Hilbert-Siegel 
modular functions (1.1. Pyatetskii-Shapiro, 
Baily, Christian [20], where we can tind some 
150 references), Hermitian modular functions 
(H. Braun, Klingen), etc. For the most general 
case, 1.€., for an arithmetically defined discon- 
tinuous group acting on a *symmetric bounded 
domain, a unified theory of automorphic func- 
tions has been established in the works of 
Pyatetskii-Shapiro [21,22] and Baily and A. 
Borel [23, 24]. 

On the other hand, exactly as in the classical 
theory, where the elliptic modular function 
gave an invariant of |-dimensional complex 
tori, generalized modular functions can be 
viewed as giving an analytic invariant of a 
certain family of (polarized) tAbelian varieties. 
From this point of view, a deep number- 
theoretic (and algebrogeometric) study of 
automorphic functions (initiated by Hecke and 
Eichler) has been substantially carrjed forward 
by the work of Shimura (see the serjes of his 
papers in Ann. Math. starting from vol. 70 
(1959); see also [9, 24, 25]). 

From the analytic point of view, the theory 
of automorphic functions is closely connected 
with the unitary representation of G in the 
space L,(I\G) (or its adelic analog) [26,27] 
(for adelic analogs = [ 12,131). In this respect, 
the ftrace formula of A. Selberg [28], gen- 
eralizing the +Poisson summation formula, 
is of fundamental importance; and actually 
it can be used effectively for calculations of 
the dimension of the space of automorphic 
forms and of the trace of Hecke operators 
(R. P. Langlands, Amer. J. Math., 85 (1963); 

H. Shimizu, Ann. Math., (2) 77 (1963), J. Fac 
Sd. Univ. Tokyo, 10 (1963); also [24]), When 
X = G/K is a symmetric domain, we can define, 
for any representation p of K, a (matrix- 
valued) canonical automorphy factor, by 
which we defme (vector-valued) automorphic 
forms with respect to a discrete subgroup [` of 
G, and under a further condition (say, I free, 
I\X compact, and the thighest weight of p 
sufftciently large) we obtain a formula for the 
dimension of the space of such automorphic 
forms in terms of the tarithmetic genus of [\X 
and certain numbers related to the “dual” X, 
= G,/K and the representation p [ 15,29, 30]. 
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33 (1.5) 
Axiomatic Set Theory 


A. General Remarks 


Axiomatic set theory pursues the goal of rees- 
tablishing the essentials of G. Cantor’s rather 
intuitive tset theory by axiomatic construc- 
tions consistent with modern theories of the 
foundations of mathematics. 

A system of axioms for set theory was first 
given by E. Zermelo [47], and was completed 
by A. Fraenkel [8]. J. von Neumann [30] 
expressed it in tsymbolic logic, gave a formal 
generalization, and eliminated ambiguous 
concepts. P. Bernays and K. Godel [2,11] 
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further refined and simplified von Neumann’s 
formulation. The theories based on the sys- 
tems before and after the formal generaliza- 
tion are called Zermelo-Fraenkel set theory 
(ZF) and Bernays-Gédel set theory (BG), 
respectively. 


B. Zermelo-Fraenkel Set Theory 


ZF is a formal system expressed in ‘first-order 
predicate logic with the predicate symbol = 
(equality) and based on the following axioms 
1-9 (- 411 Symbolic Logic). These axioms 
do not contain any predicate symbol other 
than €, where x€ y is read “x is an element 
of y.” Any formula containing only Easa 
predicate symbol is called a set-theoretic for- 
mula. Following the usual convention, we omit 
the outermost universal quantifier, and use 
restricted quantifiers such as 3x € a, Vx € a 
Axiom | (axiom of extensionality): 


Vx(xea=xeb)>a=b. 


This asserts that sets formed by the same 
elements are equal. The formula x € a(x € b) is 
denoted by ac b. This means “a is a subset of 
b.” Then Axiom 1 can be expressed by 


achbabca->a=b. 

Axiom 2 (axiom of the unordered pair): 
dxVy(yex=y=avy=b). 
This asserts the existence, for any sets a and b, 
of a set x having a and b as its only elements. 
This x is called the unordered pair of a and þ 


and is denoted by {a, b} The notion of ordered 
pair is characterized by 


(a,b)=(c,d)=a=cab=d. 


An example of such a set is (a, b} = { {a, a}, 


{a,b} }. 


Axiom 3 (axiom of the sum set): 
dx Vy(yex = 4zea(yez)). 


This asserts the existence for any set a of the 
sum (or union) x of all the sets that are ele- 
ments of a. This x is denoted by Ua or S(a). 
We writeau bforu {a, b}and a’ for 
aU {a,a}. 

Axiom 4 (axiom of the power set): 


Yxdy(yex=Vze y(zea)). 


This asserts the existence for any set a of the 
power set x consisting of all the subsets of a. 
This x is denoted by P(a). We have §(P(a)) = 
a, SO § is a left inverse of P, and the products 
SP and PS are idempotent. 

Axiom 5 (axiom of the empty set): 


dx Vy( TyeéX). 
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This asserts the existence of the empty set. The 
empty set is denoted by % or 0. 
Axiom 6 (axiom of infinity): 


dx(0ExvVyex(y'Ex)). 


This asserts the existence of a set containing 
all the natural numbers, where 0, 1 = 0’ = {0}, 
2= 1'={0, 1}, 3=2'={0,1,2}, This defini- 
tion of natural number is due to von 
Neumann. 

Axiom 7 (axiom of separation): 


dx Vy(yex=yean A(y)). 


This asserts that the existence for any set a and 
a formula A( y) of a set x consisting of all 
elements of a satisfying A( y). This x is denoted 
by {yea A(y)}. This is rather a schema for an 
intinite number of axioms, for there are an 
infinite number of A( y). This axiom, also 
called the axiom of comprehension or axiom 
of subsets, was introduced by Zermelo. 

For example, the set of all natural numbers 
is introduced by 


{xea|VyOeyaAVzey(z'ey) > xey)}, 


where a is a set satisfying Axiom 6. The set of 
all natural numbers is denoted by œ or N. 
Axiom 8 (axiom of replacement): 


dx Vyea(azA(y, z} >3zexA(y,z)). 


This asserts the existence for any seta of a set 
x such that for any y of a, if there exists a z 
satisfying A( y, z) then such z exists in x. If the 
relation A( y, z) determines a function, then the 
image of a set by the relation is included in a 
set, so by Axiom 7, the image is a set. This 
axiom was introduced by Fraenkel. 

Axiom 9 (axiom of regularity): 


3xA(x)>3x(A(x)a Vyex( 1A(y))). 


This asserts that if there exists a set satisfying 

A(x), then there is a set x satisfying A(x) but 

every element y of x does not satisfy the prop- 

erty ,4(y). This axiom is also called the axiom 

of foundation. Its dual expression, called €- 

induction, is of course equivalent to Axiom 9. 
Axiom 9’ (axiom of E-induction): 


Vx(VyexA(y)> A(x))>VXA(x). 


Using this we can show that xex, 

(KEY A y €x), etc. If we assume the axiom of 
choice stated below, then this is equivalent to 
the nonexistence of an infinite descending 
sequence 


XE. EXZEX]. 


If a model of set theory satisfies the axiom of 
regularity and has an infinite descending se- 
quence that is not in the model, then such a 
model is called a nonstandard model. 
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Axiom 10 (axiom of choice): 
Vxead yA(x, yo dyVxeaA(x, y(x)). 


This asserts that if for any element x of a there 
is a set y such that A(x, y), then there is a 
choice function y for the formula, i.e., 

A(x, y(x)) for all x in a. Usually a function is 
represented by its graph. A set fis called a 
function detined on a if 


Vx Vy((x,ylefroxea), Vxeary((x, ylef), 


VxeavyV2((x, ye fa(x, Z)E fy =Z). 


This formula is denoted by Fnce(f); then the 
formula A(x, f (x)) is an abbreviation of 


Fne(f) a Jy((x, yje f^ A(x, y)). 


The axiom of choice is equivalent to many 
properties, such as the well-ordering theorem, 
Zorn’s lemma, and Tikhonov’s theorem on the 
product of compact spaces, and it is used 
widely and essentially in mathematics. 

The system of axioms 1 to 9 is called 
Zermelo-Fraenkel set theory and is denoted by 
ZF; the system ZF minus the axiom of replace- 
ment is called Zermelo set theory, denoted by 
Z; and the system ZF plus the axiom of choice 
is denoted by ZFC. 

The system Z is weaker than ZF. Indeed, the 
existence of the set œw of all the natural num- 
bers and of P(ro), P(P(w)), can be proved 
in Z, but the existence of the set{w, p(w), 
P(P(o)), .} cannot be proved in Z. However, 
we Can prove its existence in ZF. 

The theory ZF minus the axiom of intinity is 
called general set theory. Its consistency can be 
reduced to the consistency of the theory of 
natural numbers as follows. Let n be any 
natural number and n = 2 + 272 +1 + 
2"(n, < n, < < ną) be its binary expan- 
sion. We can define a model of general set 
theory by identifying a set with the natural 
number n and defining m € n by “m appears as 
one of the n;.” In general set theory we can 
show the existence of 0, P(O), P(P(0)), but 
the existence of the set {0, P(O), P(P(0)), .} 
cannot be proved. However, we can prove its 
existence in Z. The set consisting of all heredi- 
tary finite sets is the smallest model of general 
set theory. 


C. Bernays-Gédel Set Theory 


The existence of {u| A(u)} for an arbitrary set- 
theoretic formula .4(u) cannot be deduced 
from the axioms of ZF. We call this object a 
class to distinguish it from sets. We introduce 
a generalized logical system of first-order 
predicate logic by adding to the logical system 
used in ZF class variables, class constants, and 
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inference rules with respect to quantifiers for 
classes. For any set-theoretic formula A(u) in 
which no tbound class variable occurs, we 
adopt 


AX Vu(ue X SA(u)) 


as an axiom, where capital letters X are class 
variables. The set theory thus obtained is 
equivalent to Gédel set theory [11]. Von 
Neumann axiomatized set theory by making 
use of the notion of functions instead of that of 
classes [30]. In refining this theory and intro- 
ducing the notion of classes, Bernays and 
Godel [2,11] initiated Bernays-Godel set 
theory, BG. 

ZF and BG are related as follows: Any 
formula provable in ZF is provable in BG, and 
any set-theoretic formula provable in BG and 
having neither class variable nor class constant 
is provable in ZF. In this sense, the systems 
can be regarded as essentially equivalent, but 
as BG has class variables and class constants, 
it is more convenient for expressing set- 
theoretic notions. 

Von Neumann detined the following func- 
tion R by *transfinite induction: 


RO) = Ø, RUR y B(R()), 
where yand f are ordinal numbers and 


| J< B(R(B)) denotes the set sum of $(R(0)), 
B(R(1)), .... BCR(B)), ... (B <a). The function R 
can be detined by a set-theoretic formula, so it 
exists as a class. Now consider the model 

M = M (q) for a fixed ordinal number g. We 
define sets of the model M as elements of R(«), 
and classes of the model as subsets of R(x). We 
denote the e relation of the model by €,,. For 
classes X and Yof the model, we write XE, 
Y2X eY Then a necessary and sufficient 
condition for R(x) to be a model of BG is that 
4 is an tinaccessible ordinal number (— 3 12 
Ordinal Numbers). The existence of an in- 
accessible ordinal number cannot be deduced 
from the axioms of ZF. There is a series of 
studies of axiomatization of set theory in 
which any number of inaccessible ordinal 
numbers is assumed to exist [23]. When R(x) 
is a model of BG (ZF), it is called a natural 
model of BG (ZF). Furthermore, if H is J) a R(x) 
then H satisfies all the axioms of BG (ZF). As 
we do not need the axiom of regularity for 
defining the class H, we see that BG (ZF) is 
consistent as long as BG (ZF) without the 
axiom of regularity is consistent. 


D. Independence of the Continuum Hypothesis 
and the Axiom of Choice 


These axiomatizations of set theory motivated 
a series of studies from the standpoint of the 
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tfoundations of mathematics on problems that 
remained unsolved after the appearance of 
Cantor’s primitive set theory. Among these, 
the problem of the relation between the con- 
tinuum hypothesis and the axiom of choice 
was central. 


Consistency of the Axiom of Choice and the 
Continuum Hypothesis. Gödel [1 1] proved 
that if ZF without the axiom of choice is con- 
sistent, then the system obtained by adding to 
ZF the axiom of choice and the continuum 
hypothesis is also consistent. To show this, he 
constructed a model of ZF satisfying the 
axiom of choice and the generalized con- 
tinuum hypothesis as follows. Assume first that 
M is an arbitrary domain of objects among 
which the ẹ relation is detined. By a formula 
on M, we understand a formula having con- 
stants of M as its only constants, having € as 
its sole predicate symbol, and further having 
exclusively variables whose ranges are re- 
stricted to M. Let us denote by M’ the totality 
of subsets of M detined by a formula A(x) on 
M. Now we put M,={@}, M41 = Ma Mp = 
(Ja<g Ma if B is a timit ordinal number. We 
call x constructible if x € M, for some ordinal 
number «, assumed to be less than the first 
inaccessible ordinal number, if any. We denote 
the totality of constructible sets by L, and the 
totality of sets of ZF by V. We call the asser- 
tion V= L, that is, every set is constructible, 
the axiom of constructibility. If we add this 
axiom to ZF, the axiom of choice and the 
generalized continuum hypothesis become 
provable. On the other hand, if we regard 
elements of L as sets of the model and the 
original e relation as the e relation of the 
model, we have a model of ZF in which the 
axiom of constructibility holds. 


Independence of the Axiom of Choice and tbe 
Continuum Hypothesis. Since the result of 
Gödel, attempts have been made to prove that 
the axiom of choice is independent of the other 
axioms. Fraenkel constructed a model of set 
theory without satisfying the axiom of choice, 
starting from a countable number of objects 
that are not sets. A. Mostowski constructed in 
ZF a model of set theory having objects that 
are not sets, and he proved that the model 
satisfies the axiom: Every set can be flinearly 
ordered, but does not satisfy the axiom of 
choice [27]. E. Mendelson constructed a 
mode1 of set theory that does not satisfy the 
axiom of choice by making use of an infinite 
descending chain a, 3 4) 3 a33..[25]. These 
models, however, do not satisfy al] the axioms 
of ZF or of Zermelo set theory minus the 
axiom of choice, even though they satisfy most 
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of the axioms. Consequently, they are not 
sufficient for proving independence. 

P. J. Cohen [3,4] proved the following 
results in connection with the independence of 
the axiom of choice, the continuum hypo- 
thesis, and V= L: If ZF is consistent, each of 
the following conditions has a model. (1) The 
axiom of choice as well as the generalized 
continuum hypothesis holds, but there exists 
an a satisfying a ¢ L and ac w, (2) P(w) is not 
well-ordered. (3) The axiom of choice holds, 
but the continuum hypothesis does not hold. 
(4) $B(B(w)) cannot be linearly ordered. 

By (1), we see that V= L is independent of 
the axiom of choice and the generalized con- 
tinuum hypothesis: (2) shows the independence 
of the axiom of choice; and (3) shows that the 
continuum hypothesis is independent of the 
axiom of choice. In (4), R (SB(a)) corresponds 
to the set F of all real-valued functions detined 
on the interval [0, 1], and (4) shows that the 
proposition “F can be linearly ordered” is not 
deducible in ZF without the axiom of choice. 


E. Some Recent Results 


Boolean-Valued Set Theory. D. Scott and 
R. M. Solovay defined models of set theory in 
which the set-theoretic formula has values in a 
complete Boolean algebra. This viewpoint is 
motivated by P. J. Cohen’s original notions of 
the forcing relation and the generic filter. 

According to the relation between sets and 
their representing functions, a function f: A+ 
B corresponds to a subset of A when B is a 
complete Boolean algebra. ® by transtinite 
induction we put 
VE= |] PPV PY, 

pa 

where P(A) is the Boolean-valued power set 
of A defined by 


ue P)(A)=u: AB. 


Let V™ be the union of all Vs; then the 
truth values of the formulas fu € v] and [u= v] 
are detined by transtinite induction as 


[uev] = i v(x): [x=u], 
[u=] 


= [] «= [xer}- I] 


xedom (u) xedom(v) 


v(x) => | xeu], 


where a = b is an abbreviation for the element 
—a +b in B. 

In V® the following properties are satistied: 
(i) The truth value of any formula provable 
from ZFC is 1, the largest element of B. 

(ii) Any complete homomorphism h: B, >B, 
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can be extended to 
h: V8) V», 
so that the commutative relation 


h([A(uy, -3 Un) |= [Alh(u), ---, hlun))] 


is satisfied for every restricted formula; further, 
if h is an epimorphism then it is satisfied for 
every formula. 

(iii) (maximum principle) For any formula A(x) 
there is an element y of V‘) such that 


[3xA(o] = [AW]. 


Moreover, if [3xR(x)] = 1 then there an ele- 
ment u of V such that | R(u)} = 1 and 


[Sx(R(x)A A(x) | = [AW]. 


Property (i) means that if /A/ < 1, then A is 
not provable from ZFC. Many consistency 
results in set theory are obtained in this way 
by constructing special partial order struc- 
tures, topological spaces, or complete Boolean 
algebras. 


Axiom of Strong Infinity. The axiom of infin- 
ity, Axiom 6, of ZF asserts that there exists a 
set including the set œ of al] natural numbers. 
The set @ is the smallest infinite cardinal num- 
ber. In general, the axiom of strong intinity is 
stated in a form such that certain special prop- 
erties of q are satistied for an uncountable 
cardinal number. 

Though such an axiom asserts the existence 
of large cardinal numbers, their properties 
sometimes reflect, for example, those of the 
real numbers, or of the set of real numbers, etc. 

We state here some typical examples: 

(i) Weakly inaccessible cardinal numher. This 
is a regular limit cardinal number, that is, 


Cflm,)=@,, VB <a(ws,,<a,). 


(ii) Strongly inaccessible cardinal number. 


cfm, )=@, VB <a(2°8<«@,). 


(iii) Weakly compact cardinal number. œ, is 
uncountable and the space X = 2° with < œw,- 
topology is w,-compact. 

(iv) Measurable cardinal number. œ, is un- 
countable and there is a nonprincipal <œ,- 
additive 2-valued measure p: P(w,)->{0, 1). 

(v) Strongly compact cardinal number. œ, is 
uncountable and any product of 2 = {0, 1 } with 
< w,-topology is w,-compact. 

The strength of these properties increases in 
the order presented here. Properties (i) to (ili) 
are compatible with the axiom of constructi- 
bility V= L. And (iv) implies V# L while it is 
compatible with V= L(y). But (v) implies 
VÆ L(a) for any set a. 

There are many other stronger axioms of 
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infinity, and the consistency proofs of these 
“axioms” within ZF or ZFC would involve 
essential difficulties. Inconsistency proofs of 
these properties, if they exist, would be very 
interesting. 


F. Examples of Results 


(1) Cardinality and Cofinality (— 3 12 Ordinal 
Numbers). (i) (W. B. Easton [6]) Let W be a 
model of ZFC (the Zermelo-Frankel axioms 
plus the axiom of choice) in which the GCH 
(generahzed continuum hypothesis) is valid, 
Le., Va(2°« = 0,41), and let g be a function 
from ordinals to ordinals in ¥ such that Va, 
Bla < P => Ora Sgp) and Vala, < cfa). 
Then there is a Boolean model St of ZFC, N > 
Wt, having the same cofinality and satisfying 
2°4 =(g(a) for every regular cardinal. (This 
means that Konig’s condition (cf(2°z) > @,) is 
the only restriction on the cardinality of 
powers of regular w.) 

Gi) (J. H. Silver [39]) Suppose 


wo <cf(a) = cfo} < Oa 

Then for any 4 < cf(a) 

Vv <a(2°r<@,, )72°<a,, 3. 
However, the validity of the implication 
Yn < (2r = 0,41) 2°° =O 44 


still remains an open question. And it is 
known that if there is a model for 


Yn<o(2°"=@y44), 29° > Mgt 


then there is a model in which there are many 
measurable cardinal numbers. 

(iii) (K. Prikry [31]) Let œ, be a measurable 
cardinal number in a model M of ZFC. Then 
there is a Boolean extension N of M in which 
the notion of cardinality is not changed but 
the notion of cofinality is changed, that is 
cf(w,) = o, in M but cf(w,) = @ in N. 

(iv) (Solovay) Let @, be a strongly compact 
cardinal number. Then for any strong-limit 
singular cardinal w, > œ, the continuum hypo- 
thesis holds at «,; that is, the implication 


ci(m,)<@g, Vp <B(2°1< wg) 92° = 55, 


is provable. 


(2) Lebesgue Measurability and the Baire Prop- 
erty. As is well known, every Aj (Borel) set 
(and consequently every Z} (analytic) set) of 
real numbers is Lebesgue measurable and has 
the Baire property. 

(i) (Gödel) V= L implies the existence of a 
A} set of real numbers that is neither Lebesgue 
measurable nor has the Baire property. 


Let DC denote the principle of depending 
choice: 


VxeadyeaP(x, y) 
>Jf:w>aYnewP( f(n), f(nt D). 


DC is adequate for the development of the 
classical notions of measure theory, such as 
Jordan decomposition, the Radon-Nikodym 
derivate, etc. Let I denote the hypothesis 
da(cf(w,) = 2,4 VB < a(2°8 < w,)) (strongly 
inaccessible). 

Gi) (Solovay [40]) The consistency of ZFC 
and 1 implies that of ZF and either of the 
following two axioms: 

(a) DC plus the hypothesis that every set of 
real numbers is Lebesgue measurable and has 
the Baire property. 

(b) The axiom of choice plus the hypothesis 
that every set of real numbers definable by an 
() sequence of ordinals is Lebesgue measurable 
and has the Baire property. 

Axiom (a) means that DC is not strong 
enough to construct a (Lebesgue) nonmeasur- 
able set, while Axiom (b) implies that every 
projective set is Lebesgue measurable and has 
the Baire property, and hence implies that the 
set of all constructible real numbers, as a X4 
set, has Lebesgue measure 0, and is of the first 
category. 


(3) Martin’s Axiom. Let B be a Boolean 
algebra. We say that B satisfies œw, C.C. (chain 
condition) if the cardinality of every disjoint 
family of positive elements of B is at most @,. 
Let B* denote the topological space consisting 
of all homomorphisms h: B—>2 = {0, 1} with 
the open base U(a)= {h|h(a)= 1} (ae B); then 
B* is a tBaire space. Then Martin’s axiom 
(MA) is: Let B be an œ c.c. Boolean algebra 
and & < 2”. Then the intersection of a dense 
open sets is dense in B*. Since {h X ,h(a,)= 
h(b)} is dense and open in B* if 2a, = b, 

MA means the existence of an h e B* preserv- 
ing any given set of g ( < 2”) equations in B, If 
2” = w,, MA merely reduces to the Baire prop- 
erty of B*, However, if 2” > œ, then the œ 

c.c. hypothesis is essential, for there exists a 

B satisfying œ, C.C. such that B* contains œ, 
dense open sets with empty intersection. 

(i) (Solovay and S. Tennenbaum [43]) The 
consistency of ZF implies that of ZFC, MA, 
and 2°? > %4. 

(ii) (D. A. Martin and Solovay [24]) ZFC, 
MA, and 2” > œ, imply the propositions: 

(a) Vo, <2°(2%« = 2”), hence 2” = 2%; 

(b) the totality of the first category sets of 
Lebesgue measure zero sets is a-additive for 
anya<2”; 

(c) every X : set of real numbers is Lebesgue 
measurable and has the Baire property. 
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(4) Suslin’s Hypothesis (SH) is: Every dense, 
linear, order complete set without end points, 
having at most q disjoint intervals, is order 
isomorphic to the continuum of real numbers. 

(i) (T. J. Jech, Tennenbaum) The consistency 
of ZF implies that of (a) ZFC, 7 SH, and 
GCH, as well as (b) ZFC, 1 SH, and 2” > w, 

Gii) (R. Jensen [15]) The consistency of ZF 
implies that of ZFC, SH, and 2” = @,. 

(iii) (Solovay and Tennenbaum [43]) ZFC, 
MA, and 2” > œ imply SH. 

(iv) Jensen [15]) V=L implies 7 SH. 


(5) Measurahle and Real-Valued Measurahle 
Cardinals. A cardinal x > % is said to be 
measurahle if there is a measure 4: B(xK)> 

{0, 1} with (a) u(x)=1, (b) Vv < x(u({v})=0) 
and (c) L(y <a Ay) = Lv<gH{A,);. and K is said 
to be real-valued measurahle if there exists a 
uu: B(x)  [0, 1] that satifies (a), (b) and (c), and 
is not measurable. 

Let MC (RMC) denote the existence of a 
measurable cardinal (real-valued measurable 
cardinal). 

(i) (S. Ulam [45]) The existence of an W- 
additive measure p:8(A)-[0, 1], with p(A)= 
1 and Yxe A(u({x})=0), implies RMC or MC; 
RMC implies the existence of an extension 
of Lebesgue measure defined on $( [0, 1]); 
every real-valued measurable cardinal is < X?” 
and weakly inaccessible. 

(ii) Every measurable cardinal is strongly 
(hyper) inaccessible, and (a) JaF(«)>Ja< 
w, F(a) for any 2 i formula F(a) on the ordinal 
numbers, (b) JaF(a)+Ja <p, F(a) for any IMi 
formula F(a) and the smallest measurable p. 
(Many results have been obtained concerning 
the ordinal magnitude of œ, and the measur- 
able cardinals.) 

Gii) (Solovay [41]) The consistency of ZFC 
and MC is equivalent to that of ZFC and 
RMC. 

(iv) (Martin and Solovay [24}]) ZFC, RMC, 
and MA are not consistent. 

(v) (Lévy and Solovay [21]) The consistency 
of ZFC and MC implies that of ZFC, MC, 
MA, and 2° > œ. 

(vi) (Solovay) ZFC and MC imply that 
every X1 set of real numbers is Lebesgue 
measurable. 

(vii) (Martin, Solovay) ZFC, MC, MA, and 
2” > w, imply that every Z} set of real num- 
bers is Lebesgue measurable and has the Baire 
property. 

(viii) (Silver [38]) The consistency of ZFC 
and MC implies that of ZFC and MC as well 
as the existence of (Lebesgue) non-measurable 
A} sets of real numbers. 


(6) Axiom of Determinateness. We consider the 
following inlinite game. Let A be a set consist- 
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ing of functions œw to œ. The players I and II 
choose alternatively natural numbers n; and 
m;, and, for the resulting inlinite sequence, if 


(11, 1,,Nz,---, My, My, -.. JEA, 


then 1 wins, otherwise II wins. 

A function 6: | Jeo 0">% is called a 
strategy. If II plays according to the sequence 
m=(m,,M,....™m,, . ) and Į plays according 
to the strategy a, then the resulting sequence is 
o*m 


=(o(%),m;, o(m,), ...,m,0(m,,...,m,), ...). 


The strategy g is called a winning strategy for 
Į of the game defined above, if for any play m 
of II, the resulting play o*m is in the set A, and 
similarly for the winning strategy of II. 

The axiom of determinateness AD is the 
assertion that for any A c wm”, the player I or 
II has a winning strategy. 

(i) (J. Mycielski [28]) Assume ZF and AD; 
then we have: 

(a) axiom of choice for countable sequence of 
sets of real numbers; 

(b) every set of real numbers is Lebesgue 
measurable and has the Baire property, so the 
axiom of choice does not hold; 

(c) every uncountable set of real numbers 
contains a perfect set. 

(ii) (Solovay) Assume ZF and AD; then we 
have: 

(a) cardinalities of œ, and 2” are not 
comparable; 

(b) œ and œ, are measurable cardinals, but %, 
is a singular cardinal such that cf(w,) = (Dy. 

(iii) (Martin) In ZFC, every Borel, namely 
A‘, game is determined. 

(iv) (Martin, Solovay) In ZFC: 

(a) if there exists a measurable cardinal, then 
every analytic, namely M}, game is determined; 
(b) if every A} -game is determined, then there 
is a model of ZFC with many measurable 
cardinal numbers. 

Since the axiom of determinateness AD is 
very strong, there is some suspicion that it 
might contradict ZF. But at present there is no 
such evidence. 
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Axiom of Choice and 
Equivalents 


A. The Axiom of Choice 


In set theory, the following axiom is known as 
the axiom of choice: For any nonempty family 
Y{ of nonempty subsets of a set X, there exists 
a single-valued function f, called the choice 
function, whose domain is Y1, such that for 
every element A of Y[ the value f(A) is a mém- 
ber of A. This axiom is equivalent to each of 
the following three propositions: (1) If {A,},., 
is a family of sets not containing the empty set 
and with an index set A, then the Cartesian 
product [I à A, is not the empty set. (2) If a set 
A is a disjoint union J , A, of a family of 
subsets {A,},,, which does not contain the 
empty set, there exists a subset B, called the 
choice set, of A such that every intersection of 
B and A (A€ A) contains one and only one 
element. (3) For every mapping f from a set A 
onto a set B, there is a mapping from B to A 
such that fo g = 1, (identity mapping). Also 
equivalent to the axiom of choice are the well- 
ordering theorem and Zorn’s lemma, which 
are discussed in the following sections. 


B. The Well-Ordering Theorem 


In 1904, E. Zermelo [2] first stated the axiom 
of choice and used it for his proof of the well- 
ordering theorem, which says that every set can 
be twell-ordered by an appropriate tordering. 
Conversely, the well-ordering theorem implies 
the axiom of choice. Many important results 
in set theory can be obtained by using the 
axiom of choice, for example, that tcardinal 
numbers are comparable, or that various 
definitions of the tiniteness or infiniteness 

of sets are equivalent. Various important 
theorems outside of set theory, e.g., the exis- 
tence of bases in a ‘linear space, tcompactness 
of the direct product of compact ttopological 
spaces (fTikhonov’s theorem), the existence 
of a subset which is not tLebesgue measurable 
in Euclidean space, etc., are proved using the 
axiom of choice. But for those proofs the well- 
ordering theorem or Zorn’s lemma (stated 
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below) are used more often than the axiom of 
choice. 

Using the well-ordering theorem, it can be 
proved in the following manner that for every 
field k, any linear space X over k has a basis. 
Let {x,},-, be an enumeration of X. By ttrans- 
finite induction, we can define the function f 
from X to B= {0, 1} such that (i) if X} = 0, 
f(Xo)=0; if Xo #0, f(x) = 1; Gi) for v >0,if x, 
is expressed as a linear combination of ẹlẹ- 
ments of {x |u <v, f(x,)= 1}, f(x,)=0; other- 
wise f(x,)=1. Then U ={xeX|f(x)=1}isa 
basis of X. 


C. Zorn’s Lemma 


An ordered set X is called an inductively 
ordered set if every ttotally ordered subset of X 
has an tupper bound. A condition C for sets is 
called a condition of finite character if a set X 
satisfies C if and only if every finite subset of 
X satisfies C. A condition C for functions is 
called a condition of finite character if C is a 
condition of finite character for the graph of 
the function. Zorn’s lemma [4] can be stated 
in any one of the following ways, which are al] 
equivalent to the axiom of choice. It is often 
more convenient to use than the axiom of 
choice or the well-ordering theorem. 

(1) Every inductively ordered set has at least 
one maximal element. 

(2) If every well-ordered subset of an ordered 
set M has an upper bound, then there is at 
least One maximal element in M, 

(3) Every ordered set A4 has a well-ordered 
subset W such that every upper bound of M 
belongs to W. 

(4) For a condition C of finite character for 
sets, every set X has a maximal (for the rela- 
tion of the inclusion) subset of X that satis- 
fies C. 

(5) Let C be a condition of finite character for 
functions from X to Y, Then, in the set of 
functions that satisfy C, there is a function 
whose domain is maximal (for the relation of 
the inclusion). 

Using Zorn’s lemma (1), we can prove again 
in the following way any linear space X over a 
field k has a basis. Let Y{ be the set of all non- 
empty subsets A of X such that arbitrary finite 
subsets of A are linearly independent over k. YU 
is not empty. If we order 9{ by the relation of 
inclusion, then 9 is an inductively ordered set. 
By Zorn’s lemma ( 1), there is a maximal ele- 
ment U of Qf. Since U is maximal, U is a basis 
of X. 

The same theorem is proved as follows 
using Zorn’s lemma (4). Condition C for the 
subset A of X, that arbitrary finite subsets of A 
are linearly independent over k, is a condition 
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of finite character. Hence there is a maximal 
subset U that satisties C and is a basis of X. 

Concerning recent developments regarding 
the axiomatic basis for the axiom of choice 
-> 33 Axiomatic Set Theory D. 
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Axiom Systems 


A. History 


A mathematical theory is based on a specific 
system of axioms, 1.€., a system of hypotheses 
from which the whole theory is deduced with- 
out reliance on other assumptions. 

One of the first deductive methods of math- 
ematical reasoning was utilized by Thales, who 
returned to Greece from Egypt with the knowl- 
edge of surveying methods, and who deduced 
additional results from that empirical knowl- 
edge. His method gave impetus to the devel- 
opment of Greek geometry, which flowered 
with the Pythagorean school and research by 
members of Plato’s Academy. In the course of 
this development, the deductive method led to 
the idea of constructing the whole theory upon 
a system of “absolutely obvious” statements 
from which the whole theory could be de- 
duced. Euclid systematized Greek geometry in 
his Elements utilizing this idea. His work 
became the basis of geometry after the Re- 
naissance, and Greek geometry came to be 
called *Euclidean geometry. In the Elements 
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Euclid called the basic obvious statements 
common notions when they were of general 
nature, and postulates when they were specifi- 
cally geometric. Both were later called axioms 
(or postulates). 

Among the axioms stated by Euclid, the 
“lifth postulate” concerning parallels was 
longer and more complicated than the other 
axioms. Many efforts were made to deduce 
this particular axiom from the other axioms. 
The failure of these attempts suggested the 
possibility of establishing a tnon-Euclidean 
geometry, which was actually done by N. 1. 
Lobachevskii and J. Bolyai, who replaced the 
fifth postulate by its negation and showed that 
the new system of “axioms” was as valid as the 
classical one. This development naturally led 
to a new evaluation of the idea of axioms, and 
eventually the traditional concept of recogniz- 
ing the axioms as obvious truths was replaced 
by the understanding that they are hypotheses 
for a theory. D. Hilbert [ 1] established the 
latter idea as axiomatization and claimed that 
the whole science of mathematics should be 
built upon a system of axioms. His idea be- 
came the foundation of present-day mathe- 
matics. Hilbert reorganized classical geometry 
based upon his idea and published his result in 
Foundations of Geometry (1899). 


B. Systems of Axioms 


The system of axioms of a theory, i.e., the 
system of basic hypotheses from which we 
hope to deduce the whole theory, is written in 
undefined terms (or in terms of undefined con- 
cepts) by means of which all other terms are 
defined. On the other hand, a given theory is 
axiomatized by specifying sucha system of 
axioms upon which the theory may be re- 
organized. It should also be noted that a sys- 
tem of axioms determines a ‘structure (— 409 
Structures). 

A system of axioms is considered to be 
mathematically valid if and only if it is consist- 
ent. It is also desirable that the axioms in such 
a system be mutually independent (i.e., the 
negation of any one of the axioms is still con- 
sistent with the others). When such a system is 
not independent, it can be simplified by delet- 
ing redundant axioms from it. 

When any two models of a system of axioms 
are isomorphic to each other, we call the sys- 
tem complete or categorical (— 409 Struc- 
tures). For example, the system of axioms (I)— 
(V) postulated by Hilbert as the foundation 
for Euclidean geometry is complete (— 155 
Foundations of Geometry), whereas the sys- 
tems of axioms for the theories of groups, 
‘rings, or tfields are not complete since there 
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are nonisomorphic groups, etc. Although it is 
desirable that the systems of axioms postu- 
lated for a given theory (e.g., the ‘theory of real 
numbers, or Euclidean geometry) be complete, 
the study of partial systems that may not be 
complete is also important (— 156 Founda- 
tions of Mathematics). 
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36 (XIl.1 7) 
Banach Algebras 


A. Definition [1-4] 


A real or complex *Banach space R is called a 
Banach algebra (or normed ring) if a multiplica- 
tion law between elements of R is introduced 
that makes R an fassociative algebra and if 
\|xy|] < |x|] || yl| is satisfied. The complex case 
is generally easier to handle, and a real Banach 
algebra can always be embedded in a complex 
one isomorphically and isometrically. When 
a Banach algebra contains a unity element e 
with respect to the multiplication, i.e., it is 
unital, we can suppose |/e|| = 1. If it is not yni- 
tal, we can adjoin a unity element to it. 


B. Examples 


1. Let Ca (M) be the set of complex- (real-) 
valued continuous functions x{¢} on a locally 
compact space Jt that vanish at intinity. 
Endowed with the pointwise operations 

and the tsupremum norm (- 168 Function 
Spaces), it forms a complex (real) Banach 
algebra. In the complex case, it has the invo- 
lution x*(€)=x(). 

2. Let X be a Banach space. The set (X) 
of tbounded linear operators on X forms a 
Banach algebra if we define addition, multi- 
plication by scalars, multiplication between 
elements, and the norm in the usual fashion. 

3. Let G be a locally compact Hausdorff 
group (- 423 Topological Groups) and ji be 
its tleft-invariant Haar measure. The Banach 
space L,(G) (with respect to u) can be made 
into a Banach algebra by defining xy(g) = 
{x(h)y(h-'g) duh) (- Section L). 

4. Every subalgebra of a Banach algebra 
R, which is closed in the norm topology, is a 
Banach algebra with respect to the original 
algebraic operations and norm and is called a 
closed subalgebra of R. 

Throughout this article, al] Banach algebras 
are supposed to be complex Banach algebras 
unless otherwise specified. 


C. Spectrum of an Element 


We define a new operation x: y in R by setting 
X'y=X+y—xy. Ifx-y=y-x=0, y is called a 
quasi-inverse of x. When a unity element e 
exists, y is a quasi-inverse of x if and only if 
e-y is an inverse of e x. For a complex 
number A such that |4| > |\x||, we see that 
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À~! x possesses a quasi-inverse y given by the 
strongly convergent series y= — 0, 47 "x". 
The set of all complex numbers 4 such that 
Atx does not have a quasi-inverse is called 
the spectrum of x and is denoted by Sp(x). 
(When x itself does not have an inverse, in 
particular, when R does not contain a unity 
element, we include 0 in this set.) Sp(x) is a 
bounded closed set in the complex plane, and 
sup{ |å] Z4eSp(x)} =lim,, ,,, |x". In Exam- 
ple 2, this spectrum is the spectrum of the 
operator x, and the problem of determining 
the behavior of this spectrum constitutes one 
of the central problems in the theories of 
Banach and Hilbert spaces. 


D. Representations of a General Banach 
Algebra [ 1,2] 


We understand by a representation of a 
Banach algebra R on a Banach space X any 
algebraic homomorphism z of R into the alge- 
bra B(X) (see Example 2) satisfying ||z(x)|| < 
||x|| for all xe R. We call X the representation 
space. A Banach algebra always possesses 

an isomorphic and isometric representation. 
Especially important are the irreducible rep- 
resentations. A vector subspace (closed or not) 
Y of X is an invariant subspace if x(x) Y c Y 
for any XE R. A representation is called alge- 
braically irreducible if the invariant subspaces 
are trivial, i.e., they are only {0} or X. A repre- 
sentation is said to be topologically irreduc- 
ible if closed invariant subspaces are trivial. 
The tkernel of an algebraically irreducible 
representation is called a primitive ideal, which 
Can alternatively be detined in the following 
way: A left ideal J ( # {0}, R) is regular, by 
definition, if R contains an element u, a unity 
element modulo J, such that x ~ xy e J for any 
XE R. Such an ideal is always contained in a 
maximal left ideal, which in turn is necessarily 
regular and closed, A two-sided ideal I is 
primitive if it is the set of elements a in R for 
which aR c J, where J is some fixed regular 
maximal left ideal. If R is commutative, a 
primitive ideal is a regular maximal ideal, and 
conversely. The intersection of all primitive 
ideals is called the radical of R, and when it 

is {0}, R is called semisimple. 

The set of primitive ideals 3 is known as the 
structure space of R, in which the hull-kernel 
topology (or Jacobson topology) is introduced. 
The tclosure of a set Qf in J is, under this 
topology, the set of primitive ideals containing 
the intersection of the ideals in Q{, This topol- 
ogy is rather intractable; even in commutative 
cases, it does not coincide with the Gel’fand 
topology (— Section E), in general. 
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E. The Gel’fand Representation of a 
Commutative Banach Algebra 


A complex Banach algebra is a field if and 
only if it coincides with the field C of complex 
numbers (Gel’fand-Mazur theorem). This is the 
most fundamental fact in the study of com- 
mutative Banach algebras. Now let R be a 
commutative Banach algebra. Then every 
regular maximal ideal J of R is closed and the 
quotient algebra R/J is isomorphic to C, so 
that the quotient mapping R-R/J is viewed 
as a complex-valued homomorphism of R. 
Conversely, if g is a nonzero complex-valued 
homomorphism of R, it has norm <1 as a 
linear functional on R and its kernel, say Rg, is 
a regular maximal ideal of R such that @ is 
exactly the quotient mapping R > R/R,. The 
set %II(R) of nonzero complex-valued homo- 
morphisms (or regular maximal ideals) of R 
endowed with the tweak* topology of the 
Banach space tdual R’ of R is called the maxi- 
mal ideal space of R. WM(R) is a locally com- 
pact Hausdorff space, and its topology is 
called the Gel’fand topology. For each xE R we 
define a function $ on Mi(R) by setting X(@) = 
(x). Then the mapping xX is a homo- 
morphism of R into the algebra C,(IR(R)} of 
all continuous complex-valued functions on 
M(R) vanishing at infinity. This is the Gel’fand 
representation of R whose image, the Gel’fand 
transform of R, is denoted by R. Concerning 
this, we have (1) W(R) is compact if R has a 
unity element; (2) Sp(x) equals the closure of 
the range X($t(R)) of $; (3) the representation 
xX is norm-decreasing if Co(Wt(R)) is en- 
dowed with the tsupremum norm; (4) ||la 
(=sup{|x(¢)|| peWUR)}) equals lim, |x". 
The tkernel of the Gel’fand representation of 
R is the radical of R, which consists of gen- 
eralized nilpotent elements of R. 


F. Banach Star Algebras 


An involution in a Banach algebra R is an 
operation x-»x* that satisfies (1) (x + y)* =x* 
+y*; (2) (Ax)* = Ax*; (3) (xy)*= y*x*; (4) (x*)* 
=x, A Banach algebra with an involution is 
called a Banach *-algebra. A x-homomorphism 
Ü between two Banach -algebras is an alge- 
braic homomorphism which preserves invo- 
lutions, i.e., P(x*) = B(x)*. To represent a 
Banach +-algebra we prefer a *-representation, 
i.e., a representation X- T, on a Hilbert space 
such that T,, is equal to the adjoint 7,* of T, 
for any xER. 


G. C*-Algebras [5-10] 


A Banach »*-algebra A satisfying ||x*x|| = |\x|l? 
for al] x € A is called a C*-algebra. Every C*- 
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ilgebra is *-isomorphic and -isometric to a 
3anach algebra of operators on a Hilbert 
pace (see Example 2) that contains, along 
with an operator, its adjoint (the Gel’fand- 
Vaimark theorem). A C*-algebra is semi- 
imple, and a commutative C*-algebra A is 
«isomorphic to C,($R(A)) under the Gel’fand 
‘epresentation. A topologically irreducible +- 
‘epresentation of a C*-algebra is also algebrai- 
cally irreducible (R. V. Kadison), and the set of 
‘unitary equivalence classes of these irreduc- 
ble +-representations is called the dual space. 
[t becomes a topological space if we introduce 
‘he hull-kernel topology inherited from the 
structure space, but other topologies are also 
introduced. Moreover, for the study of separ- 
ible C*-algebras, Borel structure in the sense 
of G. W. Mackey is a very powerful tool. C*- 
algebras also have intimate connections with 
the theory of ‘unitary representations of a 
topological group (see below) and with quan- 
tum physics. Many works have been published 
on *-representations, dual spaces, etc. 

A linear functional @ on a C*-algebra A is 
said to be positive if «@(x*x) > 0 for any x € A. 
For any nonzero positive linear functional @ 
of A, there exist a *-representation (z,,, H) of 
A on a Hilbert space If, and a vector č, E€ He 
such that w(a)=(z,,(a)¢,, Čo) for all ae A and 
that the subspace {7,,(a)é,,|a€ A} is dense in 
H._,. We call (za, H) a cyclic representation of 
A induced by w. A *-representation (z, H) of A 
is called universal if for any *-representation 
(p, K) of A there exists a to-weakly continuous 
+-homomorphism of n(A)” onto p(A)” such 
that p(a)=(f07)(a) for all ae A, where n(A)" 
is the double tcommutant of n(A) and SO is 
a tyon Neumann algebra (— 308 Operator 
Algebras). Von Neumann algebras 7(A)” for 
universal representations 1 of A are mutually 
-isomorphic, so that they determine the envel- 
oping von Neumann algebra of A. Especially, 
the direct sum z of all cyclic representations 
To is a universal representation and there is a 
unique isometric isomorphism of n(A)” onto 
the second dual A** of A that is bicontinu- 
ous in the o-weak topology of n(A)” and the 
o(A**, A*)-topology of A**. So A** is identi- 
fied with the enveloping von Neumann algebra 
of A. 


H. Some Classes of C*-Algebras 


Let A be a C*-algebra. If is a CCR (liminal 
C*-) algebra if it is mapped to the algebra of 
compact operators under any irreducible +*- 
representation. It is a GCR (postliminal C*-) 
algehra if every nonzero quotient C*-algebra 
of A has a nonzero CCR closed two-sided 
ideal. These classes of C*-algebras have inter- 
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esting properties. For example, the dual space 
of a separable C*-algebra is a tT} space (tT, 
space) if and only if the algebra is GCR (CCR) 
(J. Glimm). 

A is called an AF (approximately finite) 
algebra if it is the uniform closure of an in- 
creasing sequence of finite-dimensional C*- 
algebras (0. Bratteli). AF algebras are ¢lassj- 
fied by so-called dimension groups. To see this, 
we call an ordered Abelian group G (written 
additively) a Riesz group if (a) for any integer 
n>Q and any geG, ng 20 implies g>0 and 
(b) for any g;, h; (1 <i<m,1<j<n) in G with 
gi <h; for alli and j, there exists k € G with 
gi Sk <h; for all i, j. Then the isomorphism 
classes of AF algebras correspond bijectively 
to the isomorphism classes, as local semi- 
groups, of generating upward-directed heredi- 
tary subsets of the positive cones of countable 
Riesz groups, and a dimension group, defined 
otherwise, is exactly a countable Riesz group 
(G. Elliot, E. G. Effros, D. E. Handelman and 
C. L. Shen) (— M). 

Now consider two C*-algebras A and B. 
The algebraic ttensor product A ® B over C 
becomes a *-algebra in the natural way. A 
norm £ on A ® B is a C*¥-crossnorm if ||xyll, < 
ixis ylis and ||x*x\l p= llcllg for x, YEA @ B 
and ||a@ b||,< |\al| ||b|| for ae A, be B. The 
completion of A Q) B under such a norm f is 
a C*-algebra, which is denoted by A ®), B. 
There are two special C*-crossnorms on A & 
i?: |! lain and |l | pax: The former is called the 
spatial (minimal, injective) C*-crossnorm and 
is defined by |/x\|min=SUPp, y I(E Q WO, 
where @ and w run over all *-representations 
of A and B, respectively. The algebra A ® min B 
is called the spatial tensor product of A and B. 
The latter is called the greatest C*-crossnorm 
and is defined by ||X|l max =SUPr || T(x)|], where 
T runs over all *-representations of A ® B. 
And the algebra A @,,,, B is called the projec- 
tive C*-tensor product of A and B. Any C*- 
crossnorm f on A ® B lies in between these 
two norms, ie, ||x|min < Ilg < l |lmax for 
xEÁ® B, A C*-algebra A is called nuclear if 
A ® B has a unique C*-crossnorm for any C*- 
algebra B. It is known that any GCR algebra 
is nuclear and that an inductive limit of nu- 
clear C*-algebras (e.g., any AF algebra) is 
nuclear. Given a linear mapping @ between 
C*-algebras A and B, we define for any inte- 
ger n 2 1 a linear mapping 9, :A ® M,->B ®© 
M, by setting ~,(x © ey) = g(x) Q e; where 
{e;} are the matrix units for the C*-algebra 
M, of nx n complex matrices. @ is said to be 
completely positive if @, is positive for any n > 
1. A C*-algebra A is then called injective if, 
for any C*-algebras B, C with BS C and any 
completely positive contraction @: B— A, there 
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exists a completely positive contraction @: 
C+A that extends g. A C*-algebra A is 
nuclear if and only if its enveloping von Neu- 
mann algebra A** is injective. Every injective 
C*-algebra B is not necessarily a von Neu- 
mann algebra but every AW*-algebra, i.e., 
every maximal commutative -subalgebra M 
of B, is monotone complete (i.e., every increas- 
ing net of self-adjoint elements has a least 
upper bound in M) and the lattice of projec- 
tions in B is conditionally complete. 


I, Crossed Product 


Let A be a C*-algebra, G a locally compact 
Hausdorff group with left-invariant mea- 
sure u, and gy a continuous action of G on A 
as «-automorphisms «,, géG. Let K(G, A) 

be the linear space of continuous functions 
from G to A with compact support, which is 

a x-algebra under the multiplication y * z 

and the involution y* given by (y x z)(g) = 

fe y(h)a,(z(h*g)) du(h) and y*(g)= A(g~")- 
ag(y(g ')*) for y, ze K(G, A), where A is 

the tmodular function of G. By completing 
K(G, A)under the norm | y|, ={¢ | rll du(g), 
we get a Banach +-algebra L,(G, A). L,(G, A) is 
then made into a C*-algebra by furnishing the 
norm ||x|!=sup, ||7(x)|], where z runs over all 
-representations of L(G, A). This C*-algebra, 
denoted by A ®, G, is the crossed product of 

A by G relative to the action a, Crossed prod- 
ucts are useful in the structure theory of Ç*- 
algebras. 


J. Extensions by C*-Algebras (BDF Theory) 
[12] 


Let H be a separable infinite-dimensional 
Hilbert space and Y@(H) the ideal of B(H) 
consisting of all compact operators. The quo- 
tient C*-algebra BA) LEIH) is called the 
Calkin algebra. We denote it by 2(H) and 

the quotient mapping by 1: 3B(H)> A H). By 
an extension of Z(H) by a separable unital 
C*-algebra A we mean a unital (preserving 
unity elements) *-isomorphism t of A into 
AH), We call two extensions t4, t, :A > 9(H) 
equivalent if, for some unitary operator ye 
BH), n(u)t, (a)n(u)* =1,{a) for any ae A, and 
denote by Ext(A) the set of all equivalence 
classes [t] of extensions t by A. Ext(A) forms 
a commutative semigroup with respect to the 
addition [t] = [t,]+[t,], where t(a)=1, (a) ® 
t,(a)e AH) © AH) AH © H). Call an ex- 
tension t;A—2(H) trivial if there is a unital 
*_isomorphism o of A into @(H) with t = 70. 
Then (1) all trivial extensions are equivalent, 


and we define the unity element of the semi- 
group Ext(A) (D. Voiculescu); (2) Ext(A) is a 
group if A is commutative (L. G. Brown, R. G. 
Douglas, and P. A. Fillmore) or, more gener- 
ally, if A is nuclear (M. D. Choi and E. G. 
Effros); (3) Ext(A) is not always a group (J. 
Anderson). For further information — 390 
Spectral Analysis of Operators J. 


K. Derivations in C*-Algebras 


A linear operator 6 in a C*-algebra A is a 
derivation in A if its domain D(6) is a dense 
subalgebra of A and 6(xy) = 6(x)y + xd(y) for x, 
ye D(6). dis a *-derivation if, moreover, x € D(6) 
implies x* € D(S) and 6(x*) = 6(x)*. 6 is called 
bounded (unbounded, closed) if it is bounded 
(unbounded, closed) as a linear operator. 

Every bounded derivation 6 is expressed 
as (x) =Gx —xd with some 4 in the envelop- 
ing von Neumann algebra A** of A(R. V. 
Kadison and S. Sakai). The element @ can be 
taken from the multiplier algebra M(A) = 
{be A**|bA + AbCA} of A if A is simple 
(Sakai) or if A has continuous trace (C. A. 
Akemann, Elliott, G. K. Pedersen, and J. 
Tomiyama). If A is separable, we see that all 
bounded derivations in A are given by ele- 
ments in M(A) if and only if A is the C*-direct 
sum of a family of simple C*-algebras and a 
C*-algebra with continuous trace (Elliott, 
Akemann, and Pedersen) [13] and that every 
bounded derivation in the quotient C*-algebra 
A/I, I being any closed two-sided ideal, can be 
lifted to a derivation in A (Pedersen). 

Next we consider unbounded derivation. By 
a *-automorphism group on A we mean a tone- 
parameter group p, t € R, of *-automorphisms 
of A such that, for each xE A, p,x is contin- 
uous in ¢ € R. A C*-dynamical system is a 
pair consisting of a C*-algebra A and a x- 
automorphism group p, on A. The fact that the 
time evolution of a physical system is often 
represented by such a dynamical system has 
made the study of unbounded derivations 
quite active. We have to see if a given deriva- 
tion ĝis tclosable and if, in case of a closable 
6, the closure ô generates a *-automorphism 
group. Sample results: (1) If a *-derivation 6 
is well-behaved in the sense that for every gelf- 
adjoint x in D(6) there exists a tstate ọ with 
|(o(x)| = |x|! and @(d(x))=0, then 6 is closable 
and its closure is well-behaved (A. Kishimoto 
and Sakai); (2) if a *-derivation 6 is closable, 
its closure § generates a *-automorphism 
group if and only if ò is well-behaved and 
(1 +6)D(6) is dense in A (R. T. Powers and 
S. Sakai; 0. Bratteli and D. W. Robinson). 
For further results = [ 14]. 
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L. Applications to the Theory of Topological 
Groups [2, 3,6, 15] 


Banach algebras have many applications in 
different branches of mathematics and quan- 
tum physics. Here we mention some that con- 
cern topological groups. Let G be a locally 
compact Hausdorff group and p be its left- 
invariant measure. We make the complex 
algebra L,(G) of Example 3 into a Banach 
x-algebra by detining x*(g)=x(g ')A(g *), 
where A is the modular function of G. This is 
called the L,-algebra (or group algebra). It is 
not C* but is semisimple. Considering a uni- 
tary representation of G is equivalent to con- 
sidering a *-representation of the L,-algebra. 
Replace the norm of xe L,(G) by sup||z(x)]|, 
where the supremum is taken over al] the +- 
representations. The new norm satisties the C* 
condition, and the completion of L,(G) with 
respect to this norm is a C*-algebra, which we 
call the C*-group algebra of G. The dual of the 
C*-group algebra thus detined is called the 
dual of the group G, and this notion plays an 
important role in the study of topological 
groups. Unitary representations of a group G, 
*-representations of the L,-algebra of G, and 
*-representations of the C*-group algebra of G 
are all characterized by positive definite func- 
tions on G. A function p(g) is positive definite, 
by definition, if it is measurable on G and 


[po h)x(g)x(h)du(g)du(h)>0 


for any continuous function x(g) with compact 
support. 

The Abelian case. When G is an Abelian 
group (- 422 Topological Abelian Groups), a 
regular maximal ideal M of the L,-algebra R 
of G and a character y of G are in a one-to-one 
correspondence by the relation 


(the left-hand side is the value of x at M under 
the Gel’fand representation). Moreover, the set 
of regular maximal ideals of R provided with 
the Gel’fand topology and the set G of charac- 
ters of G provided with the Pontryagin topol- 
ogy (the tcharacter group of G) are homeo- 
morphic by this correspondence. Therefore the 
Gel’fand transform of an element x of the L,- 
algebra R is seen to be a function X(y) on G 
defined by the integral on the right-hand side 
in the above expression, which is properly 
called the Fourier transform of x. Of course, 
the Fourier transform can be detined for other 
classes of functions (e.g., for the L,-space over 
G. the Fourier transform in the sense of Plan- 
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cherel), and classical theories of Fourier 
series, Fourier integrals, and harmonic analysis 
(- 192 Harmonic Analysis) are studied from a 
more extensive point of view. Thus the state- 
ment that the Fourier transform of an element 
x of the L,-algebra of G is a continuous func- 
tion vanishing at inlinity (— Section E), for 
example, is a version of the classical tRiemann- 
Lebesgue theorem. Bochner’s tbeorem in 
classical Fourier analysis is restated thus: A 
continuous tpositive definite function on an 
Abelian group can be put in the form 


p(g= | 7(g)dp(y), 


where p is a uniquely determined bounded 
positive ‘Radon measure on the character 
group G. Developing these theories further, we 
obtain an alternative proof of the *Pontryagin 
duality theorem (H. Cartan and R. Godement). 
A closed ideal I in the L, -algebra R deter- 
mines a set Z(D in G as the set of common 
zeros of the Fourier transforms of elements of 
I, We ask whether, conversely, J is character- 
ized by Z(1). This question is the problem of 
spectral syntbesis, and many important results 
have been obtained. The statement that J must 
coincide with R when Z(1) is empty is a for- 
mulation of the generalized Tauberian theorem 
of N. Wiener. A considerable simplification 

of the proof was accomplished by using the 
theory of Banach algebras, which was the first 
application of the theory (Į, M. Gel’fand). 


M. Holomorpbic Functional Calculus 
[16,171 


Let R be a unital commutative Banach algebra 
with maximal ideal space M = MR). We de- 
fine the joint spectrum Sp(X) of a finite n-tuple 
X= {X15 ++ Xp} in R by (1 (9), &.(9)) 
geMt}, a nonempty compact subset of C”. 
Then the bolomorpbic functional calculus says 
the following: For each X = {x,,-,%} CR 
there exists a unique algebra homomorphism 
®, of the algebra H(Sp(X)) of tgerms of holo- 
morphic functions on §p(X) into R with the 
following properties: (1) ®,(1) = 1, the unity 
element of R, and ®,(z;) = X; 1 Si <n; (2) if 
n<m, X'={x,,....x} and F(z,,....z,)= 
F(z,,.... z,) with Fe H(Sp(X)), then ®,.(F)= 
®,(F); (3) if { F, k = 1,2, } are holomorphic 
in a fixed neighborhood U of Sp(X) and F,> 
F uniformly on U, then 0,(F,)>®,(F) (L. 
Waelbroeck). From this follows the implicit 
function tbeorem: Let Xis X ER, fEeC(M, 
and let F(w,z,,.. , Z„) be holomorphic in a 
neighborhood of the set Sp( f, X1,- - , Xp) = 
{(F(@), 21 (@),, %,(@))| EM. Suppose that 
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F(f,%,, ...,%,)=0 but 0F/3w #0 on Sp(f,x,, 
.++)X,). Then there exists a unique ye R such 
that =f and F(y,x,,.,x,)=0(R. Arens and 
A. Calderon). As an application we obtain 
Shilov’s idempotent tbeorem: If K is a closed- 
open subset of W, there exists x € R with X= 

1 on K and X=0 on M-—K. Set Q(R)= {xe 
R\e?™*= 1). Then Q(R) is an additive sub- 
group of R and Shilov’s theorem says that 

the Gel’fand representation gives an jsomor- 
phism of Q(R) onto Q(C(W)) (= H? (WR, Z)). 
Another theorem of this sort is the Arens- 
Royden theorem: R ~'/exp(R) = C(M) '/ 
exp(C(M)) X H! (M, Z), where R~!is the set 
of invertible elements of R and exp(R)= {e*| 
XE R}. Further extensions related to K-theory 
have been given by J. L. Taylor and others. 
(For functional calculus for one variable > 
25 1 Linear Operators) 
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A. General Remarks 


The notion of Banach space was introduced in 
1922 by S. Banach and N. Wiener, working 
independently. The idea was to apply topo- 
logical and algebraic methods to fundamental 
problems of analysis, such as mapping prob- 
lems in infinite-dimensional function spaces 

(- 168 Function Spaces, 197 Hilbert Spaces, 
25 1 Linear Operators). 


B. Definition of Banach Spaces 


We associate to each element x of a *]inear 
space X over the real (or complex) number 
field a real number ||x|| satisfying the following 
conditions: (i) ||x|| 0 for all x, and ||x|| =0 is 
equivalent to x =Q; (ii) ||ax|| =|a|* ||x|| for any 
real (complex) number 4; (ili) |x + y|} <|]x|| + 

|| yl]. Then ||x|| is called the norm of the vector 
x, and X is called a normed linear space. The 
norm is thus an extension of the notion of 

the length of a yector in a tEuclidean space. A 
normed linear space X is a tmetric space under 
the distance p(x, y)=||x—y||. We write s- 
lim,,..oX, =X or simply x,—x when lim, lX, 
—x||= 0 and say that x, converges strongly to 
x. If this metric space X is fcomplete, then X 
is called a Banach space. In normed linear 
spaces the addition and the multiplication 

by scalars are continuous. A closed linear 
subspace M of a Banach space X is again a 
Banach space, and the tquotient space X/M 
becomes a Banach space if the norm of a tcoset 
is defined by ||x +M || =infmey [lx +mlj. The 
subset {x ||x|| < 1} of a normed linear space 

X is called the (closed) unit ball (or unit sphere) 
of X. 

Examples. tFunction spaces C, L, (l<p< 
œ), M, W, Ho, tsequence spaces c, |, m and 
BY are all Banach spaces (- 168 Function 
Spaces). 


n> © 


37 D 
Banach Spaces 


C. Linear Operators and Linear Functionals 


Suppose that a linear subspace D(T) of a 
linear space X is the (definition) domain of 

a mapping T with values in a linear space 

X, T is called a linear operator if T(x + 

By) = a Tx + BTy for any scalars «, B and x, ye 
D(T). R(T)={Txe X, |xeD(T)} is called the 
range of T. In the special case where X; is 

the real or complex number lield, T is called 
a linear functional. If X and X, are both 
normed linear spaces, then T is continuous if 
and only if s-lim,,, Tx, = Tx whenever s- 
lim,., X,=X. This is equivalent to the condi- 
tion SUPxep(7), |x| <1 || Tx|| <œ. In particular, if 
D(T) =X, the linear operator T is continuous 
if and only if the set {Tx ||x|| <1} is bounded. 
In this case, T is called a bounded linear oper- 
ator, and |T|| = sup), <1|| Tx ||is called the 
norm of the operator T. In particular, a linear 
operator T satisfying || T]| < 1 is called a con- 
traction. (Sometimes T is called a contraction 
only when ITI < 1. In that case, an operator 
with | T || < 1 is called a nonexpansive opera- 
tor.) The scalar multiple, sum, and product of 
linear operators are defined by (xT)x = a( Tx), 
(T +S)x= Tx +Sx, and(ST)x = S( Tx), respec- 
tively. The identity operator J in X is defined 
by I: x =x for all x EX. If the inverse mapping 
T! of x3 Tx exists, then it is called the in- 
verse operator of T. A normed linear space X 
is said to be isomorphic to a normed linear 
space Y if there exists a bounded linear opera- 
tor T from X onto Y with bounded inverse. 

If T can be chosen to be isometric (i.e. || Tx||_ = 
||x|| for all xe X), then X is said to be iso- 
metrically isomorphic to Y. 


D. The Dual Space and the Dual Operator 


The totality of continuous linear functionals 
f defined on a normed linear space X is a 
Banach space X’ under the previously de- 
fined linear operations and the norm | f|| = 
SUP)x) <1 |f(x)|. This X’ is called the dual (or 
conjugate) space of X. In view of useful prop- 
erties of the tinner product in ‘Hilbert spaces, 
it is sometimes convenient to write <x, f > for 
f(x). Let X and Y be normed linear spaces, 
and let T be a linear operator with the dense 
domain D(T) in X and the range R(T) in Y. 
If (f, g) is a pair with fe Y’ and ge X’ satisfy- 
ing the equation (Tx fF) = <x, g) for any © 
D(T), then g is determined uniquely by f The 
operator T' defined by T’f= gis linear and is 
called the dual operator (or conjugate or adjoint 
operator) of T. This is an extension of the 
notion of the ftranspose of a matrix in matrix 
theory. If T is a bounded linear operator then 
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T’ is also a bounded linear operator such that 


ITI = IT]. 


E. The Weak Topology and the Strong 
Topology 


Let X be a normed linear space and X’ its 
dual space. Take a finite number of elements 
Xi X2, X, from X’, and consider the subset 
of X:{xeX |supı cienl <x, xi) <e}, e>O0. If 
we take the totality of such subsets of X as a 
tfundamental system of neighborhoods of 0 
of X, then X is a ‘locally convex topological 
linear space, denoted sometimes by X,,. This 
topology is called the weak topology of X. If 
a sequence {x,}C X converges to x € X with 
respect to the weak topology of X, then it is 
said to converge weakly. This is equivalent to 
the convergence (x,, f <x, f) for any f e 
X’. The original topology of X determined by 
the norm is then called the strong topology 
of X, and to stress the strong topology we 
sometimes write X, in place of the original 
X. Take a finite number of elements x,, X3, 
< Xa from X, and consider the subset of 
X'{x'EX'| sup, cicnl<Xj.X’>|<e}, €>0. If 
we take the totality of such subsets of X’ as a 
fundamental system of neighborhoods of O of 
X’, then X’ is a flocally convex topological 
linear space. We write this space as X’ and 
call the topology the weak* topology of X’. 
The topology of X’ detined by the norm || f || is 
called the strong topology of X’, and to stress 
the strong topology we write X;. The terms 
“weak” and “weakly” are used in reference to 
the weak topology, for instance, weak closed- 
ness and weakly closed. Similar conventions 
are used for the weak* and strong topologies. 
The unit ball of the dual space of a normed 
linear space X is weak*-compact (Banach- 
Alaoglu theorem). Then by the tKrein-Milman 
theorem (— 424 Topological Linear Spaces) 
the unit ball of X’ is the weak* closure of the 
fconvex hull of its textreme points. If X is a 
Banach space, a convex subset K of X’ is 
weak*-closed if and only if the intersection of 
K and each weak*-compact subset is weak*- 
closed (Krein-Shmul’yan theorem). 


F. The Hahn-Banach Extension Theorem 


Let M be a linear subspace of a real linear 
space X and p(x) a real-valued functional 
detined on X such that p(x+ y)< p(x)+ p(y) 
and p(Ax)=Ap(x) for all x, ye X and A>0. If 
a linear functional f, defined on M satisfies 
fi(x) <p(x) for all x € M, then there is a linear 
functional f on X which extends f, and satis- 
fies f(x) <p(x) for all xeX (the Hahn-Banach 
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(extension) theorem). The Hahn-Banach exten- 
sion theorem has many applications. (1) Let M 
be a linear subspace of a normed linear space 
X. Then for any f, € M’, we can construct an 
feX' such that f(x)=f,(x) for all xe M and 
fll = Wf, ||.) For any x #0 of a normed 
linear space X, we can construct an hE x’ 

such that fo(Xq)=[|xoll and || foll = 1. (3) For 
any closed linear subspace M of a normed 
linear space X and a point xọ¢ M we can 
construct fo¢X’ such that || fpl =, fo(xo)= 
inf,ca |Xo— m|| and fo(x)=0 for all xe M. A 
proposition more general than (3) is Mazur’s 
theorem, which is useful in applications: (4) Let 
a closed subset M of a normed linear space X 
be convex. Then for any xo € M wecan con- 
struct an fy¢X’ such that f(x) > fo(x) for all 
XE M. By (4) we can prove, e.g., that a convex 
set of a normed linear space 1§ weakly closed 
if it is strongly closed. This proposition has 
the following corollaries: (5) A convex set A 
containing 0 is closed if and only if (A°)° = 

A, where A? is the tpolar of A (the bipolar 
theorem). (6) For two closed convex sets A and 
B, containing 0, the polar of A N B coincides 
with the weak* closure of the convex hull of 
A° U B°, For complex linear spaces, most of 
the propositions in this section are valid with 
Re f(x) instead of f(x). 

The Hahn-Banach theorem can also be em- 
ployed to prove the existence of the general- 
ized limit (or Banach limit) Lim,,,,, ¢, defined 
for all bounded real sequences {€,,} such that 
lim inf, o Cn S LIM, oo Cn < lim sup,,,,, ¢, and 


A> © 


Lim,,,..(#6, + By,,) = «Lim,.,,,¢,+ B Lim,..., Nn: 


G. Duality in Normed Linear Spaces 


An element Xo of a normed linear space X 
gives rise to an element x% of (Xy determined 
by <X9, x > = <x’, Xo» for all x'e X{. If we write 
Xo =J Xos then J is a linear operator satisfying 
lJ Xql| = xoll by (2), and so the space X is 
isometrically isomorphic to a linear subspace 
of (X); If X, coincides with (X;), under this 
isomorphism, we call X a reflexive (or regular) 
Banach space. A necessary and sufficient con- 
dition for the normed linear space X to be 
reflexive is that the unit ball of X be weakly 
compact. A convenient criterion for the reflex- 
ivity of X is that any bounded sequence {x,} of 
X contains a subsequence weakly convergent 
to a point of X (Eherlein-Shmul’yan theorem). 
In this connection, a Banach space X is reflex- 
ive if and only if each X’E X’ attains its norm, 
i.e., there is an Xo € X such that ||xq || = 1 and 
|<X9,x’>| =||x'|| (James’s theorem). 

A normed linear space is said to be uni- 
formly convex if for any ¿> 0 there exists a 6 > 
O such that |x] <1, || yl < 1 and ||x—y/| >e 
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implies || x + y| <2 = 6, A normed linear space 
is said to be uniformly smooth if for any ¢ > 0 
there is a 6>0 such that ||x||= 1 and | y|| <ô 
imply |x+ yl + lx- yl <2+el]|y|]. A Banach 
space X is uniformly convex (resp. uniformly 
smooth) if and only if the dual X’ is uniformly 
smooth (resp. uniformly convex). If a Banach 
space X is uniformly convex or smooth, it 

is isomorphic to a Banach space that is 
uniformly convex and uniformly smooth 

(P. Enflo). The space L, with 1 < p< mis 
uniformly convex and uniformly smooth. 
Any uniformly convex or uniformly smooth 
Banach space is reflexive (Milman’s theorem). 


H. The Resonance Theorem 


Let {T,} be a sequence of bounded linear 
operators from a Banach space X into a 
normed linear space Y. The uniform bounded- 
ness theorem (resonance theorem or Banach- 
Steinhaus theorem) states that supPp>1ı | T| < œ 
if supp >1 || p xI| < œ% for every xe X. As a corol- 
lary, we have supņ„> ||x,|| < œ for any weakly 
convergent sequence of X. Another corollary 
states that the set {xe X limsup,..,, | Txi < 
oo} either coincides with X or is a subset of X 
of the first category. This implies the so-called 
principle of condensation of singularities, which 
gives a general existence theorem for func- 
tions exhibiting various kinds of singularities, 
for example, a continuous function whose 
‘Fourier expansion diverges at every point of a 
*perfect set of points having the cardinal nym- 
ber of the continuum. 


I. The Closed Graph Theorem 


Let T be a bounded linear operator from a 
Banach space X into a normed linear space Y. 
If the image of the unit ball of X under T is 
dense in the unit ball of Y, then for any ¢>0 
and any yo € Y the equation TX =y, has a 
solution x with ||x ||< (1 + £) //y, | (Banach’s 
theorem). By using the *Baire category theo- 
rem, we can derive from this the open map- 
ping theorem: Every bounded linear operator 
from a Banach space X onto a Banach space 
Y maps each open set in X onto an open set 
in Y. As an application of the open mapping 
theorem, we can prove the closed graph theo- 
rem: A linear operator T from the whole of a 
Banach space into a Banach space is con- 
tinuous if and only if T is a closed operator, 
i.e., S-lim,_,,. X. =x and s-lim,,, TX, = y imply 
Tx = y. This theorem plays an important role 
in modern treatments of linear partial differen- 
tial equations. 


noo 
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J. The Closed Range Theorem 


Let X and Y be Banach spaces and Ta linear 
closed operator with domain D(T) dense in X 
and with range R(T) in Y. Under these con- 
ditions, the following four propositions are 
mutually equivalent. (1) R(T) is a closed set in 
Y. (2) R(T’) is a closed set in X’. (3) R(T) = 
{yeY|<y, y*> =0 for all y*eD(T’) such that 
T'y* =0}. (4) R(T) ={x* eX" | <x, x*) =0 for 
all x € D( T) such that Tx = 0}. These four 
propositions, as a whole, are called the closed 
range theorem. This theorem implies (5) R(T) 
= Y if and only if T’ has a continuous inverse; 
and (6) R(T’) = X’ if and only if T has a con- 
tinuous inverse. The following theorem is of 
similar nature: the following three propositions 
on two closed linear subspaces M and N of a 
Banach space are mutually equivalent. (7) M + 
N is closed. (8) M° + N° is strongly closed. 

(9) M° + N° is weak*-closed. 

The Hahn-Banach theorem, the resonance 
theorem, the open mapping theorem, the 
closed graph theorem, and the closed range 
theorem can be extended to various classes of 
tlocally convex topological linear spaces. By 
virtue of this extension, we are able not only to 
treat various fundamental problems of analysis 
from a unified viewpoint but also to develop 
the theory of functional analysis itself in a new 
direction (— 424 Topological Linear Spaces; 
concerning linear operators on a Banach space 
— 68 Compact and Nuclear Operators, 251 
Linear Operators, 390 Spectral Analysis of 
Operators). 


K. Differential and Integral Calculus of 
Functions with Values in Banach Spaces 


Calculus involving functions from a set to a 
Banach space is also an effective tool in yarj- 
ous problems. A function x(t) detined on an 
interval [a, b] with values in a Banach space X 
is said to be strongly (weakly) continuous if x(t) 
converges strongly (weakly) to x(t») as tty. 
For a strongly (weakly) continuous function 
x(t), the Riemann integral can be defined in a 
standard way, using strong (weak) convergence 
of the Riemann sum 


b 
| x(t)dt=lim} x(t/)(t;.1 — ti). 
a 
The tfundamental theorem of calculus, i.e., 
strong (weak) differentiability of the indefinite 
integral, remains true. Various defmitions of 
integrals of a Banach space-valued function 
on a fmeasure space are discussed elsewhere 
(- 443 Vector-Valued Integrals). 

Now let x(A) be defined on a domain Q in 
the complex plane with values in a complex 
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Banach space X. x(A) is said to be holomorphic 
if f(x(A)) is tholomorphic in Q for every fe X’. 
If x(A) is holomorphic, then there exists an X- 
valued function y(A) on Q such that 


IS" A+Q—xA))—yAlo0a s 60. 
Therefore there is no difference between 
“strong” and “weak” in analyticity. *Cauchy’s 
integral theorem remains true for a holo- 
morphic function x(A) with values in X, and 
the tLaurent expansion 


co 


xA= } a,(A—Ap)", 


n=— © 
1 =e 
amz x(A)(A— Ao)" ! då, 
2ni Je 


is valid with the integral taken in the Rieman- 
nian sense. Banach space-valued holomorphic 
functions on complex (or real) tanalytic mani- 
folds of higher dimension can be defined in a 
natural way by means of power serjes expan- 
sion. A function @(A) defined on a domain in 
the complex plane with values in the Banach 
space B(X, Y) of bounded linear operators 
from X to Y becomes holomorphic if fi (O(A)x) 
is holomorphic for every x¢ X and fe Y’. An 
operator-valued holomorphic function is often 
called an analytic operator function. 


L. The Approximation Property 


A Banach space X is said to have the approxi- 
mation property if there is a family {S,} of 
bounded linear operators of tfinite rank in X 
such that inf, || (S, =) T || =0 for all fcompact 
linear operators T in X. There is a Banach 
space that fails to have the approximation 
property (Enflo [9]). More surprisingly, the 
space of all bounded linear operators on an 
infinite-dimensional Hilbert space fails to have 
the approximation property (A. Szankowski). 
The approximation property plays a deci- 
sive role in the theory of ttensor products of 
Banach spaces. A Banach space X is said to 
have the hounded approximation property if a 
family {S,} in the definition of the approxi- 
mation property can be taken bounded, i.e., 
sup, ||S,|| < œ. The bounded approximation 
property does not follow from the approxi- 
mation property. The bounded approximation 
property is closely related to the existence of a 
basis. A sequence {e,} in a Banach space X is 
called a Schauder hasis or simply a hasis (or 
hase) for X if to each XE X there corresponds 
a unique sequence of numbers {cc,} such that 
lim, o |X — Lg=1 ke || =0. Most separable 
Banach spaces appearing in analysis have 
bases. A separable Banach space has the 
bounded approximation property if and only 
if it is isomorphic to a complemented linear 
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subspace (— Section N) of a Banach space 
with a basis (A. Pełczyński). 


M. Injective and Projective Banach Spaces 


Banach spaces of the type tC(Q) with compact 
Q and of the type tl (Q) on a set Q play a spe- 
cial role in the theory of Banach space. This 

is already seen in the fact that every Banach 
space is isometrically isomorphic to a subspace 
of a space C(Q,) as well as to a quotient space 
of a space L (0,). A Banach space X is said to 
be injective if for any Banach space Y and 

its linear subspace M, each bounded linear 
operator T from M to X can be extended to a 
bounded linear operator 7 from Y to X, that 
is, Tx = Tx foral] XE M.A Banach space is jn- 
jective if and only if it is isomorphic to a com- 
plemented linear subspace of the space C(R) 
for a compact Hausdorff space Q with the 
property that the closure of every open set is 
open. Such a topological space is called tex- 
tremally disconnected. The tmaximal ideal 
space of the tBanach algebra L,.(Q) is extrem- 
ally disconnected. Hence the Banach space 
L,(Q) is injective, Whether every injective 
Banach space is isomorphic to a space C(Q) is 
still an Open problem. However, a Banach 
space is isometrically isomorphic to a space 
C(Q) with Q extremally disconnected if and 
only if it is injective with the property of norm- 
preserving extension, 1.€., || TI =|| T|| is always 
possible (Nachbin-Goodner-Kelley theorem). In 
this connection, the following propositions on 
a Banach space X are mutually equivalent. (1) 
X’ is isometrically isomorphic to a space 

tL (x). (2) For any Banach space Y, its linear 
subspace M, and e> 0, each compact linear 
operator T from M to X can be extended to a 
compact linear operator J from Y to X with 
| T\|<(1+8)||T|| Q. Lindenstrauss). A Banach 
space X is said to be projective if for any 
Banach space Y and its closed linear subspace 
M, each bounded linear operator § from X to 
the quotient space Y/M is lifted to a bounded 
linear operator § from X to Y, ie., p(Sx)= Sx 
for all xe X, where ø is the quotient mapping 
from Y to ¥/M. Projectivity of X is character- 
ized by its being isomorphic to the space 1, (Q) 
on a set (J, A Banach space is isometrically 
isomorphic to a space l} (R) if and only if it 

is projective with the property of norm- 
preserving lifting, 1.€., (S| = |S || is always 
possible (- [8]). 


N. Complemented Subspaces Problems 


A linear subspace M of a Banach space X is 
complemented, i.e., there is a closed linear 
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subspace N such that M AN = {0} and M + 

N = X if and only if M is the range of a 
bounded projection P, i.e., P? = P and R(P) = 
M. Each nonzero closed hnear subspace of a 
Hilbert space is complemented, or more pre- 
cisely, it is the range of a projection of norm 
one (- 197 Hilbert Spaces). This property 
distinguishes Hilbert spaces from general 
Banach spaces: (1) A Banach space of more 
than 3 dimensions is isometrically isomorphic 
to a Hilbert space if each nonzero closed linear 
subspace is the range of a projection of norm 
one (Kakutani’s theorem). (2) A Banach space 
is isomorphic to a Hilbert space if each closed 
linear subspace is complemented (J. Linden- 
strauss and L. Tzafriri [ 10]). 


0. Quasi-Banach Spaces and Fréchet Spaces 


Let X be a linear space over the real (or com- 
plex) number field. Suppose that a real-valued 
function ||x|| on X satisfies (i) and (ii) of Sec- 
tion B and Gii’) |x + yl] <k(llx||+ Ily) with a 
constant k > 1 independent of x and y. Then 
||x|| is called the quasinorm of x, and X equip- 
ped with a quasinorm is called a quasinormed 
linear space. Let 0 < p < 1 be the root of the 
equation k = 2/9)! Then there is a distance 
d(x, y) = d(x -y) depending only on x -y such 
that d(x—y)< |x—yl]?<2d(x—y). Hence a 
quasinormed linear space is a metric space in 
which a sequence x, converges to x if and 
only if ||x, — x || +0. If a quasinormed linear 
space X is complete under this metric, then X 
is called a quasi-Banach space. The tfunction 
space L, is a quasi-Banach space for 0<p< 1. 

If we denote by ||x]| the distance d(x-0) of 
a quasinormed hnear space, then it satisfies (i) 
and (iii) of Section B and (ii’) || —x|} = ||x|| and 
limpo [|%_X_,—&X|| =0 whenever 4,0 and 
lim, |x — x| =O. A functional ||x|! satisfying 
(i), (i), and (iii) is called a pseudonorm. If a 
hnear space X equipped with a pseudonorm is 
complete, then X is called a Fréchet space (in 
the sense of Banach). The ‘function space $(Q) 
is a Fréchet space. Quasinormed linear spaces 
and Fréchet spaces are ttopological linear 
spaces, but they need not be tlocally convex. 
Hence it is possible that there is no continuous 
linear functional except for zero. However, the 
open mapping theorem and the closed graph 
theorem hold for linear operators from a 
Fréchet space into a Fréchet space. 
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The Bernoullis, Protestants who came origi- 
nally from Holland and settled in Switzerland, 
were a signiticant family to the mathematics 
of the 17th Century. Ina single Century, the 
family produced eight brilliant mathemati- 
cians, all of whom played important roles in 
the development of calculus. 

The brothers James (1654-1705) and John 
(1667- 1748) and Daniel (1700— 1782), John’s 
second son, were especially outstanding. James 
and John were close friends of G. W. tLeibniz, 
with whom they exchanged the correspon- 
dence through which it might be said that 
calculus developed. James studied problems 
related to the ttautochrone and fbrachisto- 
chrone, as well as problems in geometry, dy- 
namics, and other fields, including the tisoperi- 
metric problem. He was the first to change the 
name calculus summatoris to calculus integralis 
(1690). His Ars conjectandi was published after 
his death in 1713; in it is found the tląaw of 
large numbers, which made his name promi- 
nent in the theory of tprobability. James had 
little guidance, learning mathematics on his 
own. He was a professor of experimental 
physics at the University of Basel and later 
became a professor of mathematics. He taught 
mathematics to his brother John, who suc- 
ceeded him as professor at the University. 
John’s many achievements appeared in such 
publications of the time as Acta eruditorum 
and Journal des savants. In 1701, the begin- 
nings of the tcalculus of variations were seen in 
his solution to the isoperimetric problem. He 
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was the first to use the term functio, the root of 
the present term function (1714). 

Despite discord between the brothers and 
also between fathers and sons, the Bernoullis 
were ardent teachers and brilliant researchers, 
who instructed not only their sons but also 
such mathematicians as tEuler. Their achieve- 
ments were numerous in consolidating the 
content and form of calculus and also in ex- 
panding its application. Daniel was especially 
outstanding in the theory of probability; he 
also made contributions to the field of thydro- 
dynamics and to the tkinetic theory of gases. 
The eldest John’s eldest son Nicolas (1695- 
1726) achieved distinction as a professor of 
mathematics in St. Petersburg. Daniel’s 
youngest brother, John (1710-1790), suc- 
ceeded his father, John Sr., as a professor at 
the University of Basel. The son of John, Jr., 
also named John (1744- 1807), was the chair- 
man of mathematics at the Academy of Berlin. 
His brother, another James (1759-1789), was 
a professor of experimental physics at the 
University of Basel. Nicolas (1687-1759), a 
grandson of the founder Nicolas (162331708) 
and son of Nicolas the painter (1662-1716), 
held Galileo’s old chair of mathematics at 
Padua from 1716 to 1719. 


Nicolas 
1623-1708 
James Nicolas (painter) John 
1654-1705 1662-1716 1667-1748 
Nicolas Nicolas Daniel John 


1687-1759 169551726 1700-82 1710-90 


John James 
1744-1807 1759-89 


In this article, the first names have been 
given in English. The German names corre- 
sponding to James, John, and Nicolas are 
Jakob, Johann, and Nikolaus, respectively. 
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Bessel Functions 


A. General Remarks 


Bessel functions were first introduced in order 
to solve *Kepler’s equation concerning plane- 
tary motions and were systematically investi- 
gated by F. W. Bessel in 1824. Since then they 
have appeared in various problems and have 
become important. 


B. Bessel Functions 


Separating variables for the Helmholtz equa- 
tion AY + k? Y = 0 in terms of cylindrical 


coordinates, we obtain Bessel’s differential 
equation 

@widw v? 

—+-—+{ 1—— |w=0 1 
dz? zdz ( =) 0) 


for the component of the radius vector. The 


following two linearly independent solutions 
of(l): 


1 eee 
noai | e engine. 
fiA L, 


1 PEP ox 
H'?)(z) -f e` isinctivi qe, (2) 
L, 


T 


are called the Hankel functions of the first and 
second kind, respectively, where the contour L i 
of the first integration is a curve from ( = g + 0) 
+i% to —0—i0o, and L, is a curve from +0 
—iœ to (x—-0)+ioo. If both z and v are real, 
we have 


Hy(2) = Hy(2), 
where Z is the complex conjugate of z. Hence 
J2) =(Hy” (2) +H (z))/2, 

N,(2) = Y,(2)= (HP) = HO"(2))/2i (4) 


H?)(z) = H®(2), (3) 


are real functions, If both z and v are complex, 
the functions J,(z) and N,(z) defined in (4) are 
also called Bessel functions and Neumann 
functions, respectively. The other names for 
J,(z), N,(z), and H,(z) are Bessel functions of 
the first, second, and third kind, respectively. 
Each of them satisfies the following recurrence 
formulas: 

dC,(2) 


a ee = C,-1 (2) a Cy (z), 


(2v/z)C,(z) =C,_,(z) + C,41(2). (5) 


In general, functions satisfying the simulta- 
neous tdifferential-difference equations (5) are 
called cylindrical functions. Every cylindrical 
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function C,(z) is represented in the form C,(z) 
= 4, (v) HP (z) +a) HPG), where a,(v) and 
a,(v) are arbitrary periodic functions of period 
1 with respect to v. 

If v = n (an integer) we have 


J_,(Z)=(—1)"Jn(2), 

N_,(z)=(—1)°N, (2), (6) 
which show the linear dependency of J_, and 
J,, and N_,, and N,, respectively. If v #n(an 
integer), as the fundamental solutions of (1) we 
can take a pair J, and J_,, or N, and N_,. In 
(2) if we take as a contour of integration a 
curve from (—2z+0)+ico to (x—0)+i00, we 
obtain an integral representation for J,(Z), 
which yields the relations 


J (ze) = e'™" J (2), 
deze se I (2. (7) 


If v = n (an integer) and Re z > 0, we obtain 


1 f7 
na=z | eizsing + ing dt (8) 
-1 
1 (* 
al cos(zsin€ —~f)d€, (9) 
0 


which is called Bessel’s integral. These repre- 
sentations imply the following expansions by 
means of tgenerating functions: 


eit D (10) 


n=- © 


cos(zsin 2) =Jo(z)-+2 Ý Jpq(z)cos 2n, 
n=1 


x (11) 
sin(zsin€)=2 È Jy ,4,(z)sinQnt+ LC. 
n=0 
Making a change of variable y= exp( ių) in 


(2), we obtain 


1 Z l Me 
no-z] exp(=(u-2)) “i du, (12) 
L 


where L is a contour starting at the point at 
infinity with the argument = 7, encircling the 
origin in the positive direction, and tending 
to the point at infinity with the argument 7, 
From (12) we obtain a power series expansion 


z Y @ (—1)" gm 


O=% Hom'T(vtmtiO 2° 


(13) 
obtained also from (1) by a power series ex- 
pansion at z = 0, which is a tregular singular 
point of (1). Substituting (13) into 


N,(z) = (os vzJ,{z) = J_,(z))/sin vz, (14) 


we obtain a power series expansion for N,(z). 
A power series expansion for N,(z) for an inte- 
ger n is obtained by taking the limit yn (— 
Appendix A, Table 19). In particular, if v = 
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n + 1/2 (n is an integer), we have 
(2z2}"+"2 d” [sinz 
Jn az’y 


J n+12(z)=(—1)” ’ 
Z 


n=0,1, 2, 


which is represented by elementary functions 
and is sometimes called simply the half Bessel 
function. Bessel functions for half-integers have 
appeared also as radius vector components 
when the variables in the Helmholtz equation 
are separated by spherical coordinates. The 
function 


ja(z)= n/2z Ja+12(2) 
is called the spherical Bessel function. 
We have the following addition theorem: 
HP (kp)e™ = X J (kr) HY, (kr je’, 
u=1,2, 


where 
p= /r} +15 —2r,r coso, 


pcosý =r; —r coso, psiny=r,sing. 


C. Zero Points of the Function J,(z) 


From the differential equations satisfied by 
J,(az), we have 


1 
e2- p?) | 2J,(a2) J,(Be)dz 
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=BI (HIB) aS, (0e)J,(B). (15) 
By letting Ba in (15), we have 


\ z(J,(az))* dz 


-(( —Suneo? +00?) (16) 


If g and f are distinct roots of J,(z) = 0, we 
have from (15) 


1 
| zJ,(az)J,(Bz)dz=0, Revy> -1. (17) 
0 


The integral formulas (15), (16), and (17) are 
called Lommel’s integrals. 

As for the zero points of J,(Z), the following 
facts are well known: J(0) =0 ifv >0. J,(z) has 
no multiple zero points other than z = 0. J,( —«) 
= 0 if J (æ) = 0. Every zero point of J (Z) is real 
if yv > -1. Between two adjacent zero points 
that are positive, there exists one and only one 
zero point of J,_,(z) and J,,,(z), respectively. 
J,(Z) has a countably inlinite set of zero points 
on the real axis. When v > 0 is rational, every 
zero point of J,(z) except z = 0 is a ttranscen- 
dental number. The transcendentality of 1 is 
a special case of this result for v = 1/2. 


39 D 
Bessel Functions 


D. Expansion by means of Bessel Functions 


Let f(r, o) be defined for 0<r <1, —1<Q<a, 
and Xn do bn 2> -s Ans , OO s < Apsara S= 


1, 2, for every n) be zero points of J, (x) (n = 
0, 1,2, . . . ). Then we have an expansion 


(a,,,cosng +b, ,sinng)J,(a,, st), 
(18) 


which is called the Fourier-Bessel series, The 
coefficients a, , and bẹ, are determined by the 
properties of tFourier series and (16) and (17) 
as follows: 


= _ En 
b, s (Jus 1(On,5))” 


ee COS np 
Jn nS d dr; 
x | [e P)I n(n, ae io rdo dr 


= |, fj Sby= a= 2) 
The integral transformation 
ie) 
g(y)= | Xf (x) J,(xy)dx (19) 
0 


is called the Fourier-Bessel transform. If f(x) is 
sufficiently smooth and tends rapidly to zero 
as X00, the following inversion formula 
holds: 


f(x) = | yg(y)J,(xy) dy. (20) 


There are other types of series expansions in 
terms of Bessel functions as follows: Dini’s 
series 


8 


And (Am) 
1 


(åm is the mth positive root of xJ}(x)+ HJ,(x) 
= 0, where H is a real constant); Kapteyn’s 
series 


iv 0) 


» And yi m((V + m)x); 


m=1 

Schlémilch’s series 
a eo 

2y 2 amJo(mx); 
2 m=0 


and the generalized Schlömilch series 


i Am (MX) +b,,H,(mx) 


1 a 
TE 2 (mx/2)° 


where H,(mx) is the Struve function (- 
Section F). 
E. Asymptotic Expansion 


If zļor v is sufftciently large, the asymptotic 
representation for Bessel functions is obtained 
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by applying the tmethod of the steepest de- 
scent for (2). If |z|>|v|, we have 

2 T oT 
H®(z)}~ /—expi{ z——v—— ], 
@) TZ pas 3 4 
—n<argz<2n, 
2 T x 
H®'(z)~ /—exp( —i{ z—-v—— 
ne —(e-4)), 
—2n<argz<7, 
2 
J\(z)~ /—cos pepe : 
TZ 2 4 
—n<AargZ <T, 


2 
N (z) ~ [Žsin(2-5v-4). 
TZ 2 4 


Hence H{?(z) tends to zero as |z|— 00 in the 
upper half-plane, and becomes large exponen- 
tially as |z| > œ in the lower half-plane. The 
results for H’ are obtained by interchanging 
“upper half-plane” with “lower half-plane” in 
this statement. 

If both |z| and |y| are sufficiently large, we 
have the Debye asymptotic representations. 
For example, if z = v sec B (v >0, B> 0), we have 


H@-?)(ysec p) ~ (av tan 8/27"? 
x exp( + i(v(tan f — p)= 7/4)). 
Ifz=vsecha (v>0,a>0), we have 
J, (v sech a) 
~(2ny tanh a) 12 exp v(tanha — a), 
N, (v sech a) 
~ (nv tanh «/2)~"? exp via = tanh aj, 
If |v] ~|z|, we have 


anf 


t 
HC? (y sec f) ~—— 


B 


xexp( +i (+ v (ian B — tan? B -+))) 


x H44;)((v/3)tan? b) +007”, 


which is called Watson’s formula. 


F. The Wagner Function 


As an application of Bessel functions to the 
theory of nonstationary aircraft wings, T. 
Theodorsen introduced the function 


C(z) = HPHP) + AY(z)) 


[6], and H. Wagner considered the function 


1 2C(—i 
k,(s) al ee) 
Br 


2ni W 


[5], where H'?(z), H(z) are Hankel func- 
tions, and for means a Bromwich integral 
giving the inverse Laplace transform. Then 
C(z) and k,(S) are called the Theodorsen func- 
tion and Wagner function, respectively. The 
function k,(S) is equal to the lift coefficient 
when a 2-dimensional flat wing suddenly 
proceeds forward a distance s at an angle of 
incidence 1/n. 


G. Functions Related to Bessel Functions 


The following functions are closely related to 


Bessel functions (- Appendix A, Table 19.IV). 


(1) Modified Bessel functions. 
Lo) = e™ J(e'*? 2), 
sopia 

(2) Kelvin functions. 
ber,(z) +i bei,(z) = J,(e +3" 2), 
her,(z) +ihei,(z) = HP (e+3""4 2), 
ker,(z)= —(n/2)hei,(z), 
kei,(z)=(2/2)her,(z). 

(3) Struve function. 

w flo (Diet mee 
Nel È mame 
(4) Anger function. 


J,(zj= i cos(v — z sin #) dé. 


0 


When v is an integer n, we have J,(z)= J,,(z). 
(5) H. F. Weber function. 


1 tT 
Est | sin(v@ — zsin #)d0. 
XJ 


The last three functions satisfy certain in- 
homogeneous Bessel differential equations. 
Many other functions, such as tAiry’s integral, 
can be represented by Bessel functions. 
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A. General Remarks 


Biometrics is the branch of science that ap- 
plies mathematical and statistical methods 
to biological problems, and deals with all the 
phenomena that affect the physical, social, and 
psychological well-being of human beings. 
These phenomena involve the relationships 
of groups of human beings to other human 
beings, to animals, microbes, and plants, and 
to physical and chemical elements in the en- 
vironment. In dealing with these problems the 
biometrician encounters such theoretical tasks 
as analyzing autocorrelated data in time series 
and such practical undertakings as cost-vs- 
benefit evaluations of health programs. 
Biometrics began in the middle of the 17th 
century when Sir William Petty and John 
Graunt developed a new method of analyzing 
the London Bills of Mortahty. Petty and 
Graunt essentially invented the field of vital 


> 


statistics by studying the reports of christen- 
ings and causes of death and proposed a 
method called “political arithmetic.” 

Some fields to which biometries is relevant 
are given below. 


B. Statistical Genetics 


After early development in vital statistics, 
statistical genetics was founded on the new 
ideas emerging in statistics. Major contri- 
butions were made by Charles Darwin (1809- 
1882), Francis Galton (1822-1911), Karl 
Pearson (1857- 1936), and Ronald A. Fisher 
(1890-1962). 

Galton was the first to use the term “regres- 
sion” in statistics (> 403 Statistical Models 
D), when he observed that sons regressed 
linearly on their fathers with respect to stature. 
He called the phenomenon a “regression to 
mediocrity” because the deviations of the 
stature of sons were less than those of fathers. 
This gave rise to the measurement of COT- 
relation in the bivariate normal distribution 
by means of the coefficient of correlation 
(Pearson, 1897). Pearson is credited with the 
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creation of the discipline of biometry (bio- 
metrics), and he established the journal Bio- 
metrika to promote studies in the tield. Fisher’s 
major contributions were to genetics and 
statistical theories. His General theory of nat- 
ural selection appeared in 1930. This Jandmark 
book, along with earlier and later publica- 
tions, represented Fisher’s attempts to give 
quantitative form to Darwin’s views and to 
frame a statistical theory of evolution. 


C. Bioassay 


Bioassay is a set of techniques for evaluating 
the effectiveness of dosages of drugs by moni- 
toring biological responses. It entails the use of 
special transformations, such as probits and 
logits, as well as the application of regression 
to the estimation of dosages that are p percent 
effective within stated confidence limits. Prob- 
lems to be solved include measuring relative 
potency, slope-ratio assays, and quantal re- 
sponses vis-a-vis tolerance distributions. 


D. Demography 


Demography, which includes traditional vital 
statistics, rates and ratios, life tables, com- 
peting risks, actuarial statistics, and census 
enumeration techniques, is a part of bio- 
metrics. In this category, many tabulations of 
data consist of time series of events or rates 
classitied by age. For the analysis of such data, 
the cohort analysis techniques described in 
Hastings and Berry [3] are employed. 


E. Epidemiology 


The quantitative description of an epidemic 
should state the sensitivity and specificity of 
any diagnostic tests, as well as the true in- 
cidence or prevalence of the epidemic from 
survey results. Within an epidemiological 
theory, a disease is studied by the use of deter- 
ministic and stochastic models, wherein the 
theory of Markov chains can be applied (- 
260 Markov Chains). Differential equations 
involving probability-generating functions or 
tmoment-generating functions can be solved to 
yield the tprobability distribution, tmean, and 
tvariance of the number of infected individuals 
as functions of time [4,5], Fundamental to 
this whole field of application is a clear under- 
standing of causality and association. 

When clinical trials are possible, two groups 
of persons, “treated” and “untreated,” are 
monitored Over a period of time with regard to 
the incidence or recovery from the disease 
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under study. The techniques in this procedure 
include compiling reports on persons to be 
observed, using double-blind techniques, and 
combining multiple-response variables by use 
of multivariate analysis (- 280 Multivariate 
Analysis). 

If clinical intervention is forbidden on 
ethical grounds, e.g., studying congenital mal- 
formation by infecting pregnant women with 
German measles, the relative risk of expo- 
sure is estimated from retrospective studies. 
In practice, almost all statistical studies in 
epidemiology are retrospective with the sole 
exception of clinical trials. The research is ex 
post facto because investigators are mostly 
confined to describing and analyzing sudden 
and/or obvious events in the etiology of the 
disease. 


F. Clinical Trials 


In clinical trials, many problems have arisen 
for which biometricians have had to develop 
special techniques. One such technique takes 
account of unexpected adverse effects of drugs, 
and the consequent early termination of a 
trial. Moreover, when data demonstrate a 
trend earlier than expected, investigators will 
desire to end the accession of patients and to 
stop further treatment with what may be an 
inferior regimen. This means that the bio- 
metrician must be familiar with the problems 
of multiple examinations of data, multiple 
comparisons, and other adjustment procedures 
required by the ex post facto dredging of data. 


G. Future Trends 


There are two areas in which the biometrician 
has played a leading role recently. These are 
pertinent in many different applications and 
problems, and considerable methodological 
research has been devoted to the two areas. 
The areas are “mathematical modeling” and 
“effects of hazardous substances.” 


Mathematical Modeling. The relationship 
between a set of independent variables and the 
dependent or response variable(s) is usually 
referred to as a mathematical model. The 
model may take the form of a standard multi- 
ple regression analysis with a single response 
variable or with multiple response variables as 
in multivariate analysis (- 280 Multivariate 
Analysis). 

It is generally assumed that specialists with 
substantive knowledge of the specitic field 
of applications (epidemiology, toxicology, 
pharmacology, radiology, genetics, etc.) play a 
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crucial role in determining a model or rela- 
tionship between a set of independent vari- 
ables and response variable(s). However, it is 
mainly the biometrician who finally selects the 
specific model establishing the functional 
relationship and who attempts to measure the 
strength or influence of the independent vari- 
ables therein. The biometrician is also ex- 
pected to contribute substantially to the deci- 
sion as to whether the relationship is causal, or 
merely one of association or correlation. For 
example, in the measurement of carcinogenic- 
ity of a food additive or drug, questions arise 
as to whether a substance can be judged harm- 
ful if it “accelerates” the appearance of a tumor 
even though it does not increase the incidence 
of the abnormal growth. In general, the answer 
to this question is in the affirmative when an 
unequivocal dosage-response relationship 

is indicated between the substance and the 
tumor. 


Effects of Hazardous Substances. With the 
successful conquest of most of the infectious 
diseases that have plagued mankind through- 
out history, health authorities have recently 
been concentrating on two chronic diseases 
whose etiology is yet to be determined: cardio- 
vascular disease and cancer. In both cases 
there is no disagreement with the thesis that 
heredity exercises a determining influence, but 
the role of the environment in causing many 
cases is also unquestioned. Measurement of 
the risk due to potentially toxic substances in 
the environment, principally with respect to 
these two diseases, represents the greatest chal- 
lenge to the biometrician today. The social 
benefits of success make this a tantalizing area 
of research, though exceptional complexities 
are involved. 

For related topics -> 263 Mathematical 
Models in Biology. 
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A. Introduction 


The Boltzmann equation is an equation of 
motion of a rarefied gas given by L. Boltzmann 
in 1872 [1]. He used it successfully in his 
pioneering work on the kinetic theory of gases 
and on the more general statistical mechan- 
ics (- 402 Statistical Mechanics), but the 
equation was held to be inconsistent with the 
tclassical mechanics used in its derivation; 
objections were raised by, e.g., J. Loschmidt 
(1895) and E. Zermelo (1896). Through the 
controversies it became widely recognized that 
the Boltzman equation should be justiiied by 
means of the ‘theory of probability, and many 
such justifications have been proposed. Also, 
this equation has been studied extensively 

as a significant nonlinear partial differential 
equation. 


B. Boltzmann Equation 


Let f= f(t, x, č) be the density of gas molecules 
having position x € R? and velocity č e R° at 
time t. The Boltzmann equation is a conser- 
vation law for f of the form [1-3] 


h= VS -alt x): Vef+OLf]. (1) 


Here a(t, x) denotes the external force and Q is 
a quadratic nonlinear integral operator in é- 
space describing binary collisions of the mole- 
cules. The integral kernel of Q, called the colli- 
sion cross section, depends on the intermole- 
cular forces. The two classical examples are the 
hard ball model (a gas of rigid spheres) and the 
inverse power law potential (one proportional 
tor °, where ris the intermolecular distance 
and § > 1). The latter gives rise to singularities 
in the kernel, and a cutoff is customarily ẹm- 
ployed to avoid this difficulty. Grad’s hard 

(s > 5) and soft ($ < 5) cutoff potentials may be 
effectively used in the study of (1) [3]. 

If the gas is contained in a vessel (domain), 
then f must also satisfy boundary conditions 
determined by the assumed law of reflection 
at the walls (boundaries) (specular reflection, 
random reflection, etc.). For spatially homo- 
geneous gases with no external forces, (1) re- 
duces to 


fh=QlLsl, (2) 


with f = f(t, č). The Maxwellians, those solu- 
tions satisfying the tMaxwell-Boltzmann dis- 
tribution law, are the only stationary solutions 
of (2); they describe equilibrium states. 
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C. Justification by tbe Tbeory of Probability 


Consider (2). The collision process is a tMar- 
kov process, and the tmaster equation ob- 
tained from the tChapman-Kolmogorov equal- 
ity is equivalent to (2) under the assump- 
tion of “propagation of molecular chaos” (G. 
Uhlenbeck (1942)). This assumption becomes 
valid in the limit as N (total number of mole- 
cules) >œ (M. Kac (1955), H. McKean 
(1967), F. Grünbaum (1971)) [4]. Also, non- 
linear Markov processes defined by (2) (with 
probability density f/N}) have been studied 
(McKean (1967), H. Tanaka [5]). No such 
results have yet been established for (1). 


D. Existence of Solutions 


T. Carleman [2] gave the first solutions of the 
Boltzmann equation, solving globally in time 
the *Cauchy problem for (2) for the hard ball 
model. His result has been extended to a wide 
class of potentials. 

The spatially inhomogeneous case (1) is also 
known to have global solutions if a(t, x) =0 
and if the initial data for f are nearly Max- 
wellian. This was first proved by S. Ukai [6] 
assuming periodicity in x, and then proved for 
the Cauchy problem by Ukai [7] and T. 
Nishida and K. Imai [8], and for the initial 
boundary problem by J. P. Guiraud [9] (ran- 
dom reflection) and Y. Shizuta and K. Asano 
[ 10] (specular reflection) for bounded domains, 
and by Ukai and Asano [ 1 1] for exterior 
domains, all assuming Grad’s hard cutoff 
potentials. The case of soft cutoff potentials 
was also solved (Ukai and Asano, R. Caflisch 
(1980)). All the solutions are unique and 
tend to Maxwellians as t> 00, with certain 
decay rates. There are some results also on 
1-dimensional shock wave solutions (B. Nico- 
laenko (1975)) and stationary solutions (J: P. 
Guiraud (1972) (bounded domains), Ukai and 
Asano (1980) (exterior domains)). 

For initial data far from the Maxwellian, (1) 
remains unsolved even locally in time for hard 
cutoff potentials. As for noncutoff potentials, 
the existence theorem in [5] is the only result 
known so far for either (1) or (2). 


E. Hilbert Expansion 


Put f €= f/N with g= 1/N.If fë has a power 
series expansion in €, then (1) gives an infinite 
system of equations for the coefficients. D. 
Hilbert (19 12) proposed a method of solving 
this system as an application of his theory of 
integral equations, initiating the attempt to 
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solve the Boltzmann equation. No proofs of 
convergence exist, but his expansion and the 
improved one by S. Chapman (1917) and D. 
Enskog (19 17) made possible the applications 
of (1) to hydrodynamics. The zeroth-order 
approximation to the expansion gives rise to 
the compressible Euler equation, and the first- 
order approximation yields the compressible 
Navier-Stokes equation. The solutions f* of 
the Cauchy problem for (1) with nearly Max- 
wellian initial data converge to those of the 
compressible Euler equation locally in tas 
é—Q (Nishida [12], Caflisch (1980), and 

to those of the compressible Navier-Stokes 
equation asymptotically as t + 00 when => 0 is 
fixed [13]. 
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A. Boolean Algebras 


Boolean algebra was introduced by G. Boole 
to study logical operations (- 411 Symbolic 
Logic). It is now included within the more 
general concept of tlattice or lattice-ordered 
set (- 243 Lattices) and appears not only in 
logic but also very often in analysis in the form 
of a particular lattice of sets, ¢.g., the lattice of 
tmeasurable sets. 

Let L be a given set and suppose that to any 
pair of its elements x, y there correspond two 
elements x U y, x N y of L (called join and meet 
of x and y, respectively) such that the follow- 
inglaws arevalid: (1) x Up=yUx, xNy=yfx 
(commutative law); (2) x U (y Uz) = (x Uy) Uz, 
xN(yNz)=(xNy)Nz (associative law); (3) xU 
(y Nx) =(x U y) Nx =x (absorption law); (4) x U 
(yAz)=(xUy)N(xUz), xA(yUz)=(xN y)U 
(x N z) (distributive law); (x, y, z€ L). From 
(1), (2), and (3) it follows further that x U x = 
x Nx = x (idempotent law). If x < y is defined 
to mean x U y = y, L becomes an tordered set 
with respect to the ordering < Now suppose, 
moreover, that the following law holds: (5) 
there exist a least element 0O and a greatest 
element J, and for any element x there exists an 
element x’ satisfying x U x’ = ],x N x’ =0 (law of 
complementation). Then L is called a Boolean 
algebra (or *Boolean lattice). In this case x’ is 
uniquely determined by x and is called the 
complement of x. The binary operations 
(x, y)-x U y, x Ny together with the operation 
x-—x’ are called Boolean operations. These 
operations obey de Morgan’s law (xU y)’ = 
x Ny xN =x uy. 


B. Generalized Boolean Algebras 


Suppose that a < b holds for two given ele- 
ments a, b of an ordered set. Then the set of all 
elements x satisfying a <x <b is denoted by 
[a, b] and is called an interval. An interval of a 
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Boolean algebra is also a Boolean algebra with 
respect to the induced operations U and N, 
where the least and greatest elements are a and 
b, respectively, and the complement of x in 

[a, b] is equal to a U (x N b) =(a Ux’) N b. More 
generally, if a set L with two operations U, f 
satisfying (1)—(4) above has a least element 0 
and if each interval of L satisties (5) (1e., is a 
Boolean algebra), then L is called a generalized 
Boolean algebra. 


C. Boolean Rings 


A ring L satisfying the condition xx = x for all 
x € L (i.e., all of its elements are idempotent) is 
called a generalized Boolean ring, and if it has 
a unity element then it is called a Boolean ring. 
A generalized Boolean ring L satisfies x +x = 0 
for all x € L and is necessarily a commutative 
ring. A (generalized) Boolean algebra L be- 
cornes a (generalized) Boolean ring if for any 
elements x, y of L the sum x + y is defmed to 
be the complement of x Ny in the interval 

[0,x U y], and the product xy is delined to be 
xN y. A nonempty subset J of a (generalized) 
Boolean algebra L is an ideal with respect to 
the corresponding structure of the ring if and 
only if xU yeJ for x, yeJ and xN yeJ for 
xej, ye L. More generally, in any lattice, a 
nonempty subset that satisfies these conditions 
is sometimes called an ideal of the lattice. 


D. Representation of a Boolean Algebra 


Any Boolean algebra L is isomorphic to a 
Boolean lattice of subsets in a set X. If Lis of 
finite theight, then L is isomorphic to the 
Boolean lattice P(X) of all subsets of X. In 
general X can be taken to be the set of all 
maximal ideals of L, Let ae L and let O(a)= 
{m|meX, a¢m}. The isomorphism is obtained 
by the mapping a-+O(a). If we define a topol- 
ogy in X such that (O(a) ae L} is the topen 
base, then X is a compact, totally discon- 
nected T, space and O(a) is characterized as 

a compact open set in X. Such a space X is 
called a Boolean space (M. H. Stone [3, 4]). 

In any complete Boolean algebra L, the 
complete distributive laws hold: (sup,x;)N y = 
sup,(x; N y) and its dual. These are equivalent 
to the stronger relations: (sup,x,;) (sup, y) = 
sup,,(x; N y;) and its dual. In order for a 
Boolean algebra L to be isomorphic to the 
Boolean algebra P(X) of all subsets of X, it is 
necessary and sufficient that the following 
strongest complete distributive laws hold: 
inf, (sup si) Xy) = SUPr(INf,X;, o) (Where F is the 
set of all functions @ assigning to each i€ I a 
value o{i)eJ(i)) and its dual. 
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A. General Remarks 


A complex-valued function defined on a subset 
E of the complex z-plane is called a bounded 
function defined on E if its range f(E) is 
bounded, that is, if there exists a positive 
constant M such that | f(z)| < M on E. How- 
ever, when studying the theory of bounded 
functions, we usually restrict ourselves to the 
consideration of tanalytic or tharmonic func- 
tions. On the other hand, the classes of func- 
tions f(z) satisfying conditions such as Re f (z) 
>0 or a<arg f(z)<f rather than | f(z)|<M 
are studied by a method similar to that ap- 
plied to the study of bounded functions. 

tSchwarz’s lemma, tLiouville’s theorem, and 
*Riemann’s theorem on the removability of 
singularities (which will be explained later) are 
among the classical theorems in the theory of 
bounded functions. 


B. Maximum Principle 


When a function f(z) is holomorphic and not 
constant in a domain D of the complex plane, 

| f (z)| never attains its maximum in the interior 
of D. In particular, when f(z) is continuous on 
the bounded closed domain D= D U @D, the 
maximum of | f(z)| on D is taken on its bound- 
ary ôD. This fact is called the maximum (mod- 
ulus) principle. 

As a direct application of the maximum 
principle, we candeduce Schwarz’s lemma: If a 
holomorphic function f(z) in |z| <R satisfies 
| f(z)| <M and f(0)=0, we have | f(z)|<M- 
|z|/R (|z|<R). The equality at zo, 0<|zy|<R, 
occurs only for the functions f(z) = e4M z/ R 
(where A is a real constant). 
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C. Lindel6f’s Principle 


E. Lindelof extended Schwarz’s lemma and 
obtained various extensions of the maximum 
principle, from which he, together with E. 
Phragmén, deduced several useful theorems on 
the behavior of a function that is single-valued 
and holomorphic in a neighborhood of the 
boundary. We mention some representative 
theorems: 

Let z = (f) and w = W(f) both be mero- 
morphic and tunivalent functions in |{]< 1 that 
map |{|< 1 onto D, and D,,, respectively. Set 
p(0)=z and w(0)=w,. Let D,(p) and D,(p) 
denote the images of |] < p (0 < p< 1) under 
the mappings @ and yw, respectively. Under 
these circumstances, if a function f(z) that is 
holomorphic in D, satisfies f(D,) c D, and f(z) 
= Wo, then f(D,(p)) c D,(p). Furthermore, 
unless f(z) maps D, onto D,, univalently, 
f(DAp)) is contained in the interior of D,(p) 
(an extension of Schwarz’s lemma). 

Let f(z) be analytic in a bounded domain D 
but not necessarily single-valued. Suppose that 
| f(z)| is single-valued. Suppose, furthermore, 
that there is a positive constant M such that, 
for each boundary point ¢ of D, except for a 
finite number of boundary points and for each 
e > 0, the inequality | f(z)| < M holds on the 
intersection of D with a suitable neighborhood 
of Ç, and suppose also that each of the excep- 
tional points has a neighborhood such that 


f(z) is bounded on the intersection of D with 


this neighborhood. Under these assumptions 
we have | f(z)|< M. Moreover, if | f(zg)|= M at 
a point Zg of D, then f(z) is a constant (an 
extension of the maximum principle). 

Let f(z) be holomorphic in an angular 
domain W: arg z|<az/2. Suppose that there is 
a constant M such that, for each ¢ > 0, each 
finite boundary point has a neighborhood 
such that | f(z)|< M +¢ on the intersection of D 
with this neighborhood, and that for some 
positive number f > g and for sufftciently large 
|z| the inequality | f(z)|<exp|z|'/* holds. Under 
these assumptions we have | f(z) < M in D 
(Phragmén-Lindeléf theorem). 

Let f(z) be a function that is holomorphic 
and bounded in a closed angular domain W: 
a < arg z < ß except for the point at infinity. 
Suppose that f(z)+aas z> œ along a side of 
the angle and that f(z)>b as z=% along the 
other side of it. Then we have q= band f(z)>a 
uniformly as 7 œ in W (Lindelöf ’s asympto- 
tic value theorem). 


D. Bounded Functions in a Disk 


If f(z) is a bounded holomorphic function in 
the unit disk |z|< 1, it has a limit at every 
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point zo on the circle C: |z|= 1, except for a set 
of 1 -dimensional measure zero, as z tends to Zo 
from within an angle with vertex at z, and çon- 
tained in |z|< 1 (or along a *Stolz’s path at zp) 
(Fatou’s theorem). Under the same assump- 
tion, if the boundary value function f(e") = 
lim,..,-o f(re’®), which exists by Fatou’s theo- 
rem, is equal to a constant a for a set of posi- 
tive measure on the circle C, then f(z) =a in 
|z| < 1 (F. and M. Riesz theorem). These theo- 
rems are valid for some kinds of meromorphic 
functions in |z| < 1 (- 272 Meromorphic 
Functions D). 


E. Three-Circle Theorem and Related 
Theorems 


Let f(z) be a function that is single-valued, 
holomorphic, and not identically zero in an 
annulus p<|z|<R. Set M(r)=max,,.-,| f(2)| 
(p <r < R) for f(z). Then log M(t) is a tcon- 
vex function of log r in log p < log r<log R 
(Hadamard’s_ three-circle theorem). The same 
assertion holds for a function f(z) that is not 
necessarily single-valued, as long as | f({z)| is 
single-valued. When f(z) is single-valued, a 
stronger assertion can be obtained (0. Teich- 
müller). The following theorems are regarded 
as the analog of Hadamard’s three-circle 
theorem for the respective basic regions: 

Set L(o)=sup_ 5 real flo + it)| (<0 <P) 
for a function that is bounded and regular in a 
strip æ < Re z < ß. Then log L(a) is a convex 
function of g in a < g < f (Doetsch’s three-line 
theorem). 

Set [(o) = lim sup, ,..|f{o + it)| (x < 0 < P) for 
a function that is holomorphic and bounded in 
a half-strip % < Rez < B, Imz > 0. Then log I(a) 
is a convex function of g in a < g < f (Hardy- 
Littlewood theorem). 

Set 


1 2n 
L(r)=—— IP dO 
(=z | irre”) 
for a holomorphic function in a disk |z| < R. 
Then for every Pp > 0, log 11 18 an increasing 
convex function of logr for —œ < logr < log R 
(Hardy’s theorem). 


F. Hardy Class 


Hardy’s theorem motivates us to introduce a 
class of functions. An analytic function f in the 
unit disk is said to belong to the Hardy class 
HP’ (0<p<o) if I,(r, f)=(22) ff" | fre”) d0 
remains bounded as r-1, For the case p= 00, 
fe H” if I(r, f)=max,,,-,| f(z) is bounded. 
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An analytic function fis said to be of the class 
N if fő" log* |f(re”®)| d0 is bounded for r< 1. 

f €Nifand only if f = o/y, where p,weH®. 
feH” if and only if |f|? has a tharmonic majo- 
rant. If 1 <p < 00, H” is atBanach space with 
the norm || f||,=(sup,<; [,(r, f)) "P for p< œ 
and || f||,,=sup,.,1,,(r, f) for p= œ, and if 0< 
p <1, H’ is a complete metric space with the 
metric sup, <; lpr, f—g). For 0<p<q< o the 
inclusion relation N > H? > H4 > H™ holds, so 
that Fatou’s theorem and that of F. and M. 
Riesz are valid for functions of H?” (p> 0). 

For a nonnegative integer p and a sequence 
of nonzero complex numbers a, |g, |< 1, the 
infinite product B(z)=z? 17, (%,(«, —z)/|a,| (1 

%,Z)) converges locally uniformly in the unit 
disk if and only if 2, (1 —|a,|)< oo. Then 
B(z) is called a Blaschke product and {a,} a 
Blaschke sequence. If the sequence f, }U {0} is 
the set of zeros of an analytic function f (an m- 
tuple zero appears m times in {a,}, and z = 0 is 
a p-tuple zero), B(z) is said to be generated by 
the zeros of f. If fe N, then 1, (1—|a,|)< 00 
for the set {a,} of zeros off: An analytic func- 
tion f in the unit disk is called an inner func- 
tion if its modulus is less than 1 and its non- 
tangential limit on the unit circle is of modulus 
1 almost everywhere. A Blaschke product is an 
inner function. An inner function f without 
Zeros is said to be singular if f(0) > 0. A sin- 
gular inner function f can be represented by 
f(z)=exp{ —[o*(e* +z)/(e” —z)du(0)} with a 
positive measure p tsingular with respect to 
Lebesgue measure. An outer function F(z) for 
HP (p > 0) is an analytic function that can be 
represented by 


: 1 2n pif +7 
F(z) = erea | 08H) ao} 
0 eS 


where r is a real number and (0) is a non- 
negative function such that (@)e L? and 
logw(0)eL'. An H?-function f (p> 0) is fac- 
torized uniquely as f(z) = B(z)S(z) F(z), where 
B(z) is a Blaschke product (generated by the 
zeros off), S(z) a singular inner function, and 
F(z) is an outer function for H” (with y(0)= 

| f(e’*)|) (factorization theorem). Conversely, 
such a product belongs to H”. The inter- 
polation prohlem raised by R. C. Buck asks 
whether or not there exists a sequence of dis- 
tinct points {z, |z,|<1} such that, given any 
bounded sequence of complex numbers {wp}, 
there is a function fe H® for which f(z,) = Wa- 
The sequence {z,} in the problem is called an 
interpolating sequence. L. Carleson proved that 
a necessary and sufficient condition for {z,} to 
be an interpolating sequence is that there be a 
6>0 such that IT, 24|(z,—2)/(1 —2,2,)|26 
(k=1,2,...), An interpolating sequence is a 
Blaschke sequence, and any sequence {z, || z, 
= 1} contains an interpolating subsequence 
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tending to the unit circle. An analogous theo- 
rem is obtained for a function of H? (p> 1). 


G. Corona Problem 


The Hardy space H® on the unit disk D is 
furthermore a commutative Banach algebra 
with identity. Let M denote the tmaximal ideal 
space of H®, It is a compact Hausdorff space 
with respect to the tGel’fand topology. To 
each point ze D, there correspond a homomor- 
phism 9;: 9,(f) =f(z) for f e H”, and a mazi- 
mal ideal M, = { fe H® | f(z) = 0} as its kernel. 
This correspondence gives a continuous injec- 
tion of D into the maximal ideal space M of 
H”. By identifying z and M,, D is regarded as 
an open subset of M, and the corona problem 
then asks: Are there points of M outside the 
closure of D? The negative answer was given 
by L. Carleson, that is, D is dense in A4 (corona 
theorem). The corona theorem is obtained as a 
direct consequence of the following theorem 
and is equivalent to it. For functions fe H”, 
i=1,2,..,n,satisfying $| f,(z)|>6in D with 
some 6 >Q, there exist gge H”, i=1,...,n, such 
that > f(z)g,(z) = 1 in D. As a trivial conse- 
quence of the corona theorem, we have the 
following cluster value theorem: Let C( f, {) 
denote the ‘cluster set of fe H” at ¢ with 

I¢|= 1; then C(f,Q)=f(M), where f is the 
Gel’fand transform off and M, is the tfiber of 
M over ¢, M is decomposed into pairwise 
disjoint *Gleason parts. A Gleason part is a 
connected open set or a singleton according as 
each point of it is captured in the closure of an 
interpolating sequence or not. Each point of 
the *Shilov boundary I of M forms by itself a 
trivial Gleason part, and Į is adherent to a 
sequence S in D with the property that the set 
of its nontangential limit points covers the unit 
circle (Brown-Shield-Zeller theorem). Let D 

be, in turn, a general bounded domain. The 
corona theorem is proved for some classes of 
such domains. A Shilov boundary I lies only 
over the set of points on ôD which are not 
tremovable for bounded analytic functions. It 
is adherent to a sequence S in D on which 
sups| f(2)|=|If\|..- For any fe H” and fe aD, 
the boundary of fi (M,) is contained in fi T) if T, 
= IMM, is not empty. This, together with the 
cluster value theorem, has an important im- 
plication in the theory of cluster sets. The ex- 
tremal problem of maximizing f'(z5)|, Zy€ D, 
among all fe H?” satisfying II fll in 1 has the 
unique normalized solution G such that G’(z,) 
= max | f'(Z)|. G(z) is called the Ahlfors func- 
tion. The Ahlfors function has unit modulus on 
the Shilov boundary (Fisher’s theorem). The 
theory of H® is also applied to the problem of 
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trational approximation of bounded analytic 
functions. 


H. Applications of the Maximum Principle 


Theorems of the following type are useful for 
some problems of holomorphic functions: 

Let D be a domain. Suppose that there 
exists an arc of angular measure g that is on a 
circle of radius R centered at a point z, of D 
and not contained in D. Let C denote the 
intersection of the boundary of D with the disk 
|z = Z| < R. If f(z) is a single-valued holom- 
orphic function that satisfies f(z)| <M, and if 
lim sup; f(z)|< M for every ¢e C, then the 
inequality f(zọ)| < _M'~""m"" holds for every 
positive integer n satisfying 2n/n < « (Lindeléf’s 
theorem). 

Let D be a domain bounded by two seg- 
ments OA, OB both starting from 0 and mak- 
ing an angle za, and a Jordan arc AB, and 
let R be the maximal distance between 0 and 
the points on AB. Suppose that f(z) is holo- 
morphic in D and that lim sup] f(z)| as z> 
Ce ðD with ze D is not greater than M for 
{e OAU OB and mfor fe AB. Then we have 
| f(z)|<.M'~*m4 (where 4=((|z|/R)*”) at every 
point on the bisector of the angle / AOB in D 
(Carlemann’s theorem). 


I. Holomorphic Functions with Positive Real 
Parts 


Holomorphic functions with positive real parts 
are intimately connected with bounded func- 
tions. Concerning these functions we have the 
following classical result, which is equivalent 
to Schwarz’s lemma: If f(z) is holomorphic in 
z|< R, Re f (z) > 0 in the same domain, and 
J(0)= 1, then (R—|z])(R+|z|)<Re f(z) <(R + 
\z\)/(R =|z\)(\z|< R). The right or left inequality 
becomes equality for some Zp), 0 < |zo| < R, only 
it fl2)=(Rzp Fzoz)/(R +292), respectively. 

In order to prove various results for the 
Class of functions with positive real part, 
Herglotz’s integral representation, which is 
based on Poisson’s integral representation and 
unique to this class, can be used effectively. It 
is given by 


2m oiP 4 7 
f(z | = 


0 e?z 


dp(ọ), |z|<R, 


where p(ọ)is monotone increasing (real- 
valued) with total variation 1 and is deter- 
mined uniquely up to an additive constant by 
f(z). An analogous integral representation is 
introduced for a holomorphic function in an 
annulus. 

Similar to the case of holomorphic functions 
is the notion of Hardy classes to harmonic 
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functions, that is, a harmonic function u in the 
unit disk is said to be of the class h? (0< p< 
oo) if I,(r, u) is bounded for r<1.y4 €h” if and 
only if y+ ive H”, where vis conjugate to u. h! 
is equal to the vector space of functions which 
are representable as a difference of two non- 
negative harmonic functions. The Herglotz 
theorem stated above is then a corollary of the 
theorem of integral representation for ht- 
functions. 


J. Coefficient Problems 


There are many classical results for partial 
sums and coefficients of the Taylor expan- 
sion of bounded functions in a disk. Let f(z) 
= ro cz" be the Taylor expansion of a 
bounded function in |z|< 1. Set its partial sum 
S.(Z)= DB sotyz (n=O, 1, . ..). and let t,(z)= 
(1/(n+1))X"_,5,(z) (n=0, 1,...), which is the 
sequence of the arithmetic Means of the par- 
tial sums (the Fejér sums). Then | f(z)| < 1 in 
|z|<1 if and only if |t,(z)| <1 for |z|= 1 (n= 

0, 1, ) (L. Fejér). Thus the sequences {t,(z)} 
for bounded functions f(z) are uniformly 
bounded, whereas the sequences {s,,(z)} are not 
uniformly bounded. Indeed the maximum 
value of |s,(1)| over the set of function f satis- 
fying | {(z)|< 1 for|z|< 1 is 


an {—1/2\? 
1+) ( l ) 
j=l: J 

= 3 (= oy 

j=0 2-4... 2j 
(G,~x 'lognas n>). 

The following result is decisive for coeffi- 
cient problems: Set h,, = Lh-9€,—jC)-; (4 SY), 
h,, = h,, for f(z) = D2 9c,2", let mz be the 
maximal eigenvalue (a nonnegative real num- 
ber) of the Hermitian matrix ( —h,,) and 
let m=lim m,(> 0). 

Let H be a Hermitian form in inlinitely 
many variables given by 


G, 


n 
B,v=O0> 


2 


20 oO 
=m) al?) 
y=0 H 


=0 


00 
5 CyXu+v 
v=0 


Then a necessary and sufficient condition for 

| f(z)| < 1 when |z| < 1 is that H be positive 
semidetinite, i.e., the sequence of the ‘principal 
minor determinants of H, 


1 2 m 
A = ea. 
(i ean 


is all positive or positive for an initial finite 
number of them and zero for the remainder 
(I. Schur). 
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A corresponding result for functions with 
positive real part in a disk can be stated in a 
simpler form: A holomorphic function f(z) = 
1/242 7.,¢,2" in |z|< 1 satisfies Re f(z)> 0 if 
and only if 


Migs fe. be ge En 
1 : 

$ “1150, n=1,2,... 

E E | 


(C. Carathéodory). Furthermore, for n = 
1,2, , when we regard (c,,. , C,) as a point 
of complex n-dimensional Euclidean space, we 
can determine the domain of existence of 
points satisfying this criterion by Carathéo- 
dory. This result is generalized for coefficients 
of the Laurent expansion of a function that is 
holomorphic and single-valued in an annulus. 
Next, there are some results for a function 
that omits two values: If f(z) = a, + a, z+ 
... (a, £9) is holomorphic in |z|< R and 
f(z) 40, 1 in the same domain |z| < R, then 
there exists a constant L{dp, a,), depending 
only on a, and a,, such that R < L(ap, a,) 
(Laudau’s tbeorem). Under these circum- 
stances, | f(z)|<S(ao, 0) in |z|< OR for 0<0<1, 
where S(a,, 0) is a constant depending only on 
a, and ĝ (Scbottoky's theorem). These theo- 
rems have applications in value distribution 
theory. On the other hand, coefficient prob- 
lems have been investigated as extremal prob- 
lems in H?-spaces under general conditions. 


K. Angular Derivative 


Let f(z) be holomorphic in |z| < 1. If f(z) >Wo 
uniformly as z—2, along Stolz paths with end 
point at Zo and if the limit lim,- ((f(z)— Wo)/(z 
—Z,)) = D exists, we call D the angular deriva- 
tive of j”(z) at zy. In the case of the half-plane 
Re z > 0, the angular derivative at the point Zo 
on the imaginary axis is similarly defined. It 
should be noted that f(z) = wọ is replaced by 
1/f(z) for wọ = œ and 1/(z = Zo) by z for Zo = œ. 
In the latter case, a Stolz path is a path con- 
tained in an angular domain |argz| <a ( < 2/2) 
and tending to «©, The study of angular de- 
rivatives was initiated by G. Julia (1920) and 
J. Wolff (1926) and was further advanced by 
Carathéodory (1929) and E. Landau and G. 
Varilon (1929). 

A fundamental theorem for angular deriva- 
tives can be stated as follows: If a holomorphic 
function f(z) in Re z > 0 satisfies Re f(z) 2 0, 
there exists a constant c (0 £ c < +00) such that 
f(z)\/zc and f(a uniformly as z=% 
along every Stolz path. Moreover, the pth 
derivative of f(z) for an arbitrary positive 
p, denoted by D?f(z), has the property that 
zP! D?f(z)c/T (2 — p} uniformly. Further- 
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more, the inequality Re f(z) > c Re z holds 
everywhere in Re 7 > 0. An analogous theorem 
is valid for the unit disk. 

An important problem in the theory of 
conformal mapping is to find some condition 


for a mapping w = f(z) of the unit disk (or half- 


plane) G onto a simply connected domain B to 
have a nonzero and finite angular derivative at 
a boundary point Zp, that is, the condition for 
conformality at the boundary point. Cara- 
théodory showed that a sufftcient condition is 
the existence of two circles that are mutually 
inscribed and circumscribed at the boundary 
point wọ = f (Zọ) Æ œ of B and that lie inside 
and outside of B, respectively. Ahlfors later 
established a necessary and sufftcient condi- 
tion for the existence of the angular derivative 
by making use of his distortion theorem for a 
strip domain. The angular derivative was 

used by Wolff in his research on the iteration 
of conformal mappings. 


References 


[1] L. Bieberbach, Lehrbuch der Funktionen- 
theorie II, Teubner, 1931 (Johnson Reprint 
Co, 1969). 

[2] C. Carathéodory, Funktionentheorie 1, II, 
Birkhauser, Basel, 1950; English translation, 
Theory of functions, Chelsea, Į, 1958; II, 1960. 
[3] E. G. H. Landau, Darstellung und Be- 
griindung einiger neuerer Ergebnisse der 
Funktionentheorie, Springer, 1929. 

[4] J. E. Littlewood, Lectures on the theory of 
functions I, Clarendon Press, 1944. 

[5] P. L. Duren, Theory of H?-spaces, Aca- 
demic Press, 1970. 

[6] R. Nevanlinna, Eindeutige Analytische 
Funktionen, Springer, second edition, 1953; 
English translation, Analytic functions, 
Springer, 1970. 

[7] L. Carleson, Interpolations by bounded 
analytic functions and the corona problem, 
Ann. Math., (2) 76 (1962). 

[S] H. S. Shapiro and A. L. Shield, On some 
interpolation problem for analytic functions, 
Amer. J. Math., 83 (1961). 

[9] K. Hoffman, Banach spaces of analytic 
functions, Prentice-Hall, 1962. 

[10] K. Hoffman, Bounded analytic functions 
and Gleason parts, Ann. Math., (2) 86 (1967). 
[11] M. L. Weiss, Cluster sets of bounded 
analytic functions from a Banach algebraic 
viewpoint, Ann. Acad. Sci. Fenn., 367 (1965). 
[12] T. W. Gamelin, Lectures on H”(D), Univ. 
Nacional de la Plata, 1972. 

[13] T. W. Gamelin, The algebra of bounded 
analytic functions, Bull. Amer. Math. Soc., 79 
(1973). 


180 


44 (XVII.1 0) 
Branching Processes 


A. General Remarks 


A brancbing process is a mathematical model 
for random motion of a family of particles 
each of which is in an isolated process of multi- 
plication and death. Examples of such ran- 
dom motions are population growth, miosis of 
genes, growth of the numbers of neutrons in an 
atomic chain reaction, and cascade showers of 
cosmic rays. The simplest and most fundamen- 
tal branching process is discussed in Section 
B. For a historical introduction to the study 

of branching processes — Kendall [2] and 
Harris [3]. 


B. Galton-Watson Processes 


Although there is a similar process with con- 
tinuous parameter t€R, we consider here 
only the case of a discrete time parameter 
(i = 0, 1,2,.). Suppose that we are given a 
family of particles of the same kind. Each 
member of the family splits into several par- 
ticles according to a given probability law 
independently of the other members and its 
own past history. Let Z, be the number of 
particles of the family at a moment (or gener- 
ation) n; then {Z,} gives rise to a Markov 
chain. This is called the Galton- Watson pro- 
cess. A precise mathematical description of the 
process is as follows: A Galton-Watson pro- 
cess is a Markov chain {Z, n=0, 1, .. . } on the 
nonnegative integers with ftransition proba- 
bility defined in terms of a given tprobability 
distribution {p, k=0, 1,},p, 20, Diop, = L 
by 

_ fpř if ist, j>0, 
py= PUZa sizi} if i=0, j>0, 


ij 
where the probability distribution { po k= 

1, . } denotes the i-fold tçonvolution of 
the probability distribution { p, K=0,1,...} 
and d;, is the fKronecker delta. Then p, is 
interpreted as the probability that an object 
existing in the nth generation has k children in 
the (n + 1)th generation. We assume that Zo = 
1 a.s. for the rest of this section. An impor- 
tant tool in the analysis of the process is a 
tgenerating function f(s) = Lj2o ps, Be 
Denote the generating function £o Le = 
k)s* (S< 1) of Z, by f,(s). Then we have fo(s)= 
S, fils) =f(s) sfinld)= =f) (ij =0,1,2,.), 
and the texpectation of Z, is given by E(Z,) = 
m", where m= f'(1) =E (Z,) is the expecta- 
tion of Z,. A Galton-Watson process is said 
to be subcritical, critical, or supercritical if 
m<1,=1, or > 1, respectively. If we have Z,= 


0 for some n, then Z,,,= Zp+2 = - - = 0. The 
probability q = P (lim, Z, = 0) is called the 
extinction probability. The case f (s) = sg is 
excluded in the following. The extinction pro- 
bability q of the process {Z,} is the smallest 
nonnegative solution of the equation s = f(s). 
Itis lifm<1 and <1 ifm>1. Moreover, 
P(lim, Z, = 00)= 1 —q when m> 1, and hence 
the process {Z, is ttransient. 

If we put W, = Z,/m" when m < 00, {W,} 
gives rise to a ‘martingale, and the limit W= 
limp» W, exists with probability 1. If m> 
1, then the tmoment-generating function 
p(s)= E(exp( —sW)) of W satisfies the Königs- 
Schröder equation (ms) = f(¢(s)), Re s > 0. If 
m <1, then P(Z, = 0 for some n) = 1. The fol- 
lowing theorem on the conditional distribu- 
tion of Z,, given that Z, 4 0, was first proved 
by Yaglom [6] under moment restrictions. If 
m <1, there exist b, = lim, P(Z,=k Z,#0), k = 
1,2 ,.... Then {b,{A=1,2 ,... }isaprobability 
distribution and its generating function g(s)= 
YZ, bs", |s| < 1, is the unique solution of 
the equation g( f(s))=mg(s)+ 1 —m, |s| <1, 
among generating functions vanishing at 0. 
Furthermore, when m < 1, Ee jb;< œ if and 
only if Lj; p(jlogj) < œ (i.e, E[Z, logZ,]< 
%0). 

If m= 1 and g? = variance of Z} < %, then 
lim, P[Z,/n> u| Z, #0] =expl —2u/o7], u>0. 
If g? = œ, then this result is still correct with 
the interpretation that the limit in the left- 
hand side equals 1. 

For further details regarding the theory of 
Galton-Watson processes — [3, S]. 


C. Multitype Galton-Watson Processes 


These are generalizations of the Galton- 
Watson process, involving k types of particles 
(k > 2), say, Ti, Ta, , Tj. Let p'(r,, T2., T) be 
the probability that a particle of type T; splits 
into fa particles of type T, (m = 1,2,. . , k), and 
set f'(S,, S2,- > S,)= È, pili ra -o Si Sh? 

. Skt. The number of particles Z, =(Z}, Z7, 
...,Z,) at a moment n gives rise to a tMarkov 
chain over Z* , and its transition probability 
is determined by p'(r,, r2, . r,) as for Galton- 
Watson processes. Here Z*, denotes the space 
of all k-dimensional tlattice points whose 
components are nonnegative integers. We 
now define a mathematical model of the 
process mentioned above. A Markov chain 
{Z,,|n=0,1,...} on Z*, is called a multi (k)- 
type Galton-Watson process if its transition 
probability is given by 


Pi, j) = P[Z,4,=j Z,= i] = coefficient of 
k 


[ [ s} in 


=1 


Z> 


(f'(s1,525 


ar) 


~ 
I 
— 
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for i=(i,,1,,+- cn i) J=UtJ25 a EZ’, n= 
0,1,...and s=(s,,5,,-.»,5,)€D,, where D, = 
{s=(51,5, vy 5,)JER*|[5)| <1, [= 1,2 Y e k} 
When m,;=0f'(1,. 1)/és;< œ (i j= 1,2, 
., k), the tconditional expectation of Zp4m, 
given Zm is given as E[Zn4n Zn] = Z_M", 
where M is the matrix (m,,). From now on we 
assume that m, < œ (i,j = 1,2,., k). A multi- 
type Galton-Watson process {Z, n = 0, 1, . } 
is said to be singular if the generating func- 
tions f'(s1,$5,...,5), /=1,2 ,..., karealllinear 
in S1, S2,, S With no constant terms. We 
assume nonsingularity throughout. If there is 
a positive integer N such that every compo- 
nent of MY is positive, the process Z,, is said 
to be positively regular. In this case, M has a 
positive teigenvalue 4 that is simple, and 4> 
|u| for all other eigenvalues u. The eigenvalue 
A plays the role of min the case of the Galton- 
Watson process. For every i, 1< i <k, set g'= 
P[Z,,=0 for some n|Z,)=e;], where e;=(e},e?, 
€f), ef = dj. Then q' is called the extinction 
probability of the process {Z, n = 0, 1, .} 
starting with a single particle of type T,; let 
q=(q',q’,....q*). Suppose that the process 
{Z, n=0, 1,... } is positively regular and not 
singular. Then q=(1,1,.... Dif A< 1, and 0< 
q'<1(i=1,2,...,k) ifA>1, and q satisfies the 
equation q=(f'(q), f7(q), . .f*(q)). Further- 
more, the process {Z, n=0,1,...}is ttran- 
sient, 1.e., P[Z, = j inlinitely often] = 0 for any 
j #0, je Z*. Moreover, if 2>1, then W, = 
Z,,/A" converges with probability 1, and 
Jim Z,/M=vW as, 
where v denotes the positive left teigenvector 
v=(v,,., W) for J of the matrix M, and W is 
a nonnegative trandom variable. P[ W> 0] > 
O holds if and only if 


E[Z{) logZ{}]<oo forall 1<ij<k. (1) 


Here Z{) denotes the number of type T; par- 
ticles in the first generation for a process with 
Zo = £; 1 <i <k. If condition (1) holds, the 
*moment-generating functions of W, @,(a) = 
E{exp[—aW]|Z,)=e,], i=1,2,...,k, a>0, 
satisfy p;(x) =f (p, (4/4), Go(a/A),. , (4/4), 
l<i<k. 

When A <1, a theorem analogous to Ya- 
glom’s holds. For further information = Joffe 
and Spitzer (J. Math. Anal. Appl. 19 (1967)) 
and Athreya and Ney [S]. 


D. Markov Branching Processes 


The branching processes dealt with in Sections 
B and C are limited in the sense that gener- 
ation times are lixed. We now formulate a 
continuous-time version of branching pro- 
cesses. The treatment in this section is limited 


44E 


Branching Processes 


to Markov processes that are extensions to 
continuous time of the Galton-Watson pro- 
cess. A Markov process {X,, P;} on Z}, is 
called a Markov branching process if the tran- 
sition probability P(t, i,j)= P,[X,=j]@jeZ\, 
t 2 0) satislies 


È P(t,i, w-( Pt, hs’) 
j=9 j=0 


for allie Z! and |s|< 1. Then there exist 

a positive constant a and a probability 

distribution { p,; K= 0,2,3,.} such that 

lim,jo( 1 ~ P(t, i, i))/t = iaifie Z}, and 

lim, | P(t, i,j)/t =iap,_;4,if j2i— 1, j#i. The 

+Kolmogorov equations are 

d j+1 

— P(t, ij)= —jaP(t,i,j)+a lp;-14, P(t,i, l) 

T J jaP(t,i,j TE Pj-i+1 
(‘forward equation) 


and 


d 
gre i, j)= —iaP(t, i,j) 


(se) 


tia > 


I=i-1,14i 


Pi-i+1 Plt, Lj) 


(tbackward equation). 


Then a and { p, k=0,2,3,. } are interpreted 
as follows. An object existing at f has a proba- 
bility adt of dying in the time interval [t,t+ av 
of length dt. If it dies at any time t, the proba- 
bilities are po, P2, p3, - that it is replaced by 

0, 2, 3,. objects. When EKo P(t, i, K) < 1, the 
number of particles attains +20 in a finite time 
interval with positive probability. In order that 
Eo P(t, i, K) = 1 for allie Z}, it is necessary 
and sufticient that for each 1 >£>0, f 1 (flu) 
—u)~' du =œ, where f(s)= Èico, j1 pjs for 

Isi <1. 

Most of the theory of the Galton-Watson 
process carries over to the continuous case. In 
particular, concerning the flimit distribution of 
X,, more precise results have been obtained. 
(For detailed discussion of Markov branching 
processes — [3,8].) 

Some work has also been done on the 
asymptotic behavior for temporally inhomo- 
geneous cases of multitype Markov branching 
processes. 


E. Branching Markov Processes 


We now give a systematic treatment of gen- 
eral branching processes in which an object is 
characterized by a parameter x in a ‘compact 
*Hausdorff space § with a countable open 
thase. The results are formulated in terms of 
strong Markov processes. Set S° = {8}, where 
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@ is an extra point. For every positive integer 
n, let S"=SxSx...xS/~, where ~ isthe 


n 

tequivalence relation given by the ‘permu- 
tations of coordinates. S” is compact with 
respect to the ‘quotient topology, and its 
ttopological direct sum S = 112) S" is flocally 
compact. Let S=SU {A} be the one-point 
tcompactification of S. A point x ES is de- 
noted by x=[Xx,,X),...,X,] if x is the tequiv- 
alence class containing (x,, X2,.., X ES”. 

x = [x] is denoted simply by x. Set B(S) = 

{f|f a bounded Borel measurable function on 
S}, BYS)= {FEBO |I fll <1}, C()={flfa 
bounded continuous function on S}, and 
C*(S) = C(S) N B*(S). B(S), C(S), . are defined 
similarly. For f€B*(S), define fe B(S) by f(x) = 
Lif x=, =M f(x) if x= [x xax ]E 
S", and = 0 if x =A. A ‘strong Markov pro- 
cess X = {X,, P,} on is called a branching 
Markov process if its semigroup {T,} (- 261 
Markov Processes) satisfies 


A n P 
T,/ (x)=(T,f)s(x) for every xeS and 
feB*S), 


where for ge B(S), gis is the restriction of 

gon S [10-12]. We set(i) Z,=n if Xe S", 
n=0,1,...,00, where §” = {A}, and (ii) T= 

inf{ t Z, #Zp}. Z and q are called the number 
of particles and the first splitting time, respec- 
tively. Let X = {x,, P} be a strong Markov 
process on S. We assume that the semigroup 
{H,} of X is a strongly continuous semigroup 
on C(S). Let ọ, be a nonnegative continuous 
tadditive functional of X. Let p,(x), n =0, 2, 
3,. be a sequence of nonnegative functions 
in B(S) such that E£ 9,4, P(X) =1. Con- 
sider a sequence of stochastic kernels 7,,(x, dy), 
n=0,2,3,....onSx S", i.e., for lixed A € B(S"), 
T,(x, A) is a measurable function of x, and 

for lixed x it is a probability on (S", 8(S")), 
where B(S") is the ftopological o-field on S”. 
Set n(x, D) = Ln=o,n# 1 Pax) Talx, DN S”) for 
DeB(Ŝ), xeS, where S(S) is the topological g- 
field on §. Then there exists a unique branch- 
ing Markov process X = {X,, P,} on satisfy- 
ing the following: (1) The Markov process {X,, 
t <t, P,} on S obtained from X by shorten- 
ing its tlifetime is equivalent to the Markov 
process obtained by ‘killing X at a rate dg, 

(- 261 Markov Processes). (ii) For any DE 
S(S) and 4>0, E [expl —At]; X,eD|X,_]= 
E,[exp[ —4At]|X,_ ]2(X,_, D) as. on {t< œ} 
for every x ES. (iii) With probability one, X, = 
A for allt > lim,;..t,, where t, denotes the nth 
splitting time [10]. This process X = {X,, P,} is 
referred to as the {x,, Ø, ™}-branching Markov 
process. 


Example 1. Let S={a,,a,...,a,}; then S can 
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be identified with Z‘ and § with ZF = Z% U 

{ œ}. Therefore a branching Markov process X 
on § is a Markov process on VAA such that its 

transition probability P(t, i, j), i, je Z*. satisfies 


t, i, (j, Jo. i) St se. sit 


k 
gi I] >» P(t, Er (ji Jas. Ji) 
i=1 pjg- jiez% 
. a . i 
x T ; 


for i=(i,, 1, mek ane ieZ\, 
-SE Dh. 


P(t,oo, o)=1 
S= (S1533 


Then the process X is called a multi (Q-type 
Markov brancbing process. 

Example 2. Let § = [0, co], k(x) be a non- 
negative locally integrable function on [0, 00), 
and p,(x), n = 0, 1, , be a sequence of non- 
negative measurable functions on [0, o0) such 
that © p,(x)= 1 and p,(x)=0. We extend 
k(x) and p,(x) (n #2) as functions on [0, œ] by 
setting them {0 0 at oo, and set p,(0o) = 1. 
Define a stochastic kernel (x, dy) on § x § by 


> Pa(X)9410,0,...,0]} (DN S”) 
n=0 Sv 
= n 
n(x, D) = if x €[0, 00), 
Öit, æ] (DNS?) if x= 00. 


Let X = Ixa P,} be the uniform motion on § 
with the ae {H,\ such that for feC(S) 
H,flx) = flx-+0) if xe[0, 00) and = f(co) if 

x = 0. Set p, = ate „) ds. Then we have a 
{X;, Pp n}- ae a process, and we 
call it an age-dependent brancbing process. 
We now discuss some fundamental prop- 
erties of {x,, œ, n}-branching Markov pro- 
cesses. For the sake of simplicity we assume 
the following: (i) #; = fó k(x,)ds,keC*(S)= 
(FEC(S)| f >0}, Gi) py(x)eC*(S) and 


TAX, D) =x, x... (D) 
ee ped 


for De B(S"). Then {T,} is a strongly con- 
tinuous semigroup on C,(S) = {fe C(S) 
lim, .4 f{x)=0}. Let A be the tinfinitesimal 
generator of {H,} (- 261 Markov Processes) 
and YA) be the domain of A. If fe 2(A)N 
C*(S), then u(t, x)=(T,f))s(x)e P(A), and it 
satisfies ĝu/ôt = Au +k(F(';u) u),u(0+,x)= 
Sx), where F(x; g)= fsĝ(y)z(x, dy) [10]. 

Let .@ be the space of aj] nonnegative 
tRadon measures on § and Z, be the şub- 
space of .# consisting of al] probability Radon 
measures on § endowed with the topology of 
tweak convergence. Set # = M U {6}, where ĝ 
is an extra point and = [0, 00] x Mg. Define 
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a mapping p:.@/3(A, åo) (A, Ap)e-M by 
p(A, Ap) = Ady if À < co and = dif 4 = 00, and 
define the topology of .@ as the strongest of all 
the topologies rendering p continuous. Set 
C**(S)={feC(S)|f>0}, and for feC**(S), 
detine a function ®,(4) on æ by ®,{4) = 
ne (A, He if Ae M and =0 if A=0, where 
) = fs f(x)A(dx). Consider a mapping 4% : So 
p a w(x)=0 if x =6,= di=1 9} paa 
x= [xi X3, c X JES", and = @ ifx=A. Let 
M, be the subspace of .@ consisting of a11 non- 
negative integer-valued Radon measures on S, 
i.e., &,={W(x)|xeS}. We denote by 2 the 
space of all right-continuous mappings v: 
[0, 0)> M, whose discontinuities are at most 
of the first kind and such that v, = 6 for t > 
§ if v, = 6. Consider a branching Markov 
process {X,, P,}, and let Pi be the tprob- 
ability law on 9 of the stochastic process 
{v,=w(X,;0< t< œ} with X,=x, where u= 
w(x). Then the branching property can be 
rewritten as follows: For every t>0, feC**(S) 
and His H2€ Mp, E, +,,[exp[—(, f)]]= 
E, CexplL—(% AIIE,,Lexpl—(, A] (> 
Silverstein, Z. Wahrscheinlichkeitstheorie und 
Verw. Gebiete, 9 (1968); for a treatment of 
branching processes in the framework of 
‘martingale problems — Holley and Stroock, 
Publ. Res. Inst. Math. Sci., 14 (1978)). 

There are analogous processes which pre- 
serve the basic features of the branching 
property. A Markov process { p, P,} on M 
with 6 as a ttrap is called a continuous-state 
branching process if jt satisties the property 
that for every 1>0, feC**(S), and py, 4EM, 
E, +u,Lexp[ —(u, f)]]= E, Lexp[—(#,,f)]] 
E,,,Lexpl — (u, f)]]. This concept was intro- 
duced by Jiiina [ 15] for some special cases. 
For further information — Jifina (Third 
Prague Conf., 1964), Lamperti (Bull. Amer. 
Math. Soc., 73 (1967)), Watanabe (J, Math. 
Kyoto Univ., 8 (1968)), and Fujimagari and 
Motoo (Kédai Math. Sem. Rep., 23 (1971)). 
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A. General Remarks 


R. Brown, an English botanist, observed in 
1827 that the minute particles comprising the 
pollen of plants, when suspended in water, 
exhibit peculiarly erratic movements [ 1]. The 
physical explanation of this phenomenon is 
that haphazard impulses are given to the sys- 
pended particles by collisions with molecules 
of the fluid. Let X(t) be the x-coordinate of a 
particle at time ¢, Then X(t) is treated as a 
trandom variable, and the distribution of X(t) 
-X(s) is a tnormal distribution N (0, D|t—s]), 
with tmean 0 andtyarianceD t-s |, where D 
is a positive constant. To be more exact, such 
a family of random variables {X(t)} is now 
considered as the family of random variables 
determining a tstochastic process. Various 
aspects of the theory were analyzed by A. 
Einstein [2], L. Bachelier, N. Wiener, P. Lévy, 
and others. 
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B. Wiener Processes 


Let T be the real line Rt or a subinterval. A 
tstochastic process {X(t)},,, defined on a 
*probability space (Q, B, P) is called a Wiener 
process on R’ if it satisfies the following three 
conditions: (1) X(t, @)eR*(te T, weQ). (2) The 
R*-valued random variables X(t,), X(t)~- 
X(t;-1), j= 2,3,,n, are tindependent for 
any t;<f,<...<t,, HET (j=1,2,...,n), where 
nis an arbitrary positive integer. (3) If X,(t) is 
the ith component of the vector X(t), then the 
{X,(t)},er (1 SiS d) are independent as sto- 
chastic processes and every increment X,(t) 

— X (s) is normally distributed with mean 0 
and variance t—s|, A Wiener process is also 
called a Brownian motion. A Wiener process 
is a temporary homogeneous additive pro- 
cess. A fseparable Wiener process has continu- 
ous paths with probability 1. Conversely, if 
{X(t)},e7 is a temporary homogeneous addi- 
tive process on R! whose tsample function is 
continuous with probability 1 and the incre- 
ment X(t) = X(s) has mean O and variance 

t —s|, then it is a Wiener process (- 407 Sto- 
chastic Processes, 406 Stochastic Differential 
Equations). Let {F159 be an tincreasing 
family of o-subfields of 23. We assume that 
{F,} is right continuous. A d-dimensional 
continuous process X=(X(t)),,,,, is called a 
d-dimensional {.Y,}-Brownian motion if it is 
t{ ¥,\-adapted and satisfies 


ElexpLi<é, X()— X(s)>]|F] 
=exp[—(t—s)|¢|?/2] 


a.s. for every €€R4, O<s<t, 


where |é| denotes the tnorm of €€R?[9]. Then 
X satisfies conditions (1)-(3) mentioned above, 
and hence it is a Brownian motion on R’ (- 
406 Stochastic Differential Equations). 

Let {X,(@)} (k=0, 1, . ..) be a sequence of 
independent random variables defined on a 
probability space (Q, B, P) such that each X, 
has the normal distribution N (0, 1) with mean 
0 and variance 1. Then the series 


t 2 2 sin nt 
gros [25 xo i 


converges uniformly in t € T with probability 1, 
and its limit, denoted by X (t, w), is a Wiener 
process [3, 10]. 

Let W° = C( [0, 00)—> R’) be the space of all 
continuous functions w: [0, 00)—>Rźt endowed 
with the topology of uniform convergence on 
finite intervals and 8( W°) be the ttopological 
o-field. Let X=(X(t)),,,,, be a Brownian 
motion on Rf and ft be the probability law of 
X(0). The probability law P* of the Brownian 
motion X on ( W%, B( W%)) is called the d- 


te[0,z]=T, 
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dimensional Wiener measure with the initial 
distribution 4 [9, 11,12]. For a probability pz 
on (Rf, 8(R‘)) the d-dimensional Wiener mea- 
sure P, with the initial distribution p isa prob- 
ability on ( W4, ®( W*)) characterized by the 
property that for every 0 = tg <<t,<<t, and 


A;jEB(R’, j= 1,2, uM, 
P,[w(t,)€A,, w(ta)€ Ao, --., W(t,)E An] 


n 1 d 
-hh Liam) 
oar 
xexp| ———— | (dx) dx, ...dx,, 
o| Gi) 0) AX 


where 8(R‘) denotes the topological g-field on 
Rd. 


C. Brownian Motion as a Diffusion Process 


In terms of the general framework of Markov 
processes, Brownian motion is a typical exam- 
ple of a diffusion process (= 115 Diffusion Pro- 
cesses). Let X = {X,(w), Rô, P,\ be a continuous 
Markov process on R (— 26 1 Markov Pro- 
cesses) with the ftransition probability 


i 1 
(2m1) Pe x { ~zlx-al) 


Be®B(R4. 


P(t, x, B)= | 


B 
t>0, xeR4, 


For each xE Rô, the process {X,(w), P} isa 
Wiener process in the sense mentioned above. 
The Markov process X, which is a collection 
of Wiener processes {X,, P,} starting at x, is 
said to be a d-dimensional Brownian motion. A 
d-dimensional Brownian motion possesses the 
‘strong Markov property. Let & be the tgen- 
erator of the tsemigroup T, corresponding to 
X. A bounded uniformly continuous function 
f defined on R’ belongs to the domain of G if 
its partial derivatives ĝf/ôx; and 6 f/0x,0x;,i,j 
= |,2,. d, are bounded and uniformly con- 
tinuous. For such a function f we have Gf(x) 
=(1/2)Af(x), where A is the fLaplacian[8, 12]. 


D. Brownian Motions and Potentials 


For g > 0, the function G,( x) detined by 


Ps ee -4/2 PS 2 
G=, e “(2nt) exp F ix] Ja 
is said to be the cc-order Green’s function. 
Since Brownian motion is tnonrecurrent for 
d > 3, the limit Go+ (x)= lim, o G,(x), xe R, 
exists for > 3, and G,+(x) is equal to K,(x) = 
(F'(d/2 = 1)/424)|x ~**?, which is the kernel 
for the tNewtonian potentials. Brownian 
motion is trecurrent when d < 2 and G,+(x) = 


+00, xe R%. In this case, K,(x) is defined by 
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Ko(x) =lim, )9(G,(x) = G,(X)), and Ko(x) = 
(1/2n)log(1/|x|), when d=2 and |x9|=1. 

This is the kernel for the tlogarithmic poten- 
tials. When d = 1 and x, = 0, K,(x) = —|x|/2. 
Using this relationship, we can express many 
concepts of classical potential theory in an 
elegant form in probability language. Let 

X = {X(t), P,} be the d-dimensional Brown- 
ian motion. Given a set A, set o4 =inf{t {> 

0, X(t)e A}, where the infimum over the 
empty set is understood to be +00. Then 4, is 
called the thitting time of X for the set A (— 
261 Markov Processes). For a Green domain 
(i.e., a domain which is a tGreen space) D in 
R’ (d> 2), set 29 °(t, x, y)dy =P (X (thedy, cap > 
t), x, yE D. Then the right-hand side of this 
equation is the ‘transition probability of the 
Brownian motion on D with the tabsorbing 
barrier ôD, Then G?(x, y) =|? g(t, x, y)dt is 
tGreen’s function of D. If B is a compact sub- 
set of a Green domain D or an open subset 
with compact closure B c D, then the hitting 
probability p(x) = P,(0g < dap) is the equilib- 
rium potential of B relative to D. 

Suppose A is an tanalytic subset of R% 
*Blumenthal’s O-1 law implies that P,(a, =0) 
= 1 or 0. The point x is said to be regular for 
A if this probability is 1 and irregular for A 
otherwise. Let B be a compact subset in R? 
(d > 2). Then *Wiener’s test (— 120 Dirichlet 
Problem) states that x is regular or irregular 
for B according as the following series diverge 
or converge: 


where C(B,) is the Newtonian capacity (the 
logarithmic capacity relative to a bounded 
domain when d =2) of the set B, = {fy 2740 < 
ly —x|<2~*} B [12]. Suppose D is a bounded 
domain in Rf (d > 2). A point x e ĝD is regular 
or irregular for R? — D according as x is regu- 
lar or irregular in the sense of the *Dirichlet 
problem for D. Given a continuous boundary 
function f on dD, u(x) =E (f(X (aap) is the 
solution of the generalized Dirichlet problem 
for D. Given xeD, the distribution h(x, B)= 
P(X (Gap) € B) (BE B(AD)) used in the solution 
u(x) = fan f(y)h(x, dy) of the generalized Diri- 
chlet problem is the tharmonic measure of @D 
as viewed from x. 


E. 1-Dimensional Brownian Motion 


In his monograph [6], P. Lévy gave a pro- 
found description of the fine structure of the 
individual 1-dimensional Brownian path 
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{X(t,@)},>9- Let us set 
o, = the hitting time to the point a in R’, 


m(t,@)= min X(s,@), 


Bo SSK 


M(t,@)= max 


Gg SSK1 


(s, w), 


Yo(t, @) T |X (t, w)|, 


Y,(t,)= e w), ee ola) 
(t, 0) = M(t,@)—X(t,w), t>d9(w), 
and 
X(t), t< Oo (), 
Y,(t, o) = fe w)—m{t,@), t2oo(w). 


Then we have (1) Po(M(t)>a)=2P)(M(t)> 

a, X(t)>a)=2P)(M(t)>a, X(t)<a)= 
P,(|X(t)| >a) (reflection principle of D.Andreé). 
(2) The stochastic process {a,, 0< a< 00, Po} 
is a tone-sided stable process with exponent 
1/2, that is, it is additive and homogeneous 
with the law 


Po(0,— 0, <0) = Po(O,- <t) 
| b-a 
= Jo ./2ns3 


e7 (b-ay?/2s ds, 


O<a<b, t20. 


(3) Let m~!(t,@) be the right continuous in- 
verse function of 


ott, o=] Xto, œ (X (s, w)) ds, 


0 


where Xto, «)(*) is the findicator function of the 
interval [0, œ). Set Y3(t,@)=X(p ~! (t, œ), œ). 
Then the four processes { Y,(t, w), 0<t <%, P,} 
(xe[0, œ), 0<i<3) on [0, œ) have the same 
probability law. Each of them is a diffusion 
process with transition probability 


P(t, x, B) 


1 
eea s (e7 ETI? gt yP/2t) dy, 
o 


BeB([0,«0)), t>0, xe[0, œ), 


and is said to be a Brownian motion on [0, 00) 
with a freflecting barrier at the origin. (4) As a 
consequence of (3), if X(0, œ) = 0 a.s., then, for 

fxed t, M(t), -m(t), and Y,(t)(O<i<3) have a 
common distribution. For example, 


Po(M(t) 2 a) = Po(Ga<t) = 2Po(X(t)> a) 


2 
Z ahs e-* 2t ay 


a>0, 
nt |, 


P (X (t)eda, M(t)e db) 


= |= (2b— ayer 0" dadb, 0<a<b. 
T 
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(5) The diffusion process {X (t, w), 0 <t< do, P,\ 
(x € (0, 00)) obtained from a 1-dimensional 
Brownian motion by shortening its flifetime is 
called a Brownian motion on (0, œ) with an 
‘absorbing barrier at the origin, and its tran- 
sition probability is given by 


P(t, x, B) 


(e7 x79 2 saps \x+yl?/20) dy, 


1 
‘| 
Be B((0, «)), 


The arcsin law is valid for many functionals 
of 1-dimensional Brownian motion. For 
example, 


2 0 
(| fo.alX(6.0)as<0) = arcsin |°, 
0 T t 


0<0<t, 


t>0, xe(0, 00). 


O<st, 


Po(t,(@) < s) =(2/n)arcsin /si/t ; 


where 1,(o)=sup{s| X(s,@)=0,0<s<t}. 
The visiting set #(w)= {t| X(t, @)=0} of a 
Brownian path is a ttotally disconnected 
set. Its fLebesgue measure is 0, and the 
tHausdorff-Besikovich dimension number of 
F(a) is 1/2. 
Consider a -dimensional Wiener process 
{X(t)}o<;<. Starting at the origin. Let us set 


0 i 
B= ra t>0, 
and 
B,(t)=cX(t/c?), t20, (c>0). 


Then the stochastic processes {B,(t)}o<:<. 
and {B,(t)}y <j<q are likewise 1-dimensional 
Wiener processes starting at the origin. Hence 
the properties of the Wiener process starting at 
the origin in a neighborhood of t = 0 (t = œ) 
can be obtained from those in a neighborhood 
of t= œ (t=0). 


F. d-Dimensional Brownian Motion 


Almost all paths of d-dimensional Brown- 
ian motion are continuous but are not of 
tbounded variation on any finite interval. 
Accordingly, they cannot have lengths. A 
positive, continuous, increasing function @ 
defined on [t), 00) with tọ > 0 is said to belong 
to tbe upper (lower) class witb respect to local 
continuity if P,((inf{t || X(£,@)|> Ji (t/t), t> 
0} > 0) = 1 (0). Kolmogorov’s test states that 
~ belongs to the upper class or to the lower 
class with respect to local continuity according 
as 

% 

HEO OR dt 


to 
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converges or diverges. For example, 
p(t)=(2logi, t +(dt 2)logys)t + Zlogayt +. 
+ 2logy,-1t+(2+ d)log,, t)? 


belongs to the upper or lower class with re- 
spect to local continuity according as 6 > 0 or 
6<0, where 10gm t =loglog,,_;)¢ and log,,,t= 
log t. The law of the iterated logarithm for 1- 
dimensional Brownian motion, 


( | X(t, a) — X(s, @)| 
P| imsup ——————-= =1, 
ros ,/2|t—s|loglog 1/|t—s| 


is a special case of this example. For the case d 
= 1, consider the space-time Brownian motion 
{(—t, X(t), Po}. Set D, = {(s, X)| —1/to <s <0, 
/—5 @(-1/s)<x<«}. Then O=(0,0) is a 
regular point of D, or an irregular point of D, 
for the space-time Brownian motion accord- 
ing as @ belongs to the lower or upper class 
with respect to local continuity. (- 261 Mar- 
kov Processes). Thus Kolmogorov’s test is the 
Wiener test for space-time Brownian motion. 
Let w be a positive, continuous decreasing 
function defined on [t, 00) with tọ > 0. Then 


Pa ( int X(t, o< /t yijo.e>0}) >0) 


=0 or 1. 


We have 
T. 
| 7U at =oor<o when d>3, 
Io 
and 
a dt 
—— =o or <0 when d = 2. 
I t|log w(t)| 


To describe uniform continuity of a path on 
the interval [0, 1], take a positive, continuous 
increasing function y defined on [t), 00) with 
to > 0 and set g(t) = Ji /(1/t). Then w is said 
to helong to the upper class with respect to 
uniform continuity if almost all paths X(t, œ) 
(0< t< 1) satisfy the tLipschitz condition rela- 
tive to @, that is, for almost all œ there exists 
an E(w) > 0 such that 0 < t—s|<e(a) implies 
|X (t,0.)—X(s,0)| <¢(|t—s]). And y is said to 
belong to the lower class with respect to uni- 
form continuity if almost all paths X(t, œ) 

(0 <t <1) do not satisfy the Lipschitz condition 
relative to y. Then w belongs to the upper or 
lower class with respect to uniform continuity 
according as 


ie 6) 
| y(t) t2 e702 dt 
t 


(è) 


converges or diverges [ 16]. For example, 
YE=2logt +d +4)log,, t +21088t +. 
+210gn-1)t +(2 + d)log,, D7 
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belongs to the upper or lower class with re- 
spect to uniform continuity according as § > 0 
or § <0. The following theorem on the uni- 
form continuity of 1-dimensional Brownian 
motion is a special case of this criterion: 


P, limsup |X (t,o) —X(s, @)| = | =1. 


Jaso, /2lt—sllog 1/lr—s| 


Now we state some other properties of 
Brownian paths. Let A be a set of zero outer 
capacity in R° (d> 2). Then P(X(t)e A for 
some t > 0) = 0 for any x e Rf, Let A be a plane 
set with positive tinner capacity. Then P,(X (t) 
€ A for intinitely many t larger than any given 
§ > 0) = 1 for anyx € R?. With probability 1, 
the Lebesgue measure of the set {X(t, œ) |0 < 
t <oo}in R'is zero for d> 2. With proba- 
bility 1 this set is everywhere dense in R? 
when d = 2 and nowhere dense in Rf when 
d>3[ 13,15, 17]. Almost all 2-dimensional 
Brownian paths have k-fold multiple points 
for any integer K> 2. In the 3-dimensional 
case, almost all paths have ‘double points but 
cannot have any triple point. In the d( >4)- 
dimensional case, almost al] paths have no 
double point [18-20]. 


G. Ito’s Formula and Brownian Local Time 


The following formula of It6 [21] is of funda- 
mental importance in the theory of stochastic 
processes. 

It6’s formula. Let X={X(t) ={X,(t), X,(t), 
..+,X,(t)), Pe} be a d-dimensional Brownian 
motion and f(x)=f(x,, X2,.., Xa) be aC?- 
function delined on R, Let us set <p(t) = 
{(X(9)—f(X (0). Then {¢9(2)},59 is a continu- 
ous tadditive functional of the Brownian mo- 
tion X (- 261 Markov Processes), and we 
have 


t 


d 
xin) 11x0)=¥ | 


0 


ð 
(Soxo 
Xi 


+5 | aneconds t>0, 


where the first term in the right-hand side is a 
fstochastic integral with respect to the Brown- 
ian motion X [22] (- 406 Stochastic Differ- 
ential Equations). In particular, if f is a C?- 
function defined on R4, Itô’s formula can be 
rewritten as 


where the right-hand side is a stochastic inte- 
gral of the *Stratonovich type with respect to 
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the Brownian motion X [9] (- 406 Stochastic 
Differential Equations). 

Consider a 1-dimensional Brownian motion 
X = (X(t), P,}. By the local time or the sojourn 
time density of X we mean a family of non- 
negative random variables {@(t, x)} (te [0, oo), 
xéR’) such that, with probability 1, the fol- 
lowing holds: 

(a) the mapping [0, 00) x R!ə(t,x)—> (t, x)eR! 
is continuous, 
(b) for every Borel subset A of R' and ¢ >0, 


|, x4(X (s))ds= al p(t, x)dx. 


The notion of the local time of Brownian 
motion was first introduced by Lévy [6]. The 
family of random variables { g(t, x)} (t > 0, 

XE R!) defmed by 


(t,x) =(X()— x)" —(X(0)—x)* 
l 
- l Xix, œX (s)) dX (s) 


Satisties properties (a) and (b) mentioned above 
and hence it is a local time [9, 22]. Here a+ is 
the bigger of a and 0. It is clear that the local 
time g(t, x) of X is given by 
1 t 
p(t, x) = lim 4e | X(x~e,x+0)(X (s)) ds 
for every xER',t>0. 

Furthermore, we have 
lott, b)— p(t, a)| 

dlog 1/6 


<2,./max otta)) = l, 


P ( lim sup 


b-a=6|0,a<b 


aeR! 


and 


t,ô)—ọ(t, 
rts )— 0(¢,0) 


— [S2 E(t, 0))= I. 
510 dloglog 1/6 


Lévy [6] studied the fine structure of the local 
time. Define the visiting sets Z += {t} Y(t, w) 
=0} and Z~ ={t| Y,(t,@)=0}, where Y,(t, œ) 
and Y, (t, œ) are the stochastic processes de- 
fined in Section E above. Then we have 


: TE 
Po (i i= x [the number of flat stretches of 
M(s, a) (0<st) of length 2€] = M(t, a), 
120) =I. 


Since the two diffusion processes X * = 
{Y)(t),0<t< oo, P} and X~ ={Y,(t),0<t< 
oo, P,} define the same probability law on 

W( [0, 00)), there exists a functional @*(t, œ) of 
xt corresponding to M(t, w) of X ~. Here, 
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W( [0, œ)) denotes the space of all continuous 
functions w: [0, 00)->[0, 00), The flat stretches 
of the graph of @ *(t, œ) are the open intervals 
Z, (n> 1) such that ¥,., %,=[0, 0)-2#* 
Then we have 


Po (ip [Fx [the number of intervals 
elO 


¥,<[0, t) of length >4]=9°(40120) =l. 


Furthermore, we have 


P (in fe x [the total length of the inter- 
l0 N 2e 


vals Z, c [0, t) of length <e]= 9 *(t, w), 


1>0)=1 


and 


(ime | wnei 


0 


=9"(t0)t20)= 1 


Let d,(t, w) be the number of times that the 
reflecting Brownian path Y,(s, œ) crosses down 
from £> 0 to 0 before time f, Then 


Po (in éd,(t, w)= p`(t, w), 120) = l. 


H. Flows and Random Distributions 


The flow derived from the 1-dimensional 
Wiener process {X,} ~ 4 << 18 'Kolmogorov’s 
flow. It has tmixing properties of gj] orders 
and is tergodic (- 136 Ergodic Theory, 39.5 
Stationary Processes). The ‘stationary process 
with independent values at every point corre- 
sponding to the tcharacteristic functional 


exp(—3| p(t)? i) 


on the *Schwartz space Y defines the same 
probability law with the stationary process 
obtained by differentiation of the Wiener 
process in the tdistribution sense [23] (- 
395 Stationary Processes, 407 Stochastic 
Processes). 


I. Generalizations of Brownian Motion 


In addition to the Brownian motion described 
above, there are several stochastic processes 
that are also called Brownian motion. A Gauss- 
ian system {X (a)}aer~ defined on a proba- 
bility space (Q, B, P) is said to be a Brownian 
motion witb an N-dimensional time parameter 
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if(i) E(X (a))=0, Gi) E(X (a)X(b))=H({al + 
|bi—|a—b}), (iii) P(X(0)=0)=1. Leta*bethe 
tspherical inversion of a € R“ with respect to 
the unit sphere. Set X*(a)=|a|X(a*) (a40) 
and X*(O) = 0. Then {X*(a) taer" defines the 
same probability law with a Brownian mo- 
tion with an N-dimensional time parameter. 
Almost all paths of a Brownian motion with 
an N-dimensional time parameter are con- 
tinuous. A positive, continuous, increasing 
function @ defined on [tọ, 00) with tọ >0 is said 
to belong to the upper (lower) class with re- 
spect to local continuity if the probability 

that the closure of the set {a |X(a, w)| > /\al 
ọ( L/jal), |a| > 0} contains the origin 0 is equal 
to 0 (1). Then @ belongs to the upper or lower 
class with respect to local continuity according 
as the integral 


t 


0 


[owner tea 
t 


converges or diverges. For example, 


P(t)=(2log,,t+(2N+ — I)log.3)f+ 2logy4yt+ 


+ 2logy,-;,t +(2+ d)log,, t)"? 


belongs to the upper or lower class with re- 
spect to local continuity according as 6 > 0 or 
ô <0. As a special case, we have 


e( limsup ESC 


——] |=. 
a0, /2{alloglog1/a| 


Take a positive, continuous increasing func- 
tion @ defined on [tg, 00) with to > 0, and set 
W(t)= Jt (1/t). If almost all paths X (a, œ) 
(|a| < 1) satisfy the Lipschitz condition relative 
to w(t), then @ is said to belong to the upper 
class with respect to uniform continuity. It is 
said to belong to the lower class with respect 
to uniform continuity if, with probability 1, 
these paths do not satisfy the Lipschitz con- 
dition relative to w(t). Then @ belongs to the 
upper or lower class with respect to uniform 
continuity according as 


0 
| oti te ee dt 
t 


0 


converges or diverges (T. Sirao, Nagoya Math. 
J., 17 (1960)). For example, 


p(t) = (2Nlog t+ (4N + log, t+ 2 log,s)t 
+... +2loge1)t+(2+ Slog, t)" 


belongs to the upper or lower class with re- 
spect to uniform continuity according as ô > 0 
or 6< 0. As a special case, 


i |X (a,m)—X(b, m)! 
P lim su £ 
ja-|+0.lal.161<1,/2N|a—b|log 1/|a—b| 


=1)=1 
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(For general information about Brownian 
motion with a multidimensional time para- 
meter — P. Lévy [6] and H. P. McKean, 
Theory of Prob. Appl. 8 (1963).) 

Let X(t) be a Wiener process. L. S. Ornstein 
and G. E. Uhlenbeck based their investigation 
of the irregular movements of small particles 
immersed in a liquid on Langevin’s equation 


dU (t)= —aU (t)dt + BdX (t), 


where U (t) is the velocity of a particle. The 
first term on the right-hand side of this tsto- 
chastic differential equation (- 406 Stochastic 
Differential Equations) is due to frictional 
resistance or its analog, which is thought to be 
proportional to the velocity. The second term 
represents random external force. The station- 
ary solution of this equation is given by 


U(t) a Be-**- dX (u). 


The stochastic process {U(t)} <<% iS a 
stationary (Gaussian Markov process with 
covariance function y(t) = (B?/2a)e~%"!. This 
process is called Ornstein-Uhlenheck Brownian 
motion [24] (— 176 Gaussian Processes). 

Brownian motion has been defined on state 
spaces that are *Riemannian spaces or +Lie 
groups, and its properties are being investi- 
gated (= 5 Additive Processes, 115 Diffusion 
Processes). 
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Calculus of Variations 


A. General Remarks 


One of the first objects of differential calculus 
was to systematize the theory of the extrema 
of functions of a linite number of indepen- 
dent variables. In the calculus of variations 
we consider functionals (i.¢., real-valued or 
complex-valued functions delined on a func- 
tion space {u} consisting of functions delined 
on a certain domain B), originally so named 
by J. Hadamard. Such a functional is denoted 
by J[u B] (or simply J[u]). A function u, 
which is considered an independent variable of 
the functional, is called an argument function 
(or admissible function). 

Concrete examples of functionals are the 
length of a curve y = f(x) and the area of a 
surface z = z(x, y), which are expressed by 


LLy] -| ' /1+(y'@)y ax 


and 
S[z] =f J1+23+2) dxdy, 
B 


respectively. Furthermore, consider a curve y = 
y(x) connecting two given points (x,, yọ) and 
(x,, y.) with y, >y,. The time in which a parti- 
cle slides down without friction from (x,, y,,) 
to (x1, y1) along this curve under constant 
gravity acting in the direction of the positive 
y-axis is expressed by the functional 


Jtl=k | (1+ y)/(y—yo) dx, 
Xo 
k constant. (1) 


Let F(...) be a known real-valued function 
that depends on a certain number of indepen- 
dent variables and on argument functions of 
these variables, together with derivatives of 
these functions up to a certain order. Then a 
typical problem in the calculus of variations is 
formulated as an extremal problem of a func- 
tional that is expressed by the integral with 
F(...) as the integrand, for example, 


J[u]= ii F(x, u(x), u(x), . . u(x) dx, 


J[u, v] = F(x, y,u(X, y), Uys Uys re 
[u,v] yaar (xX, Y), Urs My yay 


v(x, y), Vys Vysana ) dxdy. 


For instance, a curve minimizing (1) is the 
curve of steepest descent. 

In extremal problems, suitable boundary 
conditions can be assigned to argument func- 
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tions. On the other hand, there are so-called 
conditional problems in tbe calculus of vari- 
ations. A typical example of this sort is the 
tisoperimetric problem, i.e., the determination 
of the curve that bounds a domain with maxi- 
mal area among all curves on a plane with 
given length. In general, an extremal problem 
of a functional under a subsidiary condition 
that the value of another given functional 
remain lixed is called a generalized isoperi- 
metric problem. In addition to these, there are 
Lagrange’s problem, in which a finiteness 
condition is imposed, and Hilbert’s problem, in 
which a condition consisting of differential 
equations is imposed. 

The birth of the calculus of variations was 
almost simultaneous with that of differential 
and integral calculus. Johann Bernoulli, Jakob 
Bernoulli, L. Euler, and others had dealt with 
several concrete problems of the calculus of 
variations when in 1760, J. L. Lagrange intro- 
duced a general method of dealing with vari- 
ational problems connected with mechanics. 
Then an equation bearing the name of Euler 
or Lagrange was introduced. 


B. Euler’s Equation 


As an example, consider the simplest vari- 
ational problem 


x 

J[y]= | F(x, y(x), y’(x))dx = min. (2) 
Xo 

Let the boundary condition y(x) = Y.. y(x,) 

=y, be assigned to the argument function 

y(x). Consider a family of admissible functions 

Y(x; €)= y(x)+ n(x), where n(x) is any lixed 

function vanishing at both endpoints and 

gis a parameter. If y(x) gives the minimum 

of J[y], then the function of ¢, JẸ Y], must 

attain a minimum for ¢ = 0. The condition 

(OJE YV/&),,, = 0 is written in the form 


0 -| (F,n +F, n')dx 


xy d 
-| n( SF ax 


by taking into account the boundary con- 
dition. By making use of the arbitrariness of 
n(x), we conclude that 


0= By =F Epy By Eyy (3) 
holds, in view of the following lemma: Let ọ(x) 
be continuous in [x9, x, ] and y(x) be a func- 
tion of class C? that satisfies n(x) = n'(xo)=- - 

= (Xo)=0, n(x) =n (x)=... = P(x) =0 
(0<q<p). If [¥:n(x)e(x)dx =0 holds for any 
such n(x), then g(x)=0. (Here class C? can be 
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replaced by class C? when q is supposed finite. 
If ye C°”, then q = œ% is admitted.) This is called 
the fundamental lemma in the calculus of vari- 
ations. We call (3) the Euler-Lagrange differ- 
ential equation (or Euler’s equation for the 
extremal problem). Since this equation is of 
the second order, y(x) can be determined by 
means of the boundary condition. 


d 
The quantity [F],=F, =e F; contained in 
x 


equation (3) is called the variational derivative 
of F with respect to y. Furthermore, dy = nde 
and ĝJ = (6J [ Y\/&),,, de are called the first 
variations of the argument function y and of 
the functional J[y], respectively. If n(x) is not 
subject to the condition that it must vanish at 
the endpoints, then the first variation of J 
becomes 


J= | [F] ôydx+ [F, dy]. 


In comparison with an ordinary extremal 
problem f (x1, s Xp) = min in differential 
calculus, [F], and 6J correspond to gradfand 
df, respectively. In general, a solution of the 
Euler-Lagrange differential equation [F], =0 
is called a stationary function for the vari- 
ational problem, and its graph is called a 
stationary curve. 

For a variational problem involving several 
argument functions, we have only to write the 
system of Euler-Lagrange differential equa- 
tions corresponding to them. For a problem 


x 
=| l F(x,y, y’, 00-5 ¥™)dx=min 


Xo 
whose integrand involves derivatives of higher 
orders of an argument function, the Euler- 
Lagrange differential equation is 


m d! 
LF], =e (SI om yo =O. 
For a problem involving a double integral 
J[u] = F(x, y, Uu, Ux, Yyy) dx dy = min, 
S Sg 


the equation is 


7] ô 
0=[F], = Faz Fa 


F; 
Ox 


x By y 

For a generalized isoperimetric problem, for 
example J[y]= min, K [y] = c, the Euler- 
Lagrange equation becomes 


[F +4G],=0 


with the so-called Lagrange multiplier A, where 
F and G denote the integrands of J and K, 
respectively. Two integration constants con- 
tained in a general solution of this differential 
equation of the second order and an undeter- 
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mined constant 4 can be determined, for in- 
stance, by the boundary condition y(X9) = Yo, 
y(x,) = y, and a subsidiary condition K [y] = c. 
As an example, for the classical proper isoperi- 
metric problem F = y, G = ,/1 + y?, the equa- 
tion is 1 —A(y'/./1+ y'?yY = 0, which after 
integration leads to (x —a)? + (y = PY =A? 
(— 228 Isoperimetric Problems). 

Besides the case of fixed endpoints, there is 
a boundary condition, for instance, that an 
endpoint (x,, y,) of the argument function 
y = y(x) must lie on a given curve T(x, y) = 0. 
For the case of such a movable endpoint, the 
extremal function is subject to the condition of 
transversality, 


(F—y'F,)T,—F, T,=9, 


X=X1. 


C. Sufficient Conditions 


A. M. Legendre introduced the notion of the 
second variation, corresponding to differential 
quotients of the second order in differential 
calculus, in order to discuss sufficient con- 
ditions. Concerning the simplest problem (2), 
the inequality F,,,.(x, yo{x), yo(x)) 2 0 is neces- 
sary in order for yọ(x) to give the minimum. 
Conversely, the inequality Fy, > 0 and Jacobi’s 
condition (which is stated below) imply that 
y = yo(x) gives a weak minimum. Here “weak 
minimum” means the minimum when a family 
of admissible functions { yey, <E Y —yyl< 
e} is considered a neighborhood of y,. 
Jacobi’s condition: Let y be a solution of a 
linear ordinary differential equation of the 
second order, 


d du d 
ne Fy = Fy ay fw’ u=0, 
u(Xo) = 0; 


then the smallest zero of ų that is greater than 
Xo (i.e., the conjugate point of xX) is greater 
than the right endpoint x;. 

K. Weierstrass derived suflicient conditions 
for a strong minimum by extending the range 
of admissible functions to { y — yo <€}. Re- 
sults that were obtained until about that time 
constitute the content of what is usually called 
the classical theory of the calculus of variations. 

If for the variations problem (2) there exists 
a unique curve through every point in a do- 
main on the xy-plane that belongs to a one- 
parameter family of stationary Curves of the 
functional 


x) 
J[y]= | F(x, y, y)dx, 
Xo 


then the domain is called a field of stationary 
curves. Let the parameter value of the curve 
through a point (x, y) in such a family of 
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stationary curves y = @(x; a) be denoted by ġ 

= a(x, y. The slope p(x, y) = [e'(x; ®) Ja=acx,yy iS 
called the slope of the tield at the point (x, y) 
or, by regarding x, y as variables, the slope 
function of the field. The value of a curvilinear 
integral 


lc=1[y] 
= | (Fœ) -0 -PE ype 
C 


is then determined and depends only on the 
two endpoints of the curve C. We call I, 
Hilbert’s invariant integral. In view of the 
property mentioned above, we can denote the 
value of the functional J for a function y = y(x) 
representing a curve C by Je. Then any admis- 
sible curve C that passes through a tield em- 
bedding a stationary curve Co satisfies 


0<AJ=Jo—Jc,= | &(x, y; p, y')dx. 

Here i 

(x,y; P-Y) 

= F(x, y, y') ~ F(x, y, p)— (y —p)F,(x, y, p) 


is the &-function introduced by Weierstrass. 
For Cy to give the minimum of J[ y], it suffices 
that & > 0 hold for every point (x, y) in. the field 
and every value y’ {279 Morse Theory). 


D. Optimal Control 


Let a system of differential equations 
AX; /dt = fi(X1,-- s Xay Wis, Uy), 
(ui) u)JEQ; X(t) =x, i=1,..) n, (4) 


be given, where u,,.., u, are parameters. In 
general, a problem of optimal control is to 
determine u; = u,(t) (to St < t,) such that the 


value of a functional 


J= | Fes- rsita sn d 


assumes a minimum, where x(t) are the solu- 
tions of (4) and are considered functions in 
Ui,- - -> Up, and t. Such a problem is a kind of 
conditional variational problem. But since the 
existence region of ų is restricted, certain con- 
ditions in the form of inequalities are imposed, 
and furthermore y is not necessarily continu- 
ous, and in many cases the problem cannot 
be treated within the classical theory of the 
calculus of variations (+ 86 Control Theory). 


E. The Direct Method in the Calculus of 
Variations 


In mathematical physics tvariational principles 
are derived from discussions of formal corre- 
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spondence between a functional J[u] to be 
minimized and Euler’s equation for J [u]. This 
is certainly one of the important methods in 
the calculus of variations, but it is also possible 
to investigate a stationary function ug on the 
basis of its stationary character and indepen- 
dently of Euler’s equation. This is called the 
direct method in the calculus of variations. It 
plays an important role in the theoretical 
treatment of the existence and uniqueness of 
solutions, and it is also significant as a tech- 
nique for approximate or numerical solutions. 
When a differential equation is given inde- 
pendently of the calculus of variations, it is 
possible to apply the direct method if a func- 
tional whose Euler’s equation is the given 
differential equation can be constructed. 

Let D be a bounded domain in m- 
dimensional space and feL.,(D) be a real- 
valued function. Consider the variational 
problem of minimizing the functional 


s= leradužax—2 f fudx. 
D D 


Here we suppose the set of admissible func- 
tions, denoted by Aj, to be the Hilbert space 
obtained by completing the function space 
Cy (D) with respect to the norm 


1/2 
N(u) (| jeradutdx | ar) 
D D 


Utilizing F. Riesz’s representation theorem in 
Hilbert spaces, it can be shown that there 
exists a minimum value | in A, which is 
uniquely realized by certain ug € A;. Since the 
function uy belongs to A,, it can be shown that 
the boundary condition 


Ul ap =9 (5) 


is satisfied in a generalized sense. Furthermore, 
in view of J [uo] < J [uo + o] being valid for 
any peCọ (D), it can be verified that the 
equation 


—Au=f (6) 


is satistied in D in the sense of differentiation 
of tdistributions. In other words, the station- 
ary function ug is a solution in the wide sense 
(a tweak solution) of the classical boundary 
value problem for *Poisson’s equation for- 
mulated by (6) and (5). If a function space A, 
with A, > A, > C%(D) is taken as the set of 
admissible functions, the value |= J [ugo] be- 
cornes the greatest lower bound of J in 4 ye 

In this case, if {u,}”., is any minimizing se- 
quence from A,, that is, ifu,éA,,n=1,2,..; 
J[u,] >l (n- œ), then it converges to Ug in the 
sense that 


N(u,—Up) 70 (n> 00). (7) 


In other words, the solution in the wide sense 
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Ug Of the boundary value problem can be 
constructed as the limit of a minimizing se- 
quence that consists of sufficiently smooth 
functions vanishing on the boundary. In the 
proof of the fact that a solution in the wide 
sense uo coincides with the classical solution of 
the boundary value problem under an assump- 
tion of suitable smoothness for f and ôD, a 
standard argument has been established for 
proving the regularity of a solution in the 
wide sense. 

The technique of obtaining the solution of 
the boundary value problem as the limit of 
a minimizing sequence was proposed by B. 
Riemann concerning the classical *Dirichlet 
problem and was completed by D. Hilbert. 
This pioneering work led to the recent treat- 
ment of boundary value problems by utilizing 
Hilbert spaces. For +Self-adjoint boundary 
value problems, the method stated for the 
above example has been generalized almost 
directly to the cases of differential operators of 
higher order and with variable coefficients (- 
323 Partial Differential Equations of Elliptic 
Type). By making use of some auxiliary argu- 
ments, this technique can be applied exten- 
sively to the construction of several kinds of 
mapping functions in the theory of functions 
of a complex variable, to the solution of 
tintegral equations of the second kind, and 
also in other fields [3,4]. 

The eigenvalue problem, which is formu- 
lated by 


Hu=2u, u#0, (8) 


with a +Self-adjoint operator H in a Hilbert 
space, can also be transformed into a yari- 
ational problem for the Rayleigh quotient 


R[u] =(Hu, u)/||u\l? (9) 


(= 298 Numerical Computation of 
Eigenvalues). 


F. Solution of Differential Equations by the 
Direct Method 


In view of the convergence shown in (7), a 
minimizing sequence can be regarded as an 
approximating sequence for a solution of the 
boundary value problem or a stationary func- 
tion Uy. Let a function 


HSEEE Crest) (10) 


involving an n-vector c =(c,,...,¢,) asa para- 
meter be admissible for any c. If J[u,(-;¢)] = 
F(c), obtained by substituting u, into J, is 
minimized at c = c®, then u,{(-; c°) is considered 
as a function that approximates to most pre- 
cisely within the family y,(+;c). This vector c° 
is obtained, in general, by solving the simulta- 
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neous equations 
éJ[u,(3¢)]/6c;=0, j= l,...,n. (11) 


The function u, appearing in (10) is often taken 
to be a so-called linear admissible function. 
For instance, in the example above, let {o}: 
be a system of independent functions complete 
in A,, and set 


Up =C1 P1 toe. + Cn Qn. (12) 


The method that constructs a minimizing 
sequence uj, u,,. by determining the value of 
the ¢, in (12) by (11) is called Ritz’s method, 
and @, is called a coordinate function in this 
method. As for the rate of convergence in 
approximation by Ritz’s method, as well as 
for the estimation of errors, there are several 
results by the Soviet school in addition to 
those of E. Trefftz [3,5]. (Other methods of 
constructing minimizing sequences are stated 
in detail in [3]; concerning a connection with 
*Galerkin’s method — 304 Numerical Solution 
of Partial Differential Equations.) 

Ritz’s method applied to eigenvalue prob- 
lems is called the Rayleigh-Ritz method. Since 
a stationary value of the Rayleigh quotient is 
itself an eigenvalue in this case, the precision of 
approximation is far better for eigenvalues than 
for eigenfunctions, so that it is a convenient 
method for the approximate computation of 
eigenvalues. 
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Cantor, Georg 


George Cantor (March 3, 1845-January 6, 
1918), the founder of set theory, was born in 
St. Petersburg into a Jewish merchant family 
that settled in Germany in 1856. He studied 
mathematics, physics, and philosophy in Zu- 
rich and at the University of Berlin. After re- 
ceiving his degree in 1867 in Berlin, he became 
a lecturer at the University of Halle and served 
as a professor at that university from 1879 to 
1905. In 1884, under the strain of opposition 
to his ideas and his efforts to prove the fcon- 
tinuum hypothesis, he suffered the lirst of 
many attacks of depression which continued 
to hospitalize him from time to time until 

his death. 

The thesis he wrote for his degree concerned 
the theory of numbers; however, he arrived at 
set theory from his research concerning the 
uniqueness of trigonometric series. In 1874, he 
introduced for the lirst time the concept of 
+Cardinal numbers, with which he proved that 
there were “more” ttranscendental numbers 
than talgebraic numbers. This result caused 
a sensation in the mathematical world and 
became the subject of a great deal of con- 
troversy. Cantor was troubled by the oppo- 
sition of L. tKronecker, but he was supported 
by J. W. R. tDedekind and G. Mittag-Leffler. 
In his note on point-set theory, he wrote, in 
connection regard with his concept of infinity, 
“The essence of mathematics lies in its free- 
dom!” In addition to his work on cardinal 
numbers, he laid the basis for the concepts of 
torder types, ttransfinite ordinals, and the 
theory of real numbers by means of ffunda- 
mental sequences. He also studied general 
point sets in Euclidean space and defined the 
concepts of faccumulation point, ‘closed set, 
and topen set. He was a pioneer of the point- 
set theory that led to the development of 
general topology. 
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A. General Remarks 


The electric capacity of a conductor in the 3- 
dimensional Euclidean space R° is defined as 
the ratio of a given positive charge on the 
conductor to the value of the potential on the 
surface. This definition of capacity is indepen- 
dent of the given charge. The capacity of a set 
as a mathematical notion was defined lirst by 
N. Wiener (1924) and was developed by 0. 
Frostman, C. J. de La Vallée Poussin, and 
several other French mathematicians in con- 
nection with tpotential theory. 


B. Energy 


Let Q be a tlocally compact Hausdorff space 
and @(x, y) be a flower semicontinuous func- 
tion on Qx Q such that -œ <® < «0. A mea- 
sure jt will mean a nonnegative tRadon mea- 
sure with compact fsupport S,. Denote by 
(x, u) the tpotential { B(x, y)du(y) of a mea- 
sure u with kernel ® and by ( Ls, y) the tenergy 
f {Odudu of u. Let X be a set in Q, and denote 
by Uy the class of normalized measures u (ie., 
of measures 4 satisfying u(Q)= 1) with Sc X. 
Let K be a nonempty compact set in Q, Set 
W(K)=inf(p, u) for uE Yg, and W(@)= œ for 
the empty set @. For ®(x, y) = 1/|x—y|in Q 

= R?, the general solution u(x) of the *Dirich- 
let problem (texterior problem) for the bound- 
ary function | in the unbounded component of 
R? —K is equal to the potential of an tequilib- 
rium mass-distribution. Therefore, if S is a 
smooth surface surrounding K and normals 
are drawn outward to S, the integral -( 1/4 r): 
fs(ĉu/ôn)do of the normal derivative is equal 
to 1/W(K). This is the capacity of K delined 
by N. Wiener (J. Math. Phys., MIT, 3 (1924)) 
when K is a closed region. Vallée-Poussin 
(Ann. Inst. H. Poincaré, 2 (1932)) called the 
supremum of u(R?°) the Newtonian capacity of 
a bounded tBorel set E, where u runs through 
the class of measures p with S, c FE whose 
*Newtonian potentials are not greater than 1 
in R?. If E is compact, the Newtonian capacity 
coincides with Wiener’s capacity. For the 
tlogarithmic potential in R*, e “™ is called 
the logarithmic capacity. When *Green’s func- 
tion g(z, 00) with the pole at the point at infin- 
ity exists in the unbounded component of R? 
—K, lim,_,,,(g{z,0©)—log|z|) is called Rohin’s 
constant and can be shown to be equal to 

W (K). (For the relation between Robin’s con- 
stant and treduced extremal distance — 143 
Extremal Length.) In the case of a general 
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kernel it is difficult to define capacity as above 
by means of W(K), and hence the value of 
W(K) itself instead of the capacity of K is often 
used. When W(K)= co, we can say that K is of 
capacity zero. The minimum value of the 
+Gauss integral (u, 4)—2 f f duis a generaliza- 
tion of W(K), where we%, and f is an upper 
semicontinuous function bounded above on K. 


C. Minimax Value 


Suppose that we are given the kernel ® as 
above. For a set X © Q and a measure H, 

set U (u; X) = sup, .y P(x, u) and V(p; X)= 
inf, y P(x, u). Next, for Yc Q, set U(Y) = 

inf U(u;S,), V(Y) = sup V(y; 8,), U(X, Y) = 

inf U( u, X), and V(X, Y) = sup V( w, X), where 
ped. If (x, y) = @(y, x) is taken as a kernel 
instead of @(x, y), then the notations u/(K), 
U(u; X), V(u; X), are used correspondingly. 
For any compact set K the following relations 


hold: 
W(K)= W(K)< CI(K)= u(K) 


U(Q, K) =V(K,Q) 
<{0(Q, K)= V(K,Q) 


2 ee K)=V(K, a 
V(K)= V(K). 


U(K, K)=V(K, K) 


Examples show that all the inequalities can be 
strict. The tminimax theorem in the *theory of 
games plays an important role in the proof of 
these inequalities [7]. Even if the kernel is 
symmetric, the inequalities can be strict except 
for the equality W(K)= U(K). When the kernel 
is positive, we can define the quantities which 
correspond to U(Y), V(Y), U(X, Y), V(X, Y) 
by considering the class of u with S, c Y and 
(u, 4) = 1 instead of Wy. 


D. Transfinite Diameter 


Ask>o, 


D(K)y=kU(k-1)7! inf s Y O(x;, x;) 
Kyser XEK ZY 

decreases and the limit D (K) is equal to W (K). 
For the logarithmic kernel in R?, M. Fekete 
defined D (K) and called e72% the transfinite 
diameter of K (1923). F. Leja and his school in 
Poland studied relations between transfinite 
diameter and *conformal mapping. Next, set 


k 
kR,(X,Y)= sup inf}, O(x, x;). 
Kips tees xEY xeX i=l 
Then R(X, Y) = lim R,(X, Y) exists as k—> 00, 
and we have R(K, Y)= V(K, Y). 

Fekete introduced R(K)=R(K, K) in R? 
(1923). G. Polya and G. Szeg6 computed D (K) 
and R(K) for special K and cc-kernel r “(4 2 0) 
in R? and R*[12]. The equality D(K)=R(K) 
holds for the logarithmic kernel in R? and the 
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Newtonian kernel in R?. The maximum of the 
absolute value on K of tChebyshev’s poly- 
nomial (— 336 Polynomial Approximation) 
of order k with respect to K in R? is equal to 
exp( —kR,(K)). 


E. Evans’s Theorem 


In order that K be of Newtonian capacity 
zero, it is necessary and sufficient that there 
exist a measure ji on K such that the New- 
tonian potential of 4 is equal to % at every 
point of K. This result was proved by G. C. 
Evans and H. Selberg independently (1935) 
and is called Evans’s theorem (or the Eyans- 
Selberg theorem). The corresponding theorem 
in R? is often applied in the theory of functions 
[ 15,16]. A similar potential exists in case of a 
general kernel if and only if R(K, K)= œ. 


F. Nonadditivity of Capacity 


Many kinds of capacity satisfy the inequality 
cap( Un X,) < È, cap X,,, where a capacity is 
denoted by cap. Even the Newtonian capacity 
C is not necessarily additive, but it satisfies 
C(K,UK,)+ C(K, 1K,)<C(K,)+ C(K,) (G. 
Choquet [2]). Choquet [3] proved that X can 
be divided into mutually disjoint sets X, and 
X, such that C,(X) = C,(X,) = C,(X,), where 
C,(X) is the Newtonian inner (or interior) ca- 
pacity delined to be sup, c y C(K) if X # Ø 

and 0 if X = Ø. 


G. Relation to Hausdorff Measure 


There are many studies of relations between 
capacity and tHausdorff measure f 1]. Frost- 
man [6] introduced the notion of capacitary 
dimension and observed that it coincides with 
the Hausdorff dimension. The capacity of 
product sets has been evaluated from above 
and below [ 11]. For compact sets K c R”, K’ 
c R”, their dimensions a, p, and the dimension 
yof K x K’, we have the relation a + <y < 
min(m + a, n+), where the equalities are 
attained by general Cantor sets. There are 
also works on the evaluation of capacities 

of general Cantor sets [10,15]. If K is a con- 
tinuum of logarithmic capacity 1 in a plane, 
then its diameter d satisfies 2 <d < 4, and its 
area A satisfies A<n [S]. Consider the sum K 
={z,+...+2,|z,€K,,1<k<n} ofcontinua 
K,,---», ina plane. The logarithmic capacity 
of K is strictly greater than the sum of the 
logarithmic capacities of K,,, K, except 
when all K, are convex and mutually similar 
[14]. By various fsymmetrizations the log- 
arithmic capacity decreases in general (— 228 
Isoperimetric Problems; also [ 133). 
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H. Capacitability 


The Newtonian outer (or exterior) capacity 
C,(X) is defined by inf C,(G), where G ranges 
over an open set containing X. The inequality 
C,(X)< C,(X) holds, in general. When the 
equality holds, X is called capacitable. Cho- 
quet (1955) [2] proved that all tanalytic sets 
and hence Borel sets are capacitable but there 
exists an analytic set whose complement is not 
capacitable. He himself generalized his result 
on capacitability in the following way [4]: Let 
Q be an abstract space, ~ a nondecreasing 
function defined on the family of all subsets of 
Q, and # some family of subsets of Q that is 
closed under the formation of finite unions 
and countable intersections. Assume that 
0(H,)|@(H) as H,, in # decreases to H and 
that @(X,)to(X) as X,,X. When p(X) is equal 
to sup{p(H)| HEX, HcX}, X is called 

(ọ, #)-capacitable. Choquet detined 4- 
Suslin sets and showed that they are (p, #)- 
capacitable. M. Kishi [9], Choquet [5], and B. 
Fuglede [7] investigated capacitability with 
respect to several kinds of capacity more gen- 
eral than Newtonian capacity. We can discuss 
capacitability with respect to quantities de- 
fined in connection with the *Gauss varia- 
tional problem. 
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Cardinal Numbers 


A. Definition 


The general concept of cardinal number is an 
extension of that of natural number (Cantor 

a 1]). When there exists a tone-to-one corre- 
spondence whose tdomain is a set A and 
whose trange is a set B, this set B js said to be 
equipotent (or equipollent) to A, and this rela- 
tion is denoted by A-B. The relation ~ is 

an tequivalence relation, and each equivalence 
class under this relation is said to be a cardinal 
number. The class of all sets equipotent to a set 
A is denoted by A (or A|) and is said to be the 
cardinal number (power, cardinality, or potency) 
of tbe set A. When A is a finite set, A 1s said to 
be finite, and when A is an infinite set, A is 
said to be infinite (or transfinite). When the 
cardinal number of a set A is m, A is also said 
to consist of m members (or m elernents). In 
this sense, O and the natural numbers are 
considered to express finite cardinal numbers. 
For example, 0= @, 1 = {0}, 2 = {0,1}, etc. 
Examples of infinite cardinal numbers: A set A 
which is equipotent to the set N of all natural 
numbers is said to be countably infinite, and 
the cardinal number of the set N is denoted by 
a. A set A which is finite or countably infinite 
is said to be countable. The cardinal number of 
the set of all real numbers is denoted by c and 
is called the cardinal number of tbe continuum. 
Moreover, the cardinal number of the set of all 
real-valued fynctions whose domain is the 
interval [0, 1] is denoted by f. These three 
cardinal numbers are known to be distinct. 
Henceforth in this article, lower-case German 
letters denote cardinal numbers. For a defi- 
nition of cardinal numbers using the concept 
of ordinal numbers — 312 Ordinal Numbers. 
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B. Ordering of Cardinal Numbers 


m2>n or n<m will mean that there exist sets 
A and B such that m=A, n= B, and A >B. 
A#B does not necessarily imply m #n. For 
example, the cardinal number of the set of all 
positive even numbers is also a. m >n and 
n>m imply m =n (Bernshteïn’s theorem). 
Since the treflexive and ‘transitive laws for the 
relation < between cardinal numbers are 
obvious, the relation is an tordering relation. 
The ‘well-ordering theorem imples that > is a 
total ordering (comparability theorem for 
cardinal numbers). m >n means that m >n 
and m n. When B<m, B is said to be at 
most m. 


C. Sum, Product, and Power of Cardinal 
Numbers 


For cardinal numbers m and n, choose sets A 
and B so that m=A, n= B, and ANB= Ø, 
and put s= AU B. Then s is uniquely deter- 
mined by m and n. The s is said to be the sum 
of the cardinal numbers m and n and is de- 
noted by m +n. If the sets A, B are chosen as 
described above, the cardinal numbers of the 
‘Cartesian product A x B and of the set of 
functions A® are called the product of m and n 
and the nth power of m, denoted by mn and 
m", respectively. These operations are also 
determined by m and n. For these three oper- 
ations, the following laws are valid: commuta- 
tive laws m+n=n+m, mn = nm; associa- 

tive laws (m +n) + p =m + (n + p), (mn)p = 
m(np); distributive law p(m +n)= pm + pn; 
exponential laws m"*?=m"m?®, m™ =(m"}, 
(mn) =m? rn’. In particular, if A =m, then 2" 
is the cardinal number of the tpower set ‘B(A) 
of A. 

Addition and multiplication of more than 
two cardinal numbers can be defined as fol- 
lows. Let A be any set, and suppose that to 
any element 4 of A there corresponds a unique 
cardinal number m,. Let M, be a set such that 
M,=m,, and M,N M, =Ø for 4#/’. Then 
the cardinal number of the fdisjoint union 
1, M, is said to be the sum of all m, and is 
denoted by 2, m,. The cardinal number of the 
Cartesian product [], M, is said to be the 
product of all m,(A€A) and is denoted by 
II, m,. The axiom of choice can be stated as 
follows: If m; #0 for all AeA, then IT, m; 40. 


D. The Continuum Hypothesis 


For a, c, and f defined as before, f=2°>c=2°> 
a. In general, 2" >m holds for any cardinal 
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number m (Cantor). The hypothesis which 
asserts that for any m, there does not exist an 
n such that 2™ >n >m is called the gener- 
alized continuum hypothesis. In particular, this 
hypothesis restricted to the case where m =a 
is called the continuum hypothesis. After Can- 
tor stated this hypothesis (J. Reine Angew. 
Math., 84 (1878)), it remained an open ques- 
tion for many years. In particular, Cantor 
himself repeatedly tried to prove it, and W. 
Sierpiński pursued various related hypotheses. 
Finally, the continuum hypothesis and the 
generalized continuum hypothesis were proved 
to be independent of the axioms of set theory 
by K. Gödel (1940) [3] and P. J. Cohen (1963) 
[4] (— 33 Axiomatic Set Theory). 


E. Cardinality of Ordinal Numbers 


Lower-case Greek letters will stand here for 
tordinal numbers. The cardinal number of 
{€|E<a} will be denoted by z, which is called 
the cardinality of the ordinal number « or the 
cardinal number corresponding to x. When a 
cardinal number m corresponds to some 
ordinal number, the minimum among ordinal 
numbers x with x= m is called the initial or- 
dinal number corresponding to m. An initial 
ordinal number corresponding to an infinite 
cardinal number is called a transfinite initial 
ordinal number. There exists a unique corre- 
spondence fw, from the class of ordinal 
numbers onto the class of all transfinite initial 
ordinal numbers such that £ >y implies œg > 
w,. In particular, wg =, and an ordinal 
number č such that č <œ; is called a count- 
able ordinal number. œ, is called the fth 
transfinite initial ordinal number. The cardi- 
nality of ws is denoted by N; (X is the Hebrew 
letter aleph). In particular, a is denoted by No 
(aleph zero). We have NX; >X, if and only if 
B>y, and in this case Ng +N, =Nz, NN, = 
N,- The axiom of choice implies that every 
infinite cardinal number is an Xz. Hence, in 
this case, the continuum hypothesis can be 
formulated as 2° =N; , and the generalized 
continuum hypothesis can be formulated as 
25" =N p41 for every ordinal number $. 


F. Finiteness and Infiniteness 


Dedekind [5] defined a set A to be infinite if A 
is equipotent to a proper subset of itself, and 
to be finite otherwise. It is also possible to 
define finiteness and infiniteness of sets as 
follows: A set A is finite if there exists a *well- 
ordering of A such that its tdual ordering ts 
also a well-ordering, and A is infinite other- 
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wise. If a set is finite in the latter sense, then it 
is also finite in the sense of Dedekind. Under 
the axiom of choice, these two definitions can 
be shown to be equivalent. 
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Elie Cartan (April 9, 1869—May 6, 1951) was 
born at Dolomieu in the French province of 
Isére. He entered the Ecole Normale Supé- 
rieure in Paris in 1888 and graduated in 1891, 
having at the same time qualified in the agrége 
examination. Beginning his research immedi- 
ately, he completed his thesis on the structure 
of continuous transformation groups [2] in 
1894 at the age of 25. 

Cartan was a professor first at the Univer- 
sity of Montpellier, later at the University of 
Lyon, then the University of Nancy, and fi- 
nally in 1912 at the University of Paris. He 
freely used the tmoving coordinate system 
introduced by J. G. Darboux, and contri- 
buted to many areas, such as the theory of 
tLie groups, the theory of tPfaffian forms, the 
theory of tinvariant integrals, ttopology, tdif- 
ferential geometry (especially the geometry 
of tconnections), and theoretical physics. His 
doctoral thesis is still an object of interest 
among young researchers today, and the con- 
cept of connection introduced by him is funda- 
mental in the field of differential geometry. 
Henri Cartan (1904-) is his eldest son. 
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A. General Remarks 


Catastrophe theory was originally proposed 
by R. Thom [1, 2] in late 1960. This theory 
provides certain mathematical models for the 
evolution of forms in nature, in particular in 
biology and the natural languages. In recent 
years, E. C. Zeeman and others have devel- 
oped and applied the theory to various fields, 
including the physical sciences, medicine, 
economics, and sociology; a detailed bibli- 
ography may be found in [3,9, 10]. 


B. Static Models 


The basic concept of the theory is the static 
model. This is a family of potential functions 
f,:X >R, where X is a subset of R”, containing 
a neighborhood of the origin, and the para- 
meter u lies in a neighborhood U of the origin 
in R”. 

We regard R” as the internal space or state 
space, which is parametrized by the various 
variables that are relevant to the process under 
study, and R’ is the external space or control 
space, which can be either the physical space- 
time continuum in which the process under 
consideration takes place or a space of control 
parameters that govern the process. As a rule, 
we assume that the dimension r <4, although 
the dimension n can be arbitrary large. 

The static model is local in nature, and any 
one of the local minima of f,, called a local 
regime at ue U, is a candidate for the state 
of the model corresponding to the control 
point u. 

Mathematically, a static model is a germ of 
C®-functions f:R” x R’->R at 0 that is an 
unfolding (r-dimensional extension family) of a 
germ of C®-functions y = f |R" x {0}:R">R at 
0; further details are given in Section D. 
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C. Classification of Singularities 


Let &(n, m) be the vector space of tgerms of 
C”-functions f:R">R” at 0, and let B(n)c 
&(n, n) be the subset of invertible germs R"> 
R" mapping 0 to 0. 

The germs y, €€&(n, m) are called right 
equivalent if there exists an he &(n) such that 
noh=€. Let &(n)=4(n, 1). Then it follows in 
the usual manner that &(n) is a local algebra 
with the unique maximal ideal Æ =.@(n)= 
{ne &(n)|n(0)=0}. A germ ne 6(n) such that 
4(0) = Dn(0)=0 is called a singularity. For any 
singularity 7, we define the codimension of n by 


codim y = dima(.W@/<6n/EX;) sim) 


where <0n/0X;> iq 18 the ideal of &(n) gen- 
erated by 0n/0x,,...,6n/0x, and x =(x,,...,X,) 
denotes the coordinate system of R”. 

The following result was proved by J. N. 
Mather [4]: Up to the addition of a nondegen- 
erate quadratic form in other variables, and up 
to multiplication by +1, a singularity of codi- 
mension <4 and 21 is right equivalent to one 
of the y appearing in Thom’s list of the seven 
elementary catastrophes below. 


D. Unfoldings 


Let 4 be a singularity. An r-unfolding of y is a 
germ fe.@(n+r) such that f |R” x {0} =n; this 
unfolding is denoted by (r, f). Let (r, f) and 
(s,g) be unfoldings of 7. A morphism (o, ®, £): 
(r, f )—>(s, g) consists of (i) a germ pe S(n+r, 
n +s) such that ọ |R” x {0} =identity, (ii) a 
germ Me4(r,s) such that z,.0@~=Mo7z,, (ili) a 
germ ee M(r) such that f=gogm+eor,, where 
m: R” x R’>R’ is the projection. In this case 
we say that the unfolding (r, f) is induced by 
(p, ®, £) from (s, g). A morphism is an isomor- 
phism if ọ and © are diffeomorphic germs. 

The addition of unfoldings (r, f) and (s, g) of 
n ts defined by (r, f}+(s, g)=(r+s, f+g—n), 
where the last term on the right is given by 
(f+g—n)(x, u, v) = f(x, u) + g(x, v) — n(x). 

Thus if the constant unfolding (s, 4) of 7 is 
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defined by n(x, u)=n(x), we have (r, f)+(s,n)= 
(r+s, f). 

An unfolding (r, f) of 4 is said to be versal if 
any unfolding of 7 is induced by (r, f) and a 
suitable morphism. A versal unfolding (r, f) 
with minimal r is said to be universal. The 
following facts have also been proved by 
Mather [4]: 

A singularity 7€.@(n) has a versal unfolding 
if and only if codim y is finite. Any two r-versal 
unfoldings of n are isomorphic. Every versal 
unfolding is isomorphic to (r, f)+ constant, 
where r=codimy and (r, f) is a universal 
unfolding defined as follows: If {b,,...,b,} < 
Mn) is a system of representatives for a basis 
of .M(n)/<0n/0X;> ém, then the unfolding f of 
y is defined by f(x, u)=n(x)+5,(x)u,+...+ 
b,(x)u,. 


E. The Seven Elementary Catastrophes 


Let f and g be germs in &(n+r). We say that f 
and g are equivalent as r-unfoldings if there 
exist he A(r), a family of H € A(n), where ue 
UcR’, and an ce€.“(r) such that f(x, y)= 
g(H,(x), h(u)) + €(u). 

We say that a static model (r, f) is stable if 
any small perturbation (r, g) of (r, f) in é(n +r) 
(with the Whitney C”-topology) is equivalent 
to (r, f). The following is the main result ob- 
served by Thom and proved by Mather and 
others (for references see the bibliography in 
[3]). Suppose that r <4. Then the set of stable 
static models (r, f} is an open dense subset of 
&(n+r), and up to the addition of a nondegen- 
erate quadratic form and multiplication by 
+1, any stable static model (r, f) is equivalent 
to one of the models with the standard poten- 
tials F, which are the universal unfoldings 
of singularities y in Table 1 (x, y denote inter- 
nal variables, and u, v, w, t denote external 
variables). 

The static models with these standard po- 
tentials are known as the elementary catas- 
trophes and can serve as qualitative models 
of various natural processes. 


Table 1. Thom’s List of the Seven Elementary Catastrophes 


r Singularity n Standard Potential F Name 

1 x? x? + ux Fold 

2 x? x4 + ux? + ox Cusp (Riemann-Hugo-Niot) 
3 x x? + ux? + ox? + wx Swallowtail (Dovetail) 

3 x? +y? x? +y? + uxy + ox + wy Hyperbolic umbilic 

3 x? — xy? x? — xy? + u(x? +y?) + ox + wy Elliptic umbilic 

4 xÉ xÉ + tx4+ ux? + vx? + wx Butterfly 

4 x?y+y* x?yty4+ ux? + oy? +wx + yy Parabolic umbilic 
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F. Catastrophe Sets 


A process for the static model feé&(n+r) is a 
subset s of X x U, where X and U are neigh- 
borhoods of the origin in R” and R’, respec- 
tively. If s is a process and ue U, we define s, = 
sN(X x {u}). We say that ue U is a regular 
point for s if there exists a neighborhood V of 
uin U and a homeomorphism h: X x V-+X x 
V such that z,0h=2, on X x V and h(s N 

(X x V))=s, x V. A catastrophe point is a non- 
regular point in U. The set of all catastrophe 
points is called the catastrophe set. 

There are several conventions with regard to 
the definition of a process for the unfolding 
(r, f). One of these is the Maxwell convention, 
which requires that s, be a least local regime at 
ue U. The catastrophe set of the model (r, f) 
according to the Maxwell convention consists 
of those points u of the control space R” where 
the least minimum of f, is attained for at least 
two points or where this minimum is attained 
at a unique point but ts not stable. 

Another is the perfect delay convention, 
which assigns to each path t in U a mapping 
(possibly discontinuous) m,:t— X x t such that 
7,(m,(u)) =u, (m_(u), u) is a local regime, and m, 
remains continuous for a maximum interval 
on the path t. Consider the set 


A= {(x,u)eER" x R’|dz f(x, u) is degenerate} 


and its image B =m, (A), called the bifurcation 
set, under the projection z,:R” x R’>R’. Then 
the points of the bifurcation set B are impor- 
tant candidates for catastrophe points of the 
static model with respect to the perfect delay 
convention. 

For geometrical studies of the elementary 
catastrophes — [6-8]. The static models have 
been generalized to metabolic models, the 
structure of which largely remains an open 
question [2]. In this connection, the bibli- 
ographies of [3,9, 10] appear to be fairly 
complete. 
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A. Categories 


Consider the family of all tgroups. Given two 
groups X and Y, denote the set of all thomo- 
morphisms of X to Y by Hom(X, Y). If X, Y, 
and Z are groups and if f:X > Y and g: Y> 
Z are homomorphisms, we can compose 
them to get a homomorphism go f: X >Z. 

In general, suppose that we are given, as in 
this example, (1) a family IN of mathematical 
objects, and (2) for every pair (X, Y) of objects 
in W, a set Hom(X, Y) whose elements are 
called morphisms; and let fe Hom(X, Y) 
and ge Hom(Y, Z) determine a morphism 
go feHom(X, Z), which is called their com- 
posite. A morphism feHom(X, Y) is also 
written f: X > Y. Suppose further that these 
morphisms satisfy the following axioms: (1) if 
f: X—>Y,g:Y>Z,and h:Z— W are mor- 
phisms, then (hog)of=ho(go f); (2) for 
each object X EM there exists a morphism 
1,:X—X such that for any f:X >Y and 
g:Z—>X we have foly=f and l,og=g; 
(3) Hom(X, Y) and Hom(X’, Y’) are disjoint 
unless X = X’ and Y= Y’. Then we call the 
whole system (i.e., the family of objects Mt, the 
morphisms, and the composition of mor- 
phisms) a category. The elements in IN are 
called the objects of the category. 

By axioms (1) and (2), the set Hom(X, X) is 
a semigroup (with respect to the composition 
of morphisms) which has 1, as the identity 
element. Hence 1, is determined uniquely by 
X. On the other hand, axiom (3) implies that 
a morphism f determines the objects X and 
Y such that fe Hom(X, Y). From these facts 
we can give an alternative definition of cate- 
gory using only the morphisms and their 
composition. 

The totality of the objects (or morphisms) in 
a category @ is denoted by Ob(%) (or FI(@); 
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the notation Fl comes from the French word 
flèche). The relation xe Ob(@) is often ab- 
breviated to xe, while Hom(X, Y) is written 
Hom,(X, Y) if necessary. A subcategory of a 
category @ is a category 6” with Ob(@)c 
Ob(@), such that for X, Yeg’ we have 
Hom,,.(X, Y)c Hom, (X, Y) and the compo- 
sition in @’ is the restriction of @ to 6”. If 
Hom,,(X, Y)=Hom,(X, Y) for all X, Yeg”, 
we say that @’ is a full subcategory of @. 

We define the product category @, x 6, of 
two categories in the canonical way, using the 
pairs of objects and the pairs of morphisms. 


B. Examples of Categories 


(1) Taking all sets as the objects, all mappings 
as the morphisms, and the composition of 
mappings as the composition, we obtain a 
category called the category of sets, denoted by 
(Sets) (or (Ens) from the French ensemble). For 
the empty set @ we make the convention that 
Hom(@, Y) contains just one element for any 
Y and that Hom(Y, @) is empty if¥ + Ø. 

(2) As we have seen, taking all groups as the 
objects and the homomorphisms as the mor- 
phisms, we get the category of groups, written 
(Gr). If we limit the objects to *Abelian groups, 
we get the category of Abelian groups (Ab) as a 
full subcategory of (Gr). 

(3) Fix a ring R. The left R-modules and 
their R-linear mappings define the category of 
left R-modules, which we denote by pæ. The 
category of right R-modules, Æg, is defined 
similarly. When R is tunitary, we usually 
limit the objects of pæ and Mpg to funitary 
modules. If R is commutative we can identify 
rM with Mg. When R=Z (the ring of ra- 
tional integers), p% can be identified with 
(Ab). When R is a field, p% is also called the 
category of linear spaces over R. 

(4) Taking rings as objects and homomor- 
phisms of rings as morphisms, we obtain the 
category of rings. The subcategory consisting 
of unitary commutative rings and unitary 
homomorphisms is called the category of 
commutative rings and is denoted by (Rings). 

(5) If we take ‘differentiable manifolds as 
objects and differentiable mappings as mor- 
phisms, we obtain the category of differentiable 
manifolds. Similarly, for tanalytic manifolds 
and analytic mappings we obtain the category 
of analytic manifolds. 

(6) Taking topological spaces as objects and 
continuous mappings as morphisms, we get a 
category called the category of topological 
spaces and denoted by (Top). On the other 
hand, if we take the thomotopy classes of 
continuous mappings as morphisms and 
define their composition in the natural way, 
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we obtain another category, which is called the 
homotopy category of topological spaces. 

(7) Fix a tpreordered set 7. Taking the ele- 
ments of J as the objects and the pairs (x, y) of 
elements of J with x< y as the (unique) mor- 
phism from x to y, we get a category, in which 
we define the composite of the morphisms 
(x, y) and (y, z) to be (x, z). 

In examples (1) through (6), the totality of 
the objects Ob(@) is not a set, but a fclass 
(— 381 Sets G; for the logical foundation of 
category theory — [3,9]). 


C. Diagrams 


If a set of arrows {A,} and a set of points {Bp} 
are given in such a way that each arrow A, has 
a unique initial point and a unique endpoint, 
then we say that {A,, B,} is a diagram. (Usu- 
ally, we consider the case where each point B, 
is the initial point or the endpoint of at least 
one A, (Fig. 1).) Let @ be a category and 

{A,, Bs} a diagram. If we associate a mor- 
phism f, in @ with each arrow A, and an 
object Z,e@ with each point Bz, so that 
f,€Hom(Z,, Z,) whenever A, has the initial 
point B, and the endpoint B,, then we say that 
{ Ja Zg} is a diagram in the category @ (Fig. 2). 
Suppose, furthermore, that the following con- 
dition is satisfied: For any pair of points B, 
and B,, and for any sequence of adjacent 
arrows A,,,...,A,, Starting at By and ending 
at B, (i.e., the initial point of A, is Bẹ, the 
endpoint of A, is the initial point of A, _, and 
the endpoint of A, is B,), the composite 
Fam © Sem 1 O Ofa, E Hom(B,, B,)) depends 
only on B, and B,. Then the diagram in @ is 
said to be a commutative diagram. For exam- 
ple, commutativity of Fig. 2 is equivalent to 


Bofi=hoh=fs. 


B, A, B 
oe xX a X, 
Ås 1 
Sate NG 
X, — > X 
B, è ě Ř Ea 
Fig. 1 Fig. 2 


D. Miscellaneous Definitions 


A morphism f:X > Y in a category @ is called 
an isomorphism (or equivalence) if there exists a 
morphism g: Y>X such that fog=ly,gof= 
1,. In this case, g is determined uniquely by 

f and is itself an isomorphism. We call g the 
inverse morphism of f. Then the inverse of g 

is f. An isomorphism is sometimes written 
f:X 5Y. Two objects X and Y are said to be 
isomorphic if there is an isomorphism X >Y, 
and then we write X = Y. The composite of 
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isomorphisms is again an isomorphism. In 
particular, an isomorphism X >X is an inver- 
tible element of the semigroup Hom(X, X), 
and is called an automorphism of X. Examples 
of isomorphisms are bijections in (Sets), group 
isomorphisms in (Gr), the R-isomorphisms in 
gM, ring isomorphisms in the category of 
rings, tdiffeomorphisms in the category of 
differentiable manifolds, and homeomorphisms 
in (Top). 

A morphism f:X > Y is called a monomor- 
phism (or injection) if for any object Z and for 
any morphisms u, v:Z->X(u#v) we have 
fou#fov. Dually, f:X — Y is called an epi- 
morphism (or surjection) if for any u, v: Y> 
Z(u#v) we have uo f #vo f In the category 
of sets the monomorphisms and the epimor- 
phisms coincide, respectively, with the injec- 
tions and the surjections as mappings (— 381 
Sets). A monomorphism which ts at the same 
time an epimorphism is called a bijection. An 
isomorphism is always a bijection, but the 
converse is false in some categories. 

Two monomorphisms f,:X,—X and 
f,:X 7X (with the same X) are said to 
be equivalent if there exist g: X, >X, and 
g,:X,7X, such that fı =f,0g, and fa = 
fi Og> (Fig. 3). An equivalence class with 
respect to this equivalence relation is called a 
subobject of X. Similarly, we define a quotient 
object of X as an equivalence class of 
epimorphisms X > X;. 


Fig. 3 


An object e of a category @ is called a 
final object of @ if for every object Y of @, 
Hom(Y,e) contains one and only one element. 
Dually, an object e’ ts called an initial object 
(or cofinal object) if Hom(e’, Y} contains one 
and only one element for every Ye@. If e; and 
e, are final objects, then there is a unique iso- 
morphism e; Se, and similarly for initial 
objects. A set with only one element is the 
final object in (Sets), and a space with only 
one point is the final object in (Top). In the 
category (Gr) (resp. (Ab)), the trivial group {1} 
(or {0}) is the final object and the initial object 
at the same time. In the category of commuta- 
tive rings, the zero ring {0} is the final object 
and the ring of rational integers Z ts the initial 
object. 


E. Product and Coproduct 


Let X, and X, be objects of a category @. We 
say that a triple (P, p,, p,) consisting of an 
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object P and morphisms p;: PX; (i= 1, 2) is 
the product (or direct product) of X, and X, if 
for any pair of morphisms f;: X > .X; (i= 1, 2), 
there exists a unique morphism f:X—P with 
pio f = f; (i= 1,2) (Fig. 4). If (P', p4, p) is an- 
other product of X, and X,, then by virtue of 
this definition there is a unique morphism 
f:P->P such that p;o f =p; (i= 1,2), and f is 
an isomorphism. The product is unique in this 
sense. The product (or any one of the prod- 
ucts) of X, and X, is denoted by X, x X, or 
by X, H X. 


Fig. 4 


The product in the categories of sets, of 
groups, of rings, and of topological spaces 
coincides with the notion of direct product in 
the respective systems. In a general category, 
the product does not always exist. Suppose the 
product X x X exists for an object X; then 
there is a unique morphism Ay: X > X x X such 
that 1y=p,oAy=p,0A,, which is called the 
diagonal morphism of X. Let f: X > X; (i= 1, 2) 
be morphisms and assume that the products 
(X1 X X,P1,P2),(X4 x X2, P1, P2) exist. Then 
there is a unique morphism f:X, x X, > 
X; x X; satisfying p;o f = fio p; (i= 1,2). This 
f is denoted by fı x f2. On the other hand, if 
gi: X >X;(i=1,2) are given, the unique mor- 
phism g: X >X, x X, with p;og=g; (i= 1,2) is 
denoted by (g1, 92). We have (g;,92)=(g1 X 
gr)OAy, if X x X exists. 

The dual notion of product ts coproduct. 
We say that a triple (S,j,,j,.) of an object S and 
morphisms j;: X;—>8S (i= 1, 2) is the coproduct 
(or direct sum) of X, and X, if for any mor- 
phisms f;:X;—>X (i=1, 2) there exists a unique 
morphism f:S—>X with foj;=/f; (i= 1,2) 

(Fig. 5). The coproduct, like the product, is 
uniquely determined up to canonical iso- 
morphisms. It is denoted by X, + X, or by 

X, U X,. The coproduct in (Gr) is the tfree 
product. In (Ab), or more generally in g.%, the 
product of two objects can be identified with 
the coproduct (= direct sum) (— 277 Modules 
F). The coproduct in the category of com- 
mutative rings is the ttensor product over Z. 


X 
YEN 
LASN 


Xi h Je Xo 


Fig. 5 


Product and coproduct can also be defined 
for a family {X;}ier of objects. Namely, the 
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product of {X;};.; is an object P together with 
a family of morphisms p;: P> X; (i€ I) having 
the property that for any family of morphisms 
f.: X 3X; (ied), there exists a unique morphism 
f:X—>P such that p;o f = f; (ie /). The product 
is unique up to canonical isomorphisms, and 
similarly for the coproduct (— Sections F, L). 


F. Dual Category 


In the theory of categories we often encounter 
the dual treatment of notions and propo- 
sitions. To be precise, we may define the no- 
tion of the dual category @° of a category @ as 
follows: The objects of @° are those of @, i.e., 
Ob(@°) = Ob(@); for any objects X and Y we 
put Hom,.(X, Y)=Hom,(Y, X); if f: xX > Y 
and g: Y>Z in @° (e., f: Y>X and g:Z> 

Y in @), then the composite go f in @° is de- 
fined to be fog in @. It is clear that @° then 
satisfies the axioms of a category. Quite gener- 
ally, given a proposition concerning objects 
and morphisms we can construct another 
proposition by reversing the directions of the 
morphisms, and we call the latter the dual 
proposition of the former. The dual propo- 
sition of a proposition in @ coincides with a 
proposition in @°. For instance, a monomor- 
phism (epimorphism) in @ is an epimorphism 
(monomorphism) in @°, and the final (initial) 
object in @ is the initial (final) object in @°. 
The product (coproduct) in @ is the coproduct 
(product) in @°. Although the notion of the 
dual category is defined quite formally, it is 
useful in describing relations between specific 
categories. The dual category of (Ab), for 
instance, is equivalent to the category of 
commutative compact topological groups 
(tPontryagin’s duality theorem). 


G. Categories over an Object 


Fix a category @ and an object Se@. A pair 
(X, f) of an object X €@ and a morphism f:X 
—§ is called an object over S or an S-object, 
and f is called its structure morphism. We 
often omit f and simply say “an S-object X” if 
there is no danger of misunderstanding. If 
(X, f) and (Y,g) are S-objects, a morphism 
h:X >Y such that f=goh is called an S- 
morphism from (X, f) to (Y,g). The category 
whose objects are the S-objects and whose 
morphisms are the S-morphisms is called the 
category of S-objects in @, and is denoted by 
E/S. It has (S, 1s) as the final object. The prod- 
uct of two S-objects X and Y, taken in @/S, is 
called the fiber product of X and Y over S (in 
€), and is denoted by X xs Y or XIIs Y. The 
dual notion of the fiber product is called fiber 
sum (or amalgamated sum). Thus for two 
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morphisms f:S—X and g:S—Y, the fiber 
product of X and Y over S in @°/S is the fiber 
sum of X and Y (with respect to S); tt is de- 
noted by X J], Y. 

Let @ be the category of commutative rings 
and Keg. Then the family of K-objects in @° 
is precisely the family of commutative K- 
algebras. The fiber product A x, B in @°, i.e., 
AlIx B in @, is the tensor product A @x B of 
algebras. 


H. Functors 


Let @ and @’ be categories. A covariant functor 
F from @ to g'is a rule which associates (1) 
with each object X in @, an object F(X) in @’, 
and (2) with each morphism f:X >Y in @, 

a morphism F(f):F(X)>F(Y) such that 
F(gofJ=F(g)OF(S), F(1x)= 1px A con- 
travariant functor is defined dually, by mod- 
ifying this definition to F(f):F(Y)> F(X), 
F(gof)=F(f)o F(g). Thus a contravariant 
functor from @ to @’ is the same as a covariant 
functor from the dual category 6° to @ (or 
from @ to @’°). Functor is a general term for 
both covariant functors and contravariant 
functors, but some authors use the word ex- 
clusively in the sense of a covariant functor. 

A functor in several variables is defined to be 
a functor from the product category of the 
categories in which the variables take their 
values. 

A covariant functor F:@-' is said to be 
faithful (fully faithful) if for any X, Yeg, the 
mapping Hom(X, Y)->Hom(F(X), F(Y)) in- 
duced by F is injective (byective), and simi- 
larly for contravariant functors. A faithful 
covariant functor F:@-»@’ which maps dis- 
tinct objects of @ to distinct objects of @” is 
called an embedding, and in this case @ can 
be identified with a subcategory of @” by F. 

A fully faithful covariant functor F:@>@’ is 
called an equivalence (between the categories) if 
it satisfies the condition that for any object X’ 
of 6”, there exists an object X of € such that 
F(X)= X’. In this case we can consider the 

two categories essentially the same. A con- 
travariant functor from @ to @’ which defines 
an equivalence from ¢° to @’ is called an 
antiequivalence. 


I. Examples of Functors 


(1) Let € be the category of groups (or rings). 
For any X €@ let F(X) be the underlying set of 
X (ie., the set obtained from X by “forgetting” 
its structure as a group or ring), and for any 
homomorphism f put F(f)= f. Then we get 

a faithful covariant functor (often called the 
forgetful functor) F :@ — (Sets). 
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(2) Let @ be any category and fix an object 
X of @. Then we get a covariant functor hy:@ 
—(Sets) as follows: With each Ye@ we associ- 
ate the set Hom(X, Y), and with each mor- 
phism f: YY’ in @ the mapping f o(where 
fo:Hom(X, Y)>Hom(X, Y’) is defined by 
(fo)(g)=f og). Similarly we define a con- 
travariant functor h*:@ — (Sets) by h*(Y)= 
Hom(Y, X) and h*(f)=of. 

(3) Let p: A>B be a homomorphism of 
rings. With each left A-module M associate 
the tscalar extension p*(M)= B®, M, and 
with each A-homomorphism / associate the 
B-homomorphism p*(f)=1, © f. Then we get 
a covariant functor p*: ,@—>,.%. 

(4) Let R be a ring. With each left R-module 
M associate its dual module M* =Hom,(M, 
R), and to each R-linear mapping f associate 
its tdual mapping 'f =o f. Then we get a con- 
travariant functor pM —>Mpg, and similarly for 
Mp pM. 

(5) For each differentiable manifold X let 
F(X) denote the commutative ring of the dif- 
ferentiable functions on X, and for each dif- 
ferentiable mapping f: X > Y let F(f) be the 
ring homomorphism o f: F(Y)> F(X). Then F 
is a faithful contravariant functor. 

(6) Fix an Abelian group A. By associating 
with each topological space X the cohomology 
group H(X, A) and with each continuous map- 
ping f: X > Y the homomorphism H(Y, A)> 
H(X, A) induced by f, we obtain a contrava- 
riant functor from (Top) to (Ab). 

(7) Fix a topological space X, and let T(X) 
be the set of the open sets in X. Then T(X) is 
ordered by inclusion, so it is a category (— 
Section B, no. 7). The contravariant functors 
from T(X) to (Ab) are precisely the tpre- 
sheaves of Abelian groups over X. We can use 
any category instead of (Ab) to define a pre- 
sheaf over X (— 383 Sheaves). 


J. Natural Transformations 


Let @ and @’ be categories, and denote by 
Hom(@%, @’) the collection of all covariant 
functors ¢>@’. Let F, GeHom(@, @’). A nat- 
ural transformation (or functorial morphism) 
from F to G is a function which assigns to 
each object X of € a morphism (X): F(X)> 
G(X) in @’ such that for any morphism f:X > 
Y in @, the equation G(f)o g(X)=@(Y)oF(f) 
holds; in other words, the accompanying dia- 
gram is commutative: 


(xX) 
X F(X) > G(X) 


f F(f) G(f) 
yoy As { 
Y  F(Y)—> GY) 
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A natural transformation between contrava- 
riant functors is defined similarly. For in- 
stance, let A and B be Abelian groups and let 
H'(-, A) and H4(-, B) be the contravariant 
functors of tcohomology viewed as functors 
(Top)—(Sets). Then the natural transform- 
ations between them are the tcohomology 
operations. 

Let 9: F>G be a natural transformation, 
and suppose that o(X): F(X) G(X) is an 
isomorphism for every X €@. Then the inverse 
transformation G—>F of exists, and ¢ is 
called a natural equivalence (functorial isomor- 
phism or isomorphism) and is written g: FSG. 

Suppose that Ob(@) is a set. Then the collec- 
tion Hom(F, G) of all natural transformations 
F—>G is also a set, and hence we can consider 
Hom(@, @’) a category in which the objects are 
the covariant functors € >@’, the morphisms 
are the natural transformations, and the com- 
position of morphisms is the natural one. Then 
Hom(€°, @’) is the category of contravariant 
functors from @ to @’. In particular, the cate- 
gory Hom(@°, (Sets)) is sometimes denoted 
by @. 

Given a category @, a covariant (resp. con- 
travariant) functor F:@—(Sets), and an object 
X e@, we can define a canonical bijection 
®,:Hom(h,, F)3 F(X) (resp. Hom(h*, F)5 
F(X)) by ®y(@) = o(X)1,y. (The functors hy 
and h* were defined in Section I.) The inverse 
mapping of ®, assigns to če F(X) the natural 
transformation ọ:hy—>F defined by g(Y)u= 
F(u)é(Ye@). In particular, if we take F = 
hy(h*), we obtain a canonical bijection 
Hom(h,, hy) SHom(Y, X) (Hom(h*, h) 5 
Hom(X, Y)). It follows that there is a fully 
faithful contravariant (covariant) functor € > 
Hom(@, (Sets)) (€ +Hom(@°, (Sets)) = 4) 
which associates h,(h*) with X €@. 


K. Adjoint Functors 


Let F:€@>@' and F':@’ >@ be covariant 
functors. Suppose that there is a rule which 
assigns to each pair of objects Meg and 
M'e@' a bijective mapping Oy, m : Hom,(M, 
F’(M'))SHom,.(F(M), M’) such that for any 
pair of morphisms NM in € and M’>N’ 
in @’, the following diagram induced by the 
morphisms is commutative: 


Homeo(M, F’(M’)) = Home( F(M), M’) 


| | 


Hom,(N,F’(N’)) ““S Home(F(N), N’) 


Then we say that F is a left adjoint functor of 
F’ and that F' is a right adjoint functor of F. 
We can regard Hom,.(F(M), M’) as a functor 
from @ x @’ (contravariant in the variable 
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Me@, covariant in the variable M'€@’) to the 
category of sets, and similarly for Hom,(M, 
F’(M’)). This commutativity of the diagram 
means that these two functors are isomorphic 
(— Section J). 

For instance, let A and B be rings and La 
fixed B-A-tbimodule. Let F: 4M —>pM and 
F’: pM — 4M be the functors defined by 


F(M)=L®,M, F'(M’)=Hom,(L,M’), 


where the assignment for morphisms is defined 
in the natural way. Then F is the left adjoint of 
F' and F’ is the right adjoint of F. In parti- 
cular, let 9: A>B be a homomorphism and 
consider the case L= B. Then F is the functor 
p*: 4M —>pgM and F’ is the functor py: gM — 
aM, so that p* is the left adjoint of p, (and 

p, is the right adjoint of p*) (— 277 Modules, 
K, L; for more examples of adjoint functors — 


[11]). 


L. Representation of Functors 


We begin by discussing an example. Let T be 
a set, and consider the following problem: Is 
it possible to find a group X and a mapping 
€:T—X such that, for any group Y and for 
any mapping 4: T—> Y, there exists a unique 
homomorphism u: X >Y with uo č =n? The 
answer is yes; it is enough to take the tfree 
group X generated by T and the canonical 
injection č: TX (Fig. 6). On the other hand, 
let F(Y) be the set of all mappings T> Y, and 
for each group homomorphism f: Y> Y’ 
define the mapping F(f): F(Y)— F(Y’) by 
F(f)n=fon (ye F(Y)). Then we get a co- 
variant functor from the category @ of groups 
to the category of sets, F : € —(Sets). We can 
now reformulate the condition on X €@ and 
¢e F(X) as follows: 

For any Ye@ and for any ne F(Y), there 
exists a unique morphism u: X > Y such that 
F(u)o=n. 


T. 
| 9 
ae 


Fig. 6 


Proceeding to the general case, let 6 be an 
arbitrary category and let F:@—(Sets) be a 
functor. If there exist an object X of @ and an 
element é of F(X) satisfying the condition just 
stated (with the modification u: YX in the 
contravariant case), then we say that the pair 
(X, č) represents the functor F, or less specifi- 
cally, that X represents F, and we call č the 
canonical element of F(X). We also say that F 
is representable. The condition stated above is 
a formulation of the so-called universal map- 
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ping property. If (X’, ¢’) also represents F, the 
unique morphism u: X > X’ (or X'>X) with 
F(u)€=€' is necessarily an isomorphism. 

When (X, é) represents F, the natural trans- 
formation g:hy—>F (h*-—F in the contrava- 
riant case) which corresponds to č by the 
canonical bijection Dy: Hom(hy, F) 3 F(X) ts 
an isomorphism. Conversely, if there is a func- 
torial isomorphism :hy—>F (or h* > F) for 
some X €@, then the object X represents F, 
with the canonical element of F(X) the element 
which corresponds to @ by the canonical bijec- 
tion Py, i.e., €=p(X) 1. 

We have already seen the example of a free 
group; here we list a few more examples. (1) 
Let {X;};-, be a family of objects in a category 
@. For each Yeg we put F(Y)=T],;-, Hom(Y, 
X,), and for each morphism f: Y— Y’ we define 
the mapping F(f):F(Y')>F(Y) by FIA) = 
(fio f). Then we get a contravariant functor 
F:@—-(Sets). A pair (X, č) which represents 
F (where če F(X)=II,.,; Hom(X, X;)) is the 
product of {X;}. Thus representability of F is 
equivalent to the existence of the product of 
{X;}, and similarly for the coproduct. 

(2) Let R be a ring, M a right R-module, and 
N a left R-module. For each Abelian group Y 
let F(Y) denote the set of the R-balanced map- 
pings M x N >Y (— 277 Modules J). Since a 
homomorphism f: Y> Y’ induces a natural 
mapping F(f):F(Y)> F(Y’) by composition, 
we obtain a covariant functor F:(Ab)—(Sets). 
This functor is representable: the pair consist- 
ing of the tensor product M @,; N and the 
canonical mapping M x N>M @,N repre- 
sent it. 

(3) Let R be a commutative ring and Sa 
subset of R. For each commutative ring Y, let 
F(Y) denote the set of homomorphisms R >Y 
that map the elements of S to invertible ele- 
ments of Y. As in the preceding example, we 
obtain a covariant functor F :(Rings)—(Sets). 
This functor is represented by the tring of 
quotients S-*R and the canonical homomor- 
phism R>S7'R. 


M. Groups in a Category 


Let @ be a category with a final object e, and 
assume that a finite product always exists in @. 
If an object Geg and morphisms «:G x G> 

G, B: GG, ¢:e-G are given such that the 

the diagrams of Fig. 7 are commutative, then 
(G, a, B, £) is called a group in @ (group object in 
€ or 6-group). 

If @ is the category of sets, then « defines a 
law of composition in the set G, and the image 
of e by e is the identity element and f(x) is the 
inverse of x, so that G is an ordinary group. If 
@ is the category of topological spaces (analy- 
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tic manifolds, algebraic varieties, ‘schemes) 
then G is a ttopological group (tLie group, 
talgebraic group, tgroup scheme). 

We can also define the @-group by lifting 
the group concept in (Sets) to the category @ 
by means of the functor h*. Namely, let G be 
an object of @, and suppose that for each Ye@ 
the set h°(Y)=Hom(Y, G) is equipped with a 
group structure and that for each morphism 
f: YY’ the induced mapping h°(Y')>h°(Y) 
is a group homomorphism. In other words, 
suppose that h° is a contravariant functor 
from @ to the category of groups. Then the 
object G with the additional structure on h® is 
called a @-group. This definition is equivalent 
to the one given above. 


GXGXG+G6xG 
igXa a 
GXG 7 G 
lexe (15,2) 
GX eG XG G—GXG 
oe a | |. 
G e a G 


N. Additive Categories 


A category @ is called an additive category if 
for each pair X, Ye@, the set of morphisms 
Hom(X, Y) has the structure of an additive 
group such that (1) the composition of mor- 
phisms is distributive in both ways: ho(f +9) 
=hofthog,(ft+tg)oh=foh+goh; (2) 
there exists an object 0’ with Hom(0’,0’) = {0}; 
(3) the product (or the coproduct) of any two 
objects exists. Then the object 0’ in (2) is a final 
and initial object, and is called the zero object. 
Both the product and the coproduct of any 
two objects exist and can be identified. The 
dual category of an additive category is also 
an additive category. A functor F from an 
additive category to another is called an addi- 
tive functor if F( f+g)=F(f)+F(g) holds 

for morphisms. In an additive category @, 
Hom(X, Y) is an additive functor from @ to 
(Ab) in each variable. 

For any ring R, the category of left (or right) 
R-modules is an additive category. The follow- 
ing definitions are generalizations of the corre- 
sponding concepts in the theory of modules. 
The kernel of a morphism f:4—>B is a pair 
consisting of an object A’ and a monomor- 
phism i: A’>A with foi=0, such that any 
morphism u: X >A with fou=0 is divisible 
by i (that is, u=iov for some v: X >A’). Dual- 
ly, the cokernel of f is a pair consisting of 
an object B’ and an epimorphism j: B> B’ 
with jo f =0 which divides any morphism 
u:B—X with uo f=0. We write A' = Ker f, B’ 
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= Coker f. The kernel of j: B> Coker f is 
called the image of f and is denoted by Im f; 
the cokernel of i: Ker f> A is called the 
coimage of f and is denoted by Coim f. If all 
these exist, it follows from the definitions that 
there is a unique morphism Coim f>Im f 
such that the composite of A+Coim fIm f 
— B is equala to f. 

An additive category @ is called an Abelian 
category if it satisfies the following conditions: 
(1) every morphism has a kernel and a coker- 
nel, (2) for every morphism f, the morphism 
Coim f>Im f just mentioned is an isomor- 
phism. The dual category of an Abelian cate- 
gory is also Abelian. The categories of Abel- 
ian groups, of R-modules, and of sheaves 
of @-modules on a tringed space (X, @) are 
important examples of Abelian categories. 
Many propositions which are valaid in (Ab) 
remain valid in any Abelian category. In parti- 
cular, the notion of an texact sequence is de- 
fined in an Abelian category in the same way 
as in (Ab), and the fiber product and fiber sum 
of a finite number of objects always exist in an 
Abelian category. A functor between Abelian 
categories which carries exact sequences into 
exact sequences is called an exact functor; 
(such a functor is automatically additive). If € 
is a category of which Ob(@) is a set and if 6” 
is an Abelian category, then Hom(@, @’) is an 
Abelian category. Given an Abelian category 
€ and a subcategory ©’ which satisfies certain 
conditions, one can construct an Abelian 
category @/@' which is called the quotient 
category (Serre’s theory of classes of Abelian 
groups; — [8]). 

If @ is an Abelian category of which Ob(@) 
is a set, there is an embedding of € into the 
category p.-@ of modules over some ring R by 
a fully faithful flat exact covariant functor (full 
embedding theorem, B. Mitchell, Amer. J. 
Math., 86 (1964)). This remarkable theorem 
enables us to extend results obtained for 
modules to the case of Abelian categories. 

The notions of category and functor were 
introduced in [7] and were applied first in 
topology and then in homological algebra and 
algebraic geometry (— 200 Homological 
Algebra). 
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Cauchy, Augustin Louis 


The French mathematician Augustin Louis 
Cauchy (August 21, 1789-May 25, 1857) 
graduated from the Ecole Polytechnique in 
1807 and from the Ecole des Ponts et Chaus- 
sees in 1810, to become a civil engineer. In 
1816, his mathematical works were recognized, 
and he was appointed a member of the Aca- 
démie des Sciences while a professor at the 
Ecole Polytechnique. After the July revolution 
in 1830, he refused to pledge loyalty to Louis- 
Philippe and fled to Turin; he later moved to 
Prague. He returned to France after the revo- 
lution of 1848 and became a professor at the 
University of Paris, where he remained until 
his death. He was a Catholic and a Royalist all 
his life. 

His scientific contributions were numer- 
ous and covered many fields. In algebra, he 
did pioneering work in tdeterminants and in 
the theory of ‘groups. He also made notable 
achievements in theoretical physics, optics, 
and the theory of elasticity. His main field 
was analysis. He was interested in making 
analysis rigorous by giving calculus a solid 
foundation in such works as Cours d'analyse 
de l'Ecole Polytechnique (1821). In his paper 
“Memoire sur les intégrales définies prises 
entre les limites imaginaires” (1825), he proved 
the main theorem of the theory of functions of 
a complex variable. Another important work 
is his proof of the existence theorem for the 
solutions of tdifferential equations in the cases 
of real and complex variables 
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Let Q be a fquaternion algebra over a field K 
of characteristic zero. A general Cayley algebra 
@ 1s a 2-dimensional Q-tmodule Q + Qe with 
the multiplication (q + re)(s+te)=(qs+ytr)+ 
(tq+rs)e, where q, r, s, t€Q, y is a given ele- 
ment in K, and t, s are the tconjugate quater- 
nions of t, s, respectively. The elements of @ 
are called Cayley numbers; @ is a nonassocia- 
tive, talternative algebra of dimension 8 over 
K (— 231 Jordan Algebras). The map a= 
q+re=>aā =q —re is an tantiautomorphism 

of €. Define two maps @>K by N(a)=aa=4Ga 
(norm of a) and T(a)=a-+d (trace of a). Then 
every a in @ satisfies the equation x? — T(a)x + 
N(a)=0. Furthermore, N(ab)= N (a)N (b) for 

a, bin @. The tquadratic form N (x)= T(xx)/2 
characterizes @. In particular, any two (non- 
associative) general Cayley algebras over the 
same field K which are not falternative fields 
are isomorphic. 

In order for @ to be an alternative field, 
either of the following two conditions is neces- 
sary and sufficient: (i) N(a)=0 implies a=0; 

(ii) Q is a noncommutative division algebra 
and y cannot be expressed in the form g? — 
ÀE? — py? + Al? (0, €,n, Ce K). (For the mean- 
ing of A, u with respect to Q — 29 Associative 
Algebras D.) Every alternative field of finite 
dimension is a general Cayley algebra. 

In particular, when Q is the fquaternion 
field over the real number field with 2 = u = 
—1, the general Cayley algebra over Q with 
y= —1 is called the Cayley algebra. When K 
is an talgebraic number field of finite degree, 
there are only a finite number of nonisomor- 
phic general Cayley algebras over K. 

The Lie algebra D(@) of all tderivations of a 
general Cayley algebra @ is a tsimple Lie 
algebra of type G,. If K is the real number 
field, the identity component of the group of 
all tautomorphisms of the Cayley algebra @ is 
a compact simply connected tsimple Lie group 
of type G,. The Cayley algebra @ is the unique 
alternative field over the real number field K. 
This last fact is important because of the fol- 
lowing proposition: In the theory of tnon- 
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Desarguesian projective planes, the field which 
gives rise to the coordinates is an alternative 
field. Let QP, be the set of all 3 x 3 fHermitian 
matrices A over the Cayley algebra @ such 
that tr A =1, A? = A. Then we can define a 
structure of a projective plane on &P,, which 
with this structure is called the Cayley projec- 
tive plane. Furthermore, let 3 be the set of all 
3 x 3 Hermitian matrices over @, with a multi- 
plication in 3 defined by A: B=(1/2)(BA+ 
AB). The identity component G of the group 
of all automorphisms of 3 is a compact simply 
connected simple Lie group of type F,. This 
group G acts on LP, transitively, and &P, = 
G/Spin(9) (— 249 Lie Groups; Appendix A, 
Table 5.IIT). 
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A. General Remarks 


The motions of planets, comets, the moon, and 
satellites in our solar system are the main 
topics in celestial mechanics (— 392 Spherical 
Astronomy). However, studies in this subject 
can also include motions of fixed and binary 
stars in our galaxy, equilibrium figures of 
celestial bodies, and rotational motions of the 
earth and the moon. 

Celestial mechanics is usually based on 
*Newtonian mechanics, with the effects of 
tgeneral relativity sometimes taken into ac- 
count to determine corrections to the orbits of 
celestial bodies. Therefore the main task of 
celestial mechanics is to solve differential 
equations of motion based on Newtonian 
mechanics. However, since the equations for 
the problem of n> 2 bodies cannot be solved 
rigorously (— 420 Three-Body Problem), 
appropriate methods have been devised where- 
by we may obtain approximate solutions of 
the equations with accuracy comparable to 
that of observations. 

The two-body problem, which concerns the 
behavior of two celestial bodies regarded as 
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points exerting mutual interactions, can be 
reduced to a one-body problem with reference 
to a central force, since integrals of motion of 
the center of gravity for the system exist. The 
*Hamilton-Jacobi equation for the one-body 
problem is of tseparable type and can be 
solved completely. The orbit for the two-body 
problem is a tconic with one of its ffoci at the 
center of gravity. The majority of celestial 
bodies in the solar system actually perform 
elliptic motions. *Kepler’s orbital elements for 
elliptic motion are functions of the integration 
constants in the solution of the Hamilton- 
Jacobi equation and are determined by the 
initial conditions. 


B. Perturbations 


In studying the tn-body problem, we first solve 
certain two-body problems and then apply the 
method of *perturbations, i.e., the method of 
variation of constants, in order to obtain solu- 
tions developed as tpower series of small 
parameters. The parameters are ratios of the 
masses of planets to that of the sun for plane- 
tary motions and the ratio of the geocentric 
lunar distance to the solar distance for lunar 
motion. 

Electronic computers have made it possible 
to compute planetary coordinates for long 
intervals of time by solving numerically dif- 
ferential equations of motion including all 
possible interactions. However, in discuss- 
ing the stability of the solar system, analytic 
methods are more effective, particularly the 
method of obtaining secular perturbations by 
eliminating short-periodic terms by canonical 
transformations. This is one of the averaging 
methods of solving differential equations. 
However, as the solutions obtained by means 
of perturbation methods are not always con- 
vergent, most important problems related 
to the stability of motion have not yet been 
solved rigorously. Secular perturbations for 
planetary motions can be derived by solving 
differential equations that are linearized by 
neglecting cubic powers of orbital eccentric- 
ities and inclinations to the ecliptic (— 309 
Orbit Determination), which are small quan- 
tities. The teigenvalues of these linear differen- 
tial equations correspond to mean angular 
velocities of the perihelion and the ascending 
node. The equations for the eigenvalues are 
called secular equations. 


C. Artificial Satellites 


To discuss motions of artificial satellites close 
to the earth, the latter cannot be regarded as a 
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point or as a sphere but must be assumed to 
be an oblate spheroid, i.e., an tellipsoid of 
revolution. The effects of oblateness on the 
motion of satellites can be derived as pertur- 
bations of the theoretical elliptic motions ob- 
tained as the solutions of this two-body prob- 
lem under the assumption that the earth is 
spherical. Also, by utilizing a special potential 
very close to the geopotential, we can find a 
Hamilton-Jacobi equation of separable type 
which is solvable. This special potential ap- 
pears in the problem of two fixed centers with 
equal masses situated on an imaginary axis. 
When the geopotential is assumed to be axi- 
ally symmetric, the equations of motion for the 
satellites have two degrees of freedom; there- 
fore there appear two fundamental frequencies 
related to the special potential. When these 
two frequencies are equal, the problem is 
called a critical inclination problem and is 
important from the mathematical point of 
view. Theories for satellites can be applied to 
the motions of fixed stars in the galaxy. 


D. Equilibrium Figures 


There is a large literature concerning equilib- 
rium figures and the stabilities of celestial 
bodies assumed to consist of spinning fluids. 
The two-body problem with tidal interactions 
is particularly important; problems concerning 
the evolution of the earth-moon system are 
special cases of such a problem. 

The theory of rotation of the earth as it is 
affected by tprecession, ‘nutation, and latitude 
variations is also a part of celestial mechanics; 
for this theory, elastic theory and geophysics 
are applied. 

For the n-body problem — 420 Three-Body 
Problem. 
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A. General Remarks 


The theory of characteristic classes arose from 
the problem of whether or not there exists a 
ttangent r-frame field on a tdifferentiable 
manifold (E. Stiefel [5]) (— 105 Differentiable 
Manifolds). The importance of the character- 
istic class as a fundamental invariant of tvec- 
tor bundle structure is now fully recognized 
(— 114 Differential Topology, 147 Fiber 
Bundles, 427 Topology of Lie Groups and 
Homogeneous Spaces). 


B. Stiefel-Whitney Classes 


Let € =(E, B, R") be an n-dimensional real 
tvector bundle (called an R"-bundle) with 
tparacompact Hausdorff tbase space B, fiber 
R”, and forthogonal group O(n) as the fstruc- 
ture group. Then the element w;(č) of the i- 
dimensional tcohomology group H'(B; Z,) 
(i=1,2,...,n) of the base space B with coeffi- 
cient in Z, = Z/2Z, called the i-dimensional (or 
ith) Stiefel-Whitney class, and the element w(¢) 
=1+w,(&)+...+w,(€) of the tcohomology 
ring H*(B; Z.), called the total Stiefel-Whitney 
class, are defined as follows. First, we deal with 
the case n= 1. We call the infinite-dimensional 
real projective space the tinductive limit P*(R) 
= lim P”(R) of the finite-dimensional real tpro- 
jective space P”(R). The nontrivial tline bundle 
y, over the infinite-dimensional real projective 
space P”(R) is a universal R'-bundle (tuniver- 
sal bundle for the orthogonal group O(1)); 
therefore any R'-bundle é =(E, B, R') is equiv- 
alent to an tinduced bundle from the universal 
bundle y, by a classifying mapping f: B> 
P”(R):č= f*y,. We define the 1-dimensional 
universal Stiefel-Whitney class w,(y,) to be 

the generator of H'(P”(R); Z,) and set w,()= 
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fw,(y,). For general n, we consider the prin- 
cipal O(n)-bundle (P, B, O(n)) tassociated with 
the given R"-bundle č =(E, B, R”). Let Q, be 
the subgroup of O(n) consisting of all diag- 
onal matrices. Then the orbit space Y= P/Q, 
is the base space of the tprincipal Q,-bundle 
n=(P, P/Q,,Q,). Let p: Y>B=P/O(n) be the 
natural projection. Then the R"-bundle p*é 
over Y induced by p ts associated with y and 
is equivalent to the tWhitney sum of n line 
bundles (F. Hirzebruch [4]), p*¥é=é,0...0 
č. Moreover, the homomorphism p*: H*(B; 
Z,)~> H*(Y, Z,) is injective (A. Borel [1]). 
Therefore we can uniquely define the total 
Stiefel-Whitney class w(é) of the R’-bundle by 
the relation p* w(¢)=w(€,)... w(G,). The Stiefel- 
Whitney classes defined above are compatible 
with bundle mappings f:w(f*é)= f*w(é) (— 
147 Fiber Bundles M). The Stiefel-Whitney 
class w,(y,) € H‘(Bom; Z2) (1 <i<n) of the 
universal R"-bundle y, over the ‘classifying 
space Bom is called the i-dimensional univer- 
sal Stiefel-Whitney class. For the Whitney sum 
of vector bundles, we have w(é ® yn) = w()w(). 
In order for an R"-bundle é to be forient- 
able, namely, for č to have an SO(n)-structure, 
it is necessary and sufficient that w,(€)= 
0. For an oriented R”-bundle č, the Euler- 
Poincaré class X,,(¢) is defined to be the tpri- 
mary obstruction X,(¢)¢H"(B; Z) for con- 
structing a tcross section of the tassociated 
(n—1)-sphere bundle. In particular, the Euler- 
Poincaré class of the universal bundle for 
SO(n) is called the universal Euler-Poincaré 
class. X,,(€) mod 2 is equal to w,,(€). If n is odd, 
we have 2X,(€)=0. 


C. Chern Classes 


We consider an n-dimensional complex vector 
bundle w=(E, B, C”) (called C"-bundle in the 
following) with a paracompact Hausdorff base 
space B, fiber C”, and unitary group U (n) as 
the structure group. The cohomology class 
c(w)e H?"(B; Z) (i= 1,2,...,n), called the 2i- 
dimensional (or ith) Chern class, and the total 
Chern class c(w)=1+c¢,(@)+...+¢,(@)e 
H*(B; Z) are defined as follows. In the case 
n=1, let C” =lim C” (the inductive limit of 
the *complex Euclidean spaces C”), S” be 

its unit sphere, and P”(C) be the infinite- 
dimensional complex projective space consist- 
ing of all complex lines through the orgin O of 
C”. Then the natural mapping S” > P(C) 
defines a universal principal U(1)-bundle 

(S°, P”(C), U(1)). Let y, be its associated 
universal C'-bundle. Then we define the 1- 
dimensional universal Chern class c,(7,)é 

H? (P(C); Z) to be the cohomology class that 
takes the value —1 on the cycle $?(2P!(C)c 
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P”(C)) with the natural orientation. Since a 
general C'-bundle €=(E, B, C!) is induced 
from y; by a classifying mapping f.: B> 
P”(C), we set c,(€)= f-*c,(y,). When n> 1, let 
(P, B, U(n)) be the principal U(n)-bundle asso- 
ciated with the given C"-bundle é =(E, B,C’). 
Let T, be the subgroup of U (n) consisting of 
all diagonal matrices (which is a tmaximal 
torus of U(n)). Then the quotient space Y= 
P/T, is the base space of the principal T,- 
bundle y =(P, P/T,, T,). Let p: Y>B=P/U(n) 
be the natural projection. Then the C"-bundle 
p*€ over Y is associated with y and is equiva- 
lent to the Whitney sum of n complex line 
bundles: p*Fé=€,@...H &,. Moreover, 
p*:H*(B; Z)— H*(Y; Z) is a monomorphism 
(Borel [1], Hirzebruch [4]). Therefore we can 
uniquely define the total Chern class c(€) of the 
C"-bundle č by the relation p*c(é)=c(é,)... 
c(€,). The Chern classes, as defined above, 
are compatible with bundle mappings f (i.e., 
c(f*o)=f*c(é)) (— 147 Fiber Bundles N). 
The Chern class c,(y,)€ H*!(Bygy; Z) (1 <i<n) 
of the universal C"-bundle y, over Bym is 
called the 2i-dimensional universal Chern class. 
Let č, y be complex vector bundles over B. 
Then we have c(é ® y) =c(€)c(y). By the nat- 
ural inclusion U(n)< SO(2n), we can identify 
a C"-bundle œ with an oriented R*”-bundle 
Wz. Then we have c,()mod 2 = w3;(@p), 

Wirt (Op) =0 (i=0, 1, ..., n), C(O) = Xano). 


Examples. Let (S*"*!, P"(C), U(1)) be the tHopf 
bundle, and let yf be the associated complex 
line bundle. Then the classifying mapping of y? 
is the natural inclusion P"(C)— BU(1)= P“(C), 
and c(y7)=1—g,, where g,¢ H*(P"(C); Z) is the 
cohomology class dual to the homology class 
represented by the hyperplane P”'(C). On the 
other hand, for the complex line bundle €7 = 
{P"~1(C)} determined by the tdivisor (— 72 
Complex Manifolds) P”-1(C)< P"(C), we have 
e(€{)=1+4+g,, and €7, y7 are dual to each other. 
¢} is called the canonical line bundle over the 
complex projective space. Moreover, the Whit- 
ney sum t ® eg, of the tangent bundle 7(P"(C)) 
of the n-dimensional complex projective space 
P"(C) and the trivial C’-bundle £; is equivalent 
to the Whitney sum of (n+ 1) copies of &7. 
Therefore we have c(t(P"(C)))=c(é7 0 7 @ 

n DEDS (L Hg. 


D. Pontryagin Classes 


Utilizing the inclusion mapping O(n) < U (n) of 
structure groups, we can associate a C"-bundle 
Eo=€@,/—1 with an R"-bundle é. We 
define the 4i-dimensional (or ith) Pontryagin 
class of the R"-bundle é by p,(€)=(—1)'c,(é¢) 
(e H*'(B; Z)) (i= 1,2, ..., [n/2]) (Hirzebruch 
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[4]). In particular, the Pontryagin classes of 
the universal bundle for O(n) are called the 
universal Pontryagin classes. We also define 
the total Pontryagin classes p(é)= 1+ p,(é)+ 
<- + Pini2(¢). We have 2c);4;(€¢) =0. For R"- 
bundles č and y, p(č ® n)=p(é)p(y) modulo 2- 
torsion elements. We have p,(é) (mod 2)= 
(w2:(6))? and p,(2)=(X2,(€))? for the oriented 
R?"-bundle €. Moreover, for a complex vector 
bundle w, we have (Wu [8]) 


( —1)*p,(@p) = p> ( —1)'c(@)co4-(o), 


where we put c)(w)=1. 
All such classes as defined in Sections B-D 
are called characteristic classes. 


E. Other Definitions of Characteristic Classes 


Axiomatic Definition. (1) For a C’-bundle é 
over a paracompact Hausdorff base space B, 
Chern classes c;(č)e H*'(B; Z) (i> 0) are de- 
fined, and we have co(&)= 1, c,(€)=0 (i>n). (2) 
For the total Chern class c(6)=D2.9¢,(2), we 
have c( f*¢)= f*c(€) for each bundle mapping 
f. (3) For the Whitney sum, we have c(é @ 4) = 
c(€)- e(n). (4) Normalization condition: For 
the canonical line bundle €7, we have c(é")= 1 
+g, (— Section C). We can verify the existence 
and uniqueness of c;(&) satisfying these four 
conditions, so Chern classes can be defined 
axiomatically by these conditions (Hirzebruch 
[4]). We can similarly defined Stiefel-Whitney 
classes axiomatically. 


Definition in Terms of Obstruction Classes. 
When the base space is a (CW complex, we 
can define the Chern class of a C"-bundle é 
over B as follows: Let V,,,-441(C)=U(n)/I,-441 
x U(q—1) be the tcomplex Stiefel manifold of 
all orthonormal (n—q + 1)-frames in C” with 
Hermitian metric. Then V, „-4+;(C) is *(2q—2)- 
connected, and its (2g — 1)-dimensional 
*homotopy group nag-1(Vn,n-q+1(C)) =Z. Let 
¢' be the tassoctated bundle of é with fiber 
Vn—g+i(C). Then the tprimary obstruction 

(€ H?4(B; Z)) to constructing a ‘cross section of 
¢' coincides with the Chern class c,(&). Anal- 
ogously, we can interpret w,(€) for an R"- 
bundle č as an obstruction class (— 147 Fiber 
Bundles). 


Definition in Terms of Schubert Varieties. 

We denote by C* the subspace defined by 

Zk+1 =Zk+2 = ++» = Zp}y = 0 of the space C"**¥ = 
{Zis -> Znan) lz:€C,i=1,...,n+ N}, and fix 
the sequence of subspaces C! c C? c... c CYN, 
The set of all complex n-planes X through 

the origin O in C”*" forms the tcomplex 
Grassmann manifold Mp+yN.n(C). We denote 
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by E(y,’) the set of all pairs (X, v), where 
X€EM,4y.,(C) and v is a vector in X. Then 
we can define a 2N-universal complex n- 
dimensional vector bundle y with base space 
M,,+n.n(C), with total space E(y,%) and projec- 
tion (X,v)>X. Let o=(@(1), ...,@(n)) be a 
sequence of integers satisfying the condition 
O<a@(1)<...<@(n)<N. Then the set e, of all 
n-planes X <C"** through the origin O satis- 
fying dim(X NC) =i, dim(X N C!+ 2-1) = 
i—1,i=1, 2,...,n, forms a real (2 È- œ(i))- 
dimensional open tcell. The set of all these 
open cells e„ gives a tcellular subdivision of 
Mi+n.n(C) as a CW complex. The closure @,, 
of e, is a cellular subcomplex of M,,,y,,(©), 
called a Schubert variety. This is a tpseudo- 
manifold with canonical orientation and 
represents a (2 È- w(i))-dimensional inte- 
gral cycle, called a Schubert cycle. We denote 
ê, by (w(1), ..., a(n). All these homology 
classes form the basis of the homology group 
H,(M,+n,n(C); Z). The cocycle dual to the cycle 
(0,...,01,..., 1) represents the Chern class 

eae 

n—q q 
can E H74(M,,4y.n(C); Z). For the real Grass- 
mann manifold M,+y,„(R), we can analogously 
define the universal Stiefel-Whitney classes. 


Thom’s Definition. Let é be an R"-bundle over 
B, B; be its Thom space, and Ue H"(B.; Z3) 
be the ffundamental class of B;. Let j: B>B; 
be the inclusion induced from the zero cross 
section and @: H*(B; Z3) = H**"(B.; Z,) be the 
tThom-Gysin isomorphism. Then we have j* U 
=w,(¢), o~ ' (Sq U) = wi(¢) (0<i<n), where 

Sq’ is the Steenrod square (R. Thom [6]). 


Definition in Terms of Differential Forms. 

Let B be a tdifferentiable manifold and č = 
(P., B, U(n)) be a differentiable principal U(n)- 
bundle over B. Let Q=(Q,,) be the tcurvature 
form corresponding to the tconnection form 
o=(q,), i,j=1,...,n, on P,. Then Q,, is a 
complex-valued 2-form, and Q,,= —Q,,. For 
the matrix Q, we consider the differential form 


w=) w,=det|1+(2x./—-1)'QO}, 


where J is the unit matrix, the multiplication 
in the determinant is the texterior product, 
and w, is the part of degree 2q in y. Then y is 
defined as a real form independent of the con- 
nection w. We have dy, =0, and the coho- 
mology class of (—1)*, in H*4(B; R) is the 
Chern class c,(€) with real coefficients (Borel 
and Hirzebruch [2], S. S. Chern [3]). 


Definition in Terms of Symmetric Polynomials. 
(— 427 Topology of Lie Groups and Homoge- 
neous Spaces.) 
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F. Characteristic Classes of Manifolds 


For a differentiable (complex or almost com- 
plex) manifold M, the characteristic classes of 
its tangent bundle are called characteristic 
classes of the manifold M. We shall denote 
Stiefel-Whitney classes, Pontryagin classes, 
Euler-Poincaré classes, and Chern classes of M 
by w,(M), p;(M), X,,(M), and c;(M), respec- 
tively. These are invariants of differentiable 
structures, orientations, or (almost) complex 
structures of a manifold M if M is a differ- 
entiable, oriented differential, or (almost) com- 
plex manifold. By the Stiefel-Whitney num- 
bers of an n-dimensional manifold M, we 
mean the values of n-dimensional monomials 
of Stiefel-Whitney classes of M on the fun- 
damental homology class ((w,(M)"!w,(M)”... 
w,(M)")[M]éZ.,, where r, + 2r,+...+nr,= 
n, r;20. We can define integer-valued Pon- 
tryagin numbers and Chern numbers similarly. 
These numbers are called generally character- 
istic numbers of the given manifold. In partic- 
ular, X,,(M)[M]=yx(M) is the tEuler-Poincaré 
characteristic. 

In the case of topological manifolds, we 
can define characteristic classes in the follow- 
ing sense. Let M be a closed n-dimensional 
topological manifold and X” the genera- 
tor of H"(M; Z.). By defining X'(Y"™')= 
(X'¥"-)[M]eZ, for X'¢H'(M;Z,), Y" ‘ec 
H"-'(M;Z,), we have an isomorphism 
H'(M;Z,)=Hom(H" ‘(M, Z,); Z,). The ele- 
ment u;e H'(M; Z), corresponding to the 
homomorphism Y” '—>Sq' Y" [M] under this 
isomorphism is called the Wu class of M, 
where Sq' is the tSteenrod square. Moreover, 
we call w,= $j- Sq u;e H/(M; Z3) the Stiefel- 
Whitney class of the topological manifold M. 
Then for any tdifferentiable structure 2, we 
have w(M, 2)=w;. Therefore Stiefel-Whitney 
classes of differentiable manifolds are topolog- 
ical invariants (more precisely homotopy type 
invariants) (Thom [6], W. T. Wu [8]). J. W. 
Milnor [9] proved that Pontryagin classes of 
differentiable manifolds are not topological 
invariants. The image of p;(M) by the homo- 
morphism H*'(M; Z)—> H*'(M; Q) induced by 
the inclusion Z <Q (the rational number field) 
is called the rational Pontryagin class. In 1966, 
S. P. Novikov [10] proved the topological 
invariance of the rational Pontryagin class 
(— Section H). 


G. Index Theorem for Differentiable 
Manifolds 


Let M be an oriented closed manifold of di- 
mension 4k. Putting f(x, y)=x: y[M] for 
elements x, y of the 2k-dimensional real coho- 
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mology group H?*(M;R), we obtain a bilinear 
form on H7*(M;R). The tsignature of the 
quadratic form f(x, x) (namely, (number of 
positive terms)—(number of negative terms) in 
its canonical form) is a topological invariant 
(homotopy type invariant) of the manifold M. 
We call it the index or the signature of the 
manifold M and denote it by t(M). If the di- 
mension of M is not divisible by 4, we define 
t(M)=0. For the product of manifolds we 
have t(M x N)=1(M)-1(N). Also t(M) is an 
invariant of the tcobordism class of M (Thom 
[6]). 

The index t of a differentiable manifold 
gives a homomorphism of the tcobordism ring 
Q into the ring Z of integers. Hirzebruch inves- 
tigated the multiplicative property of t and 
gave its expression by means of Pontryagin 
numbers. Let P, be the ith telementary sym- 
metric function of indeterminates f,,..., Br. 
Then a homogeneous part of the formal power 
series 


> Ji 


i=1 tanh./B; 


of B,,..., 8, is a symmetric polynomial of 
B,,---,B,, and therefore a polynomial of P, 
with rational coefficients. For k <n, we de- 
note the homogeneous part of degree k by 
L,(P,,..., P,). Specifically, if P, are the Pontrya- 
gin classes p,(M**) of a 4k-dimensional closed 
differentiable manifold M**, then L,(P,,..., P,) 
is a 4k-dimensional cohomology class of M**. 
Then we have the formula 


t(M**)=L,(P,, ...,P,) LM“, 


called the index theorem of differentiable mani- 
folds (or Hirzebruch index theorem). For exam- 
ple, L, =(1/3)P,, La =(1/45)(7P, — P?), and 
L,=(1/945)(62P, —13P,P,+2P3),...(— 114 
Differential Topology). Later this index theo- 
rem was generalized to the tAtiyah-Singer 
index theorem (— 237 K-Theory). 


H. Combinatorial Pontryagin Classes 


Let K be an oriented n-dimensional compact 
thomology manifold, and let 2” be the bound- 
ary of an oriented (r+ 1)-simplex, namely, the 
combinatorial r-sphere. Let f: K>2" *' be a 
tpiecewise linear mapping. Then for almost all 
points y of £" *, f~'(y) is an oriented 4i- 
dimensional compact homology manifold, and 
its index t( f ~'(y)) is independent of y. We 
denote this by t(f). The (f) is an invariant of 
the homotopy class of f. Let o be the funda- 
mental class of H" *(2""*!; Z). Then for n> Bi 
+2, there exists a unique cohomology class 

I, =1(K)e H*'(K; Q) such that for any piece- 
wise linear mapping f: K >Z" “', we have 
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(L f*o)[K]=t(f). We can remove the restric- 
tion n>8i+2 if we take K x X" for K and 
define /,(K) to be I,K x 2”) for sufficiently 
large m. If K is a *C!-triangulation of a dif- 
ferentiable manifold M, l;(K) coincides with 
the class L,(P,,..., P) defined by Hirzebruch 
(R. Thom [7]; V. Rokhlin and A. Shvarts), 
where P,=p;(M) is the Pontryagin class of 

M. Since the variable P, can be expressed as 

a polynomial with rational coefficients of 

L (Pi, P), j <i, we define the combina- 
torial Pontryagin class p;(K) of a homology 
manifold K as the polynomial of [,(K) with 
rational coefficients. Therefore, if K is a C}- 
triangulation of a differentiable manifold M, 
we have p,(K)= p;(M) (modulo torsion ele- 
ments). The class /,(K) and consequently p,(K) 
are important combinatorial invariants of K. 
These classes are topological invariants [10]. 
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A. Mathematics in the Chao, Han, and Tang 
Dynasties (3rd Century B.C.— 10th Century A.D.) 


In ancient China, the art of divination, called 
yi, was used in government administration. 
This was a kind of calculation that used pieces 
called tse. The book embodying it, called the 
I-Ching, is still popularly used. It shows that 
“numbers” or mathematics was seriously uti- 
lized in China at that time. The multiplica- 
tion table for numbers up to nine (called the 
Pythagorean table in the West) was known in 
China from the legendary period. However, 
mathematics in the Greek sense, that is, math- 
ematics as a logically systematized science, was 
unknown in ancient China. 

Suanching-Shihshu, or the Ten Books on 
Arithmetic—namely, Choupi-Suanching, 
Chiuchang-Suanshu, Haitao-Suanching (edited 
by Liu Hui), Suntzu-Suanching, Wutsao- 
Suanching, Hsiahouyang-Suanching, 
Changchiu-Suanching, Wuching-Suanshu, 
Chiku-Suanching (edited by Wang Hsiao- 
Tong), and Shushu-Chiyi (edited by Hsu Yue) 
—came into being between the 2nd century 
B.C. and the 6th century A.D., from the Chao 
to the Han eras, with the exception of the 
Chiku-Suanching compiled in the Tang era. 
These are the only mathematical texts from 
this early period whose authors and times of 
publication are known. They were used in the 
civil service examination for selecting adminis- 
trators up to the beginning of the Sung era 
(960 a.D.). The most important among them is 
Chinchang-Suanshu, or the Book of Arithmetic, 
which contains nine chapters. It treats positive 
and negative fractions with laws of operations 
on signed numbers, equations, and the elemen- 
tary mathematical knowledge of daily life. The 
Chiku-Suanching contains a number of prob- 
lems reducible to equations of the 3rd and 
4th degrees. 

There were also two works called San Tung 
Shu (edited by Tong Chuan) and Chui Shu 
(edited by Tsu Chung-Chih), but no copies of 
them are extant. Later works, one from the Sui 
era (published in 636) and another from the 
Tang era, tell us that the latter contained the 
result 3.1415927 >n > 3.1415926 and the ap- 
proximate values 355/113 and 22/7 for z. 

In the 1st century a.D. Buddhism was intro- 
duced from India, and paper was invented. 
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However, despite the communication with 
India, neither the Indian numeration system, 
written calculation, nor the abacus was at that 
time widely used in China. The extraction of 
square or cube roots was done with calculat- 
ing rods. 


B. Mathematics in the Sung and Yuan 
Dynasties (10th—14th Centuries) 


In the Sung and Yuan periods contact was 
made with the Arab world. In the 13th cen- 
tury, a mechanical algebra utilizing calculating 
rods made remarkable progress; this can be 
attributed to Arab influence. Toward the 

end of the Sung era appeared the Shushu- 
Chiuchang by Ch’in Chiu-Shao and the Yiku- 
Yentan by Li Chih. The former gives a method 
like Horner’s for approximate solution of 
equations, and the latter gives the principle of 
tienyuan-shu, i.e. the mechanical algebra of 
this period. The principle of tienyuan-shu was 
further expounded in the Suanhsueh Chimeng 
(1295) and the Suyuan Yuchien (1303) by Shih 
Shih-Chich, the Yanghui Suanfa by Yang Hui, 
and other works. These were introduced into 
Japan and they influenced the wasan (Japanese 
mathematics) of early times. Until recently, no 
further original mathematical ideas appeared 
in China. 


C. Mathematics after the Ming Era (15th 
Century) 


In this epoch, European renaissance civiliza- 
tion began to influence the Orient. In 1607, 
Matteo Ricci (1552—1610) translated Books I- 
IV of Euclid’s Elements into Chinese with the 
aid of Hsu Kuang-Chi. In 1592 Suanfa Tang- 
tsung by Ch’éng Ta-Wei appeared, which 
dealt with the use of the abacus. This book 
had great influence upon wasan. 

No development was seen in the indigenous 
mathematics of the Ching era, that is, after 
the 17th century, but science and technology 
were imported by Christian missionaries. This 
brought about calendar reform from the lunar 
to the solar method. On the other hand, new 
editions of classical works such as the Ten 
Books on Arithmetic began to appear in this 
period. Emperor Kang Hsi-Ti (1655-1722), 
who was in correspondence with Leibniz, 
asked Ferdinand Verbiest (renamed Nan Huai 
Jen in Chinese) to compile Shuli-Chingwen 
(Advanced mathematics), whose 53 chapters 
were completed in 1723. This book dealt with 
European-style algebra and trigonometry. In 
the latter half of the 19th century, Alexander 
Wylie translated a number of Western mathe- 
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matical books into Chinese, including Books 
VII-XIII of Euclid’s Elements and some works 
on calculus. Many current Chinese and Japa- 
nese mathematical terms originated with this 
translation. 
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A. General Remarks 


An example of a tC®-function is a treal ana- 
lytic function, which is defined to be a function 
that can be expressed as a power series that 
converges in a neighborhood of each point 

of the domain where the function is defined. 
Many examples, however, show that real 
analytic functions form a rather small subset 
of the C”-functions. Sometimes C®-functions 
(not real analytic functions) play essential roles 
in the development of theories of analysis (— 
105 Differentiable Manifolds S). On the other 
hand, there is a subfamily of C*-functions 
having some remarkable properties in com- 
mon with the family of real analytic func- 
tions. This family is called the family of quasi- 
analytic functions. It has been an important 
object of study since the beginning of the 20th 
century. On the other hand, the Gevrey class 
functions, which are no longer quasi-analytic 
in general, also constitute an important subset 
of C®-functions. The first part of this article 
deals with C®-functions, the second part with 
quasi-analytic functions, and the third part 
with Gevrey class functions. 


B. C°-Functions 


Let Q be an open set of the n-dimensional real 
Euclidean space R”. A real-valued function 
f(%1,.--,X,) defined on Q is called a function 
of class C” on Q (or C”-function on Q) if 

f(x ,---,X,) is continuously differentiable up to 
any order. The totality of C®-functions defined 
on Q is denoted by C” (Q). It is an fassociative 
algebra over the real number field R. A contin- 
uous function f defined on some closed set F 


217 


of R” is called a C”-function on F if there exist 
an open neighborhood U of F and a function 
geC”(U) such that f=g|F. This definition is 
equivalent to the following (H. Whitney [6]): 
For any multi-index «=(«,,...,@,), we can find 
a continuous function f%(x,,...,x,) on F such 
that (i) f°(x,,...,x,)=f(%;, -+> Xn) and (ii) for 
any positive integer r and for every multi- 
index a with |a|<r, 


lim line f) 
=E 


=e x JET fo 


la+Bl<r p! 


where || || denotes the Euclidean norm of R”. 


C. Local Theory of C”-Functions 


We shall now introduce an equivalence rela- 
tion ~ in C”(R"), defined as follows: f ~g 
<> f|U =g|U for some open neighborhood 

U of the origin. Let £, denote the quotient set 
C”(R")/~, which naturally inherits the struc- 
ture of an associative algebra from C”(R”). An 
element of &, is called a germ of a C”-function 
at the origin. We denote the germ of fe C”(R”) 
by J. To feg, we assign the formal Taylor 
expansion >) (D%/(0)/a!)x* around 0, where 

D*f means (8*7 ™f)/(0"'x, ...0""x,) for «= 
(ri-ra) This assignment induces a homo- 
morphism t from 6, to the tring of formal 
power series R[[x,,...,x,,]] of n variables. 
The homomorphism <z is surjective but not 
injective. Put A,=1t~'(0)<4,. A function f 
whose germ f belongs to A, is called a flat 
function. The function g(x) defined by g(x)= 
exp(—1/x?) when x 40 and g(0)=0 is an 
example of a flat function on R+. A close study 
of the relationship between &, and R[[x,, 
...>X,]] leads to the preparation theorem for 
C”-functions, which can be stated as follows: 
Let F(x,,...,x,)€6, satisfy F(0,...,0,x,)= 

x? G(x,) (GE€,, g(0) 40). Then any feg, can 

be expressed as f = FO + R, where Õe g, and 


R= Deb r(x 1, -3 Xp—1) x) with re &,_, (B. 
Malgrange [3]). 
Let f(x,,...,X,) be a symmetric function in 


(Xis --- Xn) of class C”. Then there exists a 
germ ĝe é, such that F(x, wig Ry Ong ix, 
o,), where o,,...,0, denote elementary sym- 
metric functions with respect to x,,...,X, (G. 
Glaeser, Malgrange). Let feg, satisfy f(x)= 
f(—x). Then there exists a germ ge é, such 
that f(x)=9(x?) (H. Whitney). 


D. Global Results 


Case of n Variables. C* (Q) becomes a *Fréchet 
space when it is endowed with the topology of 
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uniform convergence on compact sets for all 
partial derivatives. Let J and J, be two closed 
tideals of C°(Q). Then we have J =J, if and 
only if t,(J)=1,(J,) for each x EQ, where t, is 
the mapping from C% (Q) to the ring of the 
formal power series that assigns the formal 
Taylor series of f around x (Whitney). 


Case of One Variable. In the case of one vari- 
able, further information can be obtained from 
various points of view. In the following, f 
denotes a C”-function defined on the unit 
interval 7 =[0, 1]. If f satisfies f(1)= 1, f’(0) 
=... = f7 1(0)=0, then we have 


1 1 1 
Pa Ta + a k, 
where m;=sup{| f(x)||xe/} and where k 
is some constant independent of the choice 
of f and r (E. Borel). Similar kinds of inequal- 
ities were obtained by A. N. Kolmogorov, 
A. Gorny, and H. Cartan. Let A be an arbi- 
trary countable set of real numbers. If for any 
xel we can find an integer r(x) such that 
f(x)e A, then such a function f is neces- 
sarily a polynomial. The interval I can be 
divided into three disjoint subsets: SV, SY), 
and S$’. These are characterized as follows: 
For xeS the formal Taylor series t,( f) of f 
around x converges to f in some neighbor- 
hood of x. For xe SY”, t (f) diverges. And for 
xe SY, t,(f) converges in some neighbor- 
hood of x but does not tend to f. Then S$’ is 
an open set and S¥ is a G,-set, while SY” is 
an F,-set of the tfirst category. Conversely, let 
I=S,+8,+5S, be any partition of J into an 
open set S,, a G;-set S,, and an F -set S, of the 
first category. Then there is some feC* (1) 
with S,;=S (i= 1, 2, 3) [5]. 


E. Relations between C~-Functions and Real 
Analytic Functions 


Let C°() be the set of real analytic functions 
on I. Then C°(/) is a subalgebra of C(I). 
Applying the above result in the case of S, = 
Ø, we find a function feC”(/) that admits 
no real analytic function coinciding with f 

in a subinterval of J. Actually, functions with 
such a property are distributed densely in 
C®™(I). A necessary and sufficient condition for 
a function fe C®™(I) to belong to C°(J) is that 
for suitable constants A and k, | f(x)|< 
Ak"n!, xel, n=0, 1, 2,..., be valid (Pring- 
sheim’s theorem). If f(x) 20 for all xeJ and 
n=0,1,2,...,then feC°(J) (S. N. Bernshtein). 
For any open set Q (c R”), the set C?(Q) of 
real analytic functions on Q is dense in C”(Q) 
(polynomial approximation theorem). This re- 
sult is true even when the topology of C”(Q) 
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is replaced by a stronger one (Whitney [6]). 
Let fe C”(Q) and pe C”(Q). Then we can 
find ge C”(Q) satisfying f =gọ if and only if 
for any xEQ, t,( f) is divisible by 1,(@) in the 
ring of formal power series (S. Lojasiewicz, 
Malgrange [3]). 


F. Quasi-Analytic Functions 


The investigation of quasi-analytic functions 
began with the attempt to obtain an intrinsic 
characterization of analytic functions. Borel 
defined monogenic functions as functions 
differentiable on their domains of definition, 
which can be any subset of the complex plane, 
not necessarily assumed to be open (— 198 
Holomorphic Functions Q). Similar to com- 
plex analytic functions, monogenic functions 
are uniquely determined by their values on 
any curve. While quasi-analyticity can be 
defined by such properties, it is customary to 
approach quasi-analytic functions from an- 
other aspect, that is, the behavior of higher 
derivatives of C”-functions. 

Generally, a subset B of C®(J) is called a set 
of quasi-analytic functions if the mapping t,: 
B-—R[[x]] defined in Section D is injective at 
each point xeJ. The functions belonging to B 
are called quasi-analytic (with respect to B). 
Here, an important problem is to characterize 
a set of quasi-analytic functions B by specializ- 
ing the image of t,(B). 

Now let {M,,} be a sequence of positive 
numbers. Let C(M,) be the subset of C” (I) 
consisting of f such that 


[f(x)|<Ak"M,, xel, n=0,1,2,..., 


where A= A(f) and k=k(f) are constant. 
Then Pringsheim’s theorem simply asserts that 
C(n!)= CCD). 

In 1912, J. Hadamard raised the problem of 
determining the condition that the sequence 
{M,} should satisfy so that C(M,) becomes a 
set of quasi-analytic functions [9]. A. Denjoy 
showed that if 


M, =(nlog! nlog*n...log?n)", 
where 
log! n=logn, log?n=log(log? 'n), 
PER 33 x, 


then C(M,,) is a set of quasi-analytic functions 
[10]. Later he derived an improved result that 
the condition © M, "= œ is sufficient. T. 
Carleman first gave a necessary and sufficient 
condition for C(M,) to be a set of quasi- 
analytic functions, and later A. Ostrowski and 
T. Bang gave another version of the same 
condition [11—13]. The condition states essen- 
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tially the following: A necessary and sufficient 
condition for C(M,) to be a family of quasi- 
analytic functions in the interval (a, b) is given 
by either (i) X p, * = +00, where f, =inf,,,, Mj’* 
(Carleman), or (ii) f” (log T(r))/(r?) dr = œ, 
where T(r)=sup, , (r"/M,) (Ostrowski, Bang). 
S. Mandelbrojt and T. Bang also gave another 
condition [13, 14]. (The simplest proof of these 
results is found in [13] or [15], where the 
proof follows Bang’s idea.) 

Related to the above theorem, we also have 
the following: Let {M,} be a sequence of posi- 
tive numbers with Y(M,/M,.,)<0o. For «>0 
we can find fe C(M,) defined on (—00, 00) such 
that f(0)>0, f(+«)=0. Moreover, for 0< 
a< ß there exists feC(M,) such that f(0)>0, 
f™=0(a<x<f,n=0,1,2,...) [16]. 

Suppose that we are given an interval J and 
increasing sequences {v,} and {M,} of positive 
numbers. Then we have the problem of finding 
suitable conditions on {v} and {M,} under 
which the mapping f>{ f"(xo)} gives an 
injective mapping from C(M,) to the sequences 
above. When {v,} and {M,} satisfy the above 
conditions, then a function belonging to C(M,,) 
is called quasi-analytic (v,) in the generalized 
sense. The study of the inclusion relation be- 
tween two families C(M,) and C(M,) also 
deserves attention. In [15] the relation be- 
tween C(M,) and C(n!)=C°(J) is discussed in 
detail. There are many open problems con- 
cerning the relationship between C(M,) and 
C(M,;) in general. 

Quasi-analytic functions are closely related 
to problems in various branches of analysis, in 
particular the theories of complex analytic 
functions, Fourier series, Fourier integrals, 
Dirichlet series, and asymptotic expansions 
[8, 15, 17]. 


G. Gevrey Class Functions 


This class of functions has its origin in the 
study of tparabolic equations. A C®-function 
J (x) defined in a domain of R” is called of 
Gevrey class s (1 <s < œ) if for every com- 
pact set K in that domain there exist positive 
constants A, and Cx such that 


max |D*f(x)|< Ax Cf a!" 


for all multi-indices a (|eJ=a,+...t+4a,, a!= 
a,!a5!...a,!). A typical example is 


ot exp(—I/x) for x>0, 
om 0 for x <0, 


which is of Gevrey class 2. Let us denote the 
function space of a Gevrey class s by y. Evi- 
dently y cy? ifs<s’. Let f(xjey™ and 
F(y)ey; then xt F(f(x))ey. In particular, 
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the sum and the product of two functions in 
y® belong also to y®. Furthermore, the tim- 
plicit function theorem holds in this class. 
Unlike the class of analytic functions, we can 
use this class of functions of class s (> 1) fora 
tpartition of unity, and several problems in 
partial differential equations can be treated 
within this function space. Gevrey class func- 
tions are discussed in a complete form in [20] 
(— also [19]). 
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A. History 


The notion of a class field was first introduced 
by D. Hilbert (1898). Let k be an talgebraic 
number field and K a tGalois extension of k. 
Hilbert called such a field K a class field over k 
(or K/k was called a class field) when the fol- 
lowing property was satisfied: A tprime ideal 

p of k of absolute degree 1 (i.e., a prime ideal 
whose fabsolute norm is a prime number) is 
decomposed in K as the product of prime 
ideals of K of absolute degree t if and only if 
p is a tprincipal ideal. (Such a field K is now 
said to be an absolute class field over k in 
order to distinguish it from a class field defined 
later more generally by T. Takagi as explained 
below.) Hilbert conjectured the following theo- 
rems (1)—(4) together with the principal ideal 
theorem (— Section D), and proved them in 
some special cases. (1) For any algebraic num- 
ber field k there exists one and only one class 
field K/k. (2) A class field K/k is an tAbelian 
extension whose tGalois group is isomorphic 
to the fideal class group of k. Hence the degree 
n=(K:k] is equal to the tclass number h of k. 
(3) The trelative different of a class field K/k is 
the principal order; thus K/k is an tunramified 
extension. (4) Let p be a prime ideal of k, and 
let f be the smallest positive integer such that 
p/ is a principal ideal. Then p is decomposed 
in the class field K/k as p=, 2B, ... Py, 
NxalBi)= pl, fg=n. 

Hilbert was led to these conjectures by the 
analog to the theory of talgebraic functions in 
one variable. Theorems (1)—(4) were proved by 
P. Furtwängler (Math. Ann., 63 (1907)), but 
these results were subsumed under the class 
field theory of Takagi, who generalized the 
notion of class field and proved that every 
Abelian extension of k is a class field over k (J. 
Coll. Sci. Imp. Univ. Tokyo, (9) 41 (1920)). Since 
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then, the arithmetic of Abelian extensions of k 
has developed through this theory. In Takagi’s 
paper, L. Kronecker’s problem concerning 
Abelian extensions of an imaginary quadratic 
field (— 73 Complex Multiplication) was 
solved simultaneously; this had been an open 
problem since the 19th century. Later, E. Artin 
proved the general law of reciprocity (Abh. 
Math. Sem. Univ. Hamburg, 5 (1927)) which put 
class field theory into its complete form. The 
original proof by Takagi was rather com- 
plicated, and H. Hasse, Artin, J. Herbrand, C. 
Chevalley, and others tried to simplify it. In 
particular, Chevalley introduced the notion 

of tideles and gave a purely arithmetic proof. 
On the other hand, attempts are also being 
made to generalize this theory to non-Abelian 
extensions. We mention here the results of 

G. Shimura [15] and Y. Ihara [16]. Today, 
class field theory is considered one of the 

most beautiful theories in mathematics. 


B. Definition of a Class Field 


Let k be an algebraic number field. For the 
definition of a general class field over k, we 
need a generalization of the ideal class group 
of k (— 14 Algebraic Number Fields H). Let m 
be an tintegral divisor of k, and let 3(m) be the 
multiplicative group of all tfractional ideals of 
k which are trelatively prime to m. For the 
rest of this article, we mean by an ideal of k 

a fractional ideal of k. Denote by S(m) the 
tray modulo m. Let H(m) be an fideal group 
modulo m, that is, a subgroup of (m) con- 
taining S(m). A Galois extension K of k is said 
to be a class field over k for the ideal group 
H(m) if the following property is satisfied: A 
prime ideal p of k of absolute degree 1 which is 
relatively prime to m is decomposed in K as 
the product of prime ideals of K of absolute 
degree 1 if and only if p belongs to H(m). The 
absolute class field of Hilbert is the case where 
m=(1) and H(m) ts the group of all principal 
ideals of k. 

A class field K/k for an ideal group H is 
uniquely determined by H (uniqueness theo- 
rem). The fconductor f of H is said to be the 
conductor of the class field for H. The ideal 
group H corresponding to the class field K/k is 
determined by K as follows: H(f)/S(f) is the 
union of all cosets C of 3(f) modulo S(f) such 
that C contains a frelative norm Nx,(W) of 
some ideal Y of K which is relatively prime to 
f. In general, let K/k be a Galois extension and 
m be an integral divisor of k. Let H(m) be the 
union of all cosets C of 3(m) modulo S(m) such 
that C contains a relative norm Nx (2) of 
some ideal W of K which is relatively prime to 
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m. Then H(m) is a multiplicative subgroup of 
3(m), and the index h=(3(m): H(m)) is not 
greater than the degree n=[K:k]. We have 
h=n if and only if K/k is the class field for 

H. Hence a class field K/k can be defined as 
a Galois extension of k such that h=n for a 
suitable integral divisor m of k. 


C. Fundamental Theorems in Class Field 
Theory 


(1) Main theorem: Any Abelian extension K/k 
is a class field over k for a suitable ideal group 
H. 

(2) Existence theorem: For any ideal group 
H(m) there exists one and only one class field 
for H(m). 

(3) Composition theorem: Let K, and K, be 
class fields for H, and H,, respectively. Then 
the composite field K, K, is the class field over 
k for H, N H}. Consequently, K, >K, if and 
only if H, cH. 

(4) Isomorphism theorem: The Galois group 
of a class field K/k for H(m) is isomorphic to 
3(m)/H(m). In particular, every class field 
K/k is an Abelian extension of k. 

(5) Decomposition theorem: Let f be the 
conductor of the class field for H. If p is a 
prime ideal of k relatively prime to f and f is 
the smallest positive integer with p/ eH, then 
p is decomposed in K/k as p =R, P,... P, 
Nx B) = pl, fg=n. 

(6) Conductor-ramification theorem: Let f be 
the conductor of a class field K/k. Then f is not 
divisible by any prime divisor that is unrami- 
fied for K/k, and f is divisible by every prime 
divisor that ramifies for K/k. Let f=[Tf,, 

Îp = p°. Then j, coincides with the tp-conduc- 
tor of K/k, and the exponent c can be explicitly 
expressed by the order of the tramification 
groups and the tramification numbers of p for 
K/k (— 14 Algebraic Number Fields P). 

(7) Let p be a prime ideal of k that rami- 
fies for K/k. Let H, be the ideal group of k 
such that (i) the conductor of H, is relatively 
prime to p and (ii) H, is the minimal ideal 
group of k containing H with property (i). 

Let n=[K:k], e=(H,:H), and p’eH,, where 
p*(d<f)¢H,. Then p is decomposed in K/k as 
p =(P; B2... Boy Nka B) = p’, n=efg. 

(8) Translation theorem: Let K/k be the class 
field for an ideal group H(m), and let Q be an 
arbitrary finite extension of k. Then KQ/Q is 
the class field for H*, where H* is the ideal 
group of Q consisting of all ideals b of Q with 
Noy (b)€ H(m). In particular, the conductor of 
KQ/Q is a divisor of the conductor of K/k. 

(9) Artin’s general law of reciprocity: Let K/k 
be the class field for an idea group H with the 
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conductor f. We denote the Artin symbol of 
an ideal a of k that is relatively prime to f by 


(K/a)= (=) . 
a 
Let a mapping ® from 3 (f) to the Galois 
group G of K/k be defined by ®(a)=(K/a) for 
ae 4(f). Then ® induces the isomorphism 
M/H) =G. Namely, the isomorphism men- 
tioned in (4) is explicitly given by the Artin 
symbol. Also, the ideal group H(f) is character- 
ized as the set of all ideals a such that ae %(f) 
and (K/a)=1. From this theorem we can prove 
all the known laws of reciprocity for power- 
residue and norm-residue symbols (— 14 
Algebraic Number Fields O, Q, R). 

From the general results of class field theory 
we can systematically derive all the known 
theorems concerning the arithmetic of qua- 
dratic fields, cyclotomic fields, and Kummer 
extensions. 


D. Principal Ideal Theorem 


Let K/k be an absolute class field. Then the 
extension of any ideal of k to K is a principal 
ideal of K. This theorem is called the principal 
ideal theorem. It was conjectured by Hilbert, 
formulated by Artin as a theorem of group 
theory, and proved by Furtwängler (Abh. 
Math. Sem. Univ. Hamburg, 7 (1930)). Later a 
simple proof was given by S. lyanaga (Abh. 
Math. Sem. Univ. Hamburg, 10 (1934)). This 
theorem was also generalized to the following 
general principal ideal theorem (Iyanaga, 
Japan. J. Math., 7 (1930)): Let K/k be the class 
field for the ray S(f), and let f= %D, where D is 
the relative different of K/k. Then the exten- 
sion to K of any ideal of k that is relatively 
prime to f belongs to S(%). Put = [I W”. 
Then v is equal to the ramification number 

v, +1 (— 14 Algebraic Number Fields K). For 
an absolute class field K/k, let the extension of 
an ideal a to K be (O(a)). Then we can choose 
O(a)e K such that O(a)O(b)" O(ab)“' ek, 
where o(a)=(K/a) is the Artin symbol for a 
(T. Tannaka, Ann. Math., 67 (1958)). This 
result can also be generalized for the class 
field for S(f). 


E. Theory of Genera 


Let K/k be a Galois extension and let H(m) be 
an ideal group of k. The set of all ideals 2 of K 
relatively prime to m such that Nx,(20) be- 
longs to H(m) forms an ideal group of K. This 
ideal group is said to be the principal genus for 
H. Each coset of 3(m) modulo the principal 
genus for H is said to be a genus for H. In 
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particular, let K/k be a cyclic extension with 
the conductor f, and let H(f) be the ideal group 
of k generated by Nx,(A) (Ae K) and S(f). 
Then the principal genus for H(f) is the ideal 
group formed by the ideal classes of K of 

the form C!~’, where ø is a generator of the 
Galois group of K/k (— 347 Quadratic Fields 
F). In general, let K/k be an Abelian extension, 
and let f be the conductor of K/k. Then for the 
ideal § =I] $B" of K defined in Section D, 

Nx (S(m)) = S(mf) for an arbitrary integral 
ideal m of k. In particular, let K/k be a cyclic 
extension, and let H = S(mf). Then the prin- 
cipal genus for H is the ideal group consist- 
ing of all cosets of 3(m}) modulo S(m¥) of 
the form B! -° (Herbrand; Iyanaga, J. Reine 
Angew. Math., 171 (1934)). 


F. Class Field Tower Problem and 
Construction Problem 


Furtwangler considered the following prob- 
lem: Let k be a given algebraic number field, 
k=kyck,ck,... be the sequence of fields 
such that k; is the absolute class field over k;_,, 
and K „ be the union of all the k;. Is K, a 
finite extension of k? The answer is yes if and 
only if k, is of class number 1 for some n. This 
problem is called the class field tower problem. 
Artin remarked that if for every algebraic 
number field F of degree n we have the inequal- 
ity | Dp| >(2/4)?"2(n"/n!)? > (ne?/4)"/(2nne'/") 
for the tdiscriminant Dp, then K ,,/k is always 
finite [1, p. 46]. E. S. Golod and I. R. Shafa- 
revich (1964) solved the class field tower prob- 
lem negatively; they proved that K ,,/k is 
infinite if k; (i=1,2,...) is the maximal unrami- 
fied Abelian p-extension of k;_, for a fixed 
prime number p and if the inequality y > 3 + 
2,/p+2 holds, where y is the minimal num- 
ber of generators of the p-component of the 
ideal class group of k and p is the rank of the 
unit group of k. (We call an extension K/k a p- 
extension if the degree [K :k] is a power of a 
prime number p.) For example, the class field 
tower K „/k is actually infinite if k is an imagi- 
nary quadratic field (p = 1) and y >27 for p=2, 
for example, k=Q(/—3:5:7:11-13-17-19). 
Construction problem. Let k be a given alge- 
braic number field and G a finite group. The 
construction problem asks us whether there 
exists a Galois extension K/k such that its 
Galois group Gal{K/k) is isomorphic to G. If G 
is Abeltan the problem can be solved affirma- 
tively by using class field theory. This problem 
was also solved affirmatively for p-groups by 
A. Scholtz and H. Reinhardt in 1937, and for 
general solvable groups by Shafarevich in 1954 
(Izv. Akad. Nauk SSSR, ser. mat. 18). 
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G. Class Field Theory for Algebraic Function 
Fields and Local Class Field Theory 


F. K. Schmidt developed an analog of class 
field theory for Abelian extensions over an 
algebraic function field in one variable with 
finite coefficient field (1930; — [8]). An arith- 
metic proof was given by M. Moriya (1938). 
An analog of class field theory for local fields 
with finite residue-class fields, called local class 
field theory (— 257 Local Fields) was first 
developed by Hasse, and later Chevalley gave 
an algebraic derivation (1933). 


H. Cohomology of Groups and Class Field 
Theory 


For the purpose of simplifying the proof of the 
main theorems in class field theory, the theory 
of ‘Galois cohomology was developed by T. 
Nakayama, G. Hochschild, A. Weil, Artin, J. 
Tate, and others. In particular, Artin and Tate 
[9] constructed class field theory on the basis 
of the cohomology theory of finite groups as fol- 
lows: Let G be a finite group, A a tG-module 
(or a multiplicative commutative group with 
the operator domain G), and H"(G, A) the nth 
tcohomology group (n=0, +1, +2,...) of 

G with coefficients in A (— 200 Homolog- 

ical Algebra N). Then we have H°(G, A)= 
A®/N,(A), where A® is the set of all G-invariant 
elements in A and N,(A) is the set of all ele- 
ments of the form Ng(a)= 2 ,-¢ga (ae A). We 
can consider Z a G-module by defining on =n 
(neZ,ceG). Let A, B, C be G-modules such 
that a G-bilinear mapping (A, B)—>C is de- 
fined. Then we can define the tcup product 
(x, B) a ~ B (we A"(G, A), BE AS(G, B), a~ Be 
Ñ" *(G, C)) for r, se Z with the usual prop- 
erties. Let A be a G-module and H a sub- 
group of G. Then the trestriction homomor- 
phism Rg: Ĥ”(G, A)>Ĥ”"(H, A) and the 
tinjection homomorphism Inj,,,¢:H"(H, A)—> 
Ĥ"(G, A) are defined for ne Z. If H is a 

normal subgroup of G, then the tinflation 
homomorphism Infic;nyc: 4"(G/H, A”)> 
H"(G, A) can be defined for n> 1 (— 200 
Homological Algebra M). 

Let k be an algebraic number field, and let 
K be a Galois extension of k of degree n with 
the Galois group G =G(K/k). The multiplica- 
tive group K* = K — {0}, the tidele group J, of 
K, and the idele class group Cx of K are multi- 
plicative commutative groups with G as their 
operator domain. The fundamental formulas 
in Galois cohomology for class field theory are 


À' (G, Ck)=0, (1) 
Â?’ (G, Cp) = Z/nZ. (2) 
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It is possible to realize the isomorphism of 
(2) by the invariant inv, ,,: H?(G, Ch) 5 {(r/n) 
(mod Z)|r=0,1,...,n— 1} in such a way that 
the following properties hold, where the canon- 
ical cohomology class for K/k is the element 
Ex, of A? (G, Cx) such that inVga Ega =(1/n) 
(mod Z): (i) for kc le K, G=G(K/k), H = 
G(K/l), the relation Rese; Čka = Ex holds; 
(11) if l/k is also a Galois extension with F = 
G(l/k), then we have Infp.g Čik = EK, (m= 
CK :1]); (iit) for a cyclic extension K/k we 
have inVxr Čr = Ly iNV,(Ex,), (mod Z), where 
p runs over all prime divisors of k and inv, is 
the invariant in the local theory (— 257 Local 
Fields E). By these properties, the canonical 
cohomology class ¢,,, is uniquely determined. 
After these preliminaries we can state Tate’s 
theorem, from which the fundamental theo- 
rems in class field theory follow. 

Tate’s theorem. Let K/k be a Galois exten- 
sion with the Galois group G. Then we have the 
isomorphism ®,:H"~?(G, Z) = H"(G, Cy) (n= 
0, +1, +2,...) that is given explicitly by 
®, (2) = Cx, ~ x, where €x,,€H?(G, Cx) is the 
canonical cohomology class for K/k (Ann. 
Math., (2) 56 (1952). 

Corollary 1. Since H~*(G, Z)=G/[G, G] and 
À? (G, Ck) = C,/Ngn(Cx), we have the isomor- 
phism ®,:G/[G, G] = C,/Nxj(Cx). Let f(t, 0) 
(t,o €G) be a 2-tcocycle belonging to ¢x,. 
Then by the explicit expression for the cup 
product we obtain the isomorphism 


®:a( modta, G])> 


I] f(t,0) + (mod Naa(Cr)) . 
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This is an analog of the result in local theory 
that was proved earlier by T. Nakayama and 
Y. Akizuki (Math. Ann., 112 (1936)). 

Corollary 2. For an Abelian extension K/k 
we have the isomorphism ®y:G& C,/Nxj(Cx). 
>| has the property of being the norm- 
residue symbol for C,, and from this iso- 
morphism we can prove immediately Artin’s 
law of reciprocity. Thus we can prove the main 
theorems in class field theory by cohomology- 
theoretic methods [9]. 

We can also see, by generalizing this iso- 
morphism to infinite Abelian extensions, that 
the Galois group of the maximal Abelian 
extension of k over the ground field k with 
tKrull topology is algebraically and topologi- 
cally isomorphic to C,/D,, where D, is the 
connected component of the unit element in 
C,. The structures of D, and C,/D, were ex- 
plicitly determined by Artin and T. Kubota, 
respectively (— 6 Adeles and Ideles D). 

If we assume the fundamental formulas (1) 
and (2) stated above and several other simple 
assumptions as axioms for an infinite ex- 
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tension of a fixed ground field, we can de- 
velop the results stated in this section purely 
cohomology-theoretically. Such a system is 
called a class formation (Artin; — [9]). In ad- 
dition to the cases of algebraic number fields, 
algebraic function fields in one variable with 
finite coefficient fields, and local fields with 
finite residue-class fields, which we have men- 
tioned already, we also know several other 
cases for which analogies of class field theory 
are valid. These analogies can be explained 
systematically by using class formation the- 
ory (Y. Kawada, Duke Math. J., 22 (1955)). 
Examples are (1) the theory of unramified 
Abelian extensions of an algebraic function 
field in one variable with algebraically closed 
constant field of characteristic 0 (Tate and 
Kawada, Amer. J. Math., 77 (1955)); (2) the 
theory of Kummer extensions over a field k 
such that (i) the characteristic of k is 0, (ii) k 
contains all the roots of unity, and (iii) for any 
Galois extension K/k, Nxj(K) =k; (3) the 
theory of Abelian p-extensions of a field of 
characteristic p (E. Witt, J. Reine Angew. 
Math., 176 (1963); I. Satake and Kawada, J. 
Fac. Sci. Univ. Tokyo, 7 (1955)); (4) the theory 
of unramified Abelian p-extensions of an 
algebraic function field in one variable with 
algebraically closed constant field of char- 
acteristic p (Hasse and Witt, Monatsh. Math., 
43 (1936), H. L. Schmid, I. R. Shafarevich, 
Kawada, T. Tamagawa); and (5) the theory 
of Abelian extensions of a local field with 
algebraically closed residue-class fields (J.-P. 
Serre, Bull. Soc. Math. France, 89 (1961)). 

An analogy of class field theory for infinite 
Abelian extensions was considered by Her- 
brand, Moriya, M. Mori, and Kawada. 
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A. Introduction 


The general linear groups, unitary groups, 
orthogonal groups, symplectic groups, etc., 
that are described below are all called classical 
groups (— 13 Algebraic Groups, 151 Finite 
Groups, 248 Lie Algebras, 249 Lie Groups). 


B. General Linear Groups 
Let V be a tlinear space of dimension n over a 


tfield K, and let GL(V) denote the set of all 
‘linear mappings of V onto V (hence they are 
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all bijections). Then GL(V) is a group under 
the composition of mappings. This group is 
called the general linear group (or full linear 
group) on V. Let e,,...,e, be a basis of V over 
K, and let (CH) be the matrix associated with an 
element A of GL(V): Ae;=,a/e;. Then the 
mapping A -+(«;) is an isomorphism of GL(V) 
onto the multiplicative group GL(n, K) of all 
nxn finvertible matrices over K. We can 

thus identify the group GL(V) with GL(n, K). 
GL(n, K) is called the general linear group of 
degree n over K. Consider the homomorphism 
A-|A|{|A| is the determinant of A) of GL(V) 
onto the multiplicative group K* = K — {0}. Its 
kernel SL(V) is a normal subgroup of GL(V) 
and is called the special linear group (or uni- 
modular group) on V. The subgroup SL(n, K) 
={A|AeGL(n, K), |A|=1} of GL(n, K) corre- 
sponds to SL(V) under the above isomorphism 
GL(V)=GL(n, K). SL(n, K) is called the special 
linear group of degree n over K. Unless n=2 
and K is the tfinite field F, = GF (2), SL(n, K) is 
the tcommutator subgroup of GL(n, K). The 
tcenter 3 of GL(n, K) coincides with the set 

of all scalar matrices a! (xe K*), and the 

center 3) of SL(n, K) is a finite group given by 
30 SL(n, K)={al|aeK,a"=1}. 

Now let P(V) be the projective space of 
dimension n—1 obtained from a linear space 
V of dimension n. Namely, P(V) is the set 
of all linear subspaces of dimension 1. Then 
there exists a natural homomorphism @ of 
GL(V) into the group of all projective trans- 
formations of P(V), and the kernel of o 
coincides with the center 3 of GL(V). Hence 
o(GL(V))=GL(V)/3. This group is written as 
PGL{(V) and is called the projective general 
linear group on P(V). Similarly, PGL(n, K) 
= GL(n, K)/3 is called the projective general 
linear group of degree n over K. The quotient 
group SL(n, K)/3o of SL(n, K) by the center 3, 
is called the projective special linear group and 
is written as PSL(n, K) or LF (n, K) (linear 
fractional group). 

The groups GL(n, K), SL(n, K), etc., are also 
written as GL,(K), SL,(K), etc. In particular, 
when K is the ffinite field F,, these groups 
are denoted by GL(n, q), SL(n; q), PGL(n, q), 
PSL(n, q), LF (n, q). 


Simplicity of PSL(n, k). When n=2 and K = 
F,, PSL(2, 2)= S, (the symmetric group of 
degree 3). When n=2 and K =F;, PSL(2,3)= 
N, (the alternating group of degree 4). Except 
for these cases, the group PSL(n, K) (n> 2) is 
a noncommutative tsimple group (— 151 
Finite Groups I). 

Suppose that K is the finite field F,, and let 
a(n, q), B(n, q), y(n, q), O(n, q) denote the orders 
of GL(n, q), SL(n, q), PGL(n, q), PSL(n, q), res- 
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pectively. Then we have 

a(n, q)=(q"—1)(q"—q)...(q"—q" '), 
B(n, q)=y(n, g)= a(n, g\/(q — 1), 

d(n, q)= y(n, q)/d, 


where d =(n, q — 1) (the greatest common divi- 
sor of n and q— 1). 


C. Properties as Lie Groups 


If the ground field K is the field R of real num- 
bers (the field C of complex numbers), the 
above groups are all tLie groups (fcomplex Lie 
groups). In particular, SL(n, C) is a tsimply 
connected, tsimple, and fsemisimple complex 
Lie group of type A,_,, and PSL(n, C) is the 
‘adjoint group of the complex simple Lie 
algebra of type A,_,. 


D. Determination of the Rational 
Representations of GL(V) 


In Sections D and E, the field K is assumed to 
be of characteristic 0. Let p be a homomor- 
phism of GL(V)=GL(n, K) into GL(m, K} (p: 
A =(aj)-+ B=(B?)). Then if each £? is a rational 
function (or polynomial or analytic function) 
in (a1 ,05,...,07) over K, p is called a rational 
representation (or polynomial or analytic repre- 
sentation) of degree m of GL(V). (We suppose 
that K is R or C when we consider analytic 
functions.) For example, every rational repre- 
sentation of degree 1 can be expressed as A> 
| A|° (e is an integer). In particular, if K is the 
field C of complex numbers, every analytic 
representation of GL(n, C) is a rational repre- 
sentation. Since GL(n, C) is the tcomplexifica- 
tion of the tunitary group U (n), there exists a 
one-to-one correspondence between the com- 
plex analytic representations of GL(n, C) and 
the continuous representations of U(n); this 
correspondence preserves equivalence, irre- 
ducibility, ttensor product, and direct sum 

of the representations (— 249 Lie Groups). 
Hence, determining the rational representa- 
tions of GL(n, C) is equivalent to determining 
the continuous representations of U(n). In the 
general case, the rational representations of 
GL(V)=GL(n, K) are all completely reducible. 
For any rational representation p of GL(V), 
there exists a natural number e such that the 
representation p’: A—|A|* p(A) is a polynomial 
representation. Hence in order to determine 
the rational representations of GL(V), it is 
sufficient to determine the irreducible poly- 
nomial representations of GL(V), which, as 
described below, can be obtained by decom- 
posing the representations on the ttensor space 
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V™=V®...® V of degree m (m copies of V) 
(m=1,2,...). For AEGL(V), define D,,(A)e 
GL(V™) as the tensor product 


D,,{A)= A Q ... © A (m copies of A). 
Namely, for v,,...,0,,€V, we have 
D,,(A)(v, © ... © 0m = Av, ©... O Avy. 


The mapping GL(V)3 AD,,(A)EGL(V™) is a 
polynomial representation of degree n™ of 
GL(V). Now let 2£(V) be the tassociative 
algebra of all linear mappings of V™ into V™ 
(total matrix algebra), and let A be the sub- 
algebra of 2(V™) generated by {D,,(A)| AE 
GL(V)}. Next, for an element o of the symmet- 
ric group G,, of degree m, define B, Ee GL(V™) 
by B,(v, @ ... @ Vm) = 05-11) @ -O Valim) 
Then the mapping o— B, is a representation 
of S,, on V". Thus we obtain a representation 
t of the tgroup ring KLS] of S,, over K on 
V”: K[S,,]72(V”). Set t(K(S,,]) =B. Then 
A and 8B are fcommutors of each other in 
2(V"), i.e., A= {X eL(V™)| XB = BX (for all 
BeB)}, B={XeEL(V")| AX = XA (for all 
Aed)}. 

Now for a right ideal r of B, let r(V™) be the 
subspace of V” composed of all the finite sums 
of the form È Bx (Ber, xe V"). Then the fol- 
lowing statements hold: 

(1) r(V”) is invariant under Y; hence it is a 
subspace of V™ invariant under GL(V). Con- 
versely, for any subspace U of V™ invariant 
under GL(V), there exists a unique right ideal r 
of B such that U =r(V™). 

(2) Let r,, r, be right ideals of B, and put 
U, =r; (V), U,=1r,(V"). Then r; Sr, (as right 
S,,-modules) if and only if U, = U, (as repre- 
sentation spaces of GL(V)). 

(3) The mapping r~1(V”) is a lattice iso- 
morphism of the tlattice of right ideals of B 
onto the lattice of GL(V)-invariant subspaces 
of V”. Hence if r =r; +r, (direct sum), then 
U = U, + U, (direct sum). Also, r(V™) gives an 
irreducible representation of GL(V) if and only 
if r is a minimal right ideal of B. 

Since the algebra K[G,,] is a tsemisimple 
algebra, B can be considered as a two-sided 
ideal of K[S,,,]. Hence a minimal right ideal r 
of B is also a minimal! right ideal of K [Sn], 
and the tidempotent element e which generates 
r is a ‘primitive idempotent of K[S,,]. From 
the theory of symmetric groups (— 362 Repre- 
sentations H) the primitive idempotents of 
K[G,,] are all given (up to isomorphism) by 
*Young’s diagrams T(/,,....f) i 2h2-.-2 
f,>0, m=f,+...+f,). In this setting, we have 

(4) Let e=e(f,,...,f,) be the primitive 
idempotent determined by Young’s diagram 
T( fis -s fi). Then eK [S,,] < 8 if and only if 
k <n. In this case, put eK [S,,] =r, r(V")= 
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e(V™=V"(T(S,, ---,f,)) and denote the 
irreducible representation of GL(V) on 
VT fi- fd) by AP D(A; fis -3 Ja). We call 
(fis ---, fa) the signature of this irreducible 
representation. 

(5) The representation D(A; f,,..., fa) 1s an 
irreducible polynomial representation of 
GL(V). Furthermore, for any irreducible poly- 
nomial representation p of GL(V), there exists 
a unique D(A; fi, ---, J) equivalent to p. For ex- 
ample, if k= 1, then fi =m, e=(m!) E csn, 
and V" (T(m)) is the space of ‘symmetric ten- 
sors of degree m. If fi =...=f,=1, then k=m, 
e=(m!) t. Dc, (sgno)o, and V"(T(1,..., 1)) 
is the space of talternating tensors of degree m. 

(6) Let x(4; f,,...,f,) be the tcharacter of the 
irreducible representation D(A; fi, ..., f,). Then 


WAS fi> codi) 


el elz tee ey 
eli 62 ae gi 
pipi err’ By. i 
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gn gt 2 E i] 


where €,,...,&, are the feigenvalues of A and 
L=f,t+(n—-1), l,=f,+(n—2),...,1,=f, (set 
fusp = +. =f, =0). Hence the degree d of 
D(A; f,,.--,f,) is expressed as 


d=D(I,,...,1,)/D(n—1,..., 1,0), 


where D(x,,...,X,)=T]j;<;(x;— x). 

(7) In particular, denote the character 
of D(A; m) by Pm= Pml A). Then they satisfy 
|[—zA|'=pot+p,zt+p227 +... and 


XA; fis. Ji) 


Pr, Pr +t Pf, +in-1) 
—|Pfz-1 Py, Pf +in-2) 
Pfa=in-1) Pfaz-2) ° Ph 
where we put fe+1 =Jk+2 =- = fa =9, P-1 = 
p-2=...=0. This matrix is simply written as 


|Pi—(n—1)s «+» Pil With the convention that in 
each row, we set 1, =f, +(n—1),...,1,-1= 
hats. 


E. Determination of the Rational Represent- 
ations of SL(V) 


The rational representations of SL(V) are 
completely reducible. By restricting any 
irreducible representation D(A; fis <-s fa) 
(fi2fh2...2f,20) of GL(V) to SL(V), we 
get an irreducible rational representation 
D(A; f,,...,f,) of SL(V). Furthermore, any 
irreducible rational representation of SL(V) 
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can be obtained in this way. D(A; f,,...,f,) 
and D(A; fy,...,.f') are equivalent representa- 
tions of SL(V) if and only if fi~ firi =f — firi 
(i=1,...,n—1). 


F. Unitary Groups 


The set U(n) of all n x n tunitary matrices with 
complex elements is a group under multipli- 
cation (— 269 Matrices). This group U (n) is 
called the unitary group (or unitary transfor- 
mation group) of degree n. The subset of U(n) 
consisting of all matrices of determinant 1 is a 
normal subgroup of U(n). This group is called 
the special unitary group and is denoted by 
SU(n). 

U(n) and SU(n) are subgroups of GL(n, C) 
and SL(n, C), respectively, and can be obtained 
from these groups through the tunitary restric- 
tion. Hence they are both compact, connected 
Lie groups; in particular, SU(1) is composed 
only of the identity and U(1) is the multiplica- 
tive group of all complex numbers of absolute 
value 1. The center 3 of U(n) is the set of all 
diagonal matrices AI(AeEC, |A|= 1), and we 
have 


g=U(L), 3: SU(n)=U(n), 


U(n)/SU(n=U(1). 


Moreover, for n>2, SU (n) is a simple, semi- 
simple, and simply connected Lie group, which 
gives one of four infinite series of simple com- 
pact Lie groups. 

U(n)/3 is denoted by PU (n) and is called the 
projective unitary group. We have the relations 
PU (n)= SU (n)/3 N SU (n), 3N SU (n) = Z/nZ. 
Hence PU (n) is locally isomorphic to SU (n). 


G. Irreducible Representations of U (n) 


Restricting the irreducible representation 
D(A; fi», fa) of GL(n, C) on SU (n), we obtain 
a continuous irreducible representation of 

SU (n), and conversely, all continuous irreduc- 
ible representations of SU (n) are obtained in 
this manner. Similarly, any continuous irre- 
ducible representations of U (n) are given by 
A>] A| D(A; f,,...,f,), where e is an integer. 
Since both U (n) and SU (n) are compact, any 
continuous representation of these groups 
can be decomposed into a direct sum of the 
irreducible representations mentioned above 
(— 69 Compact Groups). 

The representation theory of U (n) and SU (n) 
is important as the most typical and concrete 
example of the representation theory of gen- 
eral compact Lie groups (— 69 Compact 
Groups, 248 Lie Algebras, 249 Lie Groups). 
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H. Unitary Groups over General Fields 


A unitary matrix and the unitary group can 
also be defined over some fields other than the 
field C of complex numbers. Namely, let P be 
a field and K a quadratic extension field of P; 
for an element € of K, let Ë be the tconjugate 
of č over P. Then a matrix of degree n with 
entries in K is called a unitary matrix of K 
(relative to P) if it leaves invariant the tHer- 
mitian form €,¢,+¢,¢,+...+¢,€,. The multi- 
plicative group consisting of all unitary mat- 
rices is called the unitary group over K (relative 
to P) and is denoted by U(n, K, P); its sub- 
group consisting of all unitary matrices of 
determinant 1 is called the special unitary 
group over K and is denoted by SU(n, K, P). 
The quotient group of SU(n, K, P) by its 
subgroup consisting of all 27 (A"=1,|A| =1) is 
called the projective special unitary group over 
K and is denoted by PSU (n, K, P). In partic- 
ular, when K and P are the finite fields F, 

and F, (4 =p"), U (n, K, P), SU (n, K, P), 

PSU (n, K, P) are written simply as U (n, q), 

SU (n, q), PSU (n, q). Then for n > 3, each 

PSU (n, q) is a noncommutative simple group, 
except for PSU (3,2) (— 151 Finite Groups I). 


I. Orthogonal Groups 


The set of all torthogonal matrices of degree n 
(with real entries) forms a group under multi- 
plication. This group O(n) is called the ortho- 
gonal group (or orthogonal transformation 
group) of degree n. The subset of O(n) consist- 
ing of all orthogonal matrices of determinant 1 
forms a normal subgroup of O(n) of index 2. 
This group SO(n) (also denoted by Ož ) is 
called the rotation group (special orthogonal 
group or proper orthogonal group) of degree n. 
Geometrically, O(n) is the set of all orthogonal 
transformations leaving a point in Euclidean 
space of dimension n fixed, and SO(n) is com- 
posed of all rotations around the point. 

Both O(n) and SO(n) are compact Lie 
groups, and SO(n) coincides with the con- 
nected component of O(n) which contains the 
identity. For n=3 or n>5, each SO(n) is a 
simple and semisimple Lie group. Following 
the theory of Lie algebras, we divide the set 
of all SO(n) (n>3 but n#4) into two classes 
according as n is even or odd, and we thus get 
two of the four infinite series of simple and 
semisimple compact Lie groups (for SO(4), for 
example, see [1]). 

Although SO(n) (n> 3) is a connected Lie 
group, it is not simply connected. The simply 
connected Lie group which is locally isomor- 
phic to SO(n) is called the spinor group and is 
denoted by Spin(n). SO(n) is isomorphic to the 
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quotient group of Spin(n) by a normal sub- 
group of order 2. Let 3 be the center of Spin(n). 
Then 3 = Z/2Z for odd n, 32 Z/4Z for n=2 
(mod 4), and 3=(Z/2Z) @ (Z/2Z) for n=0 
(mod 4) (— 61 Clifford Algebras). 

The group O(n, C) of all complex ortho- 
gonal matrices is called the complex ortho- 
gonal group, and the group SO(n, C) of all 
matrices in O(n, C) of determinant 1 is called 
the complex special orthogonal group. SO(n, C) 
(n> 3,n#4A4) is a simple and semisimple com- 
plex Lie group. 


J. Irreducible Representations of Orthogonal 
Groups 


In the same way as for GL(n, K), the irreduc- 
ible representations of O(n) can be obtained 
by decomposing the tensor product D,,(A) 
=A®...@A ofm copies of an orthogonal 
matrix A using Young’s diagram. Namely, con- 
sider the Young’s diagram T( fi, fo, ...5f,) 
such that the sum of the lengths of the first 
column and of the second column is not 
greater than n, and call it an O(n) diagram. 
Then to any O(n) diagram T=T(f,,...,f,), 
there corresponds an absolutely irreducible 
representation D°(A; fi, fo, ...,f,), and the 
representations D°(A; f,, f2, ..., f,) correspond- 
ing to two distinct O(n) diagrams are mutually 
inequivalent. D,,(A) can be decomposed into 
the direct sum of those representations D°(A; 
Sis fos- fa) such that f= fi +... + fy takes the 
values m, m—2, m—4, .... Furthermore, any 
continuous irreducible representation of O(n) 
is equivalent to a D°(A; fi, f2, ---, f,) obtained 
from some O(n) diagram T= T(f,, fo, --- fa) 
In general, two O(n) diagrams T and 7” are 
called mutually associated diagrams if the sum 
of the lengths of their first columns is equal to 
n and if the lengths of each column other than 
the first one coincide. In particular, if T= 
T(fhi>fo,--->f,) and 2k=n, then T is said to 
be self-associated. The set of all O(n) diagrams 
can be divided into pairs of mutually asso- 
ciated T, T’ (and self-associated T= T’). 
Suppose that we are given mutually asso- 
ciated diagrams T and T’ and that the length 
k of the first column of T= T(fi, fo, ....f,) IS 
not greater than n/2. Then the character 7;(A) 
of D°(A; fi, f2, ---, f) corresponding to T and 
the character y7-(A) of the irreducible repre- 
sentation corresponding to T’ are given by 


%7(A)= Peay — Pi-wt+iy 
Pi-(w—2) — Pi-w+2) +++ Pi — Pi-2vh 
Xr (4)=|A|xr(4) v=[n/2], 


where p; and |Pi-w-1)—Pi-w+1) -| have the 
same meaning as in the formula for the char- 
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acters of irreducible representations of 
GL(n, K). 

The irreducible representations of SO(n) can 
be obtained immediately from those of O(n). 
Namely, if T= T( fi; fo, ..., f,) 1s not self- 
associated, D°(A; fi, fo, ..-, fa) is irreducible as 
a representation of SO(n), and the represen- 
tations of SO(n) derived from T and the asso- 
ciated T’ coincide. If T is self-associated, 
D°(A; fis 25...» fa) can be decomposed into 
two irreducible representations of SO(n) of the 
same degree over the field of complex num- 
bers. Furthermore, the irreducible represen- 
tations of SO(n) obtained in this way from 
different pairs of associated diagrams are 
mutually inequivalent, while any continuous 
irreducible representation of SO(n) is equiva- 
lent to one of these representations. For the 
representations of SO(3) (the rotation group of 
degree 3) — 353 Racah Algebra. 

Since SO(n) is isomorphic to the quotient 
group of Spin(n) by a normal subgroup N of 
order 2, a continuous representation of Spin(n) 
which is not the identity representation on N 
can be considered as a double-valued repre- 
sentation of SO(n). This representation is called 
the spin representation and is important in the 
field of applied mathematics. 

The orthogonal group O(n) consists of all 
nxn real matrices which leave invariant the 
quadratic form €7+...+?, while the group of 
all n xn real matrices which leave invariant the 
quadratic form €?+...4+ ¢2—£?,,—...—& of 
signature (r,n—r) is called the Lorentz group 
of signature (r,n—r). The case for n=4 and r= 
3 is used in special relativity (+ 359 Relativ- 
ity). Let Gy be the connected component of the 
identity of the Lorentz group of signature 
(3, 1). Then Gg is called the proper Lorentz 
group. For o =(g;;)€G, we have |a|= +1 and 
Ga4 = | Or gag S — 1. Moreover, we have Go 
={o||ol=1, g44> 1}, G/Go =(Z/2Z) ® (Z/2Z) 
(tfour group), and Go = SL(2, C)/{ +1}. 


K. Orthogonal Groups over General Fields 


Orthogonal groups can also be defined over 
other general fields than the field of real num- 
bers as follows: Fix a tquadratic form Q(¢, č) 
=" ,=1%6;€; (lay) #0) over a field K. Then a 
linear transformation of &,(i=1, 2, ...,m) over 
K which leaves Q invariant is called an ortho- 
gonal transformation with respect to Q. The set 
of all orthogonal transformations forms a 
group. This group is denoted by O(n, K, Q) or 
simply O(Q) and is called the orthogonal (trans- 
formation) group over K with respect to Q. In 
particular, the normal subgroup of all trans- 
formations in O(n, K, Q) of determinant 1 is 
called the special orthogonal group over K with 
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respect to Q and is denoted by SO(n, K, Q) (or 
simply SO(Q)). O(n) and SO(n) are special cases 
of O(n, K, Q) and SO(n, K, Q), where K is the 
field R of real numbers and Q(€, €) is the unit 
quadratic form č? + éZ +... + €?. 

Let Q(n, K, Q) be the tcommutator subgroup 
of O(n, K, Q). Then this subgroup coincides 
with the commutator subgroup of SO(n, K, Q). 
If K is of characteristic 42, and ifn>5 and 
the tindex v of Q > 1, then Q(n, K, Q)/3 (3 is the 
center of Q(n, K,Q)) is a simple group, where 3 
= {I} or 3={ +1} (L. Dickson, J. Dieudonné). 
Suppose that K is a finite field F, (of character- 
istic #2). Then we have v=m if n=2m+ 1, and 
v=m or m—1 ifn=2m. Hence v>2 ifn>5. If 
v=0 and K =R, we have Q(n, R, Q) =SO(n) 
and, as mentioned before, SO(n)/3 is simple for 
n 25. The same proposition also holds when 
K is an talgebraic number field (M. Kneser, 
1956). If K is of characteristic 2, then O(n, K, Q) 
= SO(n, K, Q), 3= {1}, and Q(n, K, Q) is a sim- 
ple group in many cases (Dieudonné [5]). For 
the case where K is a finite field (Dickson) 

— 151 Finite Groups I. 


L. Symplectic Groups 


Let či, €5,.-.,€2, and 4,,%>,...,>, be two sets 
of variables, and suppose that the same linear 
transformation A over a field K acts on them 
(from the left). If A leaves the tbilinear form 
Èr- (Čz:-1 2; — €5;:42;_1) invariant, this linear 
transformation (or the corresponding matrix) 
A is called a symplectic transformation (sym- 
plectic matrix) of degree 2n. The set of all sym- 
plectic transformations (or matrices) of degree 
2n over K forms a group denoted by Sp(n, K) 
and called the symplectic group (symplectic 
transformation group, complex group, or 
Abelian linear group) over K. 

Any matrix in Sp(n, K) is always of determi- 
nant 1, and the center 3 of Sp(n, K) consists of 
I and —I. The quotient group PSp(n, K) of 
Sp(n, K) by 3 is called the projective symplectic 
group over K. Except for the three cases n= 1, 
K=F,;n=1, K =F;; and n=2, K=F,, the 
group PSp(n, K) (n> 1) is always simple. 


Properties of Symplectic Groups as Lie Groups. 


When K is the field C of complex numbers or 
the field R of real numbers, Sp(n, K) is a Lie 
group. The intersection of the complex sym- 
plectic group Sp(n, C) and the unitary group 
U(2n), namely, the unitary restriction of 
Sp(n, C), is denoted by Sp(n) and is called the 
unitary symplectic group (or simply symplectic 
group). Sp(n, C) is a simple and semisimple 
complex Lie group, and both Sp(n, R) and 
Sp(n) are simple and semisimple Lie groups. 
Moreover, Sp(n) is compact and simply con- 
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nected and gives one of the four series of 
simple, semisimple and compact Lie groups 
(— 249 Lie Groups). 

Let H” be the linear space of dimension n 
over the tquaternion field H. Define the inner 
product of two elements x=(x,,...,x,) and y 
= (Vy, -+> Yn) in H” by (x, y) = x17, +. + XpFn 
(y; is the tconjugate quaternion of y;), and 
consider the group of all linear transforma- 
tions which leave this inner product invar- 
iant. Then this group is isomorphic to Sp(n). 
Sp(n) is thus compared with the orthogonal 
group O(n), which leaves invariant the inner 
product of a linear space over the field R of 
real numbers and with the unitary group U(n), 
which has the same property over the field C 
of complex numbers (C. Chevalley [4, ch. 1]). 


M. Irreducible Representations of Symplectic 
Groups 


In the same way as for GL(n, K), the represen- 
tation D,,(A)=A ®... ® A (tensor product of 
m copies of A) of Sp(n, C) can be decomposed 
into irreducible components using Young’s 
diagram. Namely, for any Young’s diagram 
T=T(h,, fo, ---, f) (k <n) such that the num- 
ber k of rows is not greater than n, an irreduci- 
ble representation D*(A; f,,..., f) of Sp(n, ©) 
is determined. These D*(A; fi, ..., fa) are mutu- 
ally inequivalent, and D,,(A) can be decom- 
posed into the direct sum of representations 
D*(A; fis fo, -> f) such that f=f,+...+f, is 
equal to any of the values m, m—2, m—4,.... 
The character of D(A; fi, fa, ..., Jfa) is given by 


X1(A)= [Pints Pi-n+2 + Pi-n «+> Pi + Pi-an+al, 


where p; and |Pi-n+1sPi-n+2 + Pi- ---| have the 
same meaning as in the formula for the char- 
acters of the irreducible representations of 
GL(n, K). 

For the matrices A in Sp(n), D°(A; fis fo, ..-; 
fi) gives rise to a continuous irreducible 
representation of Sp(n). Furthermore, any con- 
tinuous irreducible representation of Sp(n) is 
equivalent to a representation D*°(A; fi, fo, ..-, 
fa) corresponding to some diagram T. 


N. Relations among Various Classical Groups 


There are some isomorphisms (homomor- 
phisms) among the classical groups mentioned 
above. For general fields K — [1,5]. For finite 
fields K — 151 Finite Groups I. For K =R or 
C, the following isomorphisms hold: SO(3)= 
SU (2)/{ +1}, SU(2)= Sp(1), SO(S) = Sp(2)/ 
{+I}, SO(6)=SO(4)/{ +1} (— 248 Lie Alge- 
bras, 249 Lie Groups). 
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O. Classical Groups over Noncommutative 
Fields 


Let V be a right linear space over a non- 
commutative field K. Then the set of all linear 
transformations of V forms a group under the 
multiplication defined by the composition of 
mappings. This group GL(V) is called the 
general linear group on V. It is isomorphic to 
the multiplicative group of all n x n invertible 
matrices with entries in K. The commutator 
subgroups SL(V) and SL(n, K) of GL(V) and 
GL(n, K), respectively, are called the special 
linear group of degree n on V and over K, 
respectively. Now, suppose that an element A 
of GL(V) leaves each element of a subspace U 
of dimension n—1 of V fixed. Choose an ele- 
ment x of V which does not belong to U, and 
set Ax = xg (mod U); xe K depends not only on 
A but also on the choice of x. However, the 
conjugate class ¢= {Aad ~! |2eK*} of a in the 
multiplicative group K* of K is determined 
only by A. In particular, if ¢={1} and A #1, 
then A is called a transvection. For a tmatrix 
unit Ej, B,(«)=1+«E,, is a transvection if 
i#j and «40. SL(V) coincides with the sub- 
group of GL(V) generated by all transvections. 
This fact also holds when K is a commutative 
field, except when n=2 and K =F,;. In this 
case, transvections generate the whole GL(2, 2), 
which is isomorphic to the symmetric group 
©, of degree 3 and does not coincide with 

the commutator subgroup. The center 3 of 
GL(n, K) consists of all scalar matrices corre- 
sponding to nonzero elements in the center of 
K. Let C be the commutator subgroup of the 
multiplicative group K* of K. Then for n> 2, 
GL(n; K)/SL(n, K) is isomorphic to K*/C. This 
isomorphism can be obtained by appropriately 
defining, for Ae GL(n, K), an element det A of 
K*/C which is called the determinant of A 

[6, 9]. The center 39 of SL(n, K) is {al |a"e C}. 
The quotient group PSL(n, K)=SL(n, K)/3 is 
called the projective special linear group of 
degree n over K. If K is a noncommutative 
field, then PSL(n, K) (n> 2) is always a simple 
group [5,8]. 

Next, let K be any field (commutative or 
noncommutative), and let V be a right linear 
space of dimension n over K. Consider a Her- 
mitian form f(x, y) (— 256 Linear Spaces) on 
V relative to an tinvolution J of K. If for a 
fixed element ¢ in the center of K we have 
f(x, y)=ef(y, x), then f is called an c-Hermitian 
form. For the rest of this article, f is assumed 
to be an e-Hermitian form on V. Let W be a 
subspace of V. If f(x, y)=0 for any x, ye W, 
then W is called a totally isotropic subspace. 
The largest dimension m of the totally iso- 
tropic subspaces of V is called the index of f. 
We always have 2m<n. If f(Ax, Ay) = f(x, y) 
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for any x, ye V, then A is called a unitary trans- 
formation relative to f. 

The set U(n, K, f) of all unitary transforma- 
tions relative to f forms a subgroup of GL(V). 
This group is called the unitary group relative 
to f. Also, the group SU (n, K, f)= U(n, K, f)N 
SL(n, K) is called the special unitary group. 
When J=1 and e=1, a unitary transformation 
(unitary group) is called an orthogonal trans- 
formation (orthogonal group), and U(n, K, f) is 
written as O(n, K, f). Also, when J =1 and 
€= — 1, a unitary transformation (unitary 
group) is called a symplectic transformation 
(symplectic group), and U (n, K, f) is written as 
Sp(n, K). In fact, in these cases, for arbitrary 
choice of f, the corresponding groups are 
mutually isomorphic. 

An -Hermitian form f is called an ¢-trace 
form if for any xe V, there exists an «e K 
which satisfies f(x, x) =œ +ga". If J=1,¢=—1 
(hence K is commutative) or £= 1 and K is of 
characteristic #2, then any e-Hermitian form 
is an -trace form. If f is an -trace form, a 
linear mapping B of any subspace W of V into 
V such that for any x, ye W, f(Bx, By) = f(x, y) 
can be extended to an element 4 of the unitary 
group U(n, K, f) relative to f (Witt’s theorem). 
In particular, U(n, K, f) acts transitively on the 
maximal totally isotropic subspaces, and their 
dimensions are equal to the index m of f. Now, 
let P be a Pythagorean ordered field (an or- 
dered field which contains square roots of 
any positive element). If K =P and J =1, or if 
K= P(/-1 ), or if the noncommutative field 
K is a *quaternion algebra over P and J is the 
operation of tconjugation of K, then for two 
Hermitian forms f, f’, their unitary groups 
U(n, K, f) and U(r’, K, f’) are isomorphic if 
and only if n=n’, and the indices of f and f’ 
are equal. In this case, U(n, K, f) can be writ- 
ten as U(n,m, K), where m is the index of f. If 
the field K is a tquaternion algebra over P and 
fis an tanti-Hermitian form, there exists an 
orthogonal basis (e;) of V such that f(e; e)=j 
(quaternion unit), 1 <i<n. Hence, in this case, 
the unitary group U (n, K, f) relative to f is 
determined only by n and K. 

Suppose that we are given an -trace form f 
over a general field K whose index m is not 
equal to 0. We exclude the case where J = 1 
and ¢=1. Then the unitary group U(n, K, f) 
contains transvections. Let T(n, K, f) denote 
the subgroup of U (n, K, f) generated by trans- 
vections which are unitary transformations. If 
m=> 2, then T(n, K, f) is the commutator sub- 
group of U(n, K, f). The center W, of T(n, K, f) 
coincides with the intersection of T(n, K, f), 
and the center 3 of GL(n, K). Ifn2>3 and K 
contains more than 25 elements, then the quo- 
tient group T(n, K, f)/W, is a simple group [6]. 
Also, if K is commutative and n>2, m>1, 
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J #1, then T(n, K, f)=SU(n, K, f), except for 
the case where n=3, K =F}. 

If K is the field R of real numbers, the field 
C of complex numbers, or the quaternion field 
H, then GL(n, K), SL(n, K), and U(n, K, f) are 
all Lie groups. In particular, SL(n, K) and 
U(n, K, f) are simple Lie groups except in the 
following three cases: (1) n=1, K =R or C; (2) 
n=2, K=R, J=1, €=1; (3)n=4, K=RorC, 
J=1,¢=1, m=2. In cases (1) and (2) they are 
commutative groups, and in case (3) they are 
locally direct sums of two noncommutative 
simple groups. 

Suppose that K =H. Since H contains C as 
a subfield, a vector space V of dimension n 
over H has the structure of a vector space of 
dimension 2n over C. From this fact, GL(n, H) 
can be considered as a subgroup of GL(2n, C) 
in a natural way. 

Each of the complex classical simple groups 
G=SL(n, C), SO(n, C), Sp(n, C) has the struc- 
ture of an talgebraic group defined over R 
(— 13 Algebraic Groups). The real forms of 
G, i.e., the algebraic subgroups of G whose 
scalar extension to C is G, can be realized as 
SL(n, K), U(n, K, f) corresponding to K =R, C, 
H. Namely, a real form of a complex classical 
group G is conjugate in G to one of the follow- 
ing groups: (i) The real forms of SL(n, C): 
SL(n, R) (type AD); SL(k, H) only for n=2k 
(type AH); and the special unitary group 
SU(n, m, C), O0<m<[n/2], relative to a Her- 
mitian form f of index m (type AIID). (ii) The 
real forms of SO(2n + 1, C): the proper ortho- 
gonal group SO(2n+ 1,m,R), O0<m<n, relative 
to a quadratic form of index m on a space of 
dimension 2n + 1 (type BI and BII). (iii) The 
real forms of SO(2n, C): SO(2n, m, R), O<m<n 
(type DI and DIT); and U (n, H, f) relative to an 
anti-Hermitian form f on H (type DIID). (iv) 
The real forms of Sp(n, C): Sp(n, R) (type Cl); 
the unitary group U (2n, m, H), 0<m<n, rela- 
tive to a Hermitian form f of index m on H 
(type CII); and Sp(n) corresponds to the special 
case m=0. The quotient groups of these real 
forms by their centers can all be realized as the 
groups of automorphisms of semisimple alge- 
bras with involutions J which commute with 
J (A. Weil [10]). 


References 


[1] B. L. van der Waerden, Gruppen von 
linearen Transformationen, Erg. Math., 
Springer, 1935 (Chelsea, 1948). 

[2] H. Weyl, The classical groups, Princeton 
Univ. Press, 1939, revised edition, 1946. 

[3] N. Bourbaki, Eléments de mathématique, 
Algèbre, ch. 9, Actualités Sci. Ind., 1272a, 
Hermann, 1959. 


230 


[4] C. Chevalley, Theory of Lie groups I, 
Princeton Univ. Press, 1946. 

[5] J. Dieudonné, Sur les groupes classiques, 
Actualités Sci. Ind., Hermann, 1948. 

[6] J. Dieudonné, La géométrie des groupes 
classiques, Springer, 1955. 

[7] C. Chevalley, The algebraic theory of 
spinors, Columbia Univ. Press, 1954. 

[8] M. Eichler, Quadratische Formen und 
orthogonale Gruppen, Springer, 1952. 

[9] E. Artin, Geometric algebra, Interscience, 
1957. 

[10] A. Weil, Algebras with involutions and 
the classical groups, J. Indian Math. Soc., 24 
(1960), 589-623. 


61 (111.16) 
Clifford Algebras 


A. Definitions and Basic Properties 


Let V be an n-dimensional tlinear space over a 
field K, and let Q be a tquadratic form on V. 
Denote the ttensor algebra over V by T(V), the 
tensor multiplication by ®. Let /(Q) be the 
two-sided ideal of T(V) generated by the ele- 
ments x ® x —Q(x): 1 (xe V). The resulting 
tquotient associative algebra T(V)/I(Q) is then 
denoted by C(Q) and is called the Clifford 
algebra of the quadratic form Q. The elements 
of C(Q) are called Clifford numbers. 

The composite of two canonical mappings 
t:V>T(V), 6: T(V)>C(Q) is a linear injection 
cot: V>C(Q). Hence we can regard V asa 
linear subspace of C(Q) via cot. Then C(Q) 
is an associative algebra over K generated 
by 1 and V. Furthermore, x? = Q(x): 1 for 
every x in V. 

Indeed, C(Q) is the universal associative 
algebra with these properties. That is, let A be 
any associative algebra with a unity element, 
and let f: VA be a linear mapping such that 
f(x)? =Q(X):1 for every x in V. Then f can be 
extended uniquely to an algebra homomor- 
phism Í :C(Q)> A with f (1)=1. Furthermore, 
let ® be the tsymmetric bilinear form associ- 
ated with 0: (x, y)=Q(x + y)—Q(x)— Q(y), 

x, ye V. Then xy+ yx = (x, y): 1 for every 
x, y in V. C(Q) is of dimension 2” over K. If 
€;,...,€, IS a basis of V, then 


1, e; ee; (i<j), -p C102 + Cn 


form a basis of C(Q). In particular, if {e;} is an 
orthogonal basis relative to Q, we have 


e? =Q(e,)- l; 
ij=1,...,7, iÆj. (1) 


e;ej= — ee}, 
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In this case, C(Q) can be defined as an as- 
sociative algebra (with a unity element) gen- 
erated by the {e,} together with the defining 
relations (1). In particular, for Q =0, C(Q) is 
the texterior algebra (tGrassmann algebra) 
over V. 


B. The Principal Automorphism and the 
Principal Antiautomorphism of C(Q) 


There exists a unique automorphism « of the 
algebra C(Q) such that a{x)= —x for every x in 
V. This automorphism «g is called the principal 
automorphism of C(Q), and we have «? = 1. 
Also, there exists a unique antiautomorphism 
B of the algebra C(Q) such that f(x)=x for 
every x in V. This antiautomorphism f is 
called the principal antiautomorphism of C(Q), 
and we have $° = 1. 

For the rest of this article we assume that 
the discriminant of Q is 40. We also assume 
for the sake of simplicity that the characteristic 
of Kis #2. LetC*=C*(Q)=K-14+V7+V*4 
+...,and C=C (Q)=V+V34+V%H+.... 
Then C(Q) is the direct sum of the linear 
subspaces C*(Q) and C7 (Q). Furthermore, 
CeCe uC C eC Co cC, and 
C7 C~ <C*. Thus C(Q)=C*+C™ has the 
structure of a tgraded algebra with the index 
group {+1}, and C* is a subalgebra of C(Q). 
The elements of C*(Q), C7 (Q) are called even 
elements and odd elements, respectively. We 
have dim C*(Q)=dimC (Q)=2""'. 


C. The Structure of C(Q) and C*(Q) 


C(Q) and C*(Q) are both tseparable, tsemi- 
simple associative algebras over K. Suppose n 
is even: n=2r. Then C(Q) is a tsimple algebra 
with K as its center; the center Z of C+ (Q) is 2- 
dimensional over K. Let e,,...,¢, be an ortho- 
gonal basis of V. Then 1 and z=2’e, ...e, form 
a basis of Z, and we have 


2? =2"(—1)'Q(e,)... O(e,) =(—1YD, 


where D is the tdiscriminant of ® relative to 
the basis {e,;}. Thus if (—1YD has a square root 
in K, Z= K @ K (direct sum), and so C*(Q) is 
decomposed into the direct sum of two simple 
algebras. If (—1)’D does not have a square 
root in K, then Z is a field and C*(Q)isa 
simple algebra. In particular, if the tindex of Q 
(i.e., the dimension of a maximal ttotally sin- 
gular subspace of V (— 348 Quadratic Forms)) 
is r, C(Q) is isomorphic to the ttotal matrix 
algebra of degree 2” over K, and C*(Q) is 
isomorphic to the direct sum of two copies of 
the total matrix algebra of degree 2’"' over K. 
Now suppose that n is odd: n=2r+1. Then 
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C*(Q) is a simple algebra with K as its center. 
(In particular, if Q is of index r, then C* (Q) is 
isomorphic to the total matrix algebra of 
degree 2’ over K.) The center Z of C(Q) is 2- 
dimensional over K, and we have C(Q)= 

Z @,x C*(O). If e,,...,e, is an orthogonal 

basis of V, then 1 and z=e, ...e, form a basis 
of Z. Putting z’=2'*'z, we have z? =2(—1)'D, 
where D is the discriminant of ® relative to 
{e:i}. Thus if 2(—1)’D has a square root in K, 
C(Q) is the direct sum of two 2*"-dimensional 
simple algebras. If 2(—1)’D has no square root 
in K, then C(Q) is a simple algebra. 


D. The Clifford Group 


Let G be the set of all invertible elements s in 
C(Q) such that sVs-' = V. Then G forms a 
group relative to the multiplication of C(Q). 
This group G is called the Clifford group 

of the quadratic form Q. The subgroup G* 
=GN C+ (Q) is called the special Clifford 
group. The linear transformation ¢(s):x— 
sxs~' of V induced by seG belongs to the 
torthogonal group O(Q) of V relative to Q. 
Moreover, the mapping s— ¢(s) is a homomor- 
phism from G into O(Q). Thus @ is a frepre- 
sentation of G on V. This representation ¢ is 
called the vector representation of G. The tker- 
nel of g consists of invertible elements in the 
center Z of C(Q). If xe GNV, then Q(x) #0 and 
— ~(x) is the reflection mapping of V relative 
to the hyperplane orthogonal to x. If n=dimV 
is odd, g(G)= y(G*)=SO(Q). If n is even, p(G) 
= 0(Q), o(G*)=SO(Q). 

Exploiting the principal antiautomorphism 
f of C(Q), we obtain a homomorphism N:G* 
— K* (the multiplicative group of K) defined 
by N(s)= B(s)s (seG*), and N(s) is called the 
spinorial norm of se¢G~*. The normal subgroup 
of G* defined as the kernel of N is denoted by 
Go and is called the reduced Clifford group (of 
Q). The subgroup (Gù ) of SO(Q) is denoted 
by Og (Q) and is called the reduced orthogonal 
group. 

In particular, when the ground field K is the 
real number field R, Oğ (Q) coincides with the 
tidentity component of the tLorentz group 
O(Q). Furthermore, if Q is definite, Of (Q)= 
SO(n), so that the identity component Spin(n) 
of Gj is a tsimply connected fcovering group 
of SO(n) via the covering homomorphism o 
(with each point in SO(n) covered twice). The 
group Spin(n) is called the spinor group of 
degree n. 


E. Spin Representations 


In this section we assume that the ground field 
K is the complex number field C and that n 
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= dim V > 3. Then we have Of (Q)= SO(n, C), 
so Gg is a simply connected covering group of 
SO(n, C) via the covering homomorphism ø. In 
this section we denote Gg by Spin(n, C) and 
call it the complex spinor group of degree n. 
Spin(n, C) is the tcomplexification (— 249 Lie 
Groups) of the compact Lie group Spin(n) and 
is a complex analytic subgroup of the tcom- 
plex Lie group C(Q)* consisting of all inver- 
tible elements of C(Q). With the bracket 
operation [x, y] =xy— yx, C(Q) becomes the 
tLie algebra of C(Q)*. Furthermore, the Lie 
subalgebra of C(Q) associated with the com- 
plex analytic subgroup Spin(n, C) is given by 
Z<; Cee, where e,,...,e, is an orthogonal 
basis of V. The spin representations of the 
group Spin(n, C) are defined as follows: 

(1) When n is odd: n=2r + 1. Since C* (Q) is 
isomorphic to a total matrix algebra of degree 
2’ over C, C*(Q) has a unique (up to equiva- 
lence) tirreducible representation ø, which is of 
degree 2’. The restriction of 6 on Spin(n, C) (on 
Spin(n)) defines an irreducible representation p 
of degree 2” of Spin(n, C) (of Spin(n)); p is called 
the spin representation of the group Spin(n, C) 
(of Spin(n)). The elements in the representation 
space of p are called spinors. Thus we can say 
that a spinor is a quantity with 2’ components 
which obey the transformation law according 
to the spin representation (— 258 Lorentz 
Group). This representation p defines a rep- 
resentation of the Lie algebra so(n, C) of 
Spin(n, C) (note that so(n, C) is a tcomplex 
simple Lie algebra of type B,). This represen- 
tation of so(n, C) is also called the spin repre- 
sentation of so(n, C). Note that p is not well 
defined on SO(n, C) or on SO(n); p is of valence 
2 on SO(n, C) or on SO(n). 

(2) When n is even: n= 2r. Since C(Q) is 
isomorphic to a total matrix algebra of degree 
2’ over C, C(Q) has a unique (up to equiva- 
lence) irreducible representation 6, which is of 
degree 2”. The restriction of 6 on Spin(n, C) (on 
Spin(n)) defines a representation p of degree 2” 
of Spin(n, C) (of Spin(n)); p is called the spin 
representation of the group Spin(n, C) (of 
Spin(n)). This representation p is, however, not 
irreducible; p is decomposed into the direct 
sum of two irreducible representations p, and 
p_. They are not equivalent to each other, and 
both are of degree 2’"'. By taking a suitable 
minimal left ideal L of C(Q) as the representa- 
tion space of the representation 6, we obtain 
the representation spaces L*, L7 of p4, p-, 
respectively, by putting L* = LN C* (Q) and 
L~ =LNC (Q). The representation p* (or p`) 
is called the half-spin representation of the 
group Spin(n, C) or of the group Spin(n). The 
elements in the representation space of p* (or 
p`) are called half-spinors. Again, p~ and p~ 
are not well defined on SO(n, C) or on SO(n). 
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They are of valence 2 on these groups. The 
representations of the Lie algebra of the Lie 
group SO(n, C) (note that this Lie algebra is a 
tcomplex simple Lie algebra of type D,) asso- 
ciated with p*, p~ are also called half-spin 
representations of this Lie algebra. 
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A. Cluster Sets of Functions Meromorphic in 
an Arbitrary Domain 


Let D be an arbitrary domain in the complex 
z-plane, I its boundary, and E a ttotally dis- 
connected closed set contained in I’. Let w= 
f(z) be a single-valued tmeromorphic func- 
tion defined in D. Then for each point zg in F, 
we can define the following sets related to the 
mapping w= f(z) in the complex w-plane (or 
on the complex w-sphere C). 


The Cluster Set. A value « is called a cluster 
value of f(z) at zo if there exists a sequence of 
points {z„} such that 


F (Zp) >a. 


The totality Cp( f, Zo) of all the cluster values of 
f(z) at zo is called the cluster set of f(z) at zo 
or, more precisely, the interior cluster set. It is 
a nonempty, closed, but not necessarily con- 
nected set. 


Z,€D, 2,20, 


The Boundary Cluster Set. The set of all values 
a such that there exist a sequence of points 
feaz of T — {Zo} (resp. F — {zo} — E) and a se- 
quence of points {w,} in the complex w-plane 
satisfying 
EnZo WaEColS En) Wa 


is called the boundary cluster set of f(z) at zo 
and is denoted by Cr (f, Zo) (resp. Cr-el f, Zo)). 
These are closed sets, and 


Cr-el f 20) < Crh 20) € Col f Zo). 


If zoel — E or zy is an isolated point of E, then 
Cr-el f. Zo) =Cr( f Zo). Furthermore, Cr f, Zo) 
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(resp. Cr-g( J, Zo)) is empty if and only if Zp is 
an isolated boundary point (resp. zp) is an 
exterior point of I — E). 


Range of Values. The set of values « such that 


D 


is called the range of values of f(z) at zọ and is 
denoted by Rp(f, Zo). In other words, Rp( f, Zo) 
is the set of values « assumed by f(z) infinitely 
often in any neighborhood of zo and is a tG;- 
set. 


ZED, 2425 


The Asymptotic Set. Let z, be an taccessible 
boundary point of D. If f(z) converges to a 
value « as z tends to Zo along a simple arc in D 
terminating at Zo, then « is called an asymp- 
totic value of f at zo. The totality Ap{f Zo) 

of asymptotic values of f at zo is called the 
asymptotic set of f at Zo. If zp is an inacces- 
sible boundary point, we let Ap{ f, Zo) be the 
empty set. 


B. Iversen-Beurling-Kunugui Theorems 


Suppose now that E is empty, and put 
Q= Cpl f; Zo) — Cf Zo). 


tIversen’s theorems in the case where D is the 
unit disk |z|< 1 and Zp is a point on |z|=1 are 
generalized as follows. 
(1) Q— Rpl f, Zo) =€ Ap S, Zo) (K. Noshiro, 1936). 
(2) First Beurling-Kunugui theorem: If zo is 
not an isolated boundary point, then Q is an 
open set. 
(3) Second Beurling-Kunugui theorem: Sup- 
pose that the open set Q is not empty. Then 
f(z) assumes every value belonging to each 
component Q,, of Q infinitely often, with two 
possible exceptions belonging to Q,, that is, 
Q,(C — Rp(f, Zo)) consists of at most two 
values (an extension of tPicard’s theorem on 
an isolated essential singularity). 

Next suppose that E is not empty and of 
tlogarithmic capacity zero, and put Q= 
Colf, 20) — Cre f, Zo). If we Q— Rolf, Zo), then 
either « is an asymptotic value of f(z) at zo or 
there exists a sequence of points ¢,¢ E (n= 
1,2,...) converging to z, such that « is an 
asymptotic value of f(z) at each C, (Noshiro, 
1937). Furthermore, if Zo is contained in the 
closure of I — E, then Q is an open set (which 
may be empty), and Q— Rp( f, Zo) is at most of 
logarithmic capacity zero (M. Tsuji, 1943). If E 
is contained in a single component I of the 
boundary T, zp is contained in the closure of T 
— E, and Q is nonempty, then w= f(z) assumes 
every value belonging to each component Q, 
of Q infinitely often, with two possible excep- 
tions belonging to Q, (Noshiro, 1950). In par- 
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ticular, if f is bounded in a neighborhood of 
Zo, the number of such exceptional values is at 
most 1. This is still true if each point of E is 
contained in a boundary component that is a 
tcontinuum containing at least two points and 
Q is not empty (M. Herve, 1955). However, 
this conclusion does not hold if we remove the 
hypothesis on the set E (K. Matsumoto, 1960). 


C. Cluster Sets of Functions Meromorphic in 
the Unit Disk 


Let D be the unit disk {|z|<1}, zp =e" bea 
fixed point on the unit circumference T, A be 
an open arc of F containing zo, and E be a set 
of tlinear measure zero such that zoe Ec A. 
With every e® e A—E we associate an arbi- 
trary simple arc A, in D terminating at e® and 
the curvilinear cluster set C,,(, e”), defined as 
the set of all values w such that z„€ Ay, Z, >€", 
f(z) 7%. We put 
Ch elf Zo) = N M,, 

r>0 
where M, denotes the closure of the union 
Cy, (fe) for all e" in the intersection of A — E 
with |z— zo| <r. 

By using the above cluster set C#_;-(f Zo) 
instead of the boundary cluster set Cr-g( f, Zo), 
we obtain results similar to those in the pre- 
ceding section (M. Ohtsuka, 1950; Noshiro, 
1955). Many interesting results have been 
obtained by F. Bagemihl and W. Seidel, E. F. 
Collingwood, O. Lehto and K. I. Virtanen, 
and others concerning the cluster sets of func- 
tions meromorphic in the unit disk. They 
studied functions of Seidel’s class U, normal 
meromorphic functions, and other functions 
where the class U is the totality of regular 
bounded functions in the unit disk possessing 
almost everywhere on |z|=1 radial limits of 
the constant modulus 1; a nonconstant mero- 
morphic function f(z) in |z|< 1 is normal if 
the family { f(T(z))} is tnormal in the sense 
of Montel, where T(z) is an arbitrary con- 
formal mapping of |z|< 1 onto itself. 


D. The More General Case 


The definitions of cluster sets are also available 
for arbitrary functions for which neither ana- 
lyticity nor continuity is assumed. If there exist 
two simple arcs A, and A, in the unit disk D 
terminating at a point z=e” such that 


Crh e*)N Crh e®)= Ø, 


then z=e* is called an ambiguous point of f. 
Bagemihl proved the following: Fhe set of 
ambiguous points of an arbitrary complex- 
valued function defined in the unit disk D is at 
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most countable [5]. Under the same hypoth- 
esis, the set of points e? such that Cp(f,e”) # 
C-(f, e®) is at most countable (Collingwood, 
1960). This result shows the importance of 
introducing the cluster set C#_,(f, Zo) that was 
previously mentioned. 


E. History 


The theory of cluster sets originated from the 
tvalue distribution theory of analytic functions 
in the neighborhood of their essential singular- 
ities. The first systematic results were those 

of F. Iversen and W. Gross, obtained about 
1920. Subsequent significant contributions 
were made by Seidel, J. L. Doob, M. L. Cart- 
wright, A. Beurling, and others. Since 1940, 
some important results have been obtained 

by K. Kunugui, S. Irie, Y. T6ki, Y. Tumura, 

S. Kametani, Tsuji, Noshiro, and other Japa- 
nese mathematicians. Many results have been 
extended to tpseudoanalytic functions. As can 
be seen from the Bagemthl ambiguous point 
theorem, some properties of cluster sets are 
not intrinsic to analytic mappings [2, 5]. On 
the other hand, it seems to be an interesting 
problem to extend the theory of cluster sets to 
the case of analytic mappings between open 
Riemann surfaces. 
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A. General Notions of Coding Theory 


When we wish to store, to search for, or to 
send information in the presence of noise 
efficiently and with the least error, we can 
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apply various bounds to the efficiency (— 213 
Information theory). Sophisticated coding 
operations are required in order to achieve 
efficiencies as close as possible to the bounds. 
In the abstract sense, the information is gener- 
ally understood to be a choice of an element 
from a finite set X. For implementation, we 
take a set K of q elements called alphabets. 
Each element of K is called a letter. We con- 
sider the direct product K”, i.e., the set of all 
sequences of n letters. An injection y from X 
into K” is called encoding. The sequence (x) 
for xe X is called a code word, and the image 
W(X) (a collection of code words) is called a 
code. Such a code is also called a block code. 
The noise is represented by a mapping œw: K” 
— K", but usually it is restricted to a certain 
subset of the set of such mappings with 
special properties. For example, we usually 
assume that the sequences xe K” and (x) are 
different only at less than d letters, where d is a 
preassigned constant. The inverse mapping 9, 
i.e., a mapping ọ:K”—>X satisfying pow(x)=x 
for all xe X, is called the decoding of y. A code 
y satisfying the property wow(x)¢W(X) for all 
xeéX and for all weQ is called error-detecting 
with respect to the noise Q. If y has the decod- 
ing ọ satisfying powo(x)=x for all xe X 
and for all weQ, y is called error-correcting 
with respect to the noise Q. To discuss such 
properties, we can assume X = W(X) (c K”) ` 
without restricting the generality, so hereafter 
we assume this condition. Also, we take g=2 
unless explicitly stated otherwise. Because 

in many communication systems q=2 is 
commonly adopted, and the generalization 

for other prime powers q can be obtained 
naturally. 


B. Bounds for the Size of Codes 


Let x=(x,,.--, Xn) V=(V1, -3 Yn E K". The 
Hamming distance d(x, y) = dy(x, y) between 
the elements x and y is the total number 

of unequal bits (x;#y,). We put dmin = 

min {dy(x, y)|x, yeX,x#y}. When e= 

max {dy(x, w(x))|@EQ, xe X} is less than dain» 
the errors due to Q can be detected, and if 
dnin = 2 + 1, the errors can be corrected. The 
maximal integer t satisfying d min > 2t + 1 is 
often called the error-correcting capability. 
Several important relations are known among 
daim t n (the length of the code), and |X| (the 
size of the code word) as follows. Hamming 
bound: | X |< 2"/>'_, (7). A code satisfying the 
equality here is called a perfect code [6]. Plot- 
kin bound: d,,;,,<1|X |/(2(|X|—1)). On the 
other hand, if the Varsharmov-Gilbert-Sacks 
bound 2” *> S4m»"2(""1) is satisfied, there 
exists an (n, k)-linear code (Section C). 
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C. Linear Codes 


Let K =GF(2)= {0, 1}, K” be an n-dimensional 
vector space over K, and X be a k-dimensional 
linear subspace. Then X is called a group code 
or (n, k)-linear code. In the present case, we can 
take a suitable basis of K for which there exists 
a k x(n—k) matrix P over K such that every 
vector x =(x,,...,X,) representing a code word 
is expressed as x =(zP,z) by a suitable vector zZ, 
and conversely, every vector of this form is a 
code word, i.e., a vector xe X is a code word if 
and only if 


»X,)P=0. (*) 


(xi, E E E O A E aoe 


Introducing the matrix H=[J,_,, — P7], where 
I„— is the unit matrix of order n—k and 7 
indicates the transpose, the condition (*) is 
equivalent to xH” =0. Therefore, H is called 
the parity check matrix, X,,_,4),...,X, are 
called the information bits, and x,,...,x,_; 

are called the check bits. If x is deformed to 

y = ox by a noise œ, i.e., the original signal x 

is transmitted as signal y, we calls=yH? = 
(y—x)HT the syndrome of the transmitted 
signal y. If we have an algorithm to determine 
the error vector e= y—x from the syndrome s, 
we have a decoding with error-correcting 
property. For the linear code, it is evident that 
dmin = Min {the number of nonzero elements of 
x|x(#0)e X}. 

The Hamming code is given by n=2”—1 {m 
being an integer > 2), k=n— m, where the (i, j)- 
component h; (i=1,...,m;j=1,...,n) of the 
parity check matrix H is given by the ith bit of 
the number j-— 1 expressed in the binary (2- 
adic) number system. The check bits are x; 
(j=2°,2!,...,2"71), and all other bits are the 
information bits. This code has the error-cor- 
recting property with respect to the noise w 
for which the nonzero component of e = wx — 
x is at most 1. In fact, from the syndrome s= 
(Sy, ..-55m), We Compute j=s,+2s,+...4 
2”-'s,, and put e;= 1, e,=0 for all | #j: if j=0, 
we can put e=0. The Hamming code is a per- 
fect code. 


D. Cyclic Codes 


The cyclic code is a special case of a linear 
code. This is the object of one of the most 
important applications of the theory of finite 
fields. By using this theory, we can actually 
construct the codes with high for error- 
correcting capability in which the encoding 
and the decoding operations are performed 
algebraically. 

An (n,k)-linear code X over GF(2) is called a 
cyclic code if x =(x,,X>,-..,X,)€X implies (x,, 


Xis Aysa Nyai) (Xn-1> Xn Xis -> Meee riggs 
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-Xp X1)E€ X. To each xe X, we can associate 
a polynomial of one variable x()=x,+x,€+ 
ve +X,6""| over GF(2). Let gy(é) be the poly- 
nomial of the least degree n—k among the 
polynomials corresponding to xe X. gx(€) is 
called the generator of X, because the property 
xe X and the property that x(€) is divisible by 
gx(é) are equivalent. g,(é) also divides č" — 1. 
Let the quotient be hy(€)=(E"— 1)/gy(é). Then 
xéX is equivalent to x(é)hy(2)=0 (mod(é" — 
1)). hy(€) is called the parity check polynomial. 
The BCH code (Bose-Chaudhuri-Hocquenghem 
code) defined below is a typical example of a 
cyclic code. 

Let « be an element of order n in GF(2”) 
(the finite field with 2” elements), and let g,(Z) 
be the polynomial of the lowest degree in 
GF (2)[é] for which a’ is a root (i= 1, 3,..., 
2t— 1). The BCH code is given by the least 
common multiple g(é) of the polynomials 
gı (é), 93(€), ---, 921-1 (č). The BCH code has at 
least n— mt information bits and satisfies 
Amin 2 2t + 1, and algebraic decoding methods 
are known [3,5]. The Hamming code is a 
BCH code when t=1, n=2"—1. 


E. Other Codes 


Other important types of codes include con- 
volutional codes (not block codes) for correct- 
ing errors in consecutive digits (burst errors) 
and Goppa codes (an extension of the BCH 
class). For correcting burst errors, special 
cyclic codes or their mixtures are also used. 
This field of research is closely connected to 
information theory, algebra, and various ap- 
plications of combinatorial analysis, such as 
texperimental design [3]. 
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Cohomology Operations 


A. General Remarks 


The notion of cohomology operations was 
introduced by L. S. Pontryagin and N. E. 
Steenrod in order to solve homotopy classifi- 
cation problems (— 202 Homotopy Theory). 
Since then, numerous works have proved the 
importance of cohomology operations as 
applied to thomotopy theory, ‘differential 
topology, and other branches of topology. In 
fact, the use of cohomology operations is 
indispensable in studying problems related to 
thomotopy groups, characteristic classes of 
manifolds, etc. 

We denote by the symbol H*(X; A)= 
» H"(X; A) the tsingular cohomology ring of 
a topological space X with coefficients in an 
tAbelian group A. 


B. Primary Cohomology Operations 


A (primary) cohomology operation (or simply 
an operation) ¢ is a tnatural transformation 


o:a Aa) >] [ H" ;B,) 


between the cohomology functors defined on 
the tcategory of topological spaces and con- 
tinuous mappings. That is, ọ is a family of 
mappings satisfying the following conditions: 
(1) For each space X, ọ defines a mapping 


p:| | H"(X; A,)>[] H"4X; B,) 
À á u 


that is not necessarily additive. 

(2) For each mapping f: X >Y, the commu- 
tativity f*o@=q@of* holds in the diagram 
[| HX; 4)ST] W(X; B) 

À 1 H 1 
[T 4"(¥; 4)>]] H”(Y; B). 
A u 


We list here two trivial examples. 

(I) Addition of cohomology groups deter- 
mines an operation @: H'(X; A) x H'(X; A)> 
H'(X; A). 

(II) The tcup product determines an 
operation 


p:H"(X;A,)x H??(X; A,)> H"*2(X; Ay ® A2) 
denoted by 


g(a, B)=a~ B. 

The composite of two cohomology opera- 
tions is defined in the obvious way. Among 
cohomology operations the most important 
ones are operations of one variable. A co- 
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homology operation of type (A, l; B, m) is a 
natural transformation 


gp: H'( ;A)>H"( ; B). 


D(A, l; B,m) denotes the Abelian group consist- 
ing of all such operations. 

Denote by H(A, l; B) the mth tcohomology 
group of an tEilenberg-MacLane space K(A, 1), 
and let ue H'(A, I; A) be the *fundamental class 
of K(A, 1). If X is a CW complex, by assigning 
f*u to each f:X >K(A,1), we obtain a one- 
to-one correspondence between the set of the 
homotopy classes z(X, K(A, D) and the co- 
homology group H'(X; A) (> 70 Complexes 
F). Hence by utilizing condition (2), it can 
be shown that the value of ọ on H'(X; A) is 
uniquely determined by its operation on 
H'(A, l; A). Thus the assignment p> pu defines 
the isomorphism O(A, l; B, m) = H” (A, l; B). 
Here we should remark that the isomorphism 
n(K(A, 1), K(B,m))= H(A, l; B) defines a one- 
to-one correspondence O(A, l; B,m)>n(K(A, l), 
K(B,m)). In some cases H(A, |; B) vanishes, 
for example, when 0<m<I; l=] <m, A=Z; 
l=2, m=2w +1, A=Z;1=2I'<m=2m' +1, 
A=Z:B=Q;1=2l'+1<m, A=Z, B=Q;A 
is finite, B= Q; ete. 

The following four types of operations to- 
gether with the two above are called elemen- 
tary operations: 

(I) Homomorphisms induced by a coeffi- 
cient homomorphism: There are homomor- 
phisms 4, : H'(X; A) H'(X; B) induced by a 
homomorphism y: A> B. 

(IV) Bockstein (cohomology) operation: This 
operation is given by the tconnecting homo- 
morphisms 6*: H'(X;C)-H'*!(X; A) asso- 
ciated with a short exact sequence 0> A> B- 
C0 of coefficient groups (— 200 Homo- 
logical Algebra). For example, the coefficient 
sequence 0+ Z4Z—Z,, 0 (Z, = Z/nZ) defines 
a Bockstein operation (or Bockstein homomor- 
phism), which is usually denoted by (1/n)6 or 
Ap- 

(V) Steenrod (or reduced) square operations 
Sq’ (i=0):Sq' are sequences of operations 
defined by the following five axioms [2, 5]: 

(V1) For each pair of integers i>0 and />0, 


Sqi: H'(X; Z,)> H'*(X;Z,) 


is a natural transformation of functors that is a 


homomorphism. 

(V2) Sq°=1. 

(V3) If degx =i, then Sq'x=x~ x (cup 
product). 

(V4) If degx <i, then Sq'x=0. 

(V5) (Cartan formula) 


q'(x~y)= 2 Sq/x~ Sq'y. 
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These five axioms imply the following two 
formulas: 

(V6) Sq is the Bockstein operation $, of the 
coefficient sequence 0>Z,>Z,-+Z,—0. 

(V7) (Adem relations) If 0<i< 2/, then 


Sqio Sqi= ix a l sato Sq* 
Ko \i—2k 
The binomial coefficient is taken mod 2. 

We can extend the definition of the Sq' so 
that they operate on the relative cohomology 
groups. Now (V1), (V2), and (V5) imply: 

(V8) If ô: H'(Y;Z,)>H'*!(X, Y;Z,) is the 
tcoboundary homomorphism, then ôo Sq' = 
Sqio6. 

(V’) Steenrod pth power operations 7 (i> 0): 
Let p be an odd prime. Then #' is a sequence 
of operations defined by the following five 
axioms [5]. 

(V1) For each pair of integers i>0 and />0, 


P: H(X; Z,)> H'*7#-Y(X; Z,) 


is a natural transformation that is a homo- 
morphism. 

(V2) P°=1. 

(V3) If degx = 2k, then P*x = x”, 

(V'4) If deg x <2k, then 7*x =0. 

(V’S) (Cartan’s formula) 


Pix y=} Pix Piy. 
j=0 

These axioms imply the Adem relations for 
# and the Bockstein homomorphism £, asso- 
ciated with the coefficient sequence 0>Z,> 
Z,2>Z, 0 (— Appendix A, Table 6.11). 

We can extend the definition of to the 
relative cohomology groups too, and we ob- 
tain d0oP=F'06. 

(VI) Pontryagin pth power operations .. 
Let p be a prime. $, is a system of operations 
satisfying the following five conditions [3]: 

(VI1) For each pair of integers />0 and 
h>1, 


P,:H'(X; Zp)—>H”'(X; Z p+) 


is a natural transformation. 
(VI2) If 7: Zp —> Zp is a homomorphism 


defined by n(1)= 1, then we have y, 0 P,x =x”. 


(VI3) If p:Z p >Z is a homomorphism 
defined by p(1)= p, then we have 


P, (x+y) 


p-l 
=P, x+ P y+ 2 (Or) ye 


(VI4) B(x ~ y)= P, x — By. 

(VIS) If p>2 and degx=2k+ 1 (odd), then 
$x =0. 

Let A, B be finitely generated Abelian 
groups. Then the computation of H*(A, 1) 
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shows that each element of O(A, I; B, m) can be 
written as the composite of a finite number 

of the operations of the types (I)-(VI) (H. 
Cartan). 

Here we show some examples. Let u be the 
fundamental class of K(Z,, 2). Then the ele- 
ments of H*(Z,,2;Z,)=Z», H? (Z3, 2; Z4) = 
Z, H*(Z,, 2; Z4) = Z4, and H5(Z,, 2;Z4)= 
Z, + Z, correspond to the cohomology opera- 
tions A2,, ån O (8/2), P2, and un, o(ô/4)o 
P, +12,0 8q? 0 Sq!, respectively, where A, u 
denote integers satisfying 0<4 <1, 0< 
u<3and ņ:Z>Z,,2:Z, >Z, stand for the 
homomorphisms defined by n(1)=1, 2(1)=2, 
respectively. 

Suppose that we are given sets of integers l; 
and m,. We define the stable (primary) coho- 
mology operation p with respect to these sets 
of integers l; and m; as a system of natural 
transformations 


p: [R ; A>] H"*"™( ; B,) 
A H 


satisfying the following condition, for all 
integers n>0; 

(3) Let S: H'*! (SX; A)> H'(X; A) denote the 
tsuspension. Then the commutativity S o ọ = 
pos holds in the diagram 


I] H"*4(X; A) ST] H"*™9(X; B) 

: ts * 18 

[ [ah {SX; ASTI He (SX; B). 
A H 


This condition is equivalent to the commuta- 
tivity with the coboundary homomorphisms. 
For example, the cohomology operations 


(—1)"B,: H"(X; Z)> H" (X; Z) 


define a stable cohomology operation £. Sq’, 
Pt are also examples of stable cohomology 
operations. 

A stable cohomology operation ọ of type 
(A, B) and of degree q is a sequence of coho- 
mology operations of type (A, n, B,n +q) de- 
fined for all integers n>0. (A, B), denotes the 
Abelian group consisting of all stable coho- 
mology operations of type (A, B) and of degree 
q. When A= B, W(A)= DF» UA, A), is a 
tgraded algebra, where multiplication of two 
operations is given by their composition. Let p 
be a prime. Then A(Z) is called the Steenrod 
algebra mod p and is denoted by Y(p). (2) is 
the augmented graded algebra over Z, gen- 
erated by Sq' subject to the Adem relations. 
Suppose that we are given a sequence of non- 
negative integers I =(i,,i,,...,i,). We call Z 
an admissible sequence if i,_, > 2i, holds for 
2<s<k. We write Sq! =Sq'' o Sqi20...0Sq't. If 
I is an admissible sequence, we say that Sq! is 
an admissible monomial. The admissible mono- 
mials form an additive basis for 2{(2), which 
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has the structure of a tHopf algebra whose 
fcomultiplication y : W(2) + W(2) ® U(2) is 
given by WSq' = Xi. Sq/ ® Sq‘. The dual 
space 2(2)* = Homz (2 (2), Z2) gives a Hopf 
algebra that is the polynomial algebra gen- 
erated by é; of degree 2'—1, where €; is the dual 
element of Sq?’ Sq?" ... Sq! with respect 

to the additive basis. The comultiplication 

p* :A(2)* + W(2)* @ A(2)* is given by p*é; 
ayer Se Ci: 

QI(p) has properties similar to Y(2) (p is an 
odd prime). In particular, (p) is a Hopf 
algebra generated by #! and f subject to the 
Adem relations with comultiplication y given 
by WB=6@14+1@ Band pP'=Yj-.P/ Q 
P>, An additive basis of W(p) is given by 
{ Bo P': pe... PB), where ¢,=0 or 1, i,,> 
Pim+1 t&m (m= 1) and i, > 1. Such a sequence 
1 =(€9, 41,6), -> igs &) is also called an admis- 
sible sequence. Denote the dual element of 
PPE? P and PPP... P! B by či 
and 7;, respectively; then the dual algebra 
QI(p)* is isomorphic to the tensor product of 
the polynomial ring over Z, generated by ¢,, 
€,,... and the exterior algebra over Z, gen- 
erated by t,, t,.... The comultiplication @* is 
given by p*(¢)) = Li-06?-;@ Gj and g*(t,)= 
7,@ 14+ Dio EP; @ r [5] (— Appendix A, 
Table 6.1ID). 


C. Secondary Cohomology Operations 


Here we restrict our attention to a special 
type of operation treated by J. F. Adams [4] 
that has been proved to be powerful in 
applications. 

For a specified prime p, we write H*(X)= 

£ H'(X;Z,). We can regard H*(X)asa 
tgraded left module over QI(p). Now let C, 
(s=0, 1) be a pair of tleft free modules over 
YI (p), and let d:C, +C, be a graded homomor- 
phism over Ap). Suppose that we are given an 
element ze C, such that dz =0. For future pur- 
poses, Cy and C, are assumed to have bases 
{co,,} and {c,_,} with degcy ,=1, and dege, , 
=m, respectively, in terms of which d has the 
representation dc, = Li,4,,4Co,,- Then z is 
expressed in the form z= $, bpCi p 

We say that ® is a stable secondary coho- 
mology operation associated with the pair (d, z) 
if it satisfies the following four axioms: 

(1) Let D"(d, X) be the module consisting of 
homomorphisms ¢:Cy—> H *(X) over U(p) of 
degree n such that ed=0. Putting e(cg ,)=X,, 
we can assume that D"(d, X)={e=I],x,€ 
T1,H"*"(X)| Ea, 2X, =0}. Let Q"(z, X) be 
the submodule of H *(X) consisting of ele- 
ments of the form &(z), where č is an Q(p)- 
homomorphism: C, >H +(X) of degree n— 1. 
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In other words, Q"(z, X)= È „b, H" ™ *(X). 
Then for each n>0 and each X, Pisa 
mapping 

®:D”(d, X)—> H +(X)/Q”"(z, X). 


(2) For each mapping f:X > Y, the commu- 
tativity f* oD=@o f* holds in the diagram 


D”(d, X)Š H+ (X)/Q"(z, X) 
tf 1 
D"(d, Y)>H*(Y/O"(z, Y). 


(3) Let S: H"*!(SX)—>H"(X) denote the 
suspension. Then the commutativity So®= 
MoS holds in the diagram 


D"(d, X)>H*(X)/Q"(z, X) 
TS Ts 
D"*! (d, SX) H*(SX)/O"*"(z, SX). 


(4) Let i: YX be an injection such that 
i* oe=0. That is, i*x,=0. We can then find 
homomorphisms (over A(p))}n:Co>H +(X, Y) 
of degree n and (:C,>H*(Y) of degree n— 1 
such that the following diagram commutes: 
H'(Y)ËH+(X)ËH (X, Y)ËH*(Y)ËH +(X) 

tn 1 
E 
Then for any such pair (4, ¢) we have i* o Pe = 
€zmod i*Q"(z, X), where 6* denotes the homo- 
morphism defined by ô*y =(— 1)"'™” dy. 

For each pair (d, z), there is at least one 
associated operation ®. Existence is proved by 
means of a tPostnikov system. Let ® and ®’ be 
two operations associated with the same (d, z). 
Then they differ by a stable primary operation 
in the sense that there is an element ~eEC,/dC, 
such that O’e= Pe + pe mod Q(z, X), where 
p=>,4,Co,, moddC, means pe= La, x}. 

For example, define the action of Y(2) on 
Z, by the rule a-v=0 if dega>0 and l-v=y 
for each ve Z,. We consider a minimal projec- 
tive resolution 


E d d 
0-Z, -CC 4C,- yak 


(— 200 Homological Algebra). First we take 
Cy = U(2) and define e(1)=1. Next we take C, 
to be free over (2) with generators c;, where 
0<iand d,c,=Sq”. Furthermore, we take C, 
to be free over A(2) with generators c; ; with 
O<i<j,j#i+1 and 


d5(c;,))=Sq? ¢j+ yj Ree 
O<k<j 


with b;, ; x€ A(2). Here the d,c;=Sq” form 

a minimal set of generators of 21(2), and 

the equations 0=d; dzc; ;= Sq? 0 Sq” + 
Lo<x<jbj,;,x5q" form a minimal set of gen- 
erators over 2[(2) of the Adem relations. Let 
C,(j) be the submodule over W(2) generated 
by c, with 0<k <j in C,, and let d,(j) be the 
restriction of d, on C,(j). We write z; ,=d¢;,;. 
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Let ®; ; denote an operation associated with 
(d,(J), 2;,). Then ®, ; is defined on the sub- 
module D; (X) consisting of elements xe 
H"(X) such that Sq?*x =0 for O0<k <j and 
takes values in H"*?'*?’-1(X) modulo the 
submodule 


PAX \= Sq? H NY) 
$ X b; ; pH" 1X). 


o<k<j 

For example, ®p o: Ker 8, —> Cok $, is the 
generalized Bockstein homomorphism defined 
by 6/2? and ®, , is the operation discovered 
by J. Adem that appears as the ftertiary ob- 
struction of S” for n>2. If k>3, we have the 
relation 


(moa 5 aQ). 
O<i<j<k 


J#it1 


These formulas can be applied to prove the 
nonexistence of an element with fHopf- 
invariant 1 in z,,_,(S") unless n= 1,2,4,8 
(Adams [4]). Analogous formulas also hold 
for Wp) with p> 2. We have no satisfactory 
theories concerning cohomology operations of 
orders higher than 2. For cohomology oper- 
ations in generalized cohomology theory — 
237 K-Theory. 
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A. Introduction 


First of all, we explain some notions in geo- 
metric combinatorial topology, also called PL 
(piecewise linear) topology, in order to distin- 
guish between the usual topological view- 
point and the combinatorial (or PL) one. By 
an n-dimensional simplicial complex K we 
mean an n-dimensional, tlocally finite and 
trectilinear simplicial complex in the Euclidean 
N-space R, i.e., an n-dimensional *Euclidean 
simplicial complex (= 70 Complexes). The 
subspace P =|K| of RY covered by all sim- 
plexes of K is called an n-dimensional poly- 
hedron, and K is referred to as a simplicial 
division or simply a division of P. A poly- 
hedron covered by a fsubcomplex of a division 
of P is called a subpolyhedron of P. A poly- 
hedron contained in a polyhedron P is a sub- 
polyhedron of P if and only if it is a closed 
subset of P. An open subset of a polyhedron P 
is always a polyhedron but not necessarily a 
subpolyhedron of P [13]. For arbitrary divi- 
sions K, and K, of a polyhedron there is a 
common ‘subdivision of K, and K,. Thus a 
property of a simplicial complex K which is 
invariant under subdivision is called a com- 
binatorial property of K or of a polyhedron 
|K]. Two polyhedra P and Q are said to be 
combinatorially equivalent if they have tisomor- 
phic divisions K and L, respectively. Then a 
homeomorphism f:P—Q which is induced 
from an isomorphism f:K >L is called a 
combinatorial equivalence. Since a combina- 
torial equivalence f: PQ is an isomorphism 
in the category of polyhedra and PL (*piece- 
wise linear) mappings, (PL maps) which is 
called the PL category, it is also called a PL 
homeomorphism or a PL isomorphism, and P 
and Q are said to be PL homeomorphic or PL 
isomorphic. A ttopological space X which 

is homeomorphic with an n-dimensional 
polyhedron P is said to be triangulable and 

is referred to as an n-dimensional topological 
polyhedron. Then a division K of P, ora 
homeomorphism t:P— X, is called a triangula- 
tion of X. A division K of P is a triangulation 
of P. However, there is a compact polyhedron 
of dimension n (> 5) which has a triangulation 
K not combinatorially equivalent to its divi- 
sion ( — 70 Complexes; for triangulation prob- 
lems on manifolds — Section C below). The 
notion of a simplicial complex or cell complex 
had been introduced as a tool to describe 
topological invariants of triangulable mani- 
folds (H. Poincaré [1]). The notion of an n- 
dimensional combinatorial manifold (i.e., a 
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polyhedron locally combinatorially equivalent 
to an n-dimensional simplex) was established 
as an important geometric object in com- 
binatorial topology by means of J. H. C. 
Whitehead’s theory of regular neighborhoods 
[2, 1939, 1940]. The 1960s saw remarkable 
results in the study of combinatorial manifolds 
together with the topological study of dif- 
ferentiable manifolds, which gave rise to con- 
temporary combinatorial or PL topology 
[3,4]. The fundamental conjecture (Haupt- 
vermutung) for combinatorial manifolds and 
the combinatorial triangulation problem of 
manifolds were negatively solved by R. C. 
Kirby and L. C. Siebenmann [5, 1969]. In the 
1970s, important results about general trian- 
gulations were obtained by T. A. Chapman [6] 
R. D. Edwards, and J. W. Cannon [7]. 


B. Pseudomanifolds and Homology Manifolds 


In this section we explain some topological 
properties of triangulable manifolds which can 
be defined by means of incidence relations 
among their triangulations. A tpolyhedron M 
=|K| is called an n-dimensional pseudomani- 
fold if K satisfies the following three con- 
ditions: (1) Every simplex of K is either an n- 
simplex or a face of an n-simplex; (ii) each 
(n—1)-simplex is a face of at most two n- 
simplexes; (iii) for any two n-simplexes ø, t of 
K, there exists a finite sequence of n-simplexes 
O =p, 01,.--,6,=T such that c; and o,,, have 
an (n— 1)-face in common. 

Consider the set S of (n— 1)-simplexes of K, 
each of which is a face of only one n-simplex. 
Then the set of all ceS together with their 
faces forms a subcomplex L of K. The poly- 
hedron |L| of L is called the boundary of the 
pseudomanifold M =|K| and is denoted by 
OM as well as by ôK = L. The boundary 
of a pseudomanifold is not necessarily a 
pseudomanifold. 

Let |K| be an n-dimensional polyhedron (or 
tcell complex). A point p in |K| is called a 
regular point of | K| if it has a neighborhood in 
|K| homeomorphic to an n-dimensional sim- 
plex; otherwise, p is called a singular point. A 
pseudomanifold without singular points is a 
topological manifold, called a triangulated 
manifold. An n-dimensional polyhedron |K| is 
a pseudomanifold if and only if the set of all 
regular points in |K| is dense and connected 
and the set of all the singular points is of di- 
mension less than n—1. Thus a connected 
triangulated manifold is a pseudomanifold 
whose boundary coincides with the boundary 
as a topological manifold: 

Let t,, T} be oriented n-simplexes of an n- 
dimensional! pseudomanifold M =|K|. Suppose 
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that they have an (n— 1)-face o in common. If 
the tincidence numbers of these simplexes with 
a satisfy the relation [t,,¢]= —[t,, 0], then t, 
and t, are said to be coherently oriented. We 
call M=|K| with all its n-simplexes oriented 
an oriented pseudomanifold if any two n- 
simplexes t, and t, are coherently oriented 
whenever they have an (n—1)-face in common. 
For an oriented pseudomanifold, the formal 
sum of all its oriented n-simplexes forms an 
integral cycle of the pair (M, 0M), which is 
called the fundamental cycle of (M, ôM). Its 
homology class, or fundamental class, gener- 
ates the homology group H,(M,0M; Z)=Z. 
When M =|K| is nonorientable, the funda- 
mental class with coefficient Z, (Z, = Z/2Z) is 
similarly defined. A pseudomanifold is orient- 
able if and only if the topological manifold 
consisting of all its regular points is orientable. 
If for each point p of a polyhedron M=|K| 
the local homology groups are H;(p)=0 (i+ 

n) and H,(p)=Z or 0, we call M=|K| an n- 
dimensional homology manifold, where H,(p)= 
H,(M, M —p; Z). If M is connected, it is also 

a pseudomanifold. The set ôM of points p of 
M =|K| with H,(p)=0 is called the boundary 
of M and is an (n—1)-dimensional homology 
manifold without boundary. 

Let M=|K| be an n-dimensional homology 
manifold. Given simplexes a, t of K, 0<t 
means that a is a face of t and o ¥t. By defi- 
nition, a vertex of the tbarycentric subdivision 
Sd K of K is the barycenter of a simplex of K, 
and a k-simplex t of Sd K is defined to be 
ldo, ...G,|, where Go, 0,,..-, 0, are simplexes 
of K such that 69 <0, <... <0, and 6; is the 
barycenter of o;. In the following we denote 
Sd K by K’. For a q-simplex o of K, we de- 
note by K (ø) the union of all (n — q)-simplexes 
|66,...6,-,| Of K’ such that o <6; <...<6,_q. 
Then K(o) is an (n—q)-dimensional homology 
manifold and pseudomanifold contained in M. 
K (o) is called the (n — q)-dual cell or simply the 
dual of g. Let K, be the subcomplex of K’ 
consisting of simplexes of K’ contained in 
K(a). Thus we have |K,|=|K(c)|. We denote 
by (K(o)) the union of simplexes of K, which 
do not intersect with o. The set of dual cells 
K*={K(o)|cEK} of simplexes of K is called 
the dual complex of K; and K* is also called 
the dual subdivision of | K|. K* satisfies the 
following conditions: 
(i) Ueex K(o)=|K|. Each K(a) is the tjoin 
6*(K(c)) of the barycenter é and (K(o)). 
(ii) ¢<t<>(K(o)) > K(t) (6, te K), (K(o)) = 
a<rex K(7). 
(iii) If K(o) 0 K(t)# Ø, then there exist sim- 
plexes of K with o and q as faces, and for the 
least simplex p of K with o and 7 as faces, it 
holds that K(a)M K(t)= K(p). 

The boundary 6(K(o)) of the (n— q)- 
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dimensional homology manifold K (ø) satisfies 
6(K(e))=(K(o)) U((6K)(a)), where (0K)(o) 
denotes the dual cell of o in 0K, and we under- 
stand that (¢K)(o)= Ø if o¢ 0K. K* U(0K)* is 
called the dual complex of (K, ôK) and is de- 
noted by (K,0K)*. For an r-simplex o (O<r< 
n—1), it holds that 


H{(K(o)),(6K());Z)=0,  i#n—r—1, 
H,-,-1((K(9)), (@K (0); Z) = Z. 


Let |K*|? be the union of i-dual cells of 

(K, 0K)* = K* U(0K)* such that i<q. Let 
C,(K*) = Â (|K*|4/|K* |"), and let 6,:C,(K*) 
—C,_,(K*) be the tconnecting homomorphism 
04: A (I K* M K* A, ([K*[ 1/1 K* |?) 
in the treduced homology exact sequence 

of (|K*|4,|K*|?"',|K*|* 7) (— 201 Homol- 
ogy Theory F). Then C,(K*) is a free Abe- 
lian group generated by the q-dual cells 

of (K, 0K)*, and C(K*)={C,(K*), @,} is a 
tchain complex. It follows from the property 
of H,((K(c)),(@K(a)) ; Z) as above that 
H,(C(K*))= H,(|K|) as in the case of a *CW 
complex. 

Now suppose that K is oriented. For a q- 
simplex o =|aga; ...a,| we give an torientation 
[a9,a,,...,a,]. Furthermore, we choose a,,,, 
g+2,+++,4, SO that [do,a,,...,a,] is the orien- 
tation of |aga, ...a,| induced from the orien- 
tation of K, and we give K (øo) the orientation 
determined by [¢,¢,,...,6,-,], where o;= 
|4p a, ...a,,;|. We define o: K(o)=1 for o and 
K(o) thus oriented, and t: K(a)=0 if t 4a. 
Let c =È 4;0;€ C (K) and c* = È y,K(o;). Then 
we define c: c* eZ by c- c*= D; jinlar: K(0;)). 
The integer c- c* is called the intersection 
number of c and c*. For homology classes 
[z]¢H,(K) and [2*]eH,_,(K*), the intersec- 
tion number z: z* is independent of the choice 
of representing cycles, and thus we define the 
intersection number [z]: [z*] =z: z*. 

When X is compact, the isomorphism of 
the Poincaré-Lefschetz duality theorem 
H(X, 0X; R)= H" 4(X;R) is given in terms of 
intersection numbers. 


C. Combinatorial Manifolds and PL Manifolds 


A polyhedron which is PL homeomorphic 
with a k-dimensional simplex or its boundary 
is called a PL k-ball or a PL (k-1)-sphere and is 
denoted by B*, S*~!, respectively. A simplicial 
complex K or a polyhedron M =|K] is called 
an n-dimensional combinatorial manifold if 
the tstar of each vertex in K is a PL k-ball. If 
M is an n-dimensional combinatorial mani- 
fold, then so is an open subset of M. An n- 
dimensional PL manifold is a topological 
manifold X with a distinguished maximal 
coordinate system 7 = {(U,,h,)}, called a PL 
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structure on X, which consists of homeomor- 
phisms h;: U;— V; from open subsets U; onto 
open subsets V; of an n-dimensional simplex 
such that hjoh;*:h(U,0 U)—>h;(U;N U,) are 
PL homeomorphisms. For a PL manifold 

(X, 7) there is a triangulation t: M >X froma 
combinatorial manifold M such that for each 
(U; hen, h;ot|t~'(U) is a PL homeomor- 
phism from an open subset t~'(U;) of M onto 
h;(U;). Thus the notions of combinatorial mani- 
fold and PL manifold are essentially the same. 
For a tsmooth manifold (X, ø), there is a 
unique PL homeomorphism class of a com- 
binatorial manifold by means of a tsmooth 
triangulation. In this sense, smooth manifolds 
are combinatorial manifolds; but the converse 
of this is not true (— 114 Differential Topol- 
ogy C). A triangulation of a topological mani- 
fold which is a combinatorial manifold is 
called a combinatorial triangulation. The trian- 
gulation problem for n-dimensional topolog- 
ical manifolds, especially the existence and 
uniqueness problem of their combinatorial 
triangulations, a long-standing problem, is 
stated as follows. CT,, (Combinatorial Triangu- 
lation Problem): Is every n-dimensional topo- 
logical manifold homeomorphic with a com- 
binatorial manifold? CH, (Hauptvermutung for 
combinatorial manifolds): Homeomorphic n- 
dimensional combinatorial manifolds should 
be PL homeomorphic. When n <3, CT, and 
CH,, hold; this was shown for n=2 by T. Rado 
[15] and for n=3 by E. E. Moise [16]. More- 
over, in this case, topological manifolds admit 
unique smooth structures, and homology 
manifolds are combinatorial manifolds. For 
n>5, tsurgery theory was used and developed 
as an obstruction theory for “homotoping” 

a homeomorphism h: M >M’ between n- 
dimensional combinatorial manifolds to a PL 
homeomorphism by D. Sullivan (1967). By the 
discovery of the torus-unfurling method for 
“isotoping” h: M >M" to a PL homeomor- 
phism, Kirby reduced CT, and CH, for n25 
to CH, for some specific manifolds, such as 
thickened tori modulo the boundary, for 
which surgery methods had been sufficiently 
developed (for isotopy — Section D). The 
solutions (1)-(9) can be stated as follows. 

(1) Classification of combinatorial trian- 
gulations. Assume n> 5. Existence: An n- 
dimensional closed topological manifold X 
admits a combinatorial triangulation if and 
only if an obstruction A(X)e H*(X; Z,) van- 
ishes. Uniqueness: A homeomorphism h: M —> 
M’ between n-dimensional closed combina- 
torial manifolds is isotopic (— Section D) to a 
PL homeomorphism if and only if an obstruc- 
tion A(h)e H*(X; Z,) vanishes (Kirby and 
Siebenmann [5]). 

In fact, for each n> 5, there is an n- 
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dimensional combinatorial manifold which ts 
homeomorphic to a torus S$! x ... x S! but is 
not PL homeomorphic, and there is an n- 
dimensional closed manifold which is not 
homeomorphic to a combinatorial manifold 
(Siebenmann [18]). 

As a special case, the following conjecture is 
proved for n> 5. 

(2) The n-dimensional annulus conjecture. 
The closed region bounded by two mutually 
disjoint tlocally flat (n — 1)-dimensional topo- 
logical spheres S, and S, in an n-dimensional 
sphere $” is homeomorphic with an n- 


dimensional annulus $"~! x [0,1] (Kirby [19]). 


Regarding the uniqueness problem for (not 
necessarily combinatorial) triangulations of 
topological manifolds, we mention the follow- 
ing deep result. 

(3) Double suspension theorem for homology 
spheres. The double suspensions of homology 
n-spheres are homeomorphic with $"*?, where 
a homology n-sphere is a closed topological 
manifold with the same homology groups as 
those of $” (Cannon [7]). 

For each n> 3, there is a homology n-sphere 
P” which is a combinatorial manifold with 
the nontrivial tfundamental group (for n=3, 
see Poincaré [1, 1904] and for n>4, see M. 
Newman, 1948). The suspension of P” is an 
example of an (n + 1)-dimensional homology 
manifold which is not a topological manifold. 
The double suspension of P” is not a com- 
binatorial manifold but is homeomorphic 
with S"*?. The problem of whether every n- 
dimensional topological manifold is homeo- 
morphic to a polyhedron is still open for n> 
4. This problem is reduced to a problem in 
homology 3-spheres (T. Matumoto; D. E. 
Galewski and R. J. Stern). 

In the following we list some results on 
combinatorial manifolds focusing on White- 
head’s theory of regular neighborhoods [2]. 
Let M =|K| be an n-dimensional combina- 
torial manifold without boundary. The dual 
complex K* consists of PL balls. Thus the 
division K of M gives rise to a decomposition 
into PL thandles H(K) of M in such a way 
that for each k-dimensional simplex ce K, 
the star St,..(6) of its barycenter ĉin the sec- 
ond barycentric subdivision K” of K is a PL 
handle of index k. If L is a subcomplex of K or 
K*, then the star St,..(|L|) of |L| in K” isa 
subhandlebody of H(K) or of its dual han- 
dlebody H*(K)= H(K*), denoted by Hx(L) 
or Hx+(L), respectively. In particular, if M 
is closed, orientable, and of dimension 3, 
then U=H,(K"”) and V= Hyx+(K&)) are 
PL homeomorphic with a PL boundary- 
connected sum of some copies of a solid torus 
S! x B?, and M=UUV, UN V=6éU =éY, 
where K“ and Ké, stand for the k-*skeletons 
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of K and K*, respectively. This decomposition 
(U, V;@U =V) of M is called a Heegaard 
decomposition (or splitting) of M [12]. We say 
that a simplicial complex L collapses elemen- 
tarily to a subcomplex L, of L, referred to as 
an elementary collapse L N L,, if L- L| = 
{o,tT}, where o is a k-dimensional simplex of L 
and z is a (k — 1)-dimensional face of g which is 
not a face of any simplex in L — {0,1}. We say 
that a polyhedron P collapses to a subpoly- 
hedron Q of P or Q expands to P, referred to 
as a collapse P N Q or expansion Q 7 P, if there 
is a division L of P containing a subcomplex 
L, dividing Q so that there is a sequence of 
elementary collapsings L\L,\...\L,. Fora 
subpolyhedron P of a combinatorial manifold 
M, asubpolyhedron U of M is called a regular 
neighborhood of P in M if (1) U is a combina- 
torial manifold which is a closed neighbor- 
hood of P in M and (2) U NP. In general, for a 
subcomplex L of a simplicial complex K, the 
‘star St,(|L|) of |L| in the second barycentric 
subdivision K” of K 1s called a second barycen- 
tric derived neighborhood or simply a derived 
neighborhood of | L| in |K| (E. C. Zeeman [4]). 

(4) Regular neighborhood theorem. Let P be 
a compact subpolyhedron of a combinatorial 
manifold M. Existence: Every derived neigh- 
borhood of P in M is a regular neighborhood. 
Uniqueness: Any two regular neighborhoods 
of P in M are PL homeomorphic by a map- 
ping keeping P pointwise fixed [2, 1939]. On 
one hand, this theorem is regarded as a com- 
binatorial counterpart to the ‘tubular neigh- 
borhood theorem of differential topology. For 
a topological manifold X with boundary 0X, 
there are a neighborhood U of ðX in X anda 
homeomorphism h:éX x [0,1] U, called a 
collar of 6X in X, such that h(x,0)=x (xe X). 
In particular, for a combinatorial manifold 
M =X we can take U as a derived neighbor- 
hood of 6M in M and has a PL homeomor- 
phism, called a PL collar of ôM in M. (For 
more about normal bundle theory — 147 
Fiber Bundles Q). On the other hand, it is 
also regarded as a combinatorial deforma- 
tion theorem of a handlebody decomposition 
Hy. (L) of a derived neighborhood St,..(|L]). 

Two polyhedra P and Q are said to be 
simple homotopy equivalent if Q is obtained 
from P by a finite sequence of collapsings and 
expansions. A composite mapping f: PQ of 
the inclusion mappings and deformation re- 
tractions associated naturally to the expan- 
sions and the collapsings from P to Q is called 
a simple homotopy equivalence. Moreover, if P 
and Q are subpolyhedra of a combinatorial 
manifold M and those collapsings and expan- 
sions take place in M, then their regular neigh- 
borhoods are PL homeomorphic [2, 1940]. 

(5) A simply homotopy equivalence is clear- 
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ly a homotopy equivalence. Conversely, we 
have the simple homotopy theorem: For the 
homotopy class of a homotopy equivalence 
f{:P—@Q between compact connected poly- 
hedra P and Q, there is a well-defined element 
W(f), called the Whitehead torsion of f, in the 
*Whitehead group Wh(G) of the fundamental 
group G=7,(P) such that f is homotopic to a 
simple homotopy equivalence if and only if 
W(f)=0, ([2, 1950]; — also J. W. Milnor 
[20]). 

Let W be a compact combinatorial mani- 
fold of dimension n+ 1. If @W consists of two 
connected components M and N such that 
the inclusion mappings Mc W and N c. W are 
simple homotopy equivalences, then (W; M, N) 
is called an s-cobordism of dimension n+ 1. 
The following theorem of D. Barden, B. Mazur, 
and J. Stallings is the nonsimply connected 
PL version of Smale’s th-cobordism theorem. 

(6) s-Cobordism theorem. For n> 5, an s- 
cobordism (W; M, N) of dimension n+ 1 1s PL 
homeomorphic with (M x [0,1]; M x0, M x 1) 
(B. Mazur [21]; M. Kervaire [22]; C. Weber 
[23]; J. F. P. Hudson [8]). 

(7) Topological invariance of simple homo- 
topy types. Two homeomorphic compact 
polyhedra are simple homotopy cquivalent 
(T. A. Chapman [6]). 

The famous Poincaré conjecture that a sim- 
ply connected closed manifold of dimension 3 is 
homeomorphic to a 3-sphere is still unsolved 
despite much effort by many mathematicians. 
This conjecture is generalized for dimension n 
as follows. 

(8) Generalized Poincaré conjecture in dimen- 
sion n. Any homotopy n-sphere 2” is homeo- 
morphic to an n-sphere, where a homotopy 
n-sphere is defined to be an n-dimensional 
topological manifold homotopy equivalent 
to an n-sphere. 

This has been affirmatively answered for 
n>5. The first proof was given by S. Smale 
(— 114 Differential Topology F, K). From the 
viewpoints of collapsing and general position 
(— Section D), J. Stallings [3, 1960] and E. C. 
Zeeman [11, 198—204] proved the following. 

(9) Engulfing lemma. For a compact poly- 
hedron P of dimension m in the interior M = 
M—OM of an n-dimensional combinatorial! 
manifold M, there is a PL n-ball in M contain- 
ing P in its interior (i.e., engulfing P), provided 
that M is m-connected (i.e., 7;(M)=0 (i<m)) 
and n—m2>3. (For the topological version — 
M. H. A. Newman [24].) If a combinatorial 
manifold M =|K| is a homotopy n-sphere 
(n> 5), then by the engulfing lemma, there are 
PL n-balls B, and B, engulfing H,.(K >) 
and Hx.(Ki,) so that B, U B, = M. By the 
generalized Schoenflies theorem (— Section 
G), B, — B, = M—B, is homeomorphic to 
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an n-ball. It follows that by the Alexander 
trick (— Section D) M ts homeomorphic to an 
n-sphere. Since CH, holds for S” (n > 5), the 
homotopy PL n-sphere M is actually PL 
homeomorphic with $”. 


D. Embeddings of Combinatorial Manifolds 


An injective PL mapping f: P—>Q between 
polyhedra is called a PL embedding if f(P) is a 
subpolyhedron of Q (— Section A). Namely, 

a PL embedding f:P—Q is a PL mapping 
which is induced from a simplictal injection 
f:K—>L, called a division of f, for some divi- 
sions K and L of P and Q, respectively. In the 
following we explain PL embeddings from an 
m-dimensional combinatorial manifold M into 
an n-dimensional combinatorial manifold N. 
For a PL embedding f: M>N, c=n—mis 
called the codimension of f. If f-'(¢N)=0M, 
f is said to be proper. Since an injective PL 
mapping f:M >N is a PL embedding if and 
only if it is proper as a mapping between topo- 
logical spaces M and N (i.e., the preimages of 
compact sets are compact), this term “proper” 
for PL embeddings should not be confused 
with the term “proper” for mappings. Two PL 
embeddings f: M—N and f':M'>N' are said 
to be (PL) equivalent if there is a PL homeo- 
morphism h: NN’, called an equivalence 
from f to f’, such that ho f(M)=f'(M’). When 
M = M', f and f’ are said to be (PL) isomor- 
phic if there is an equivalence h: NN’ from 

f to f' such that ho f= f’. A submanifold of 
N is defined as a subpolyhedron of N which 

is a combinatorial manifold. Thus the equiv- 
alence class of a PL embedding f: M >N is 
nothing but the PL homeomorphism class of a 
pair (N, f(M)) of N and its submanifold f(M). 
For a submanifold of N, the same terms as 
those for the inclusion mapping are used. For 
every division f: KL of a (proper) PL em- 
bedding f: MN and for xe M we havea 
(proper) PL embedding f|St,(x) from a PL m- 
ball St,(x) into a PL n-ball St, (f(x)) which 
represents the tgerm of f at xe M. A PL em- 
bedding of a PL m-sphere into a PL n-sphere 
is called a PL (n, m)-knot or (n, m)-sphere pair 
and a proper PL embedding of a PL m-ball 
into a PL n-ball is called a PL (n, m)-ball knot 
or (n,m)-ball pair. A PL (n, m)-knot (a PL 

(n, m)-ball knot) is said to be trivial or un- 
knotted if it is equivalent to the inclusion 
mapping dI"*! x 0c ð+ (I" c I"), where I= 
[—1,1]. Then we have the following state- 
ments (1)—(5). 

(1) Zeeman’s unknotting theorem. Every PL 
(n,m)-knot and every PL (n, m)-ball knot are 
unknotted, provided that n-m >23. 

In general, an injection g:X > Y between 
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topological manifolds is said to be locally flat 
if for each point xe X, there is a neighborhood 
U of x in X so that the local germ of f at x ts 
equivalent to the inclusion mapping U x0cU 
x I°. Then f is necessarily proper, i.e., f 1(dY) 
=0X.A PL embedding f: M—N is said to be 
locally flat if it is locally flat in the PL cate- 
gory. Namely, f|St,(x) is equivalent to I” x 
0<J". Conversely, Zeeman’s unknotting theo- 
rem implies the following. 

(2) Every proper PL embedding of codimen- 
sion > 3 is locally flat. 

Since every locally flat PL embedding f:M 
>N admits a normal tblock bundle, the classi- 
fication of F: M >U up to isomorphism is 
reduced to the classification of block bundles 
(— 147 Fiber Bundles Q), where U is a derived 
neighborhood of f(M) in N. (For the general 
classification of locally flat PL embeddings by 
surgical methods — C. T. C. Wall [25, Corol- 
lary 11.3.1]). 

In the codimension 2 case, there exist non- 
trivial knots (the classical knots) (— 235 Knot 
Theory) and there is a proper PL submanifold 
which is not locally flat. For example, a com- 
plex projective curve V defined by x” +y"! z 
=0 can be regarded as a PL 2-sphere in the 
complex projective plane CP?, and for the 
inclusion mapping i: Vc.CP?, at 0=(0:0: 1), 
i|St,(0) is PL equivalent to a cone of a torus 
knot k: S! <S? of type (n, n—1). For further 
results in higher-dimensional knot theory (— 
235 Knot Theory) we mention the works of M. 
Kato and Y. Matsumoto [26] and S. Capell 
and J. Shaneson [27]. 

Here we consider only proper PL embed- 
dings of a fixed m-dimensional combinato- 
rial manifold M into a fixed connected n- 
dimensional combinatorial manifold N, so 
that a codimension 0 PL embedding is a PL 
homeomorphism. Two PL embeddings fj, 
f,:M-N are said to be pseudoisotopic if there 
is a proper PL embedding F: M x [0,1] > N 
x [0, 1], called a pseudoisotopy from fo to 
fi, such that F(x, 0)=( f(x), 0) and F(x, 1)= 
(fi, 1) (xe M). Moreover, if F satisfies the 
level-preserving condition F(M x {t})<N x {t} 
(O<t<1), then fo and f, are said to be isotopic, 
and F or a homotopy { f,} defined by F(x, t)= 
(f(x), £) ((x,t)e M x [0, 1]) is called an isotopy 
from f, to fi. When M =N, a PL homeomor- 
phism which is isotopic to the identity, or the 
isotopy of the identity itself, is called an am- 
bient isotopy of N. Two PL embeddings from 
M to N are said to be ambient isotopic if they 
are isomorphic by an ambient isotopy of N. 

Since J* (I=[—1,1]) is a k-dimensional 
tconvex cell and is a cone 0 » d/* of é1* from 
the center 0, each point x of /*— {0} can be 
written uniquely as x=t-u for 0<t<1 and 
uedl*. Thus a PL embedding f:ôI” I" can 
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be extended to a PL embedding F: I" 1", 
called a cone extension or simply a cone of f, 
by setting F(0)=0 and F(t- u)=t: f(u) for 
t-uel™—{0}. This method of cone extension is 
often referred to as Alexander’s trick. By using 
Alexander’s trick and the uniqueness of regu- 
lar neighborhoods, one can show without 
much difficulty that (i) the isotopy classes of 
PL homeomorphisms of PL spheres or PL 
balls onto themselves are classified by their 
tdegrees (= +1)(V. K. A. M. Gugenheim, 
1953), and (ii) every PL (n,m)-knot or PL 

(n, m)-ball knot is equivalent to the standard 
one 617! x0c6I"*! or I" x Oc!" up to iso- 
topy, provided that n—m > 1. This is why we 
define the notion of ambient isotopy in a dif- 
ferent way from the differentiable category, 
where the smooth isotopy is always covered by 
the ambient isotopy. However, for codimen- 
sion >3, because of Zeeman’s unknotting 
theorem, the following holds. 

(3) The pseudoisotopy class, the isotopy 
class, and the ambient isotopy class of a PL 
embedding of codimension > 3 are the same 
(J. F. P. Hudson and E. C. Zeeman, 1964). 

In order to construct a PL embedding from 
a continuous mapping within its homotopy 
class, the notion of general position is useful. 
A PL mapping ọ:M ->N is said to be in gen- 
eral position if dim o ‘(y)<0(yeN) and 
dim S(~)<2m—n, where S(q@) is the closure of 
{xeM|o ‘(o(x))#{x}} in M (J. F. P. Hudson 
[8]). 

(4) General position theorem. Every con- 
tinuous mapping yw: M—N can be approxi- 
mated by a PL mapping @: MN which is in 
general position. 

The following theorem can be proved by 
sharpening the general position theorem and 
the engulfing lemma. 

(5) Irwin’s embedding theorem. Let y: M—>N 
be a continuous mapping from a compact 
combinatorial manifold of dimension m into a 
combinatorial manifold of dimension n such 
that w|0M is a PL embedding from ôM into 
ON. Assume that (i) M is d-connected, (ii) N is 
(d+ 1)-connected, and (iil) n—m > 3, where d= 
2m—n. Then wy: MN is homotopic to a 
proper PL embedding f: M->N relative to 
OM. Moreover, if M and N are closed and if 
(i*) M is (d+ 1)-connected and (ii*) N is (d+ 2)- 
connected, then two homotopic PL embed- 
dings from M into N are ambient isotopic (M. 
C. Irwin, 1965). 


E. 3-Manifolds 


There has been a great deal of research done 
on 3-manifolds, and we have, among others, 
the following results (1)—(4). 
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(1) Sphere theorem. Let M be an orientable 
3-manifold with z,(M) #0. Then there exists a 
PL embedding f of a 2-sphere into M such 
that f is not homotopic to 0 in M (C. Papa- 
kyriakopoulos [28}]). 

(2) Dehn’s lemma. Let D be a 2-cell with 
self-intersections in a 3-manifold M, with its 
boundary a simple polygon C. If some neigh- 
borhood U of C in D has no singularities, then 
there exists a 2-cell in M without singularities 
whose boundary is C. In 1910, M. Dehn as- 
serted this lemma, but his proof was incom- 
plete. In 1957, Papakyriakopoulos and T. 
Homma proved the lemma independently 
[28,29]. 

(3) Loop theorem. Let N be the boundary 
of a 3-manifold M and U be an open set 
in a component of N. If there is a closed 
curve in U which is homotopic to 0 in M 
but not in N, then there exists a simple closed 
curve in U homotopic to 0 in M but not 
in N (Papakyriakopoulos [30], J. Stallings 
[31)). | 

(4) Unique decomposition theorem for a 3- 
manifold (J. W. Milnor [32], H. Kneser). We 
assume that all manifolds are connected, 
oriented, and triangulated 3-manifolds without 
boundaries and that all homeomorphisms are 
piecewise linear. Two manifolds M, M’ are 
said to be isomorphic (M = M’) if there exists 
an orlentation-preserving homeomorphism of 
M onto M’. Removing an open 3-cell from 
each of two 3-manifolds M, M’ and identifying 
the boundaries of these removed cells, we 
obtain a 3-manifold M # M’, called the con- 
nected sum of M and M’. A manifold that is 
not isomorphic to the 3-sphere S? is called 
nontrivial. A nontrivial manifold P is called 
prime if P cannot be decomposed as P = 
M, # M, with M, and M, both nontrivial. 

A manifold M is called irreducible if every 
2-sphere in M bounds a 3-cell. Then from 
the sphere theorem the following results can 
be deduced: If a compact 3-manifold M is 
nontrivial, then M is isomorphic to a con- 
nected sum P, # P, # ... # P, of prime mani- 
folds P; (i= 1,2, ..., k), where P,, P,,..., P, 
are determined uniquely up to order. Every 
irreducible 3-manifold M has z ,(M)=0. Con- 
versely, if a 3-manifold M has z,(M) =0, then 
M is irreducible, provided that the Poincaré 
conjecture is correct. (For further results 

on 3-manifolds — F. Waldhausen [10] and 
J. Hempel [12].) 

Irreducible sufficiently large 3-manifolds 
[10] can further be decomposed into Seifert 
fibered spaces and “simple manifolds” which 
contain no essential tori (W. Jaco and P. B. 
Shalen, K. Johannson). Existence of hyperbolic 
structures on “simple manifolds” has been 
extensively studied by W. Thurston. 
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F. Wild Spaces 


Nonclosed 3-manifolds behave very differently 
from closed 3-manifolds. For example, there 
exists an open 3-manifold U that has the same 
thomotopy type as an open 3-cell but is not 
homeomorphic to it (M. H. A. Newman and J. 
H. C. Whitehead, 1937). The construction of 
this and similar examples generally involves 
infinite processes. Such examples, which do 
not exhibit the properties we expect, are com- 
monly termed pathological (or wild) spaces. 
For the manifold U above, the product U x Rt 
is found to be homeomorphic to the Euclidean 
4-space R4. R. Moore proved that the quotient 
space of the Euclidean 2-space R? by any 
upper semicontinuous decomposition consist- 
ing of continua not separating R? is homeo- 
morphic to R°. On the other hand, R. H. Bing 
(1959) gave an example of a similar upper semi- 
continuous decomposition of R? for which the 
quotient space B is not homeomorphic to 

R? and is not even a manifold. This quotient 
space has the property B x R! =R*. Bing 
(1957) also proved that at most countably 
many copies of a wild closed surface can 
simultaneously be embedded in R°. On the 
other hand, Stallings (1960) proved that there 
exists a wild disk of which a set of copies with 
the power of the continuum can simulta- 
neously be embedded in R? [11]. 

The study of the quotient space of n- 
dimensional manifolds by cell-like decompo- 
sitions has been fully generalized by Edwards 
and Cannon [7] for n> 5 to prove the double 
suspension theorem of homology spheres (— 
Section C (3)). 


G. The Schoenflies Theorem 


In 1906, A. M. Schoenflies proved that every 
tsimple closed curve in the plane is the bound- 
ary of a 2-cell. This result is sharper than 

the tJordan curve theorem and is called the 
Schoenflies theorem. However there was a gap 
in the proof, and the complete proof was given 
by L. E. J. Brouwer in 1910. 

The following higher-dimensional analog of 
the above theorem is called the Schoenflies 
problem: Is every (n— 1)-sphere embedded in 
the n-dimensional Euclidean space R” (n> 3) 
the boundary of an embedded n-disk? 

For the case where the embedding is topo- 
logical, L. Antoine constructed a counter- 
example, called Antoine’s necklace, for n=3 in 
1921 and solved this problem negatively. Also, 
J. W. Alexander constructed another counter- 
example, called Alexander’s horned sphere, in 
1924 (Fig. 1). A similar example was given by 
R. H. Fox and E. Artin in 1948 by making use 
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of knot theory (Fig. 2). The study of wild em- 
beddings began with these examples, and 
many interesting results were obtained (Bing). 

For the case where the embedding is PL or 
smooth, the Schoenflies problem has been 
solved affirmatively for n= 3 (Alexander). 

M. Brown (1960) and B. Mazur (1959) 
proved the following: Let h: S"! x [—1, 1]S" 
be a topological embedding; then S"—h(S""! x 
{0}) consists of connected open sets D, and 
D_ of S", and the closures of D, and D_ are 
both homeomorphic to the n-disk. This result 
solves the Schoenflies problem affirmatively 
for n>5 when the embedding is PL and lo- 
cally flat or smooth. 


Fig. 2 
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A. General Review of Combinatorics 


Combinatorics is concerned mainly with prob- 
lems of discrete sets, such as the enumeration 
of subsets satisfying certain conditions, or their 
actual construction and the selection of an 
optimal subset with respect to a suitable crite- 
rion. Generally speaking, both the theoretical 
analysis and the construction of discrete sets 
are much more difficult than those problems in 
analysis concerning infinite sets with suitable 
topologies. 

The main emphasis and the name of this 
field have changed from time to time and from 
person to person. Other names such as com- 
binatorial analysis, combinatorial theory, or 
combinatorial mathematics are commonly used 
and mean almost the same thing. Recently, 
the expression “discrete mathematics” has 
also been used to describe the same field. In 
many cases a problem originally conceived as 
number-theoretic, algebraic, analytic, or geo- 
metric has eventually turned out to be com- 
binatorial (— 102 Design of Experiments, 241 
Latin Squares). 
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tGraph theory, originally a combinatorial 
study of 1-dimensional topological tcomplexes, 
has developed into an important branch of 
mathematics. 

In this article we discuss various topics not 
yet considered to be established branches or 
organized fields of mathematics. The topics 
treated under the title “combinatorics” are 
at present not unified, and consist of a mix- 
ture of isolated techniques. However, recent 
progress with electronic computers is play- 
ing an important role in the solution of 
various combinatorial problems arising from 
large systems, and the study of combinatorics 
is now quite active both in theoretical develop- 
ment and application. We now have special 
journals in this field, such as the Journal of 
Combinatorial Theory (Academic Press), Com- 
binatorica (Elsevier), the European Journal 
of Combinatorics (Academic Press), Discrete 
Mathematics (North-Holland), and Discrete 
Applied Mathematics (North-Holland). 


B. Three Fundamental Principles 


The most fundamental among the various 
principles in combinatorics are the following 
three principles for enumeration problems. Let 
Q be a finite set and let |A| denote the number 
of elements in a subset A of Q. 

(1) Rule of sums: If A N B= Ø, then |A + B| = 
|A|+|BI. 

(2) Rule of products: | A x B|=|A| x |B], where 
A x B is the fdirect product of the two sets A 
and B. 

(3) Principle of inclusion and exclusion: 


tA WAsU UA 
= 4l- $} |4:N Al + » |A;9 A;N Al 


i<j i<j<k 
Stay e E Aar A) 


These principles can be generalized as state- 
ments concerning various measures. 


C. Möbius Inversion Formula 


Let P be an ordered set whose arbitrary inter- 
val is a finite set. Let [P x P] = {(x, y)e P x 
P|x<y} be the set of intervals of P. A func- 
tion satisfying the following two conditions is 
determined uniquely and is called the Möbius 
function over P: 
(i) u(x, x)= 1 for all xe P; 
(ii) Èz. x<z<y U(x, z)=0 for all x, ye P with x< y 
and x# y. 

When K is a field, for a mapping f: [P x P] 
>K, we can define another mapping g: [P x 
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P]>K by 
g(x y= $ f(x,z). 


ze[x, y] 


Then we have the Möbius inversion formula 


fœy= $} g, zul, y). 

ze[x, y] 
This formula implies, as its special cases, many 
important formulas, such as the *M6bius in- 
version formula in number theory, the prin- 
ciple of inclusion and exclusion, and the in- 
verse relation of difference and summation. 


D. Some Special Sequences 


In the enumeration problem, many important 
sequences {a„} are given by the tgenerating 
function f(x)= a,x". Typical examples are as 
follows: (1) Binomial coefficients: (1 +x)" = 
È) x". (2) Multinomial coefficients: (x, +... 
+X)" = Ln rn) X1 Xp”. (3) Multiple combi- 


nation a,=,,H,=("*""'):(1 —x) "=,,(,H,)x". 


(4) Bell numbers: exp(e*— 1)= È B,x"/n! (5) 
The Stirling number of the second kind S% is 
the total number of bijections from N, to N,, 
divided by m!, where N, = {1,2,...,2}. This is 
equivalent to the total number of partitions of 
N, into m nonempty blocks: (e* — 1)"/m! = 
Eom ie aie 8 

In general, when the generating function 
f(x) is an analytic function of one or several 
variables as above, we can obtain recurrence 
formulas for the sequence from the functional 
equations or the differential equations which 
f(x) satisfies; or we can get asymptotic esti- 
mates by using the tsaddle point method. 


E. Pélya’s Enumeration Theorem 


Let U be a finite set, V be a (finite or infinite) 
set, G be a transformation group operating on 
U, and w be a function assigning to each ele- 
ment of V a “weight.” A weight is an element 
of a ring and is usually defined by a suitable 
generating function. The weight of a function 
f:U-V is defined by w(f)=T1,-y w(f(w)). 
Two functions f, and f, are considered iden- 
tical (f, = f2) if there exists a ne G such that 

fi= hon. Put F= VY (E), the set of all equiv- 
alence classes of the relation =. w( f ) is well- 
defined for fe F, because f, = fa implies w(f,) 
=w( f2). Then we have the equality E zep w( f) 
= Pe(Lvev W(0), Doey WO), -< Doev WO). 
Here, Po(¥1,V2.--+» Ym) is the cycle index of G, 
which is given by n! Epe pM pLO ... yim, 
where n=|G|, m=|U|, and j,(z) is the number 
of cyclic permutations of length k contained in 
x. This is called Pélya’s enumeration theorem. 
When G=S,, (the tsymmetric group), Ps (v1, 
oes Ym) = Daz e0.sha=m(A!2"225!...m*9A,1) yi 
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... yan. When G = Am (the talternating group), 
P4,(V15-++> Ym) = Lead, kag=m.Eay=even2(Ay 127223! 
mh!) yt... yam. When G=C,, (the tey- 
clic group), Pe (Yis -s Yn) =M! Lem Plk) y, 
where ø(k) is the tEuler function, i.e., the num- 
ber of integers less than k and coprime to k. 
This theorem is widely applicable to various 
problems with suitable choices of weights. 
Several generalizations have also been studied 


(see, e.g., [3]). 


F. The Notion of Polymatroids 


The theory of matroids was originally intro- 
duced as an abstraction of the notion of flinear 
dependence in a vector space or of some prop- 
erties of tgraphs or of the notion of falge- 
braic dependence in the theory of fields (see, 
e.g., [4,5]). But recently, as many important 
applications were discovered, it has become 
an important branch of combinatorics (see 
[6-8]). 

Let u be a function from a tlattice L toa 
totally ordered module R. x is called sub- 
modular if 4(x)+ u(y) > u(x v y) + w(x y) for 
every pair of elements x, ye L. The set of ae L 
such that u(a)=min,ez (x) constitutes a sub- 
lattice in L. Let L be the family of all subsets 
of a finite set E. The sublattices K in L=2° 
and the pseudo-orders over E correspond in a 
one-to-one manner as follows (here pseudo- 
order means a binary relation taken as an 
‘ordering without assuming the tantisym- 
metric law): (i) The class of a partition P of E 
corresponds to the difference of two subsets 
which are the immediate predecessor and 
immediate successor in K, or the complement 
with respect to E of the maximum element in 
K, or the complement of the minimum ele- 
ment. (ii) The complement of the maximum 
element in K is maximum in the order of P, 
and the minimum element in K is minimum in 
the order of P. (iti) Two classes €,, č, in P, 
given as the difference of two elements in K, 
satisfy the relation č, > č, with respect to the 
order of P if and only if all elements of K such 
that x>€, in E satisfy x >€,. 

Now let E be a finite set, L=2F, and let R be 
the real number field with the usual addition 
and order relations. If a submodular function 
p:2°R satisfies the following conditions, 
then P=(E, n) is called a polymatroid over E, 
and u is called the rank function over P. (i) 
u(@)=0, where @ is the empty set. (1i) x cy 
(cE) implies u(x) < u(y). A function u: ER is 
automatically extended to u:27>R by u(x)= 
Loex ule) (xc E), which is also denoted by 
the same symbol u. If a function u satisfies 
0<u(x)< p(x) for every xe E, then u is called 
an independent vector over P. An independent 
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vector u with u(E)= p(E) is called a base of P. 
A function u: E>R is identified with an ele- 
ment in R”, where n=|E|. Then a set of inde- 
pendent vectors in P forms a tconvex poly- 
hedron in R”. If we denote the polyhedron by 
P, then such a P is characterized by the follow- 
ing two conditions: (P1) If we P then each v 
satisfying 0 <v(e)<u(e) for every ee E also 
belongs to P. (P2) If two u, ve P satisfy u(E) < 
v(E), then there exists we P, w #u, satisfying 
u(e)< w(e)<max{[u(e), v(e)] for every ee E. If 
there are two polymatroids P, =(E, p) and 
P, =(E, u) over the same finite set E, then we 
have max {u(E)|weP,NP,}=min{ p, (x)+ 
H(E— x)| xc E}. According to tinformation 
theory, an example of a polymatroid (E, u) ts 
given by the following: p(x), xe E, is the simul- 
taneous tentropy of a subset x of the infor- 
mation source E with correlation. 


G. Matroids 


A polymatroid P =(E, u) is called a matroid if 
the values of u are always integers and p(x)< 
|x|=the cardinality of the set x. A set x CE is 
called an independent set if (x) =|x|, and 

a dependent set otherwise. An independent 
set with maximal cardinality is called a base. 
A minimal dependent set is called a circuit. 
The set y with max{|y|| y >x, u(x) =p(y)} is 
uniquely determined by x, which is called 

the closure of x and denoted by cl x. Several 
axioms stated in terms of these notions exist, 
and can be used to characterize matroids. 
Some of them are: 

(1) The axiom for the family of independent 
sets £: (i) Zef; (it) xef and yc x implies 
yeI; (in) If x, ye ¥ and |x|<|y|, then there 
exists an ee E in y—x such that xU fetes. 

(2) The axiom for the family of bases 2: (1) Z 
is not empty; (i1) If b, b'e 2 and b #b', then for 
every eeb—b', there exists an e’ such that e'e 
b'—b and (b—{e})U fete. 

(3) The axiom for the family of circuits @: 
(i) @ eg; (it) For every pair x, ye@, y is nota 
proper subset of x; (111) If x, ye@ and x ¥y, 
then for every e€ xy there exists a zeg such 
that zoxUy—{el. 

(4) The axiom for the closure function cl: 
2% +2: (i) xcclx=cl(cl x) for every xe E; (ii) 
xc y implies cl xccl y; (iti) If eecl(x U {e'})— 
clx, then e'ecl(xU {e})— clx. 

The following are typical examples of ma- 
troids: (1) Let E be the family of finite set of 
vectors in a vector space V and -¥ be the 
family of sets of linearly independent vectors. 
(2) Let E be the set of edges of a tgraph and 
€ be the set of edges constituting a fundamen- 
tal circuit. (3) Let E be a finite set and S be a 
family of subsets of E. x={e,,...,e,} CE is 
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defined to be independent if there exist differ- 
ent elements s4, ..., S, in S such that e;€s; 
for every i=1,...,r. 

Let a weight function w: E—>R be defined 
for every element e€ E in a matroid (E, u). To 
obtain a base b of the matroid with minimal 
weight w(b)= Łe w(e), we can apply the fol- 
lowing procedure, called the greedy algorithm: 
(1) Arrange the elements e,,...,e, of E in such 
a manner that w(e,)<...<w(e,). (2) Put b®:= 
Ø, and i:=0. (3) If x=bU fe,,,}e%, then 
b@*):= x; otherwise, b+) := 5. (4) Repeat 
process (3) for i=0, 1,...,n—1. Then the final 
b™ is the base of minimal weight. 


H. Operations on Matroids 


For a matroid M =(E, u) with the family 2 of 
its base, B* = {E — x| xe 2} also satisfies the 
axioms of the base. The matroid M* =(E, p*) 
with Z* as its base is called the dual of M. 
Here we define p*(x)=|x|+ u(E —x)— u(E). 
When F cE, M|F =(F, up) is called the reduc- 
tion of M =(E, u) to F, where up is the restric- 
tion of u on 2°. A matroid M x F=(M*| F)* is 
called the contraction of M to F. A matroid 
given by the form (M | F) x F’ is called a minor 
of M. From two matroids M, =(E, u,) and M, 
=(E, u) over the same set E, we can construct 
a matroid M = M, v M, =(E, u) whose rank 
function u(x) is defined by the minimum of 

{a (y) + u(y) +|x— yl |y= x}. This is called 
the union of M, and M,. 

Let M =(E, u) be a matroid and « be a posi- 
tive constant. The function (x)= u(x)—a|x| 1s 
submodular. The set L of x which minimizes 
fi(x) is a sublattice of 2". The partition of E 
and the order relation of its classes defined by 
L are called the principal partition of M with 
respect to the parameter a. When the value of 
the parameter « is not specified, æ is taken to 
be 2. 

As was seen in example (1) at the end of the 
preceding section, the family of vectors in 
a vector space V over a field K is a matroid 
M. A matroid isomorphic to such an M is 
called linearly representable over K. If K = 
GF(p), the finite field with p elements, then it 
is called a p-ary matroid. A matroid linearly 
representable over any field is called regular. If 
[G] is a matroid isomorphic to the matroid 
defined by the family of edges of a graph as 
shown in example (2), it is called graphic. If the 
dual M* is graphic, M is called cographic. 
Graphic and cographic matroids are always 
regular. There exist matroids not linearly 
representable over a field K, or linearly repre- 
sentable over a particular field K but not 
regular, or regular but neither graphic nor 
cographic. If [G] is a matroid isomorphic to 
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the matroid of the type in example (3), it is 
called transversal. Such a matroid is linearly 
representable when the cardinality of the field 
K is sufficiently large. 
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A. General Remarks 


A ring R (— 368 Rings) whose multiplication 
is commutative is called a commutative ring. 

Throughout this article, we mean by a ring a 
commutative ring with unity element. 


B. Ideals 


Since our rings are commutative, we need not 
distinguish right or left ideals from tideals. A 
subset a of a ring R is an ideal of R if and only 
if a is an R-submodule of R (— 277 Modules), 
if and only if a is the tkernel of a ring homo- 
morphism from R into some ring (except for 
the case a= R). Given an ideal a of a ring 

R, the set of elements which are tnilpotent 
modulo a, i.e., {xe R|x"ea (An)}, is called the 
radical of a and is often denoted by Ja. The 
radical of the zero ideal is called the radical 
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of R, or more precisely, the nilradical of R 
(— 368 Rings H). 

For a subset S of a ring R, the smallest ideal 
a containing S is called the ideal generated by 
S, and S is called a basis for a; if S is a finite 
set, then S is called a finite basis. When a, 
(AeA) are ideals of R, the sum of these ideals, 
denoted by > a,, is defined to be the ideal 
generated by the union of the ideals a, (AEA). 
If A is a finite set, say {1,2,...,n}, then the sum 
is denoted also by a, +... + a„, and in this case, 
the sum is also called a finite sum. Note that 
a,+...+4,={a,+...+4a,|a;€a;}. The product 
a,...a, of a finite number of ideals a,,...,a, iS 
defined to be the ideal generated by the set 
{a,...a,|a;€a;}. The intersection of an arbi- 
trary number of ideals is an ideal. When a 
is an ideal of a ring R and S is a subset of R, 
the quotient a: S is defined to be the ideal 
{xeR|xSca}. If a,b, c,d, (AEA) are ideals, we 
have (a:b):c=a:be,a:5D,=( ),(a:b,). 


C. Prime Ideals 


An ideal p of a ring R is called a prime ideal if 
R/p is an tintegral domain; p is a prime ideal if 
and only if p#R and also abep (a, be R) im- 
plies aep or bep (some literature includes the 
ring R itself in the set of prime ideals). Let § be 
a multiplicatively closed subset of a ring R, i.e., 
a nonempty tsubsemigroup of R with respect 
to multiplication. A maximal member among 
the set of ideals which do not meet S is called a 
maximal ideal with respect to S. Such a mem- 
ber is necessarily a prime ideal; when S= {1}, it 
is called a maximal ideal of R. An ideal m is a 
maximal ideal of R if and only if R/m is a field. 


D. Jacobson Radical 


The intersection J of ali maximal ideals of a 
ring R is called the Jacobson radical of R; in 
some cases, this intersection J is called the 
radical of R (— 368 Rings H). Let N be an R- 
submodule of a finite R-module M. If MJ + 
N =M, then M =N (Krull-Azumaya lemma 
or Nakayama lemma). 


E. Krull Dimension 


For a prime ideal p of a ring R, the maximum 
of the lengths n of tdescending chains of prime 
ideals p=po 2 Pı F --- ZP, which begin with p 
(or œ if the maximum does not exist) is called 
the height or rank of the prime ideal p. For an 
ideal a, the minimum of the heights of prime 
ideals containing a is called the height of the 
ideal a. The maximum of heights of prime 
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ideals of R is called the Krull dimension (or 
altitude) of the ring R. For an ideal a of R, the 
Krull dimension of R/a is called the Krull 
dimension (or depth) of the ideal a. (The mean- 
ings of the terms rank, dimension, and depth 
now depend on the writings in which they 

are found; standardization of definition of 
these terms is becoming more and more of a 
necessity.) 


F. Primary Ideals 


For an ideal q of R, if q#R and every zero divi- 
sor of R/q is nilpotent, then q is called a pri- 
mary ideal. (If R is included in the set of prime 
ideals, then R is regarded as a primary ideal.) 
In this case, p = Ja is a prime ideal, and q is 
then said to belong to p or to be a p-primary 
ideal. The intersection of a finite number of pri- 
mary ideals belonging to the same prime ideal p 
is p-primary. Assume that an ideal a is ex- 
pressed as the intersection a=q,1...q, ofa 
finite number of primary ideals q,,...,q,,. If 
this intersection is irredundant, that is, if none 
of the q; is superfluous in the expression of a, 
then the set of Jai (i=1,...,n) is uniquely 
determined by a. The £ q; are called prime 
divisors (or associated prime ideals) of the ideal 
a; a minimal one among these is called a mini- 
mal (or isolated) prime divisor of the ideal a, 
and those which are not minimal are called 
embedded prime divisors of the ideal a. A max- 
imal one among the prime divisors of a is 
called a maximal prime divisor of the ideal a. 
(For definitions of these concepts in the case 
where a is an arbitrary ideal, see [4].) If, in the 
expression of a as above, n is the smallest 
occurring in similar expressions, then the 
expression is called the shortest representation 
of the ideal a by primary ideals. In this case, 
each q; is called a primary component of the 
ideal a; if ./q; is isolated, then q; is called an 
isolated primary component of the ideal a, and 
otherwise q; is called an embedded primary 
component of the ideal a. Isolated primary 
components are uniquely determined by a, but 
embedded primary components are not. 


G. Rings of Quotients 


Let R be a ring. Then the set U of elements of 
R which are not zero divisors is multiplica- 
tively closed. In the set R x U={(r,u)|reR, 
ue U}, we define a relation = by (r,u)= 
(r’,u')<>ru'=r'u. Then = is an equivalence 
relation, and the equivalence class of (r, u) is 
denoted by r/u. In the set Q of these r/u, addi- 
tion and multiplication are defined by r/u+ 
rju =(ru'+r'u)/uu’, (r/u)(r'/u')=rr'/uu’. Then 
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QO becomes a ring, and r/1 can be identified 
with r. Thus Q ts a ring containing R, gen- 
erated by R and inverses of elements of U. This 
property characterizes Q, which is called the 
ring of total quotients of the ring R. If R is an 
integral domain, then Q is a field, called the 
field of quotients of the integral domain R. Let 
S be a multiplicatively closed subset of R such 
that O¢S, and let n be {xe R|xs=0 (JseS)} 
and ¢ be the natural homomorphism R-> R/n. 
Then none of the elements of (S) is a zero 
divisor. The subring of the ring of total quo- 
tients of R/n, generated by R/n and inverses of 
elements of ~(S), is called the ring of quotients 
of the ring R with respect to S, and is denoted 
by Rs (R[S~'] or RS~!). When M is an R- 
module, M @ x Rs is called the module of quo- 
tients of the R-module M with respect to S. It 
is significant that Rg is tR-flat. There is a one- 
to-one correspondence between the set of pri- 
mary ideals q of R which do not meet S and 
the set of primary ideals Q of R, such that q 
corresponds to Q if and only if Q=qR; (q= 
QN R). When p ts a prime ideal of R, the com- 
plement R—p is multiplicatively closed, and 
Rpg-, is called the local ring of p or the ring of 
quotients of the ring R with respect to the prime 
ideal p, and is denoted by R, (— 284 Noe- 
therian Rings C, D). A ring of quotients is also 
called a ring of fractions. 


H. Divisibility 


In a ring R, if a=bc (a,b, ce R), then we say 
that b is a divisor (or factor) of a, and that a is 
a multiple of b, or a is divisible by b. We denote 
this by b|a. This relation between a, be R is 
called divisibility relation in R. If, in this situ- 
ation, c has its inverse in R, we say that a is an 
associate of b. A factor b of a is called a proper 
factor if b is neither an associate of a nor tin- 
vertible. An element which has no proper fac- 
tor is called an irreducible element. A nonzero 
element which generates a prime ideal ts called 
a prime element. 

If in an integral domain R every nonzero 
element is a product of prime elements (up to 
invertible factors), then we say that the unique 
factorization theorem holds in R, and that R is 
a unique factorization domain (or simply u-f.d.). 

Let A={a,,...,a,} be a set of nonzero ele- 
ments of a ring R. A common divisor of A is an 
element which is a factor of a;. A common 
multiple of A is defined similarly. The greatest 
common divisor (G.C.D.) of A is a common 
divisor which is a multiple of any common 
divisor; the least common multiple (L.C.M.) of 
A is a common multiple m which is a factor of 
any common multiple. Thus, the G.C.D. and 
L.C.M. exist if R is a u.f.d. 
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1. Integral Dependence 


Let R be a subring of a ring R” sharing a unity 
element with R”. An element ae R” is said to 
be integral (or integrally dependent) over R if 
there are a natural number n and elements c; 
of R such that a"+c,a" '+...+c,=0. If every 
element of a subset S of R” is integral over R, 
we say that S is integral over R. (When R has 
no unity element, a similar definition is given 
under an additional condition that c;e R'. An 
important special case is where R is an ideal. 
See D. G. Northcott and D. Rees, Proc. Cam- 
bridge Philos. Soc., 50 (1954); M. Nagata, Mem. 
Coll. Sci. Univ. Kyôto, 30 (1956).) The set R of 
elements of R” which are integral over R is a 
ring and is called the integral closure of R in 
R". If R=R, then R is said to be integrally 
closed in R”. If R is integrally closed in its ring 
of total quotients, we say that R is integrally 
closed. An integrally closed integral domain is 
called a normal ring. (In some literature, an 
integrally closed ring is called a normal ring.) 
An element ae R” is called almost integral over 
R if there is an element b of R such that b is 
not a zero divisor and a"be R for every natural 
number n. If an element a of the ring of total 
quotients of R is integral over R, then a is 
almost integral over R. R is said to be com- 
pletely integrally closed if its ring of total quo- 
tients contains no elements which are almost 
integral over R except the elements of R itself. 


J. Group Theorem 


Let Q be the ring of total quotients of a ring R. 
An R-submodule a of Q is called a fractional 
ideal of R if there is a non-zero-divisor c of R 
such that cac R. The product of fractional 
ideals is defined similarly as in the case of 
products of ideals. The inverse a! of the 
fractional ideal a is defined to be {xeEQ|xa 
c R}. If a contains an element which is not a 
zero divisor, then a ' is also a fractional ideal. 
When R is completely integrally closed, we 
define fractional ideals a and b to be equiva- 
lent ifa~'=b~!. This gives rise to an equiva- 
lence relation between fractional ideals. The 
set of equivalence classes of fractional ideals 
which contain non-zero-divisors forms a 
group. This result is called the group theorem. 
An integral domain R is called a Krull ring if 
(i) for every prime ideal p of height 1, the ring 
R, is a tdiscrete valuation ring; (ii) R is the 
intersection of all the valuation rings R, and 
(ili) every nonzero element a of R is contained 
in only a finite number of prime ideals of 
height 1. In a Krull ring R, for an arbitrary 
nonzero fractional ideal a, there is a uniquely 
determined product of powers of prime ideals 
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of height 1 which is equivalent to a in the sense 
stated above (— 439 Valuations). 


K. Dedekind Domains and Principal Ideal 
Domains 


A ring R is called a Dedekind domain if (i) R 

is a Noetherian integral domain, (ii) R is a 
normal ring, and (iii) the Krull dimension of R 
is 1. For an integral domain R which is not a 
field, R is a Dedekind domain if and only if the 
set of all nonzero fractional ideals is a group, 
if and only if every nonzero ideal of R is ex- 
pressed as the product of a finite number of 
prime ideals and such-expression is unique up 
to the order of prime factors. An important 
example of a Dedekind domain is the ring of 
all ‘algebraic integers, i.e., the tprincipal order 
of an falgebraic number field of finite degree. 
In general, if R is a Dedekind domain with 
field of quotients K and L is a finite algebraic 
extension of K, then the integral closure of R 
in L and any ring R’ such that RC R'ẸK are 
Dedekind domains. 

An ideal generated by an element is called 
a principal ideal; a fractional ideal generated 
by an element is called a principal fractional 
ideal (or simply principal ideal). In a Dedekind 
domain, the set P of nonzero principal frac- 
tional ideals is a subgroup of the group J of 
nonzero fractional ideals; J/P is called the ideal 
class group of R, and a member of it is called 
an ideal class. The torder of I/P is called the 
class number of R (— 14 Algebraic Number 
Fields). There are many Dedekind domains 
whose class numbers are infinite. 

A ring R is called a principal ideal ring if 
every ideal is principal; furthermore, if R is an 
integral domain, then R is called a principal 
ideal domain. A principal ideal ring is the 
direct sum of a finite number of rings of which 
each direct summand is either a principal ideal 
domain or a flocal ring whose maximal ideal is 
a principal nilpotent ideal. A principal ideal 
domain which is not a field is a Dedekind 
domain and a u.f.d. We consider, for an arbi- 
trary natural number n and a principal ideal 
ring R, the set M(n, R) of all n x n matrices 
over R. Given an element A of M(n, R), there 
exist elements X, Y in M(n, R) such that (i) 
Xt, Y~! are in M(n, R), and (ii) denoting by 
b; the (i,j)-entry of XAY, we have b,,R>b,,R 
>...2b,,R and bj=0 if i#j. The nonzero 
members of the set {b,,,53,...,D,,} are called 
the elementary divisors of the matrix A. Apply- 
ing this to a finite module M over the principal 
ideal ring R, we see that M is the direct sum of 
m,R,...,m,R (m;e M) such that, with a;= 
{xe R|m,x =0}, we have a, ca,c...ca,(— 2 


Abelian Groups B; 269 Matrices E). 
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L. Euclid Rings 


A ring R is called a Euclid ring if there is a 
map ọ of R — {0} into a twell-ordered set W 
(the set of natural numbers is a special case) 
satisfying the condition that if a, be R, a#0, 
then there are r,geéR such that b=aq +r and 
either r=0 or g(r) < o(a). 

Every Euclid ring is a principal ideal ring. 
Besides the ring Z of rational integers, there 
are several familiar examples of Euclid rings, 
such as Z[,/—1], Z[L,/—2], Z[w] (w? = 
1,@#1), and the polynomial ring of one 
variable over a field. 
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68 (XII.10) 
Compact and Nuclear 
Operators 


A. General Remarks 


Let X be a finite-dimensional linear space. 
Then a linear operator in X ts surjective 

if and only if it ts injective. If X is infinite- 
dimensional, this is no longer the case in 
general. For an operator of the form 1+ K 
with an integral operator K of continuous 
kernel, I. Fredholm [1] developed the the- 

ory of determinants and retrieved the above 
equivalence (— 217 Integral Equations). Later, 
F. Riesz [3] simplified the proof and showed 
that the equivalence holds if K is a compact 
operator in a Banach space. I. Ts. Gokhberg 
and M. G. Krein [4] reformulated the result as 
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the stability of indices of operators. The con- 
cept of compact operators itself was introduced 
by D. Hilbert (who used the terminology, “com- 
pletely continuous operators”). The Hilbert- 
Schmidt class is also due to him. These classes 
of operators are employed in the spectral 
theory of integral operators [2]. The trace 
class of operators in Hilbert spaces is defined 
as the class of operators for which the trace 
has meaning. A. Grothendieck [5] introduced 
nuclear operators, extending the definition of 
trace class operators to general Banach spaces. 
Nuclear operators and related classes of 
operators, such as integral operators and 
absolutely summing operators, play important 
roles in the theory of topological tensor prod- 
ucts [5], determinants [6], vector measures, 
and measures in linear spaces [7]. 


B. Compact Operators 


A ‘linear operator T from a tBanach space X 
to a Banach space Y is said to be compact (or 
completely continuous) if T maps any tbounded 
set of X to a trelatively compact set of Y. In 
other words, T is compact if, for any bounded 
sequence {x,} in X, the sequence {Tx,} in Y 
contains a fstrongly convergent subsequence. 
A compact operator is necessarily tbounded 
and hence continuous. A compact operator 
from X to Y maps any tweakly convergent 
sequence in X to a strongly convergent se- 
quence in Y. If X is treflexive, the converse is 
also true. 

In this article the set of all bounded (resp. 
compact) linear operators from X to Y are 
denoted by B(X, Y) (resp. B(X, Y)). 


C. Examples of Compact Operators 


(1) Degenerate operators. An operator 

Te B(X, Y) is said to be degenerate or of 
finite rank if the frange R(T) of T is finite- 
dimensional. A degenerate operator is neces- 
sarily compact. The tidentity operator in X is 
compact (Y= X) if and only if X is finite- 
dimensional. (2) tIntegral operators with con- 
tinuous kernel. Let E and F be bounded closed 
regions in the Euclidean spaces R” and R”, 
respectively, and let k=k(t,s), te F, se E, bea 
continuous function defined on F x E. Then 
the integral operator T with tkernel k, 


ayo-] k(t,s)x(s)ds, te F, (1) 


is a compact operator from the Banach space 
C(E) to C(F). (For the notation for various 
function spaces — 168 Function Spaces.) (3) 
Integral operators of Hilbert-Schmidt type. 

In example (2) let E and F be tLebesgue mea- 
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surable sets, and let ke L,(F x E). Then the 
integral operator T determines a compact 
operator from the tHilbert space L,(E) to the 
Hilbert space L,(F). (4) Let Q be a bounded 
open set in R”, s<t two real numbers, and 
H*(Q) the tSobolev space of order s (— 168 
Function Spaces). Then the natural injection 
from H'(Q) into H*(Q) is compact (F. Rellich’s 
lemma). 


D. Properties of Compact Operators 


Any linear combination of compact opera- 
tors is again compact. If a sequence {T,} of 
compact operators converges in the norm 
(i.e., in the tuniform operator topology), then 
the limit T is compact. Thus B(X, Y) is a 
closed subspace of the Banach space B(X, Y) 
with the toperator norm. Any product of 

a compact operator and a bounded opera- 
tor is compact. Namely, Ae B(X, Y), Be 
B(Y, Z), and CeB(Z, X) imply BAcB(X, Z) 
and ACeB(Z, Y). In particular, B® (X) = 
B(X, X) forms a closed ttwo-sided ideal 

of B(X)=B(X, X). An operator Te B(X, Y) 

is compact if and only if its tdual operator 
T'eB(Y', X^) is compact. The range of a com- 
pact operator is always tseparable. Let X and 
Y be Hilbert spaces. Then for any TeB©(X, Y) 
there exist forthonormal sets {¢,} in X and 
{Way in Y and a sequence {c,} of nonnegative 
numbers with lim c, =0 such that 


Tu=> cpu, PW, UEeX. 


Consequently, any compact operator between 
Hilbert spaces X, Y can be approximated by 
a sequence of degenerate operators in the 
operator norm. However, there are Banach 
spaces X and Y for which the statement is no 
longer true. In fact, a Banach space X (resp. 
the dual Y’ of a Banach space Y) has the tap- 
proximation property (— 37 Banach Spaces L) 
if and only if every Te B“(Y, X) is the limit in 
norm of a sequence of degenerate operators 
for any Banach space Y (resp. X). 


E. The Riesz-Schauder Theorem 


Let Te B(X) and consider a pair of linear 
equations 


u—Tu=f, fex, (2) 


p—Tp=g, gex’, (3) 


where T’¢ B(X’) is the dual operator of T in 
the tdual space X’ of X. Put M = {ue X |u= 
Tu} and .@’={geX'|g=T'g}. Then one 

and only one of the following two cases (i) and 
(ti) occurs. (i) Æ ={0}, M’ = {0}; for any fex 
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equation (2) has a unique solution; and for any 
gé X' equation (3) has a unique solution. (ii) 
dim Æ = dim æ’ =m, 1<m< œ; (2) has a 
solution if and only if f is orthogonal to M’ 
(i.e., p(f)=0 for any pọ € M’), and (3) has a 
solution if and only if g is orthogonal to æ 
(i.e., g(u)=0 for any ue M), This is called the 
Riesz-Schauder theorem. In particular, when T 
is an integral operator in a suitable function 
space, this theorem is also called Fredholm’s 
alternative theorem for integral equation (2). 


F. Fredholm Operators and Their Indices 


Let T be a tclosed linear operator from a Ba- 
nach space X to a Banach space Y. T is called 
a Fredholm operator or an operator with index 
if both Ker T (= null space N(T)) and Coker T 
(= Y/R(T)) are finite-dimensional. Then the 
integer ind T=dim Ker T— dim Coker T is 
called the index of T. If a closed linear opera- 
tor T has Coker T of finite dimension, then 
the range R(T) is closed. Moreover, if R(T) is 
closed and the domain D(T) is dense, then 
dim Coker T= dim Ker T’, where T’ is the dual 
of T, and hence ind T=dim Ker T—dim Ker T”. 
A linear operator K from X to Y is said to be 
T-compact if the domain D(K) contains D(T) 
and if K from D(T) with the tgraph norm into 
Y is compact, i.e., for any bounded sequence 
{x,} in D(T) with sup || Tx,|| < œ, the se- 
quence {Kx,} contains a strongly convergent 
subsequence. 

The following are basic properties of Fred- 
holm operators [4, 11]: (1) If T is Fredholm 
and K is T-compact, then T+ K is Fredholm 
and ind(7+ K)=ind T. (2) If T is Fredholm 
and another linear operator S is sufficiently 
close to T in (graph) norm, then S is also Fred- 
holm and ind S = ind T. (3) If T is a Fredholm 
operator from X into Y and S is a Fredholm 
operator with dense domain in Y into Z, then 
ST is Fredholm and ind ST=ind S + ind T. If 
D(T) is dense, then D(ST) ts also dense. (4) 

Let T be a closed linear operator with dense 
domain; then T is Fredholm if and only if the 
dual T’ is Fredholm, and then ind T’ = —ind T. 

Fredholm operators and their indices were 
first studied by Russian mathematicians in 
connection with boundary value problems of 
differential equations and singular integral 
equations. M. F. Atiyah and I. M. Singer have 
proved that an felliptic linear differential 
operator P of order m on a compact differenti- 
able manifold M is a Fredholm operator from 
the Sobolev space H*(M) into H* "(M) for 
any s and that its index is computed from the 
tsymbol of P and the fcharacteristic classes of 
M (= 237 K-Theory). Similarly, the indices 
of linear ordinary differential operators (and 
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more generally of maximally overdetermined 
systems of linear partial differential operators) 
are computed in various spaces of functions 
and generalized functions (H. Komatsu, B. 
Malgrange, J.-P. Ramis, M. Kashiwara). 


G. Spectra of Compact Operators 


The following structure of the tspectrum of 
compact operators Te B“(X) is derived from 
the Riesz-Schauder theorem. The spectrum 
o(T) of T consists of at most countably many 
points and has no accumulation points except 
possibly for 0. Any nonzero point of o(T) 

is an eigenvalue of T. When X is infinite- 
dimensional, 0 always belongs to a(T) but is 
not necessarily an eigenvalue of T. Each non- 
zero eigenvalue of T has finite (algebraic) 
tmultiplicity, and hence the teigenspace .#,(T) 
= {u| Tu= 2y}, AAO, is finite-dimensional. If 2 
is an eigenvalue of T, then å is an eigenvalue 
of the adjoint operator T* of T with the same 
(either algebraic or geometric) multiplicity as 
that of A. 


H. Spectral Representations of Compact 
Normal Operators 


Let T be a compact tnormal operator in a 
Hilbert space H. Then we can find a tcomplete 
orthonormal set consisting solely of eigen- 
vectors of T. Namely, for each nonzero eigen- 
value 4, of T, take an orthonormal basis {f} 
of the eigenspace associated with /,. Rearrange 
all the of’ into a sequence {¢,} and add to it, 
if 0 is an eigenvalue, a complete orthonormal 
set of the eigenspace associated with 0. Then 
we obtain a desired complete orthonormal 

set of H. Let u, be the eigenvalue associated 
with @,. Then the sequence { y,' is precisely 
an enumeration of nonzero eigenvalues of T 
with repetitions according to multiplicity. 

In terms of {¢g,} and {y,}, a tspectral repre- 
sentation of T is given as 


Tu=} u,(U,Q,)Q,,  uEH. 
a=1 


The eigenvalue problem of a compact non- 
negative ‘self-adjoint operator T can be solved 
by means of the following Rayleigh prin- 
ciple. Consider Rayleigh’s quotient R(x)= 
(Tx, x)/||x||*, x #0. The largest eigenvalue u, 
is obtained as #4, =max,,y R(x) and any vector 
x, which attains this maximum is an eigen- 
vector. When the largest n—1 eigenvalues 
fly, +++, Hn-1 and eigenvectors x,,...,X,-1 are 
determined, the nth eigenvalue p, is the maxi- 
mum of R(x) on the subspace orthogonal to 
X1,-++>Xq_—-, and any vector which attains the 
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maximum is an eigenvector. A more direct 
characterization of y,, involving no previous 
eigenvectors, is given by 


Sises Sn-EX | (x, f)=0 
Sises fn-1 #0 \ JHl, cn 


U,= min ( max ae 


This formula is referred to as the minimax 
principle. 


I. Classification of Compact Operators in 
Hilbert Spaces 


Let H and K be Hilbert spaces. Then a Te 
B(H, K) is compact if and only if (Te,, f,) 0 
for any orthonormal sequences e, in H and 

J, in K. J. von Neumann and R. Schatten 
(Ann. Math., 49 (1948)) classified operators 
Te B(H, K) in the following way. The opera- 
tor A=(T*T)'” is a compact nonnegative 
self-adjoint operator. Let «, >a, 2... be the 
enumeration in decreasing order of the posi- 
tive eigenvalues of A, with each repeated ac- 
cording to its multiplicity. The «, =«,(T) are 
sometimes called the characteristic numbers of 
T. For any p>0 the set of all Te B“(H) such 
that 


© 1/p 
iri,=(3 ar) < +0 
n=1 
is denoted by B,(4) (or simply B,). Among 
these classes, B, and B, are most important. 
B, is called the trace class and B, the Hilbert- 
Schmidt class. Correspondingly, || T ||, and 
|| T ||, are called the trace norm and Hilbert- 
Schmidt norm of T, respectively. B, is also 
called the nuclear class and any operator 
TeB, a ‘nuclear operator. The norm ||T ||, can 
also be defined more directly as follows. (i) If 1 
<p<©, ||T||,=sup ||(Te,, fa); (i) If0<p<2, 
T= inf | (|| Te, I)h; Gii) If2<p<oo, |Tip= 
sup ||(|| Tell, where e, and f, range over 
the orthonormal sequences in H and K, and e, 
over the orthonormal bases in H. The class B, 
is a two-sided * ideal in the Banach algebra B. 
Moreover precisely, Te B, and ReB imply 
|RTp<IR| fT lp. [TRU < IRI Tl], and 
|T*||,=IT||,. When 1 <p<oo, the class B, 
becomes a Banach space with the norm ||7'||,. 
It is always a tquasi-Banach space, in which 
the set of all degenerate operators is dense. 
The norm ||T||, (#0) is a decreasing func- 
tion of p. Hence B, cB, if p<q. Also, TeB, 
and SeB, imply TSeB,, where 1/r=1/p+1/q. 
Let TcB,, and let {y;} be an enumeration of 
eigenvalues of T with repetitions according to 
(either geometric or algebraic) tmultiplicity. 
Then 


Lla <T Ue. 
J 
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Hilbert-Schmidt class. For an arbitrary 
complete orthonormal set {u,}, we have 


ITI2=9 || Tull’. 


Thus B, can be defined as the set of all Te B 
such that the sum on the right-hand side is 
finite for a certain complete orthonormal set 
{u,}. A linear operator T: L,(E)>L,(F) is of 
Hilbert-Schmidt class if and only if it is an 
integral operator of Hilbert-Schmidt type 
(example (3) in Section C). The space B, with 
the norm ||T ||, becomes a Hilbert space with 
the inner product defined by 


(T, S), = > (Tu Su,), 


where {u,} is as above. 
Trace class. For an operator TeB,, the 
trace tr(T) of T is defined as 


tr(T) =. (Tu, u,). 


Here the right-hand side converges absolutely 
and does not depend on the complete ortho- 
normal set {u,}. The trace is a bounded linear 
functional on the Banach space B,. The prod- 
uct of two operators of Hilbert-Schmidt type 
belongs to the trace class, and the converse is 
also true. If 1 <p, q< œ satisfy 1/p+1/q=1, 
then the inner product <T, S> of TeB,(H, K) 
and SeB,(K, H) is defined by <T, SX =tr(ST). 
Under this inner product the dual of the 
Banach space B,(H, K) is identified with 
B,(K, H). Similarly, the dual of B©(H, K) 

is isomorphic to B,(K, H), and the dual of 

B, (K, H) to B(H, K) (J. Dixmier, Schatten). 


J. Volterra Operators 


A compact operator T in a Hilbert space is 
said to be a Volterra operator if it is tquasi- 
nilpotent, i.e., its fspectral radius is 0. The 
integral operator 


b 


(Tx)(t)= | k(t,s)x(s)ds, xeL,(a,b), 


t 

appearing in the integral equation of Volterra 
type is a Volterra operator. Conversely, a 
Volterra operator satisfying a suitable con- 
dition is unitarily equivalent to such an inte- 
gral operator (M. S. Livshits, Gokhberg and 
Krein [13]). A Volterra operator admits an 
abstract triangular representation in a manner 
similar to Jordan’s canonical form (— 390 
Spectral Analysis of Operators H). 


K. Nuclear Operators 
Extending the definition of trace class to 


operators in Banach spaces, A. Grothendieck 
[5] defined a nuclear operator (or Fredholm 
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operator according to [6]) from a Banach 
space X into a Banach space Y to be a linear 
operator T: XY that is represented as Tx = 
È A;<x, a; b; with a sequence A,>0 in l}, a 
bounded sequence a; in X’, and a bounded 
sequence b, in Y. In other words, a linear 
operator T: X >Y is nuclear if and only if it is 
decomposed as the product 


X41, A1,4Y, (4) 


where A and B are bounded linear operators 
and A is multiplication by å; in /,. The in- 
fimum of || A|| ||4;ll,, || B|| is called the nuclear 
norm of T and is denoted by ||T||,. When 
X and Y are Hilbert spaces, this coincides 
with the trace norm. The integral opera- 
tor T ‘defined by (1) is nuclear with || T |}, = 
{sup, k(t, s)|ds. The totality B,(X, Y) of nuclear 
operators T: X —=Y forms a Banach space 
under the nuclear norm. If T is nuclear and 
A is bounded, then || TA||, SITI || A|| and 
|AT ||, SIAI || TH ,. If T is nuclear, then the 
dual T’ is nuclear, and || T’||, < || T||,. Suppose 
that X and Y are Banach spaces satisfying one 
of the following conditions: (i) Y is the dual of 
a Banach space; (it) X’ has the approximation 
property; (iii) Y” has the approximation prop- 
erty. Then conversely a Te B(X, Y) is nuclear 
if the dual T” is nuclear, and ||T||,=||T"||,. 
However, this is not necessarily the case in 
general (T. Figiel and W. B. Johnson). 
Replacing l, by /,, we obtain the definition 
of operators T: X > Y of summable pth power. 
Grothendieck [5] considered the case O<p<1l 
and showed that if T,,..., T, are of summable 
Pis- Path power, then the product T, ... T, is 
of summable rth power, where r is given by 


l1/r=(X 1/p)— (n+ 1/2. 


L. Traces and Determinants of Operators 


Let T be a linear operator of tfinite rank in a 
linear space X. There are a finite number of 
elements a;e X’ and b;e X such that 


Tx= 3 <x, abi. (5) 
i=1 


Then tr(T)= > <b;,a;> and det(1 —T)= 
det(ô; ;— <b;,a;>) are independent of the repre- 
sentation (5). Let X be a Banach space. If 
TeB,(X) is represented as Tx = 11; x, a;>b;, 
then it seems reasonable that the trace tr(T) 
be defined by X 4;(5;,a;>, but the sum may 
depend on the representation. It ts known that 
a Banach space X has the approximation 
property if and only if the sum does not de- 
pend on the representation for any nuclear 
operator T. However, if T is of summable 
(2/3)rd power, the trace tr(T) is always de- 
fined uniquely. If X or T satisfies the afore- 


257 


mentioned condition, then the determinant 
det(1 — T) is also defined uniquely as the limit 
of det(1— T,), where T, is the nth partial sum. 
det(1 —zT) is an entire function of z and its 
zeros are exactly the reciprocals of the nonzero 
eigenvalues of T. If det(1—zT) #0, then the 
resolvent (1—zT)~! is given by Fredholm’s 
formula extending tCramer’s rule. Eigen- 
vectors are also computed explicitly [6]. 

Let H be a Hilbert space, and let TeB, (A). 
Then det(1 —zT)=[](1 —z4,), where 4; are the 
nonzero eigenvalues of T, and is an entire 
function of genus 0. Moreover, let p>2 be an 
integer. Then the modified determinant 


det,(1-—zT)=|] fa -zije 5 k`! aa) | 
k=1 


J 


is defined for Te B,(H) and is an entire func- 
tion of genus p— 1. This type of determinant 
was introduced by T. Carleman (Math. Z., 9 
(1921)) for the Hilbert-Schmidt class of oper- 
ators (— also Hilbert [2]). It is utilized to 
prove the completeness of the troot vectors of 
T and many other facts [9, 12, 13]. 


M. Weakly Compact Operators 


A linear operator from a Banach space X to 
a Banach space Y is said to be weakly com- 
pact if T maps any bounded set of X to a rela- 
tively weakly compact set. Weakly compact 
operators are bounded. They have proper- 
ties similar to those of compact operators. 
The following are equivalent conditions for a 
bounded linear operator T: X >Y. (1) T is 
weakly compact; (2) T’: Y’—> X’ is weakly 
compact; (3) T”: X” > Y” maps X” into Y; 
(4) There is a reflexive Banach space Z and 
bounded linear operators $: X >Z and R: 
Z— Y such that T= RS. The last characteri- 
zation is due to W. J. Davis, Figiel, Johnson, 
and A. Pełczyński. Any linear combination of 
weakly compact operators is weakly compact. 
The uniform limit of a sequence of weakly 
compact operators is weakly compact. The 
product of a weakly compact operator and a 
bounded operator is weakly compact. 
Dunford-Pettis theorem ([9]; R. S. Phillips, 
Trans. Amer. Math. Soc., 48 (1940)). Let (Q, 2) 
be a o-finite measure space and X a Banach 
space. If T:L,(Q)—X is a weakly compact 
linear operator, then there exists a bounded 
strongly measurable function g on Q with 
values in X such that 


T= | foto 


Then T maps weakly convergent sequences in 


L,(Q) into strongly convergent sequences in X. 


In particular, the product of two weakly com- 
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pact linear operators in L,(Q) is compact. 
Similarly, Grothendieck and R. G. Bartle, N. 
Dunford, and J. Schwartz (— [9]) have proved 
the following: Let Q be a compact Hausdorff 
space and X a Banach space. If T: C(Q)--X is 
a weakly compact linear mapping, then there 
is a fvector measure p defined on the *Borel 
sets in Q with values in X such that <u, x’ isa 
*tRadon measure for any x'e X’ and 


Tf= f (t)du(t). 


T maps weakly convergent sequences in C(Q) 
into strongly convergent sequences in X. The 
product of two weakly compact linear opera- 
tors in C(Q) is also compact. 


N. Absolutely Summing Operators 


A linear operator T from a Banach space X to 
a Banach space Y 1s called an integral operator 
if there is a positive Radon measure u on the 
product A x B of the unit ball A in X’ and 

the unit ball Bin Y” both equipped with the 
weak* topology such that 


(Tx, y) = | CD W'S") AUX y”), 


or equivalently if T:X > Y” is decomposed 

as (4) with /,, and /, replaced by L,,(M) and 
L,(M) for a suitable compact space M with a 
Radon measure. Nuclear operators are clearly 
integral. Integral operators are weakly com- 
pact, but they are not necessarily compact or 
nuclear. However, the product of an integral 
operator and a weakly compact operator is 
nuclear, and hence every integral operator in 
a reflexive Banach space is nuclear. 

If a compact (resp. weakly compact) opera- 
tor T from a Banach space X to a Banach 
space Y maps a closed linear subspace X, of 
X into a closed linear subspace Y, of Y, then 
the restriction T,:X,— Y, and the induced 
operator T': X/X,- Y/Y, are also compact 
(resp. weakly compact). The corresponding 
result does not hold for nuclear operators and 
integral operators. The following conditions 
are equivalent for a bounded linear operator 
T:X >Y: (i) There is a Banach space Y > Y 
such that T: X > ¥ is integral; (ii) there is a 
positive Radon measure yp on the unit ball 
A of X’ such that 


|| Tx|| < [lc x'>| dulx’); 


(iii) there is a constant C such that 


ȘI Tx;l| <Csup{¥|<x;,x’>|] Ix] <1} 


holds for any finite set {x,,...,x, in X; (iv) if 
È x; is an tunconditionally convergent series 
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in X, then È || Tx;|| converges. Grothendieck 
called such an operator a right semi-integral in 
view of property (i), and A. Pietsch called it 
absolutely summing in view of (iv). Similarly, 
a bounded linear operator T: X —> Y is called a 
left semi-integral if there is a Banach space ¥ 
such that X = X/X;, for a closed subspace X,, 
and the product T: X >Y is integral. The 
product of two right (resp. left) semi-integral 
operators is nuclear. A bounded linear opera- 
tor in a Hilbert space is right (or left) semi- 
integral if and only if it is of Hilbert-Schmidt 
class. A bounded linear operator T from a 
Banach space X into a Banach space Y is 
integral if and only if the dual T’ is integral. 
It is a right (resp. left) semi-integral if and only 
if T’ is a left (resp. right) semi-integral. 

For other related classes of operators — 
Grothendieck’s paper in Boletin Soc. Mat. São 
Paulo, 8 (1953). 
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A. Compact Groups 


A ‘topological group G is called a compact 
group if the underlying topological space of G 
is a ‘compact Hausdorff space. The ttorus 
group T"=R"/Z’ (n= 1, 2, ...) (commutative 
group), the forthogonal group O(n), the tuni- 
tary group U(n), the tsymplectic group Sp(n), 
and the additive group Z, of tp-adic integers 
are compact groups (— 60 Classical Groups; 
for other compact Lie groups — 249 Lie 
Groups, 248 Lie Algebras). Let C(G) be the 
‘linear space formed by all the complex-valued 
continuous functions f, g, h, ... defined on a 
compact group G; C(G) is a *Banach space 
with the norm 


iI fll =sup | f(x)]. 
xeG 

Since a compact group G is tlocally com- 
pact, there exists a right-invariant *Haar mea- 
sure on G. Because of the compactness of G, 
the total measure of G is finite, and the mea- 
sure is also left-invariant. By the condition 
that the total measure is 1, such a measure is 
uniquely determined. The integral of f in C(G) 
relative to this measure is called the mean 
value of f. Since for f, ge C(G), f(xy ')g(y) is 
continuous in two variables x, y, the tconvo- 
lution f x g(x)= f f(xy ')g(y)dy also belongs 
to C(G). C(G) constitutes a ring under the 
multiplication defined by the convolution. 
This ring can be considered as an extension of 
the notion of a group ring for finite groups; it 
is called the group ring of the compact group 
G. The function x > f(x~') will be denoted by 
f*, and the inner product in the tfunction 
space L(G) will be written as (f, g). 


B. Representations of Compact Groups 


Let G(E) be the group of units of tbounded 
linear operators on a Banach space E, and 
suppose that we have a homomorphism U ofa 
topological group G into G(E). The homomor- 
phism U is called a strongly (weakly) contin- 
uous representation on E of G if, for any ac E, 
the map x U(x)a of G into E is continuous 
with respect to the strong (tweak) topology on 
E. When E is a *Hilbert space, a strongly con- 
tinuous representation U such that every 
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U (a) is a unitary operator is called a tuni- 

tary representation. Let U be any strongly 
continuous representation of a compact 

group G on a Hilbert space E; for a, b in E, let 
<a, b> be the mean value of the tinner product 
(U(x)a, U(x)b) on E. Then U is a unitary repre- 
sentation on the Hilbert space E with the new 
inner product <a, bY. 

A representation U of G on a Banach space 
E is said to be irreducible if E contains no 
closed subspace other than {0} and E, which is 
invariant under every U (x)(xe G). If a weakly 
continuous representation of a compact group 
on a Banach space E is irreducible, then E is 
finite-dimensional. Moreover, any unitary 
representation U of a compact group on a 
Hilbert space E can be decomposed into a 
discrete ‘direct sum of irreducible representa- 
tions. Namely, there exists a family {E,},<4 of 
irreducible (hence finite-dimensional) invariant 
subspaces E, of E which are orthogonal to 
each other such that E=),.., E, In particular, 
any continuous representation of a compact 
group on a finite-dimensional space is tcom- 
pletely reducible. In L,(G) with respect to the 
tHaar measure on a ‘locally compact group G, 
the representation U defined by (U(x) f)(y)= 
fx (x, ye G, fe L,(G)) ts a unitary repre- 
sentation of G. This representation U is called 
the (right) regular representation of G. Decom- 
position of the regular representation of a 
compact group G into irreducible represen- 
tations is given by the Peter-Weyl theory, 
described later. 

In the rest of this article, the representations 
under consideration will be (continuous) repre- 
sentations by matrices of finite degree. Let 
D, (x) =(di})(x)) and D,(x) =(d{}(x)) be irredu- 
cible unitary representations which are not 
mutually tequivalent. Then from *Schur’s 
lemma follow the orthogonality oe 
(dP, d?)=0 and (n, dP, / ny dD) = Sim Sin 
(where n, is the degree of the ea 
D,). From each ‘class D, of irreducible repre- 
sentations of G, choose a unitary representa- 
tive D,(x) = (df;(x)), and let n, denote its de- 
gree. Then from the orthogonality relations it 
follows that the Jn a, (x) form an tortho- 
normal system of L,(G). 

Let he C(G) and consider the map H: f-> 
hx fof C(G) to C(G). Then H is a compact 
operator in C(G). Since C(G) is contained in 
L(G), we can define the inner product in C(G). 
By (hx f g) =(f,h* x g), h=h* implies (Hf, g) 
=( f, Hg); that is, H is a tHermitian operator. 
For a given f in C(G), there exists h(=h*) in 
C(G) such that h x fis uniformly arbitrarily 
near f. From the theory of compact Hermitian 
operators, Hf can be uniformly approximated 
by linear combinations of the teigenfunctions 
of H. Since the teigenspace of H is finite- 
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dimensional and invariant under U (a), the 
eigenfunctions of H are linear combinations of 
a finite number of Jn, a, (x). Hence any func- 
tion f in C(G) can be uniformly approximated 
by linear combinations of a finite number of 
Jn, d;,(x). This fact, like the similar result in 
tFourier series, is called the approximation 
theorem. From this, it follows that the or- 
thogonal system { /n, d(x)} } is tcomplete; i.e., 
if an element of C(G) is orthogonal to each 
element in this system, then it is 0. 

Since C(G) is dense in L(G), {n d3(x)} is 
a ‘complete orthonormal system of the Hil- 
bert space L(G). Hence for any feL,(G), its 

“Fourier series” $, Xi; Ch / Ng d(x) (where c;; 
=(f,./n, d})) converges to f in the mean of 
order 2 (i.e., with respect to the tmetric of 
L,(G)). In particular, if G is a compact *Lie 
group and f is sufficiently many times dif- 
ferentiable, then this series converges uni- 
formly to f. 

The space V,* of dimension n, spanned by 
the elements dj(x) (1 <j<n,) in the ith row of 
the matrix D,(x) is invariant under the right 
regular representation U. The representation 
on V,* given by U can only be D,. Then the 


fact that {n d?(x)} is an orthonormal system 
of L(G) means that the regular representation 
U of a compact group G is decomposable into 
a discrete direct sum of finite-dimensional 
irreducible representations. Each irreducible 
representation D, is contained in U with multi- 
plicity equal to its degree n,. 

If a function (x) in C(G) satisfies p(y 'xy) 
= (x) for any x, y, then it is called a class 
function. The set K(G) of all the continuous 
class functions coincides with the tcenter of the 
group ring C(G). The tcharacter of an irredu- 
cible representation of G is a class function, 
and the set {z,(x)} of all characters plays 
the same role as the orthogonal system 
{./n, dz(x)} in C(G). Namely, {7,(x)} is a 
complete orthonormal system in (the tcom- 
pletion of) K(G), and any class function can be 
uniformly approximated by linear combi- 
nations of a finite number of these characters. 

The preceding paragraphs give a brief de- 
scription of the Peter-Weyl theory. If G is the 
1-dimensional torus group T' = R/Z, namely, 
the compact group of real numbers mod 1, 
then this is actually the theory of Fourier 
series concerning periodic functions on the 
line. (For concrete irreducible representations 
of O(n), U(n), Sp(n), and formulas for charac- 
ters — 60 Classical Groups. For represen- 
tations of compact Lie groups — 249 Lie 
Groups, 248 Lie Algebras.) The theory of 
compact groups was completed by F. Peter 
and H. Weyl (Math. Ann., 97 (1927)), and J. 
von Neumann’s theory concerning almost 
periodic functions in a group (1934) united the 
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theory of compact groups with H. Bohr’s 
theory of almost periodic functions (— 18 
Almost Periodic Functions). 


C. Structure of Compact Groups 


Let a be an element of a compact group G 
different from e. Since the underlying space of 
a topological group is a tcompletely regular 
space, there exists a function in C(G) such that 
f(a)# f(e). Hence there exists a representation 
D(x) of G such that D(a) is not equal to the 
unit matrix. This means that any compact 
group G can be expressed as a tprojective limit 
group of compact Lie groups. Beginning with 
this fact, von Neumann (1933) showed that a 
tlocally Euclidean compact group is a Lie 
group (— 423 Topological Groups N). 


D. Set of Representations 


The set G' = {D} of representations of G by 
matrices admits the following operations: (i) 
D, ® D, (tensor product representation); (ii) 
& 0 ) 
D,@D,= (tdirect sum representa- 
0 D, 
tion); (iii) P~' DP (equivalent representation): 
and (iv) D (complex conjugate representation). 
Let M be a subset of G’ such that De M when- 
ever De M and the irreducible components 
of D, ® D, are in M whenever D,, D, eM. 
Then M is called a module of representations 
of G. There is a one-to-one correspondence 
between closed normal subgroups H of G and 
modules M formed by all the representations 
of G/H. 

A representation of G’ is a correspondence 
which assigns to each D a matrix A(D) of the 
same degree as that of D and preserves the 
operations of G’: A(D, ® D,)=A(D,) ® A(D3), 
A(D, ® D2)= A(D,) ® A(D2), A(P DP) = 
P~'A(D)P, and A(D) = A(D). Let G” be the 
set of all the representations of G’. Define 
the product of A,, A,eEG” by A, A,(D)= 
A,(D)A,(D) and a topology on G” by the 
tweak topology of the functions A(D) of D. 
Namely, a typical neighborhood of A, is of the 
form U(Ap; Dy, ...,.D.;£)={A| || A(D;)— Ao(D)) || 
<e,i=1,...,s}. G” is a topological group 
under this multiplication and topology. Then 
the Tannaka duality theorem states that G” = 
G holds (T. Tannaka, Téhoku Math. J., 45 
(1939)). Let R(G) be the talgebra over the 
complex number field C formed by the set of 
all the linear combinations of a finite number 
of df,(x), and let Aut R(G) be the automor- 
phism group of this algebra R(G). Let G* 
be the set of all the elements o in Aut R(G) 
which commute with every left translation 
L(x) (L(x) f)(y) = f(xy)) and which satisfy 
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o(f )=o(f). Then G* is a topological group 
with respect to the weak topology, and the 
Tannaka duality theorem implies that the 
correspondence which assigns to each xe G 
the right translation R(x) (the restriction of 
U(x) to R(G)) is an isomorphism of G onto 
G* as topological groups. For the case where 
G is a compact Lie group, C. Chevalley re- 
stated the Tannaka duality theorem as one 
giving a relation between compact Lie groups 
and complex algebraic groups (— 249 Lie 
Groups U). 
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A. General Remarks 


The notion of complexes was introduced by H. 
Poincaré to study the topology of tmanifolds 
by combinatorial methods (— 65 Combina- 
torial Manifolds). Various kinds of complexes 
have been introduced in the course of the 
development of topology. These will be dealt 
with individually in the sections that follow. 


B. Euclidean Simplicial Complexes 


Let R^ be the N-dimensional tEuclidean 
space, and let ay, a,,...,a,, be points of R” 
with coordinates a; = (af), a!”’, ..., a) for i=0, 
1,...,m. For real numbers Ay, f1, ---, Ams WE 
denote by Agdy +A, a, +... + Amam the point 
(Zioa, Lio Ajal”, ..., LiLo 4af) of R”. 

A set of n+1 points do, a,,...,a, of RY is 
said to be independent or in general position if 


vectors Ajay, G9az,...,a 9d, are linearly inde- 
pendent. For given independent points ao, 
i, -âp let |aga, ...a,| denote the subset of 
R“ given by 


[aoa] ...a,|={Agag tA, a, 4+... 
+A, Ay|AgtA,+...+4,=1,4; 20}. 


|@ga, ...a,| is called an n-simplex with vertices 
do, 4;,...,4,. Simplexes are denoted by A, A’, 
a, t, etc. For example, a 0-simplex |ag} is a 
point dy, a 1-simplex |a,a,| is a segment aoa, 
and a 2-simplex |aya,a,| is a triangle Aaga; az. 
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We attach to Aoao +4,4a,+...+4,a, the co- 
ordinates (Ap, A,,...,4,), called the barycentric 
coordinates. If |a)a, ...a,| is an n-simplex, 
every subset {d;,,d;,,..-,d;,} Of {ap, 41, ...,dy} 
is independent. A q-simplex |a;,a;, ....a;,| is 
called a q-face of |aga, ...a,|. 

A set 8 of simplexes in a Euclidean space R“ 
is called a Euclidean (simplicial) complex in R“ 
if Ę satisfies the following three conditions: (i) 
Every face of a simplex belonging to § is also 
an element of K. (ii) The intersection of two 
simplexes belonging to S is either empty or a 
face of each of them. (itt) Each point of a sim- 
plex belonging to & has a neighborhood in R“ 
that intersects only a finite number of sim- 
plexes belonging to 8. A Euclidean complex is 
also called a geometric complex (or rectilinear 
complex). Each 0-simplex in K is called a ver- 
tex in 8. We define the dimension of & to be 
n if R contains an n-simplex but no (n+ 1)- 
simplex, and oo if R contains n-simplexes for 
all n>0. 

By a subcomplex of a Euclidean complex & 
we mean a Euclidean complex that is a subset 
of 8. For a Euclidean complex §, its r-section 
(or r-skeleton) is defined to be the subcomplex 
of & consisting of all n-simplexes (n <r) in K. 

If 8 is a Euclidean complex in R”, we de- 
note by |K] the set of points in R” belonging to 
simplexes in K. This set || is called the Eucli- 
dean polyhedron of &. 

By subdivision 8’ of a Euclidean complex & 
we mean a Euclidean complex such that |8'] 
=|] and each simplex in &’ is contained in a 
simplex in K. 

Specifically, we can construct a subdivision 
K'of K utilizing tbarycenters of simplexes in 8; 
namely, we let &’ be the set of all r-simplexes 
whose vertices consist of barycenters of the 
series AJC A, c... cÅ, of simplexes in 8. Then 
S is a subdivision of K, called the barycentric 
subdivision of 8 and denoted by Sd K. 

Given a Euclidean complex & and a subset 
A of |K], we define the star of A in 8 to be the 
subcomplex of & that consists of simplexes {A} 
and their faces such that AN A # @. Further- 
more, we define the open star of A in 8 as 
the union of topen simplexes (the interiors of 
simplexes) of 8 whose closures intersect A. We 
denote by Stg(A) the star of A in 8 and by 
O,(A) the open star of A in 8; then O,(A) is an 
open set whose closure is |Stg(A)|. 

The notion of Euclidean simplicial com- 
plexes can be generalized to that of Euclidean 
cell complexes; this is done by replacing the 
term simplex by tconvex cell in the definition of 
Euclidean simplicial complex. For a Euclidean 
cell complex K, the notions of vertex, dimen- 
sion, subcomplex, r-section, and subdivision 
are defined similarly as in the case of Eucli- 
dean simplicial complexes. 
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C. Simplicial Complexes 


Given a Euclidean simplicial complex K, let K 
denote the set of all the vertices in K, and let 
Z denote the set consisting of those subsets 
{Uo,.-.,v,} of K for which there exist simplexes 
A in K such that {vo,...,v,} coincide with the 
set of vertices of A. Then we have (1) if seX 
and s>s’, s' # @, then s'e X; (2) every set con- 
sisting of a single element in K is in X, and the 
empty set is not in 2. 

In general, if a pair (K,2) of a set K anda 
set X consisting of finite subsets of K satisfy (1) 
and (2), then the pair (or the set K) is called an 
abstract simplicial complex (or simply sim- 
plicial complex). If K is a simplicial complex, 
each element of the set K is called a vertex in 
K, and each set of Z is called a simplex in K. A 
simplex consisting of n+ 1 vertices is called an 
n-simplex. We say that a simplicial complex K 
is finite if it consists of a finite number of ver- 
tices; it is locally finite if every vertex of K 
belongs to only finitely many simplexes in K. 
We define similarly countable simplicial com- 
plexes and locally countable simplicial com- 
plexes. The dimension and r-section of a sim- 
plicial complex are defined as in the case of 
Euclidean complexes. By a subcomplex of a 
simplicial complex K we mean a simplicial 
complex Ky such that each simplex of Ky is a 
simplex in K. 

Let K and L be simplicial complexes. A 
mapping g: KL is called a simplicial map- 
ping (simplicial map) of K to L if the following 
condition is satisfied: If vg, v,,...,v, are ver- 
tices of a simplex of K, then @(v9), øv), ..., 
o(v,) are vertices of some simplex of L. Two 
simplicial complexes K and L are said to be 
isomorphic if there exist stmplicial mappings 
yg: K>L, y:L-K such that yog and gow 
are the identity mappings. 

Given a simplicial complex K, let |K| denote 
the set of all functions x from the set of ver- 
tices of K to the closed interval J =[0, 1] satis- 
fying the following conditions: (i) The set 
{ve K | x(v) 40} is a simplex of K. (ii) E ex x(v) 
= |. The value x(v) is called the barycentric 
coordinate of the point xe|K| with respect to 
the vertex v. Each vertex v of K is identified 
with the point of |K| whose barycentric co- 
ordinate with respect to the vertex v is 1 and is 
called a vertex in |K|. For a simplex s={v9,0,, 
..+,U,} in K, we define |s|={xe|K||x(v)=0 
(v¢s)}, which is called a simplex in |K]. We 
call {xe]s||x(v) >0 (i=0, 1,...,n)} an open 
simplex of | K| or the interior of |s|. We remark 
here that for an arbitrary simplex sin K a 
point xe|s| can be written in the form x= 
Yves x(V): v. We can define a metric d on |K] 
by d(x, y)=(Lvex(x(v) — y(v))?)"”. However, | K| 
is usually supplied with a tstronger topology 


70C 
Complexes 


defined as follows: (1) Each simplex |s| in |K| 
has the topology given by the metric d. (2) A 
subset U of |K| is open if and only if U N |s| 

is an open subset of |s| for each simplex s of 
K. Henceforward, by the topology of |K| we 
mean the topology just defined unless other- 
wise stated. The set |K| with such a topology 
is called the polyhedron of K. The topology 
of |K| coincides with the above metric topol- 
ogy if and only if K is locally finite. If sim- 
plicial complexes K and L are isomorphic, 
then |K] and |L| are homeomorphic. If K is 
the simplicial complex defined by a Euclidean 
simplicial complex K, then | Kj and |K] are 
homeomorphic. When K is finite, there exists 
an Euclidean simplicial complex 8 whose sim- 
plicial complex is isomorphic to K. Accord- 
ingly |K| is homeomorphic to the Euclidean 
polyhedron ||. 

If K and L are simplicial complexes, a map- 
ping f:|K|—>|L] satisfying the following con- 
dition is said to be linear: If s={v9, v,,...,0,} 
is a simplex in K and xX=Agto t+... +AU, (Ag + 
$4,=1,4;20), then f(v9), ..., f(v,) belong 
to a simplex in L and f(x)=Apf(vo)+...+ 
Anf (va). The linear mapping determined by 
a simplicial mapping @: KL is denoted by 
|~p|:|K|—|L] and is also called a simplicial 
mapping and denoted by the same letter ¢. 
Let K and K’ be simplicial complexes. If 
there exists a linear mapping /:|K’|-|K], 
which is a homeomorphism, then we identify 
|K| and |K’| by / and call K’ a subdivision of 
K. The barycentric subdivision Sd K of a sim- 
plicial complex K ts defined as in the case of 
Euclidean complexes. We also have notions of 
star St,(A) and open star O,(A) for a simpli- 
cial complex K and a subset A of |K|. If K 
and L are simplicial complexes, a mapping 
f:|K|—|L| is called a piecewise linear map- 
ping if there exist subdivisions K’ and L’ of 
K and L, respectively, such that f:|K’|>|L’| is 
linear. 

Given an open fcovering M={M,} ex ofa 
topological space X, the index set K becomes 
a simplicial complex if we consider each finite 
nonempty subset s of K such that (),.;M,# 
@ (empty) to be a simplex. The resulting sim- 
plicial complex K is called the nerve of the 
open covering M. Furthermore, if Pt={M,} <x 
and M={N,} yer are open coverings of a set X, 
N is a trefinement of the covering W, and L is 
the nerve of Yt, then a simplicial mapping o: 
L-K is defined by sending each vertex w in 
L to a vertex v in K such that Nc M,,. 

Given two disjoint simplicial complexes K 
and L, a simplicial complex K * L, called the 
join of K and L, is defined by the following: (1) 
the vertices of K » L are the vertices of K and 
the vertices of L. (2) A nonempty subset of 
vertices is a simplex of K « L if and only if its 
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subsets in K and L are empty or simplexes 
there. In particular, the join of a simplicial 
complex K and a single point is called the cone 
of K. 

A simplicial complex K is said to be ordered 
if a tpartial ordering is given in the set of ver- 
tices in K such that the set of vertices of each 
simplex is ftotally ordered. Given ordered 
simplicial complexes K and L, an ordered 
simplicial complex K x L, called the Cartesian 
product of K and L, is defined by the following: 
(1) The vertices in K x L are pairs (v, w), where 
v and w are any vertices in K and L, respec- 
tively. (2) A set of vertices (v9, Wo), -.-, (Un, Wp) 
such that vo <...<v, and wọ S... Sw, is a 
simplex in K x L if (vo, .-., Va) and (Wo, ..., Wp) 
are simplexes in K and L, respectively; all 
simplexes in K x L are obtained in this man- 
ner. (3) (vı, w1) < (v2, w2) if and only if v; <v, 
and w; S w3. 

Assume that either K or L is locally finite or 
that both K and L are locally countable. Let 
X and Y be topological spaces and J be the 
closed interval [0, 1]. We define an equivalence 
realtion ~ in the topological space X U(X x Y 
x DU Y by x~ (x, y,0) and y~ (x, y, 1), where 
xexX, ye Y. The quotient space of XU(X x Y 
x I)U Y by this relation is called the join of X 
and Y and is denoted by X « Y. Then the poly- 
hedron |K x L| is homeomorphic to the prod- 
uct space | K| x |L| of the topological spaces 
|K| and |L|. And the polyhedron |K + L| is 
homeomorphic to | K|» |L]. 

By a triangulation T of a topological space 
X we mean a pair (K, t) consisting of a sim- 
plicial complex K and a homeomorphism 
t:|K|>X. A triangulation is also called a 
simplicial decomposition. If T=(K, t) is a trian- 
gulation of X, the various concepts defined for 
K can be transferred to X by means of the 
mapping t. For example, by a simplex of the 
triangulation T we mean the image of a sim- 
plex of |K| under t. We say that a triangula- 
tion T=(K, t) is finite if K is a finite simplicial 
complex. If T=(K, t) is a triangulation and 
(K’, 1) is a subdivision of K, then T’=(K’,tol) 
is called a subdivision of T. If T, =(K,,t,), T, 
=(K,,t,) are triangulations of topological 
spaces X,, X, respectively, a mapping f:X, 

— X, is called a simplicial mapping relative to 
T, and T, if t;'o fot,:|K,|>|K,| is a sim- 
plicial mapping. The following two problems 
on triangulations are famous: (1) Under what 
topological conditions ts it possible for a given 
topological space to be supplied with a trian- 
gulation? (2) Given two triangulations T}, T> 
of a space X, are there subdivisions T; = 
(Kiti) T> =(K3,t2) of T, and T}, respec- 
tively, such that K; and K; are isomorphic? 
Concerning the second problem, the conjec- 
ture asserting the existence of subdivisions Tý 
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and T; as above is known as the fundamental 
conjecture (Hauptvermutung) in topology. 
Every 3-dimensional manifold is triangulable, 
and any two of its triangulations admit sub- 
divisions satisfying the condition in (2) (E. E. 
Moise [7]). 

In 1961, J. Milnor showed that the funda- 
mental conjecture is not true for polyhedra 
[8]. The triangulation problem and the funda- 
mental conjecture for topological manifolds 
were negatively solved by R. Kirby and L. 
Siebenmann [9] (— 65 Combinatorial Mani- 
folds). Any ‘differentiable manifold is triangu- 
lable, and the fundamental conjecture holds 
for its *C’-triangulations (r > 1) [6] (— 114 
Differential Topology). 

Let T,, T, be triangulations of topological 
spaces X,, X}, respectively, and let f: X, >X, 
be a continuous mapping. Then a simplicial 
mapping @: X,—X, relative to T, and T, is 
called a simplicial approximation to f if, for 
each xe xX, the image (x) lies on the simplex 
of T, whose interior contains f(x). The follow- 
ing existence theorem is called the simplicial 
approximation theorem: For every continuous 
mapping f: X,—X>, there exist a subdivision 
T, of T, and a simplicial mapping ọ: X, >X, 
relative to Tj and T, that is a simplicial ap- 
proximation of f. If the triangulation T, is 
finite, then for a sufficiently large n we can 
choose Sd" T, as the T; above (where Sd? T = 
T and Sd" T =Sd(Sd""! T) (n>1)). If 9: X,> 
X, is a simplicial approximation to a continu- 
ous mapping f, then f and ọ are thomotopic. 


D. Cell Complexes 


Let V” be the tunit n-disk, S"! be the tunit 
(n—1)-sphere, and X be a Hausdorff space. For 
a subset e of X, let e be the closure of e in X, 
and let é=e—e. A subset e of the space X is 
called an n-cell, or open n-cell in X if there is a 
relative homeomorphism ¢:(V", S""')>(@, ê), 
i.e., a continuous mapping g@:V"—é such that 
o(S""')céand o: V"—S" 1e—é is a homeo- 
morphism. For example, S"— {p} (pe S”) is an 
n-cell. A set {e,|4€A} of cells in the Haus- 
dorff space X is called a cellular decomposition 
of X if the following three conditions are satis- 
fied: (i) e, Ne, is empty if 4# p; (ii) X =( Jien ez; 
(iii) If the dimension of e, is n+ 1, then é; € 
X", where X” is the union of all the cells e, 
(eA) whose dimensions are not greater than 
n. For example, the n-sphere S” has a cellular 
decomposition consisting of a single 0-cell 
and a single n-cell. 

A Hausdorff space X together with its cellu- 
lar decomposition {e,} is called a cell com- 
plex, and each e, ts called a cell in the cell 


70D 
Complexes 


complex X. For a cell complex, the notions of 
vertex, -section, and dimension are defined in 
the same way as the corresponding notions in 
Euclidean complexes. Let X be a cell complex 
and A a topological subspace of X such that 
the closure of each cell of X intersecting A is 
contained in A. Then the set of cells e such 
that eN A # Ø forms a cellular decomposition 
of A. The set A together with this cellular 
decomposition is called a subcomplex of the 
cell complex X. A cell complex X with its cells 
{e,} is said to be finite if the number of e, is 
finite. If each point in a cell complex X is an 
interior point of some finite subcomplex of X, 
then X is said to be locally finite. We define 
similarly a countable cell complex and a locally 
countable cell complex. If the closure of each n- 
cell of a cell complex X is homeomorphic to 
V” (n=0,1,...), X is said to be regular. If X 
and Y are cell complexes, a continuous map- 
ping f:X—Y such that f(X")c Y” (n=0,1,...) 
is called a cellular mapping. If X and Y are cell 
complexes, the set of ttopological products 

e, Xe, where e,, e, run over all cells of X, Y, 
respectively, is a cellular decomposition of the 
product space X x Y. The resulting cell com- 
plex X x Y is called the product complex of the 
cell complexes X and Y. 

A cell complex X is said to be closure finite 
if each cell in X is contained in a finite sub- 
complex of X; and X is said to have the weak 
topology if a subset U c X is open if and only if 
U Ne is relatively open in é for each cell e of 
X. We call a cell complex a CW complex if it is 
closure finite and has the weak topology. The 
cellular decomposition of a CW complex is 
called a CW decomposition. A locally finite cell 
complex is a CW complex. 

Fundamental properties of CW complexes 
are as follows: (i) A CW complex is a tpara- 
compact (hence *normal) space and is tlocally 
contractible. (ii) A subcomplex A of a CW 
complex X is a closed subspace of X, and A 
itself is a CW complex. (iii) A mapping f:X > 
Y of a CW complex X to a topological space 
Y is continuous if and only if the restriction 
f\e is continuous for each cell e of X. (iv) If 
X and Y are CW complexes and f: XY is 
any continuous mapping then there exists a 
cellular mapping of X to Y that is homotopic 
to f (cellular approximation theorem). (v) A 
pair (X, A) consisting of a CW complex X and 
its subcomplex A has the thomotopy extension 
property for any topological space. (vi) A CW 
complex has the tcovering homotopy property 
for any ‘fiber space. (vii) The product complex 
X x Y of two CW complexes X and Y is not 
necessarily a CW complex, but it is thomo- 
topy equivalent to a CW complex. (viii) If 
either X or Y is locally finite, or if both X and 
Y are locally countable, then the product 
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complex X x Y is a CW complex. (ix) For CW 
complexes X and Y, the mapping space Y* 

is homotopy equivalent to a CW complex. 

(x) The tcovering space of a CW complex has a 
CW decomposition. 

If K is a simplicial complex, the polyhedron 
|K| is a regular CW complex whose cells are 
all open simplexes in |K]. A simplicial complex 
K is (locally) finite if and only if the CW com- 
plex |K] is (locally) finite. In particular, the 
Euclidean polyhedron of a Euclidean sim- 
plicial (or cell) complex is a locally finite CW 
complex. A polyhedron |K] generally admits a 
CW decomposition whose cells are far smaller 
in number than the simplexes constituting a 
simplicial decomposition of K. For any CW 
complex X, there exists a polyhedron | K| 
that is homotopy equivalent to X. In partic- 
ular, if X is an n-dimensional finite (coun- 
table) CW complex, we can choose as K an 
n-dimensional finite (locally finite and coun- 
table) simplicial complex. 


E. Semisimplicial Complexes 


By an ordered simplex in a simplicial complex 
K we mean a finite sequence (v9, 0,,...,0,) 
(n>0) of vertices in K, contained in the set of 
vertices of a simplex in K. Let O(K),, be the 
set of all ordered simplexes of K of length n+ 
1, and define mappings 0;:0(K),,>O(K),_, 
and s;:O(K),>0O(K),4, for i=0, 1,..., by 
O;(U9, «++ Vn) = (Vos -< Vj—15 Vitt «+5 Dn) and 
Si(Vos +++ 5 Un) = (Vos «++ 5 Ving Vis Vis Vig gs e-s Un) 
Then the following relations hold: 


6,00;=0,,00; (i<j), 


J J 
SOSSE 541 OS; (i<j), 
0,0 S= S-10 0; (i<j), 
0,0 5,= 5,0 Ôi- (Gi>j+1), 
0,05; = 06,4, O S; = identity. (1) 


Let A” be the n-dimensional simplex in R” 
with vertices eo =(0,0, ...,0)}, e, =(1,0,...,0), 
..-»€„=(0, ..., 1). By a singular n-simplex in a 
topological space X we mean a continuous 
mapping T:A"—X. Let S(X), be the set of all 
singular n-simplexes in X, and define map- 
pings 0;:S(X),>S(X),-1 and s;:S(X),7S(X)n41 
for i=0, 1,...,n by 0;T(Ao, ...,4,)= T(Ag, - 
4,-1.9, Ais -> An) and s,T(Ag, ..-5 Ana) = T(Ao, 
weg hing Ait Åit1s Åi+ts «+> An+1) Where (Ao, 
...,4,) is the point 27.9 A;e;, 4,20, D8. 94,;=1. 
Then relation (1) holds between 0; and s,. 

Because of the importance of relation (1), 
which is basic in defining thomology of sim- 
plicial complexes and topological spaces (— 
201 Homology Theory), S. Eilenberg and J. A. 
Zilber gave the following definition: A semi- 
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simplicial complex K consists of a sequence of 
sets K, (n=0,1,...) together with mappings 
0;:K,2K,-1, 5::K,7K,4; (i=0, 1,...,n) satis- 
fying relations (1). An element of K, is called 
an n-simplex in K, and @,, s; are called the ith 
face operator and the ith degeneracy operator, 
respectively. A simplex is said to be degenerate 
if it is the image of a simplex under some s;. A 
semisimplicial complex is abbreviated as s.s. 
complex. The s.s. complexes O(K)= {O(K),, 
i, Si} and S(X)= {S(X),, 6;, S3} are called the 
ordered complex of K and the singular complex 
of X, respectively. 

Let K be an s.s. complex, and let L, be 
a subset of K,, for n=0, 1,.... If 6(L,)cL,_, 
and s{L,)<L,4, for each i, then L= {L,, 
6;|L,, Si} Ln} is an s.s. complex, and L is called 
a subcomplex of the s.s. complex K. If A is a 
subspace of a topological space X, S(A) is 
a subcomplex of S(X). If K is an ordered sim- 
plicial complex, a subcomplex O'(K) of O(K) is 
obtained by considering the set of all ordered 
simplexes (Vvo, v;,...,v,) such that vo Sv < 
... Sv, (n=0,1,...). If K and L are s.s. com- 
plexes, a sequence f= { f,} of mappings f: 
K„—> L, defined for each n is called an s.s. map- 
ping if ô; o f,=f,-,00; and so f,=f,4,05; 
(O0<i<n). If f:X—Y is a continuous mapping 
of topological spaces, then f determines an s.s. 
mapping S(f):S(X)S(Y) by S(f)(T)=fo T. 
Two s.s. complexes K and L are said to be 
isomorphic if there is a bijective s.s. mapping of 
K to L. For two s.s. complexes K and L, we 
define the Cartesian product K x L to be the 
s.s. complex given by (K x L) =K, x Ln, 6,(¢, 7) 
=(0,0, iT), ,(0, t) =(s;0, 5,7) (0€ Kp, tEL,). If K 
and L are ordered simplicial complexes, the s.s. 
complexes O'(K) x O'(L) and O'(K x L) are 
isomorphic. If X and Y are topological spaces, 
the s.s. complexes S(X) x S(Y) and S(X x Y) 
are isomorphic. 

Given an s.s. complex K, we construct a 
topological space |K] as follows: First, we 
provide K,, with the fdiscrete topology and 
consider the topological space K = JnvoKnx 
A". Next we consider simplicial mappings 
e':A"”! +A" and ':A"'! +A" defined by '(p,) 
=p; (j<i), e(p)=Pjr1 G21) and n'(p) =p; 
(j <i), n'(p;) =p; (j>i), where po, ...,p, are 
the vertices of A”. The topological space |K] is 
defined to be the tquotient space of K with 
respect to an equivalence relation ~ that is 
defined by the following: (0,0, y}~ (0, '(y)) 
(ceK,, yeA" *), (5,5, y)~(o,n'(y)) (EK, 
yeA"*'), where i=0, 1,...,n. The space |K| is 
called the (geometric) realization of the s.s. 
complex K. Given an s.s. mapping f:K>L, 
we obtain a continuous mapping |f|:|K|>|L| 
defined by | f|(|o, vl)=|f(0), yl, where |ø, y| is 
the point in |K| represented by (a, y)e K. We 
call |f| the realization of the s.s. mapping f. 
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The realization | K| of an s.s. complex K is a 
CW complex whose cells are in one-to-one 
correspondence with the nondegenerate sim- 
plexes in K. For a topological space X, a 
tweak homotopy equivalence p:|S(X)|—X is 
defined by p(|T, yl)=T(y) (TES(X),, ved"). 
This mapping gives rise to a homotopy equiv- 
alence when X is a CW complex. 

The singular complex S(X) of a topological 
space X has the following property: Given 
simplexes 6, -.., Oj —15 Ok+1> -<-> n41 E Kp with 
0,0; = 0;_, 0; (i<j, i,j#k), there exists a simplex 
o€K,., with 6,0 =c; (i#k). An s.s. complex K 
with this property is called a Kan complex. If, 
in addition, 0,0 =0,0' (o,0'EK,,i#k) imply 
0,6 =0,0', we call K a minimal complex. For 
every Kan complex K, there are minimal sub- 
complexes M of K that are isomorphic to each 
other. Moreover, |M| is a fdeformation retract 
of |K|. For a Kan complex K, the thomotopy 
group can be defined combinatorially. 

Two CW complexes X and Y are homotopy 
equivalent if and only if the minimal sub- 
complexes of S(X) and S(Y) are isomorphic. 


F. Eilenberg-MacLane Complex 


Given an integer n>1 and a group z (Abelian 
if n> 2), there exists an tarcwise connected 
topological space X for which the thomotopy 
groups 7,({X) are trivial for i#n and 2,(X)&z. 
Such a space is called an Eilenberg-MacLane 
space of type (z, n). Let Q(X; X, +) be the tpath 
space over X, and let py: Q(X; X, *)>X be 
the natural projection. If X is an Eilenberg- 
MacLane space of type (z,n), then (Q(X; X, +), 
Po, X) gives rise to a standard tcontractible 
‘fiber space whose fiber is an Eilenberg- 
MacLane space of type (z,n—1). Assume 

that X is an Eilenberg MacLane space of type 
(n,n) with Abelian group z. Then X is (n— 1)- 
connected; hence the tHurewicz theorem can 
be utilized to show the existence of an isomor- 
phism h:2,(X)= H,(X), while the universal 
coefficient theorem can be utilized to show 
that H"(X;2)=Hom(H,(X), z). Since in this 
case we have ,,(X)=7, the element h`! e 
Hom(H,(X), z,(X)) can be regarded as an 
element of Hom(H,(X), n). Now the funda- 
mental class of X is defined to be the coho- 
mology class ue H"(X; n) corresponding to 
h`. Let Y be a tCW complex and x(Y; X) be 
the set of thomotopy classes of continuous 
mappings from Y to X. Then there exists a 
one-to-one correspondence n(Y; X) > H"(Y; n) 
given by the assignment [f ]>f*u(— 305 
Obstructions). Let S(X) be the singular com- 
plex of X, and let M(X) be a minimal complex 
of S(X). If X is an Eilenberg-MacLane space 
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` of type (z,n), M(X) is isomorphic to a certain 


complex determined uniquely by z and n. This 
complex is called the Eilenberg-MacLane 
complex of type (7, n) and is denoted by 
K(z,n). The notation K(z,n) is also used to 
mean the space X itself. 

Let A(q) be a simplicial complex whose sim- 
plexes are all nonvacuous subsets of {0, 1, 
..., qt. Let &;: A(q— 1)>A(q) be the simplicial 
mapping defined by &;(j)=j O<j<i—), e()) 
=j+1(i<j<q-—1), and let 7;:A(q+ 1)>A(q) 
be the mapping defined by 4,;(j)=j (0<j <i), 
n{jy=j—1(itl<j<qt+l). Now K(z,n)is a 
Kan complex defined by K (z, n), = Z"(A(q); 7), 
0,0 =0 0€;, $0 =G0N;, where Z°(A(q); 2) =7, 
Z'(A(q); n) is the set of z-valued functions 
defined on the set of pairs (i,j) such that 0< 
i<j<q and satisfying the equality o(j, k)- 
o(i,k)'-o(i,j)=1 for 0<i<j<k<q, and 
Z"(A(q); 2) (n> 2) is the group of toriented 
cocycles of the simplicial complex A(q). If z is 
Abelian, the structure of the Abelian group 
Z"(A(q); 2) gives K(z,n) the structure of an 
Abelian group in the s.s. category. This struc- 
ture yields a one-to-one correspondence 
K(x, n) >K(m,n— 1-1 x... x K(a,n— 1) for 
n>1 and leads to the expression of te K(z,n), 
in the form (o,_, ...,69)> with o,¢ K(x, n — 1),. 
The W-construction of K(z,n—1) for n>1 
is a Kan complex W(z,n—1) defined by W(z, 
n—1),=K(z,n—1), x K(z,n), and 6,(a, x Ta) = 
(0004) Gg-1 X OoTq, O,(6, X T4) = 06, X O;T, for 
1 <i<q, 8(a, X T,)=5;0, X 5,74, where oE K(r, 
n—1), and t= 04-1, -002 E K(x, n) Let p: 
W(x, n—1)>K(r,n) be a natural projection. 
Then (W(z, n—1), p, K(z, n)) plays the role of 
the tuniversal bundle for K(z,n—1) in the s.s. 
category in the following sense: Let L be an 
s.s. complex, and let f: L>K(z,n) be an s.s. 
mapping. We define f* W(x,n—1) to be the 
subcomplex of W(z,n—1)x L generated by 
simplexes (a, X T4) X p, such that t, = f(p,), 
where a, x tE W(x,n—1), and p,éL,. Let 
p:f * W(x,n—1)-L be the natural projection. 
Then (f* W(x, n—1), p, L) is called the prin- 
cipal fiber bundle induced from W(z,n— 1) by 
f. Any principal bundle over L with group 
K(x,n— 1) can be expressed as an induced 
bundle. This property means that (W(x, n— 1), 
p, K(z,n)) is universal. On the other hand, 
we have an algebraic analog of the univer- 
sal bundle for the chain group of K(z,n— 1), 
called the bar construction [12]. Both these 
concepts were defined by Eilenberg and Mac- 
Lane in order to determine the structure of the 
(co)homology of K(z,n), which is denoted by 
(H*(x,n))H,(x, n). This object was later 
achieved by H. Cartan, who introduced an 
improved notion called Cartan construction 
(— Appendix A, Table 6.1). Let z(L, K(x, n)) 
be the set of s.s. homotopy classes of s.s. map- 
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pings from L to K(z,n). If z is Abelian, there 
exists a one-to-one correspondence z(L, K (nr, 
n))>H"(L; n) given by the assignment [ f ]> 
f *u, where ue H"(zx, n; 7) is the fundamental 
class of K(z,n). By virtue of this correspon- 
dence, k= f *ue H"(L; n) determines an in- 
duced bundle f* W(x, n—1) uniquely up to 
equivalence, which is denoted by K(z, 
n—1)x,L. 

Let X be an arcwise connected topological 
space. Let X” be the Cartesian product of n 
copies of X. Clearly, the symmetric group &,, 
of degree n operates on X”. The n-fold sym- 
metric product SP" X of X is defined to be the 
quotient space of X” under the action of &,. If 
we specify a reference point of X, we have a 
natural inclusion SP""! X <SP"X and can 
consider the inductive limit space | Jı <,SP"X, 
denoted by SP” X. Then the Dold-Thom 
theorem [15] shows that M(SP”.X)= 

2, K(H,(X), i). In particular, we have 
M(SP”S")= K(Z,n) for n> 1. This result can 
be applied to obtain a direct relationship be- 
tween the axiomatic definition of tcohomology 
operations using K(z,n) due to Eilenberg and 
Serre and the constructive definition using the 
symmetric groups due to Steenrod (A. Dold 
[16], T. Nakamura [17]). For a detailed study 
of the (co)homology of SP"X — [18]. 


G. Postnikoy Complex 


Let X be an arcwise connected topological 
space. For the sake of simplicity, throughout 
this section we assume that X is tsimple (— 
202 Homotopy Theory). Then the tPostnikov 
system of X can be defined as an inverse sys- 
tem (X,,, Pa) (n=0, 1,2, ...) consisting of topo- 
logical spaces X„, continuous mappings p,: 
X,—7X,,-1, and a system (X, q,) (n=9, 1, 2, ...) 
consisting of continuous mappings q,:X >X, 
such that p,Oq,=4,-, and satisfying the fol- 
lowing three properties: (1) Xo is one point. (2) 
(Xn Pn» Xn—-1) is a tfiber space induced from a 
standard contractible fiber space over an 
Eilenberg-MacLane space of type (z,(X), n+ 1) 
by a mapping corresponding to a cohomology 
class k”*! €H""*(X,_ 13 Mal X)). (3) quai {(X)> 
n(X,) gives an isomorphism for 0<i<n. 
These cohomology classes k"*! are called 
Eilenberg-Postnikov invariants (or simply k- 
invariants). Corresponding to the above facts, 
the minimal complex M(X) can be obtained 
as the inverse limit of a certain inverse sys- 
tem (K(n), p(n)) consisting of Kan com- 

plexes K(n) and s.s. mappings p(n): K(n)> 
K(n—1) defined by K(0)= K(0,0) and K(n)= 
K(x, n) X yn+1K(n—1) for n> 1 with natural 
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projections p(n), where z,=7,(X) and k"t! e 
H"*'(K(n—1);z,). This system is determined 
uniquely up to s.s. homotopy equivalence by 
its limit, called the Postnikov complex and 
denoted by K(z,,k3,75,...,k""1,7,, ...). As 
yet we are ignorant of an effective method of 
computing the cohomology of a Postnikov 
complex from z, and k"*?. 
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Complexity of Computations 


A. Measures for Complexity of Computation 


Intuitively, complexity of computation means 
the amount of computing efforts measured on 
some suitable scale. When a problem can be 
solved by means of any one of several algo- 
rithms, it is highly desirable to compare the 
complexities of those algorithms. For example, 
the power x'° can be evaluated with a pocket 
calculator by using either of the algorithms 


x19 = xx xxXXXXXXXXXXXXXXX, (A) 


uU=XXX, V=UXU, W=0XÙ, 


x? =uxw. (B) 


The complexities of these algorithms can be 
measured by various quantities: (T1) time in 
seconds, (T2) the number of times keys and 
buttons are touched, (T3) the number of basic 
operations (here, multiplications), and (S) the 
number of values to be stored in the calcu- 
lator. The quantities (T 1), (T2), and (T3) are 
called time complexities, and (S) is called the 
space complexity. Obviously, algorithm (B) is 
preferable with respect to time complexity, 
although (A) is better with respect to space 
complexity. (If the calculator has only one 
memory register, then only (A) can be executed 
without recording numbers by hand.) For 
many calculators, the following algorithm 

is best with respect to both time and space 
complexities: 


x19 = ((x?)? x x}. (C) 


In the general theory of complexity, the num- 
ber of basic operations (73) is often taken as 
the basis of time complexity, since it represents 
the intrinsic complexity of the algorithm, 
rather than of extrinsic factors such as human 
skill or mechanical performance. 

For the evaluation of a power x” in general, 
the complexity depends on the value of n: such 
a parameter dominating the complexity is 
called the size of the problem. Let Ty(n) be 
the number of multiplications required for 
evaluating x” by an algorithm X. Let Z be the 
set of all algorithms for evaluating x”. The 
time complexity T(n) for evaluation of the 
power is then defined by 


T(n)= Min, ¢ T,(n). 
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This is satisfied when [6] 
logan < T(n) <2log,n 
and 


im O 


=; 
n>% log, n 


In a problem concerning a tgraph, the num- 
ber m of nodes in the graph, the number n of 
its edges, or their sum m +n are chosen as 
the size of the graph. However, the complex- 
ity is not determined by such a size, because 
there are many different instances (particular 
graphs) of the same size. So the complexity of 
this problem is defined in the following man- 
ner. Let T(I) be the complexity of solving an 
instance I of the problem and L({n) be the set 
of all possible instances of size n. Then 


W(n) = Maxye rin T(I) 
and 
A(n)=} pT), 


where p(I) denotes the relative frequency of 
the instance J. The value W{n) is called the 
worst-case complexity, and A(n) is called the 
average complexity of the problem. 

In some cases the complexity of a circuit is 
measured by the number of its building blocks 
[1], and the complexity of a program is mea- 
sured by its length [2, 3]. 


B. Complexity of a Decision Problem 


A decision problem and its complexity are 
rigorously formulated in terms of tTuring 
machines. Let X be a finite set of symbols and 
Z+ be the whole set of nonempty strings of 
symbols in 2. Every instance of the problem is 
assumed to be represented by a string in 2*. A 
decision problem is a triple (2, L, P) of the set £ 
of symbols, a subset L of 3*, and a mapping P 
from L to {0 (false), 1 (true)}. The set L repre- 
sents the set of all possible instances of the 
problem. The problem is said to be solvable if 
and only if there exists a Turing machine M 
satisfying the following condition for every 
string « in the set L: When a tape containing 
the string «& is given to the machine M whose 
head is initially put on the leftmost symbol of 
a, the machine writes the value of P(a) on the 
tape and stops after a finite number of steps. 
Such a Turing machine is said to compute the 
function P. The number 7,,(«) of steps is the 
time complexity of solving the instance « by 
the machine M, and the length S,,(«) of the 
used area of the tape is the space complexity. 
The worst-case complexity T,,(n) is defined by 


Ty(n) = MaX,cL,jaj=n Ty (0). 
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The space complexity S,,(n) for the size n is 
defined in a similar way. The complexities 
T(n) and S(n) of the problem are not defined, 
because improvement by a constant factor is 
always possible for any Turing machine [5]. 

A problem is said to be solved in linear or 
polynomial time if its time complexity Ty,(n) is 
bounded by a linear or polynomial function of 
the size n. 


C. Objectives of Complexity Theory 


The objectives of complexity theory are as 
follows: 

(i) Analyze an algorithm X for a problem P 
and evaluate its complexity. When exact evalu- 
ation is hard, the order of magnitude O(T,(n)) 
is investigated. 

(ii) Construct better algorithms for the prob- 
lem P. This gives a better upper bound to the 
complexity T(n) of the problem. 

(iii) Make clear the limitation of improving 
algorithms for the problem P. This is done by 
establishing a lower bound to the complexity 
T(n) of the problem. 

A common approach to constructing better 
algorithms is to partition the problem into 
smaller parts, apply algorithms to the parts, 
and then combine the solutions for the parts 
into a solution for the whole (divide and con- 
quer, [5]). This approach often yields an effi- 
cient algorithm for the problem, especially 
when the partitioning can be repeated recur- 
sively. For demonstrating the limitations of 
algorithms of a certain class, diagonalization 
[5], determination of information-theoretic 
lower bounds [6], or the oracle method [6] 
are often utilized. 


D. Elementary Results on Time Complexity 


(1) Number of arithmetic operations. A poly- 
nomial of degree n with one variable can be 
evaluated in about 3n/2 arithmetic operations 
if preconditioning on coefficients is allowed. 
Otherwise, 2n— 1 operations are necessary and 
sufficient for evaluating a polynomial of degree 
n. For computing the product of two square 
matrices of degree n, O(n”) operations are 
necessary and O(n*-*) operations are sufficient. 
The inverse matrix and the value of the deter- 
minant of a square matrix of degree n are 
computed with the same order of operations 
as the product. Discrete Fourier transforma- 
tion of n points is executed in O(nlogn) arith- 
metic operations (fast Fourier transformation). 
(2) Number of comparisons and data trans- 
fers. Rearrangement of n items in increasing 
order is realized in O(nlogn) comparisons and 


268 


data transfers. Selection of the kth largest item 
among n items is done in O(n) operations. The 
position of an item in a list containing n items 
is found in O(n) comparisons by linear search, 
in O(logn) comparisons by binary chopping, 
and in O(1) comparisons on average by the 
thashing method. 

Many other results can be found in [5]-[7]. 


E. NP-Completeness 


A problem is easily solved if its complexity is 
O(n) or less. The problem is very hard if its 
complexity is O(2”), unless the size n is small. 
Between these types of problems there is a 
class of problems, each of which can be solved 
in polynomial time by a Turing machine. 

A tnondeterministic Turing machine is said 
to solve a decision problem (X, L, P) if it can 
detect the case when P(x)= 1 by exercising 
good choices; more precisely, if it can stop 
after a finite number of steps and write the 
value 1 of P(x) on the tape, starting from an 
initial state with the head on the leftmost sym- 
bol of a string « on the tape, provided that 
P(a)=1. If P(a)=0, then the machine may 
never stop. For a string « such that P(«)=1, 
NTy(a) and NS,,(«) represent the minimum 
number of steps and the minimum length of 
tape used in computing P(«). Nondeterminis- 
tic complexities NT,,(n) and NS,,(n) for the 
size n are defined in a similar way as before. A 
problem is said to be solvable in NP-time or 
NP-space if it can be solved by a nondeter- 
ministic Turing machine M whose time com- 
plexity N Ty(n) or space complexity NS,,(n) is 
bounded by a polynomial function of size 
n. The class of all problems solvable in NP- 
time is denoted by NP, while that of the prob- 
lems solvable in polynomial time by ordinary 
(deterministic) Turing machines is denoted by 
P. Obviously, the class NP contains the class 
P. Nevertheless, whether NP=P or not re- 
mains one of the biggest unsolved problems. 

A decision problem (2, L, P) is polynomially 
transformable to a problem (Z, L’, P’) if there 
exists a mapping h from 2* to X”* satis- 
fying the following conditions: (1) h(L) SL’; (2) 
for every string « in L, P(a)= P’(h(a)); (3) the 
mapping h is computed by a Turing machine 
in polynomial time. If a problem C is poly- 
nomially transformable to another problem C' 
in P or NP, then the problem C is also in P 
or NP. A problem C is said to be NP-hard if 
every problem in NP is polynomially transfor- 
mable to C. It is called NP-complete if it is in 
NP and NP-hard. Many problems that have 
been known empirically to be very hard have 
recently been shown to be NP-complete [8]. 
For instance, integer linear programming is 
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NP-complete. Linear programming has re- 
cently been shown to be in P. The satisfiability 
problem of a conjunctive normal form was 
proven to be NP-complete in 1971 [9]. 

The complement of a decision problem 
(2, L, P) is the problem (2, L, P’), where P’(a)= 
1— P(a). If a problem C is in P, then its com- 
plement C’ is also in P. When C is in NP, it is 
not guaranteed that C’ is in NP. The class 
of all problems whose complements are in NP 
is denoted by co-NP. The intersection of NP 
and co-NP contains the class P, but it is an 
open problem whether this containment is 
proper or not. The problem PRIME, which 
asks whether a number N is prime or not, the 
size n being the number of digits for represent- 
ing the number N, belongs to both NP and co- 
NP [10]. However, it is not known whether it 
is in P. 


F. Other Topics 


The notion of completeness is introduced in 
many other classes of problems solvable in 
linear space, polynomial space, exponential 
time, etc. Comparisons among these classes 
have been done, although many problems still 
remain open. The fundamental conjecture in 
this field is the nonequality NP #P, the prov- 
ing of which seems to be extremely difficult. 
This conjecture can be relativized in ways such 
that it can be either true or false, so that it is 
not provable in some formal system. 

Some tricks may be used for hard problems. 
For decision problems, some algorithms have 
been proposed for guessing the correct answer 
with high probability [11]. Some algorithms 
give nearly optimal solutions efficiently, in- 
stead of constructing optimal ones over too 
long a time [7]. 

A general theory of computational com- 
plexity would include investigations on rela- 
tions among various complexity measures, 
the complexity hierarchy, and an axiomatic 
approach independent of any machine models 


[12]. 
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A. Definitions 


A Hausdorff topological space X is called a 
complex manifold (or complex analytic mani- 
fold) of complex dimension n if there are given 
an open covering {U;},., and a family {9,;},-, of 
homeomorphisms of U; onto open sets in the 
n-dimensional complex affine space C” such 
that in case UU, # Ø, the mapping 9,09; ': 
e(U,N Uj) o(U;0 U;) is biholomorphic (i.e., 
9,09; | and its inverse are both tholomorphic 
functions when expressed in terms of coordi- 
nate functions in C”). We call X the underlying 
topological space of this complex manifold, 
and we say that an open covering {U;};er and 
a family {¢;\,., define a complex analytic 
structure (or simply complex structure) on X. 
A complex-valued function f defined on an 
open set U in X is called a holomorphic func- 
tion on U if for any i the function fog; on 
g(U N U;) is holomorphic. When we express 
the mapping 9; as 9;(p)=(z'(p), ....2"(p)) on 
U, in terms of the coordinates in C”, each z* 
is a holomorphic function on U;. We call 
(z',...,2") a holomorphic local coordinate sys- 
tem on U;. Given two complex manifolds 
Y, X, a mapping p: YX is said to be holo- 
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morphic if for any open set U in X and any 
holomorphic function f on U, fog is holo- 
morphic on g '(U)c Y. When a mapping 9: 
Y—X is bijective and both g and g™ are 
holomorphic, we say Y and X are isomorphic 
by ọ as complex manifolds. 

As in the case of tdifferentiable manifolds of 
class C”, we can define concepts such as com- 
plex analytic submanifolds, holomorphic tan- 
gent vectors, holomorphic vector fields, and 
holomorphic differential forms of degree k (or 
simply holomorphic k-forms). Meromorphic 
functions on complex manifolds can also be 
defined as in the theory of analytic functions of 
several complex variables (— 23 Analytic 
Spaces D). 

Let X be a complex manifold and p a point 
of X. Take a holomorphic local coordinate 
system (z',...,2") with center p (i.e., z*(p)=0 
for all x). A holomorphic function defined on 
a neighborhood of p can be expressed as a 
holomorphic function in (z’,...,2"), hence as a 
power series in (z+, ...,2”) absolutely conver- 
gent in a neighborhood of p. If we denote by 
© =O, the tsheaf of germs of holomorphic 
functions on X, the fstalk ©, of © at p is iso- 
morphic to the tlocal ring of convergent power 
series in n variables z!,...,2". At a point p, 
(6/éz'),,, ...,(6/0z"), form a basis of the holo- 
morphic tangent vector space at p. A holomor- 
phic k-form œ defined on a neighborhood of 
p can be expressed as =D), <... <i, fi. dZ" A 
... A dz*, where f; ... ;, is a holomorphic func- 


t 


tion for each (i,,..., ip). 


B. Almost Complex Structures 


Let X be a complex manifold, and let 

{Ui Pitie be its complex analytic structure, i.e., 
a covering of X by holomorphic local coordi- 
nate systems with g;=(z},...,2”). Express zf in 
the form z#=x7+./—1 yř, where x7 and yj 
are the real and imaginary parts of z}, respec- 
tively. Then x? and y? real-valued functions 
on the open set U; of X, and the mapping 

y: UR?" defined by y;(p)=(xi (p), Yi (P), -~> 
x?(p), y?(p)) is a homeomorphism of U; onto 
an open set of R?”. This {U;, W;};-, defines on 
X a tdifferentiable structure of class C® (in 
fact, a treal analytic structure). Thus a complex 
manifold of complex dimension n admits 
canonically a C®-structure of real dimension 
2n. For every point p of X there is a real co- 
ordinate system on a neighborhood of p, such 
as (x1, y!, ..., x", y”), where (z', ... , Z") (2° =x" 
+ el y*, <=1,...,n) forms a holomorphic 
coordinate system in X. The real tangent 
vector space at a point p of X has {(0/éx'),, 
(0/dy*),, ---, (8/Ex"),, (0/ðy")p} as its basis. 
Define a linear endomorphism J, by (6/0x*), 
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—(8/Ay\p, (8/8y"}y> —(6/2x*), (a= 1, ... 1); 
then JZ = —1, and J, does not depend on the 
choice of holomorphic coordinate system at p. 
Considering J, as a tensor of type (1, 1), we 
thus obtain a tensor field J of type (1, 1) of 
class C” on X, which is called the tensor field 
of almost complex structure induced by the 
complex structure of X. 

More generally, when a real differentiable 
manifold X is provided with a tensor field J of 
type (1, 1) of class C® such that J? = —1 (con- 
sidering J as a linear transformation of vector 
fields), we say that X admits an almost com- 
plex structure or that X is an almost complex 
manifold. In this case, for contravariant vector 
fields x and y on X, we define a tensor field S 
of type (1,2) by S(x, y) = —Lx, y] + [J (x), J(y)] 
— J([J(x), vy) —J (Lx, J(y)]). S is called a Nijen- 
huis tensor. An almost complex structure J 
is induced by a complex analytic structure if 
and only if its Nijenhuis tensor S vanishes 
identically [49]. A differentiable manifold X 
of dimension 2n admits an almost complex 
structure if and only if the structure group 
GL(2n, R) of the bundle of ttangent 2n-frames 
of X can be treduced to GL(n, C). Almost 
complex manifolds are torientable. 


C. Types of Differential Forms 


Let X be a complex manifold, and let (z?, 
...,2") be a holomorphic local coordinate 
system in a neighborhood of a point p with z* 
=xt4 ./-1 y* (a=1,...,). On the complexi- 
fied real tangent vector space 7,(X) ®@ C at p, 
we define 0/dz%, 0/0Z* by 


(8/0z"), =(1/2) {(6/6x"), —/ —1 (/dy")p}, 
(6/62*), =(1/2) {(0/0x), + / —1 (6/ey*)p}. 


It is easy to see that the operation 0/0z* on 
holomorphic functions coincides with that of 
the holomorphic tangent vector 6/0z* defined 
in Section A. A function f is holomorphic at 
p if and only if (6/0z%) f =0 (a= 1,2, ...,n). 
T,(X) @ C is the direct sum of the subspace 
spanned by {0/dz',...,6/0z"} and the subspace 
spanned by {0/0z', ...,0/6z"}. Moreover, this 
decomposition is independent of the choice of 
holomorphic coordinate system. Elements of 
the two subspaces are respectively called tan- 
gent vectors of type (1,0) and tangent vectors of 
type (0, 1). Similarly, the complexified space of 
real differentiable 1-forms can be decomposed 
into the direct sum of two subspaces spanned 
by {dz',...,dz”} and {dz',...,dz"}, where dz“ 
=dx*+./—1 dy* and dz*=dx*—,/—1 dy”. 
We say that the elements of the former sub- 
space are of type (1,0) and those of the latter 
are of type (0, 1). Thus the space of differential 
forms of arbitrary degree can be written as 
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the direct sum of subspaces of type (r, s). Here 
the subspace of differential forms of type (r, s) 
has as a basis {dz 0... A dz% A dz*1 ^ ... a dz#s 
(l<a,<...<a,<n, 1<f,<...<B,<n)}. This 
decomposition is independent of the choice of 
a local coordinate system, hence the concept of 
type can be defined globally on X. 


D. 6-Cohomology 


In the rest of this article we consider only 
complex differential forms. For every differen- 
tial form w of type (r,s) on X, its fexterior 
derivative dw decomposes into a sum of dif- 
ferential forms of types (r+ 1, s) and (r,s+ 1), 
which we denote by @w (or d'œ) and 6a (or 
d'w), respectively. We have d=6+ ð, (d’)? =0, 
(a)? =0, and 66 +38 =Q. In terms of a local 
coordinate system, we write 


fil 
o= F ode" nde" A.. 


Y 


Adz% A dz®1 0... A dz*s, 


ae 0 
fo =( IS dz" 0... 


Y 
A dz* a dZ? a dz®1 a... a dZ®, 


for w= fdz™ a... A dz% a dZ’! a... a dZ®s. A 
differential k-form œ is holomorphic if and 
only if œ is of type (k, 0) and ĝw=0. 

For the operator ô Dolbeault’s lemma holds: 
Let w be a differential form on a neighborhood 
U of a point p. If dw =0, there is a neighbor- 
hood V of p contained in U and a differen- 
tial form 6 on V such that w= 66 on V. 

Let A” and Q” be the tsheaf of germs of 
differential forms of type (r, s) and the sheaf of 
germs of holomorphic p-forms on X, respec- 
tively, and let [(X, A“) be the set of tsections 
of A™® on X. T(X, A™®) is the set of differen- 
tial forms of type (r,s) on X, and ©, P(X, A®?) 
forms a tcochain complex with respect to ĝ. 
This complex is called the 6-complex or the 
Dolbeault complex, and its cohomology 
groups are called the 6-cohomology groups or 
the Dolbeault cohomology groups. The qth 
cohomology group is denoted by H?’4(A, 6). It 
follows easily from Dolbeault’s lemma that 0> 
QP APS APD.. is an texact sequence of 
sheaves. From this we get Dolbeault’s theorem: 


H(X, QP) = H?-4(A, ô), 
where the left-hand side is the cohomology 
group with coefficient sheaf Q”. 

More generally, for any tcomplex analytic 
(holomorphic) vector bundle E on X, we can 
define the ô cohomology groups of the dif- 
ferential forms on X with values in E, and they 
can be shown to be isomorphic to the coho- 
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mology groups with coefficient sheaves of 
germs of holomorphic forms with values in E 
(— 194 Harmonic Integrals E). 


E. Analytic Coherent Sheaves 


The structure of a complex manifold X is 
determined by the tsheaf © of germs of holo- 
morphic functions on X, and @ is a tcoherent 
sheaf (of rings) (Oka’s theorem). Sheaves of 0- 
modules are called analytic sheaves, and coher- 
ent sheaves of @-modules are called coherent 
analytic sheaves. Many properties of X can be 
expressed in terms of coherent analytic sheaves 
and their cohomology groups; some exam- 
ples appear later in this article (also — 366 
Riemann-Roch Theorems B). 

It is important to know whether an analytic 
sheaf on a complex manifold is coherent. To 
this question, not only does Oka’s theorem 
apply but so does Cartan’s theorem: The tsheaf 
of ideals defined by an analytic subset of a 
complex manifold is coherent. (We say that a 
subset Y of X is an analytic subset if it is a 
closed subset and each point of Y has a neigh- 
borhood U such that UN Y is the set of com- 
mon zeros of a finite number of holomorphic 
functions on U .) Also relevant is Grauert’s 
theorem: If z: X — Y is a tproper holomorphic 
mapping of complex manifolds (i.¢., the inverse 
image of any compact subset of Y for a holo- 
morphic mapping r is also compact), then for 
any coherent analytic sheaf F on X its ‘direct 
images R*z,.(F) (q=0, 1,2,...) are also coher- 
ent [16]. (In fact, this theorem holds for ana- 
lytic spaces; — 23 Analytic Spaces.) 

For an analytic coherent sheaf F on a tStein 
manifold X, we have the following fundamen- 
tal theorems of the Stein manifold. Theorem 
A: H°(X, F) generates the stalk F, (as an O,- | 
module) at every point x of X. Theorem B: 
H4(X, F)=0 for all g>0. Conversely, a Stein 
manifold X is characterized by the following 
property: For any coherent analytic sheaf F of 
ideals of ©, H'(X, F)=0. If a complex manifold 
X is compact and F is a coherent analytic 
sheaf on X, then H(X, F) is a complex vector 
space of finite dimension. If X is an open sub- 
manifold of another complex manifold and its 
closure is compact, then H%(X, F) is finite- 
dimensional for some q that depends on vari- 
ous properties (convexity or concavity) of 
the boundary of X [3]. 

Let E be a tcomplex analytic (holomor- 
phic) vector bundle on a complex manifold 
X of dimension n, and let E* be the tdual 
vector bundle of E. Then H(X, 0?(E)) and 
HY (X, ©" °(E*)) (where H, denotes the 
cohomology group with compact support) are 
dual as topological vector spaces under suit- 
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able conditions. The duality is given by the 
integration on X of the exterior products of 
the differential forms representing the re- 
spective elements of the cohomology groups. 
The duality holds, for example, when dim 
H(X, OP(E))< œ. If X is compact, we need 
not distinguish H, from H (Serre’s duality 
theorem) [50]. 


F. Compact Complex Manifolds 


On a compact and connected complex mani- 
fold X, there are no holomorphic functions 
except constants (by the tmaximum principle 
of holomorphic functions). The field K(X) of 
meromorphic functions on X is finitely gen- 
erated over the complex number field, and its 
ttranscendence degree d does not exceed the 
complex dimension n of X. d is said to be the 
algebraic dimension of X and is denoted by 
a(X). For elements of K(X), functional inde- 
pendence and algebraic independence are 
equivalent [52]. When n=1, X is a compact 
tRiemann surface and the classical theory of 
algebraic functions shows that K(X) is an 
talgebraic function field of one variable and X 
is a ‘projective algebraic variety. When n=2, 
a(X)=2, 1, and 0 can all occur. If a(X)=2, 
X is a projective talgebraic surface (Chow- 
Kodaira theorem). If a(X)= 1, there exist an 
talgebraic curve A and a surjective holo- 
morphic mapping go: X >A such that K(A) is 
isomorphic to K(X) under ¢*, and  '(x) is 
an telliptic curve for all but a finite number of 
xeA. K. Kodaira investigated the structure of 
compact complex surfaces in detail [34, IH]. 
On a compact complex manifold X, the free 
Abelian group generated by the set of irre- 
ducible analytic subsets of codimension 1 is 
called the divisor group of X, and an element of 
it is called a divisor of X. For an analytic sub- 
set Y of codimension 1, the sheaf of ideals 3(Y) 
defined by Y is a sheaf of locally principal 
ideals of 0. For a divisor D =} a, Y,, the sheaf 
of locally principal fractional ideals 3(D)= 
TI, (Y) is called the sheaf of ideals of D. 
The set of nonzero coherent sheaves of locally 
principal fractional ideals corresponds bijec- 
tively to the set of divisors. An element f 40 of 
K(X) generates a sheaf of principal fractional 
ideals and therefore defines a divisor, which 
is denoted by (f). The divisor group has 
an ordering defined by D = È a, Y,> 0 if and 
only if all a, >0, under which it becomes an 
tordered group. For a divisor D, let 


L(D) ={ fe K (X)|f#0, (f) + D>0} U{O}. 


Then L(D) is a C-module of finite dimension. 
This submodule of K(X) is easy to handle and 
exhibits various analytic properties of D. The 
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*Riemann-Roch theorem is used to calculate 
the dimension of L(D) in terms of other factors 
(— 366 Riemann-Roch Theorems C). Here is 
an example of how L(D) exhibits a property of 
D: We call two divisors D and D’ linearly 
equivalent if there is a 0# Fe K(X) such that 
(F)=D—D". This is an equivalence relation 
finer than thomological equivalence. If D and 
D' are linearly equivalent, then L(D) and L(D’) 
are isomorphic by the mapping L(D)3 f> 
fF €L(D’) therefore dim L(D)= dim L(D’). 

(The latter equation does not follow from the 
homological equivalence.) A holomorphic 
vector bundle with fiber C and structure group 
C* is called a complex line bundle. For the 
sheaf of ideals %(D) of a divisor D, we can take 
a suitable open covering {U;} of X such that 
for each U, there is an RjeT(U;, 3(D)) which 
generates Y(D), for any xe Uj. Also gy,=Rj/R, 
is a holomorphic function nowhere vanishing 
on UN U,. With {gy as the system of tcoordi- 
nate transformations, we define the complex 
line bundle determined by D and denote it by 
[D]. It is easy to see that [D] is independent 
of the choice of {U,} or {R,}. Moreover, [D] 

is determined only by the linear equivalence 
class of D. If we denote by @([D]) the sheaf of 
germs of holomorphic sections of [D], the 
mapping H°(X, O([D]))39={9}-f= 
;/R;=,/R,€L(D) is an isomorphism of these 
modules. On an algebraic variety in a projec- 
tive space, any complex line bundle comes 
from a divisor (i.e., it can be expressed tn the 
form [D] for some divisor D) [35,51], but this 
is not necessarily true on general compact 
complex manifolds. However, the importance 
of complex line bundles in the theory of com- 
plex manifolds lies in the relation L(D)= 
H°(X, O([D])), which replaces “things with 
poles” with “things holomorphic.” 

Any analytic subvariety X of the projec- 
tive space P” is an algebraic variety (Chow’s 
theorem) [51]. Suppose that X is an analytic 
submanifold of P”, and let Y be a general 
thyperplane section of X. Then Y is a divisor 
on X, and the tChern class of the complex line 
bundle [Y] corresponds to the canonical 
Hodge metric on X (— 232 Kahler Manifolds 
D). When [Y] is represented by the system of 
coordinate transformations {g,} with respect 
to an open covering {U;}, we can associate 
with a coherent analytic sheaf F sheaves F(n) 
(n=0, +1, +2,...) as follows. Denoting by F, 
the restriction of F to U;, we glue F; and F, 
together on U,N U, with the relation johe fj 
= gi f, (where fje F;c F, fie Fy < F) and obtain 
a sheaf (denoted by F(n)) that is locally iso- 
morphic to F. The following theorems for F(n) 
hold. For each coherent analytic sheaf F there 
exists an integer no such that for any n >ne the 
following fundamental theorems A, B of projec- 
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tive algebraic varieties hold [17]. Theorem A: 
I'(X, F(n)) generates F, (as an @,-module) for 
every xe X. Theorem B: H(X, F(n))=0 for all 
q>0. This means that if we permit “poles” on 
Y of sufficiently high order, then F has suffi- 
ciently many sections and the higher coho- 
mology groups vanish. 

On a (nonsingular) algebraic variety X in 
PY we have the sheaf © of germs of holomor- 
phic functions (the tstructure sheaf as a com- 
plex manifold) and the sheaf ©,,, of germs of 
holomorphic rational functions (the structure 
sheaf as an algebraic variety). Therefore we 
have two kinds of coherent sheaves, coher- 
ent analytic sheaves and coherent algebraic 
sheaves. In fact, the cohomology theories de- 
rived from them are isomorphic. More pre- 
cisely, for any coherent algebraic sheaf F, F 
=F ®o,, © is a coherent analytic sheaf. The 
correspondence FF gives an equivalence 
between the tcategory of coherent algebraic 
sheaves and that of coherent analytic sheaves, 
thus giving an isomorphism of their cohomol- 
ogy groups. In other words, as far as the prop- 
erties that can be expressed by cohomology- 
theoretic terms of coherent sheaves are con- 
cerned, there is no difference between the 
analytic and algebraic theories of projective 
algebraic varieties [51]. 


G. Deformations of Complex Structures 


The deformation theory of complex structures 
was initiated by Kodaira and Spencer [36] in 
order to explain various phenomena in the 
theory of (compact) complex manifolds. A 
triple (X, 7, S) is called a family of compact 
complex manifolds if X and S are connected 
tanalytic spaces and z is a tproper holomor- 
phic mapping of X onto S such that (i) it is 
smooth, i.e., is locally identified with the pro- 
jection S’ x US’, where S’ and U are open in 
S and C”, respectively, and (ii) every fiber V, = 
m~'(s) of z is connected. V, is then a compact 
complex manifold. We sometimes write {V,} 
instead of (X, z, S). S is called the parameter 
space of the family. Take a point oe S and fix 
it. We say that V, is a deformation of V, for any 
seS. There are a neighborhood S’ of o in S and 
a diffeomorphism 2 1(S’}>S’ x V (Kuranishi 
[39]). Thus every V, seS’, determines a tcom- 
plex structure on a fixed differentiable mani- 
fold. We say that the complex structure of V,, 
seS’, is a deformation of that of V}. 

Now (X, z, S)={V,},¢s is said to be complete 
at o if for any family (Y, u, T)={W},.7 with a 
point o'e T and a holomorphic isomorphism 
1: W; > V, there is a neighborhood T’ of o’ in 
T and holomorphic mappings f: T’—>S and 
h:~*(T')+X such that (i) fu = zh, (ii) f(o’)=0, 
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and (ii) h=: on W,. In this case, if T’ is suffi- 
ciently small, then h induces a holomorphic 
isomorphism h,: W,—>V;., for any te T’. Hence 
{V} ses contains all small deformations of Vo. 
On the other hand, {V,},.5 is said to be effec- 
tively parametrized at o if the Kodaira-Spencer 
mapping (Kodaira-Spencer map) p,: T,S— 
H'(V,,@) is injective, where T,S is the Zar- 
iski tangent space to S at o and @ is the sheaf 
of germs of holomorphic vector fields on Vy. 
Here, the linear mapping p, is defined by 
Po(G/0s) = {(E9}./0S)z,,0)}, Where ĉ/ôse T,S and 
gix are the coordinate transformations (z;, s)= 
(Gi(Z,, 5), $), using the smoothness of 7. p9(@/és) 
is called the infinitesimal deformation to the 
direction ¢/és. 

Kuranishi’s fundamental theorem [39, 40] 
states: For any compact complex manifold V, 
there exists a family {V,},-5 with a point oe S 
such that (i) it is complete at every point of S, 
(it) is effectively parametrized at o, and (iii) 
Vi=V. 

The parameter space S in the theorem is 
called the Kuranishi space or the local moduli 
space of V. It is given as the zeros of a holo- 
morphic mapping f: U—> H?(V, ©) with f(0)=0, 
where U is a neighborhood of 0 in H! (V, ©). 
Hence (1) if H?(V, ©) =0, then S= U is nonsin- 
gular (Kodaira, Nirenberg, and Spencer [37]; 
(2) if H'(V,®)=0, then S = {0}, i.e., one point 
(e.g., V=P"(C), the complex projective space). 

Kuranisht’s theorem was generalized to a 
compact analytic space V by Grauert [18] and 
A. Douady [8]. 

There are Kuranishi-type theorems in the 
deformation theory of other objects: (1) com- 
pact analytic subvarieties of an analytic space 
(Douady [9]; — 23 Analytic Spaces G), (2) 
holomorphic mappings (Y. Miyajima [41]), 
(3) germs of analytic spaces with tisolated 
singularities [7], etc. 

So far, only the local theory of deformations 
has been developed. The global theory is not 
yet in a satisfactory state. Its final purpose is 
to construct moduli spaces and to understand 
them. For a compact differentiable manifold V, 
we denote by M (V) the set of all isomorphism 
classes of complex structures on V. It is diffi- 
cult in general to determine the set M(V). As 
yet unsolved problems are: (1) Is M(S°) non- 
empty? (2) Does M(P"(C)) (n> 3) consist of one 
point? (M(P?(C)) is known to consist of one 
point (S. T. Yau; — 232 Kahler Manifolds C). 

If M(V) has a reasonable structure (e.g., an 
analytic space structure) and a universal prop- 
erty, then we call it the moduli space (— 11 
Algebraic Functions F, 16 Algebraic Varieties 
W). Kodaira [34, III] constructed the moduli 
space of Hopf surfaces. Only a few examples of 
moduli spaces are known. It is to be noted 
that moduli spaces cannot in general exist. 
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One of the reasons for this is that there may 
exist jumpings of structures (Kodaira and 
Spencer [36]). 

As for talgebraic manifolds (varieties), there 
are two known methods for the moduli prob- 
lem: (1) Griffiths’ period mapping (— 16 Alge- 
braic Varieties V) and (2) Mumford’s geomet- 
ric invariant theory (— 16 Algebraic Varieties 
V, W). 


H. Monoidal Transformations 


Let Y be an analytic subspace of a complex 
manifold X, which is defined by a nonzero 
coherent sheaf of ideals 3. Then X has an 
open covering such that for each member U of 
the covering, there are elements @,..., @n€ 
T(U, 3) that generate the stalks 3, at all xe U. 
Let W’ be the graph of the holomorphic map- 
ping xr>(@, (x): @2(X):...: @a(x)) from U ~ UN 
Y to P”™"', and denote by W the closure of W’ 
in U x P”-1, Then W is an analytic space, pos- 
sibly with singularities, which is independent of 
the choice of the generators {@,} and is deter- 
mined uniquely by U and 3. Therefore all the 
W’s can be glued together to form an analytic 
space X and to determine a holomorphic map- 
ping p:X > X. We call p the monoidal transfor- 
mation (or blowing-up) of X with center 3 or 
with center Y. When Y is a point, p is also 
called a locally quadratic transformation or a 
o-process. When Y is an analytic submanifold, 
X also is a manifold, Y= p 1(Y) is a nonsin- 
gular divisor on X, and p-'(y)=p*"' for all 
ye Y, where k is the codimension of Y. More- 
over, the line bundle [F] restricted to each 
fiber p~'(y) is isomorphic to the Hopf bundle 
L on P*-!, i.e., the line bundle associated to 
the natural C* bundle C*P*"!. Conversely, 
suppose that we are given YS X and a holo- 
morphic mapping py: Y> Y with the property 
as above so that, in particular, pp'(y)=P*"' 
and [F] = L on pp'(y) for all ye Y. Then there 
exists a complex manifold containing Y as a 
submanifold such that X is obtained by the 
monoidal transformation of X with center Y 
as above (S. Nakano [47]). Let f:X —Y bea 
proper modification of complex manifolds (or 
analytic spaces). Then there exists a proper 
holomorphic mapping h: Y’ Y such that 

f ‘oh: Y’—X is holomorphic, where over 
any relatively compact subdomain of Y, h is 
obtained by a finite succession of monoidal 
transformations with nonsingular centers 
(Hironaka). The result is called Chow’s lemma. 


I. Fiber Spaces 


A triple of compact complex manifolds V, W, 
and a surjective holomorphic mapping f: 
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VW is called a fiber space if general fibers 
f~'(w) are irreducible. In addition, if both V 
and W are algebraic, it is called an algebraic 
fiber space. 

For a compact complex manifold V, letting 
K, denote the canonical line bundle of V, 
define a subfield 8 of K(V) to be {w,/@., 
where w,,@,€ H°(V,Q) for some m>0}. The 
transcendental degree of 8 over C is denoted 
by x(V), which we call the Kodaira dimension 
of V. 

If R=0, then define c(V)= —oo. If K(V) = 
dim V, then V is said to be of general type, 
and there exist a projective manifold X and a 
bimeromorphic holomorphic mapping u: X > 
V. If x(V)>0, there exists a fiber space f: V*—> 
W such that (1) V* is bimeromorphically 
equivalent to V, (2) dim W = x(V), and (3) some 
general fibers f~'(w) satisfy x( f '(w))=0. In 
general, for a fiber space f: VW, we have 
K(V)<xK(f‘(w))+dim W, where f~'(w) is a 
general fiber. Moreover, if it is algebraic, an 
inequality of the form K(V)>K(f~‘(w))+K(W) 
is called the conjecture C,, n being dim V. 

C,, has been verified by K. Ueno, E. Viehweg, 
T. Fujita, and Y. Kawamata [13, 14, 32, 33, 54- 
57] in the following cases: (1) n <3, (2) when 
general fibers are curves, (3) dimw=1, x(w)=1, 
(4) x(V) 20, and W is of general types, etc. By 
using the results of case (4), Kawamata proves 
that an algebraic compact complex manifold V 
is birationally equivalent to an Abelian variety 
if and only if «(V)=0 and the irregularity of V 
equals the dimension of V [32]. 


J. Analytic Surfaces 


In what follows, an analytic surface means a 2- 
dimensional compact complex manifold. For 
an analytic surface S, an exceptional curve on 
S and a (relatively) minimal model, etc., are 
defined with respect to bimeromorphic map- 
pings in analogy with the corresponding con- 
cepts for an algebraic surface (— 15 Algebraic 
Surfaces). Let C c S be an irreducible curve on 
S. Then there exists a holomorphic mapping @ 
from S onto another surface S’ such that @(C) 
is a point and such that @ induces the isomor- 
phism S—C3S'—(C) if and only if C? = —1 
and C is a nonsingular rational curve (Grauert 
{17]). S has a minimal model if and only if S is 
not a ruled surface (Kodaira [34, III]). The 
irregularity g=h®', the geometric genus p,, i- 
genus P,, etc., are also defined in the same way 
as in the case of algebraic surfaces. Note that, 
in general, h>! #h'-°. The Riemann-Roch 
theorem and M. Noether’s formula are valid 
also for an analytic surface (Atiyah and Singer; 
— 366 Riemann-Roch Theorems C). 
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K. Classification of Surfaces 


The classification of analytic surfaces by the 
aid of their numerical invariants was com- 
pleted by Kodaira and includes as a special 
case Enriques’s classification of algebraic 
surfaces [34, INI]. By an elliptic surface we 
mean a surface from which there exists a sur- 
jective holomorphic mapping ¢ onto an alge- 
braic curve A, such that for a general point p 
on A, œ~! (p) is an irreducible nonsingular 
elliptic curve. If a(S)=1 or x(S)=1, S has a 
unique structure as an elliptic surface. The 
image by ¢ of the points on S at which ¢ is 
not of maximal rank is a finite subset {a,, 
...,@,} of A. Let t; be a local coordinate on 

A around a; with t,(a) =0. We call a singular 
fiber of ọ the divisor on S defined by {t;0@ 
=0}. The structure and the construction of 
singular fibers of elliptic surfaces have been 
completely determined by Kodaira. By an 
elliptic surface of general type we mean a 
surface with Kodaira dimension 1. If «(S)= 2, 
S is projective algebraic and is called a surface 
of general type. If a(S) =0, then there exists 
only a finite number of irreducible curves on S. 
By a Hopf surface we mean a surface whose 
universal covering is C? — (0,0). If an analytic 
surface S is homeomorphic to S! x $°, S is a 
Hopf surface. Let b,(S) be the vth Betti num- 
ber of S. If a(S)=0, b, (S)=1, b (S)=0, and S 
contains a curve, then S is a Hopf surface 
(Kodaira). By a surface of class VII, we mean 
a minimal surface S with b,(S)=1. M. Inoue 
[26] constructed three families of surfaces of 
class VII, with b, =0 which contain no curves. 
These are Sy) (M e SL(3, Z)) and SẸ} q,r, and 
SV ar (N ESL(2, Z), p, q, r€ Z, teC) which 
have H x C as their universal covering sur- 
faces, H being the complex upper half-plane. 
These have a line bundle L such that H°(Q' © 
((L))=0. This property characterizes these 
Inoue surfaces among VII, surfaces with b, = 
0 which contain no curves. A couple of new 
surfaces of class VII, with b, >0 were con- 
structed and studied by Inoue, Ma. Kato, 

T. Oda, I. Nakamura, and I. Enoki [10, 27—29, 
46]. Some of these have close connections 
with cusp singularities of Hilbert modular sur- 
faces, torus embeddings, and global spherical 
shells. Enoki’s surface is denoted by Sp, a,r 
(n>0,0<|a|<1,teC’) and has the following 
properties: (1) class VII, and b, =n, (2) there 
exists a connected curve D with D? =0, (3) 
S,.a,t—~ D is an affine bundle over an elliptic 
curve. Properties (1) and (2) characterize Sp, , ; 
(Enoki). 

Let K, denote the canonical line bundle of 
an analytic surface S. An analytic surface S is 
said to be a K3 surface, if K; is trivial, i.e., if 
there exists a nonvanishing holomorphic 2- 
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form œw and if g=0. K3 surfaces are simply 
connected and are deformations of a nonsin- 
gular quartic surface in P? (Kodaira). We shall 
define the period mapping of K3 surfaces. Let 
L be a free Abelian group of rank 22 with the 
pairing < > which is the direct sum of the two 
copies of — Eg and three copies of U = Ze, + 
Ze, with <e,,€,> = ez, e2) =0, <e1, e2) = 
<€5,€,>=1, where — Eg denotes the lattice with 
the pairing corresponding to the Dynkin 
diagram Eg with the opposite signs. H*(S, Z) 
with the intersection form ts such a pair, con- 
sisting of Sand < >. A marked K3 surface is 
defined to be a pair (S, Y), where S is a K3 
surface and w: H?(S, Z)—> L is an isomorphism 
preserving < >. The class [œw] of œ is a base of 
H”? and satisfies ([w], [w]> =0 and <[a|, [o] 
>0. Let P(L,) denote the 21-dimensional 
projective space associated with L-=L @7C. 
D={(aeP(Lo)| <a, a> =0, <a, a > 0} is an 
open set of the quadric 9, = {(a)E P(L,)| <a, a> 
=0}. 2 is the set of all Hodge structures on 
Lc. The period of a marked K3 surface (S, ys) is 
defined to be (W-[w]). Every point of 2 is the 
period of some marked Kahler K3 surface. If 
the periods of two marked Kahler K3 surfaces 
(S, Y) and (S’, Y’), coincide then S is isomorphic 
to f. 

An analytic surface with p,=q=0 and 
K$? ~0 is called an Enriques surface, which 
has a K3 surface as its universal covering. An 
Enriques surface is an algebraic elliptic surface. 
An analytic surface with g=1 and K2!? 0 is 
said to be a hyperelliptic surface, which has an 
Abelian surface as an unramified covering. It is 
an algebraic surface and is an elliptic bundle 
over an elliptic curve. The classification of 
minimal surfaces is given in Table 1. The fol- 
lowing relations hold among these invariants: 
Let b* (b`) be the number of positive (nega- 
tive) eigenvalues (counted with multiplicities) 
of the intersection matrix on H?(S,R) and c; be 
the ith Chern class of S. Then 

(1) b* —b~ =4(cj — 2c) (Hirzebruch signa- 
ture theorem); 

(2) if b, is even, g=h'°=4b, and b* = 
4p 1, 

(3) if b; is odd, g=h'° + 1=3(b, +1) and 
bt =2p,. 

Let c? and c, denote the Chern numbers of 
an analytic surface S, i.e., c? =(K§) and c, is 
the Euler number of S. Then 3c, >cj [43], and 
if equality holds, then Ks is ample (Y. Miyaoka) 
(— 232 Kahler Manifolds C). If S is a surface 
of general type which is minimal, then c? > 
0, the bigenus P, >2 and the m-genus Pp is 
m(m— 1)cj/2+1—q+p, for m>2. For each 
m2 5, the mth canonical mapping of S is a bi- 
rational holomorphic mapping onto its image 
(Kodaira, E. Bombieri [5]). In general, c} >2p, 
— 4, and if the canonical mapping, i.e., the 


72 Ref. 
Complex Manifolds 


rational mapping associated with Ky, is bira- 
tional, then cf > 3p,—7. For certain kinds of 
surfaces of general type, E. Horikawa suc- 
ceeded in determining completely the structure 
of surfaces obtained as deformations of a given 
surface. Among others, every minimal surface 
with p,=4, q=0, and cj =5 is a deformation of 
a nonsingular quintic surface [22, 23]. 


Table 1. 


Classification of Minimal Surfaces 


Structure 


algebraic surface 
of general type 


elliptic surface 
of general type 


T fe Ta femeror 


epe elliptic surface 
3 


with a trivial 
ORRA hyperelliptic surface 


canonical bundle 
elliptic surface 


belonging to 


K3 surface 


Enriques surface 


rational surface 


of genus q 
surface of class 
VII 


For a, b>0, let Z, be the set of isomor- 
phism classes of all minimal surfaces with c? =a 
and 1—q+p,=b. D. Gieseker [15] proved the 
existence of N and constructed the mapping 
h:X „—>P™ so that the following hold: (1) h is 
injective; (2) if f: X — Y is a smooth holomor- 
phic mapping such that all h '(y) represent 
some classes € X,,, then the set-theoretic map- 
ping f: Y>P induced from h is a morphism 
of schemes; (3) h(Z,,) is a locally closed sub- 
variety of P’. Hence, h(Z,,) is a moduli variety 
of surfaces of general type with c? =a and 1— 
q+p,=b. 

For each m with 1<m<9, there exists a 
minimal surface of general type with p,=q=0 
and c? =m. 

If there exist a compact analytic surface S$ 
and a curve C on S such that S—C is biholo- 
morphic to a complex manifold M, then we 
say that S is a compactification of M with 
boundary C. Every compactification of C? is a 
rational surface (Kodaira, J. Morrow [45]). 
Every compactification of C x (C—{0}) is also 
rational (T. Ueda [53]). However, all compac- 
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tifications of (C— {0})? are rational surfaces, 
certain kinds of Hopf surfaces, or P'-bundles 
over elliptic curves as constructed by Serre 
(Ueda). 
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A. Classical Theory 


If the ratio w,/m, of two periods w,, œ, of an 
telliptic function f belongs to an imaginary 
tquadratic field K, then there exists an alge- 
braic relation between f(z) and f(Az) for any 
Ain K, and such an f is said to have complex 
multiplication. This phenomenon for the tsn 
function with modulus JA was discovered 
by C. F. Gauss and was applied to the prob- 
lem of dividing a tlemniscate into five arcs of 
equal length. More generally, N. H. Abel 
showed that the special dividing equation of 
an sn function with complex multiplication 

is algebraically solvable. From a number- 
theoretic point of view, L. Kronecker conjec- 
tured that every tAbelian extension of an 
imaginary quadratic number field K is deter- 
mined by a transform equation of an elliptic 
function with complex multiplication by a 
number of K (1880). This is an analog of the 
fact, announced by Kronecker and proved by 
H. Weber, that every Abelian extension of the 
rational number field is a subfield of a tcy- 
clotomic field. Kronecker’s work was con- 
tinued by Weber [2], and his conjecture was 
proved by T. Takagi (1903) for K =Q(./—1), 
by T. Takenouchi (1916) for K = Q(e?*9), and 
by Takagi (1920) for the general case using 
tclass field theory. H. Hasse [5] and M. Suga- 
wara simplified the theory of complex multi- 
plication, and Hasse noticed a relationship 
between complex multiplication and tcon- 
gruence zeta functions. Working from Hasse’s 
idea, M. Deuring constructed the theory of 
complex multiplication purely algebraically 
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and determined tHasse’s zeta function of an 
elliptic curve with complex multiplication. 
For the rest of this article, K always stands 
for an imaginary quadratic field. Let L be a 
*lattice group on the complex plane C gen- 
erated by w,, @, and let z be a complex vari- 
able. Define functions 9, g2, g3 as follows: 
plz L)= @(2,@,0))=2-? +E ((z—@) 2 
w~’), g2(L) = 92(@1,@2)=60 Low *, g,(L)= 
g3(W,,@>)=140Xq@ °, where the sum È is 
over the elements w of L except 0. Let g’ be 
the derivative of ø; then z—(1, (z), @'(z)) is a 
one-to-one correspondence between the points 
on the complex torus C/L and those on the 
telliptic curve E; X,X3=4X}—g,X2X,— 
g3X3 in the projective plane. If the quotient 
w/w generates K, then the ring of analytic 
endomorphisms of C/L (i.e., the ring of endo- 
morphisms of E) is isomorphic to a subring of 
the *principal order o of K. In particular, if the 
lattice group L is an fideal of K (for K c C; > 
347 Quadratic Fields), then the ring of endo- 
morphisms coincides with o. The function 
J(t)=J(E) = J(L) = 2° 3° g,(L)?/A(L) (A(L) = 
g2(L)? —279,(L)’) of t=@,/w,, Imt>0, is 
a tmodular function of level 1, and J(E) is 
called the invariant of the elliptic curve E. 
If E has a complex multiplication, then J(E) 
is an algebraic integer. Then the three main 
theorems of the classical theory of complex 
multiplication can be stated. 


Theorem 1. Let be the tclass number of K, 
and let a,,...,a, be a set of representatives of 
ideal classes of K. Then J(a,),...,J(a,) are 
exactly the conjugates of J(a,) over K, and 
K(J(a,)) is the maximal unramified Abelian 
extension (the tabsolute class field) of K (— 59 
Class Field Theory). 

Next we define the function f by 


f(z L)=9293A *: p(z L), 


K4Q(./-1), Q(e?”), 
=g} t p(z LP, K=Q(./—1), 


=g;,A p(z, L}, K=Q(e?™). 
Theorem 2. Let o be the tprincipal order of K, 
m be an integral ideal of K, and a be an arbi- 
trary ideal of K. Choose a number é of K such 
that m={2e0|4ćea}. Then K(J(a), f(&;a)) is 
the fclass field for the tray modulo m. 

The number č in this theorem is obtained by 
a _'m(é)=b, where b is an integral ideal be- 
longing to the ideal class of a 'm. Thus f(é; a) 
is a “special value” of an elliptic function as 
well as of a modular function. 

The tgeneral law of reciprocity, the tprin- 
cipal ideal theorem, and the ramification in the 
class field in theorem 2 can also be described 
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in terms of elliptic functions or elliptic curves. 
In general, the ring of endomorphisms of an 
elliptic curve defined over a field k of charac- 
teristic 0 ts either Z or an order of an imagi- 
nary quadratic field. On the other hand, if the 
characteristic of k is not zero, then the ring of 
endomorphisms may be an order in a definite 
*quaternion algebra. 


B. Complex Multiplication of an Abelian 
Variety 


Following Kronecker’s idea, D. Hilbert posed 
in his lecture at Paris (1900) the so-called 12th 
problem: to find an analytic function whose 
special values generate Abelian extensions 
over a given algebraic number field (— 196 
Hilbert). E. Hecke constructed unramified 
Abelian extensions of an imaginary biquadra- 
tic field using tHilbert modular functions 
[10]. After this, there was no notable devel- 
opment concerning the problem until the 
theory of complex multiplication was gen- 
eralized to the case of tAbelian varieties, which 
was made possible by progress in algebraic 
geometry, in particular A. Weil’s geometric 
theory of Abelian varieties (G. Shimura and 
Y. Taniyama [11]). The following results 
have been obtained: Consider a triple (A, X, t), 
consisting of an Abelian variety A defined over 
C, a fpolarization ¥ of A, and a point t of A. 
Call two such triples (A, ¥, t) and (A’, ¥’, t") 
isomorphic if an isomorphism of A onto A’ 
maps ¥ onto X’ and t onto t’. Then there exists 
one and only one subfield ko of C with the 
following property: In order for (A, X, t) and 
(A7, X7, t7) to be isomorphic for an automor- 
phism ø of C, it is necessary and sufficient that 
o fix all elements of ky. We call kọ the field of 
moduli of (A, X, t). If A is an elliptic curve E 
and if t=0, then kọ = Q(J(E)). In the higher- 
dimensional case, the field of moduli is gen- 
erated by special values of a tSiegel modular 
function. 

If F is a totally imaginary number field 
that is a quadratic extension of a totally real 
field Fy of degree n, then there exists a set 
{Pis -s Ony of n different isomorphisms of F 
into C such that @;=@, never occurs for i Æj, 
where the bar denotes complex conjugation. 
We call (F; {@,}) a CM-type. 

If a is an ideal of F, then L= {(ọ, (œ), ..., 
¢,(a))€C"|«ea} is a lattice group in the n- 
dimensional complex linear space C”, and 
C’/L is analytically isomorphic to an Abelian 
variety A of dimension n in a complex pro- 
jective space. The tendomorphism ring QI(A) 
of A contains a ring that is isomorphic to 
the principal order o of F. Conversely, every 
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Abelian variety of dimension n such that (A) 
contains a ring isomorphic to o can be con- 
structed in this way. 

Let M be a normal extension of Q contain- 
ing F. Denote the Galois group of M by G 
and the subgroup of G corresponding to F 
by H, and put S=(},,H, where ~, stands 
for a prolongation of the previous Q, to G. 
Then the following three conditions are equiv- 
alent: (i) 2{(A) = 0; (ii) A is simple; (iii) H = 
{yeG|Sy=S}. If H* ={5eG|5S=S}, then we 
can choose w,€G such that S= |), H*y,. 

If F* is the subfield of M corresponding to H*, 
then (F*; {w,,}) is also a CM-type, and F* = 
Q(x, ¢,(«)|ae F). Moreover, for an ideal r of 
F*, TI,,w,(t) is an ideal of F. 

Now let H,, be the group of ideals r of F* 
that are relatively prime to the norm N(m) of 
an integral ideal m of F* and for which there 
exists a number é of F with 


[O= Ne=], €=1 (mod* m). 


Then H,, is an tideal group modulo m in F*. 


Theorem 3. Assume that (A), and let ¥ be 
an arbitrary polarization of A. Denote a point 
of A with m={/e€0|At=0} by t, and let k,, be 
the field of moduli of (A, X, t). Then k,, F* is the 
‘class field over F* corresponding to Hp- 

The point t in theorem 3 always exists. In 
particular, if m =v, then t=0. When A is an 
elliptic curve E, F is an imaginary quadratic 
field and we have F* = K, and theorem 3 coin- 
cides essentially with the content of theorems 1 
and 2. Ifn>1, F=F* holds only in special 
cases. 

The theory of complex multiplication of A is 
closely related to the tHasse zeta function of A 
(— 450 Zeta Functions). 
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A. Algebraic Properties of Complex Numbers 


A complex number is an expression of the form 
a+ib with arbitrary real numbers a and b and 
the imaginary unit i. Writing «=a + ib, B= 
c+id, we define «=f if and only if a=c and 
b=d. As regards algebraic operations with 
complex numbers, we define a+ B=(a+c)+ 
i(b+d), a—fB=(a—c)+i(b—d), xB =(ac—bd) 
+i(ad+bc), and for f £0, i.e., c? +d? £0, 

a/ PB =(ac + bd)/(c? +d?) + i((bc —ad)((c? + d”)). 
Then the addition and multiplication thus 
defined obey commutative, associative, and 
distributive laws, and complex numbers form 
a fcommutative field with O=0+i0 and 1= 
1+ 0 as its zero element of addition and iden- 
tity element of multiplication, respectively. The 
set of all complex numbers is usually denoted 
by C. 
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By assigning to each real number a a com- 
plex number a+ i0, algebraic operations on 
real numbers are carried into those of the 
corresponding complex numbers. That is to 
say, the field R of all real numbers is mapped 
isomorphically into the field C of all complex 
numbers. By identifying a with a+i0, we are 
taking R as a subfield of C. Also, 0+i1 will 
be denoted simply by i. From the previous 
definition of algebraic operations, it follows 
that i? = —1. Furthermore, since « =a + ib 
=(a+i0)+(b+i0)(0+i1), a+ib is not a mere 
symbolic expression but can also be regarded 
as the outcome of algebraic operations on 
a, b, iin C. The real and imaginary parts of a 
complex number «=a + ib are, by definition, 
a and b, denoted by Rea and Img, respec- 
tively. A complex number that is not a real 
number is sometimes called an imaginary 
number; in particular, a complex number « 
with Rea=0 is called a purely imaginary 
number. For each complex number «=a +ib, 
we define its conjugate complex number as 
a—ib, and denote it by % We then have 2+ B 
=%+ f and xf =@ß. The mapping «>Z is an 
tautomorphism of C which leaves each ele- 
ment of R invariant. Also, the following rel- 
ations hold: Re «= (« +&)/2 and Ima=(a— 
%)/(2i). 

Regarded as an overfield of R, C is an 
textension field of R of degree 2 obtained by 
the adjunction of i, which is a root of an irre- 
ducible equation x? + 1 =0. The important 
algebraic property of C is that it is talgebra- 
ically closed. Namely, for any polynomial f(x) 
with coefficients in C, the equation {(x)=0 
possesses at least one root in C (tGauss’s 
fundamental theorem of algebra). 


B. Topology of C 


The absolute value (or modulus) of a complex 
number «=a +ib, denoted by |a|, is by defi- 
nition |a|=./a?+b?= Ja. If æ is real, then 
the absolute value of « in the sense of complex 
numbers is identical to the one in the sense of 
real numbers. It always holds that |x|>0 and 
|~|=O<>a=0. It follows further that |æ + £| < 
la] +|8l, |oB|=|o| |B], and |a| = |x. 

For each pair of complex numbers a and $, 
define p(a, B)=|a—f|. Then with p(a, f) as the 
tdistance function, C satisfies the axioms for a 
tmetric space, and in particular, lim,..,, %,= 
dy S lim p(«,, %) =0<>lim|a,, — a) |=0<— 

(lim a, =a, and limb, = bọ) (where «, =a, + ibp, 
dy =a, +ib,). From this it is easily seen, as in 
the case of the set R of all real numbers, that C 
also becomes a ‘locally compact and tcom- 
plete metric space. 

With respect to this topology, the four oper- 


281 


ations (except for division by zero) are con- 
tinuous: %,>%, and a> Bo imply a, + B,> 

Ao + Bos Xn — Buxo — Bos &n By > %o fo; and 
Okn/P,—>%o/Po (where in the last case we assume 
Ba #0 and Bo #0). Thus C becomes a topo- 
logical field. Furthermore, the assignment 

x — Q gives a continuous mapping C >C, a 
homeomorphic automorphism of C. 


C. The Complex Plane 


If in a plane to which is assigned rectangular 
coordinate axes a complex number & =a + ib 
is represented by a point (a, b), then the plane 
is called the complex (number) plane (Gauss- 
Argand plane or Gaussian plane) (Fig. 1), and 
the point representing « is called simply the 
point x. The abscissa and ordinate axes are 
called the real and imaginary axes, respec- 
tively. A point x =a +ib can be represented by 
tpolar coordinates r, 0 with the origin and the 
real axis as the pole and the generating line, 
respectively, where r=./a? +b? is the absolute 
value |a| of « and 0 is the argument (or ampli- 
tude), denoted by arga, of x. The argument 

of æ is uniquely determined mod 27 if a40 
and is an arbitrary real number if «=0. 


imaginary 
axis 


real axis 


Fig. 1 


The absolute value |«| of a complex number 
x, regarded as a vector from the origin to the 
point a, is the length of this vector. For com- 
plex numbers « and f, to the sum of the vec- 
tors a and f corresponds the sum a + £ of the 
complex numbers. A complex number «, in 
terms of its absolute value r and argument 0, 
is expressed as «=r(cos@+ isin 0), which is 
called the polar form of x. For polar forms 
the following hold: ¢=r(cos 0— isin 0)= 
r(cos(—0)+ isin(—@)); a~! =a/Jo|? = 
r~'(cos(—0)+isin(—6@)) («40); and a,0,= 
r,r,(cos(0, +0,)+isin(0, + @,)), where |a;|=r; 
and arga,;=0; for j=1, 2. This last relation 
when r; =r, =| is called De Moivre’s formula. 
The nth roots of unity in C are given by p;= 
cos 2nj/n+isin 2nj/n (j=0, 1, ...,n— 1) (Fig. 2). 

In the complex plane, the mapping « > a 
corresponds to the freflection of the plane in 
the real axis, x > «+ to the parallel ttrans- 
lation along a vector $, « > aß (B #0) to the 
trotation through the angle arg f followed by 
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a thomothetic transformation with center 
0 and constant ratio |f|, and «> a7! to 
the tinversion with respect to the unit circle 
{a|Jo|=1}. 

The distance p(x, 8)=|«—f| between a and 
f in C coincides with their Euclidean distance, 
provided that « and f# are regarded as points 
in the Euclidean plane, so that the complex 
plane is tisometric, and accordingly homeo- 
morphic, to the Euclidean plane. 


Fig. 2 


D. The Complex Sphere 


In the rest of this article, P denotes the com- 
plex plane and X denotes the sphere of radius 
1, with 0, the origin of P, as its center. The 
points N(0,0, 1) and S(0,0, —1) of X will be 
called the north and south pole, respectively 
(Fig. 3), where the Ist and 2nd coordinate axes 
are the real and imaginary axes of P, respec- 
tively, and the 3rd coordinate axis is ortho- 
gonal to P. A straight line from N through a 
point z (a complex number) in P intersects 

+ at a point Z =(x1, X2, X3) different from 

N, where z=(x, +ix,)/(1 — x3), x, =(z+2)/ 

(1 +[z|?), x. =(z ~Z)/(i(1 +|z1°)), and x3 = 

(|z|? —1)/(|z|? + 1). The mapping z > Z is called 
a Stereographic projection from N, by means of 
which P and X—{N} become tconformally 
equivalent to each other. Consequently z can 
be represented by a point Z of Y—{N}, and 
thus used is called a complex sphere (or 
Riemann sphere). Let us adjoin to the complex 
plane P a new element, denoted by ©, called 
the point at infinity, which corresponds to the 
only exceptional point N of X. The topology 
of the complex plane with oo can be intro- 
duced by the corresponding topology of the 
Riemann sphere. Indeed, the family of all 

the sets {z||z|>M}U {œ} for M>0 forms a 
tlocal base around oo. By introducing local 
complex coordinates ¢ = 1/z into the neigh- 
borhoods of œ, each element of this local base 
is represented as {¢||{|<M ‘3, in which the 
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convention ¢ =0 is adopted for z = œ. The 
complex sphere thus defined can be regarded 
as a ‘Riemann surface (1.e., a 1-dimensional 
tcomplex manifold). 

The complex plane (complex sphere) whose 
points are represented by a variable z or w is 
called a z-plane or a w-plane (a z-sphere or a w- 
sphere). 


Fig. 3 


E. Linear Fractional Functions 


Given complex numbers a, b, c, and d with 
ad — bc £0, we define a linear fractional func- 
tion (or simply linear function) by 

az+b 
w= . 

cz+d 


(1) 


As a mapping from the z-sphere into the w- 
sphere, this linear function ts called a Möbius 
transformation (linear fractional or simply 
linear transformation). The usual linear trans- 
formation, i.e., the one with c=0 in the pre- 
sent case, is sometimes distinguished as an 
entire linear transformation. Since (1) depends 
only on the proportion a:b:c:d, we can 
assume ad—bc=1 without loss of generality. 
The transformation (1) is fholomorphic and 
tunivalent on the whole z-sphere with only 
one exceptional tpole at —d/c (co if c=0) of 
order 1, and the inverse of (1) is also a linear 
fractional function. The set of all linear trans- 
formations forms a ‘group with composition 
of transformations as the group operation. 
One of its subgroups is the tmodular group. 
Linear transformations carry any circle 
of the complex plane (or of the Riemann 
sphere) into a circle of the same plane (or of the 
same sphere) if we adopt the convention that 
straight lines are a special kind of circle. (In 
the case of a Riemann sphere, no such conven- 
tion is necessary.) Given on a plane a circle 
with center o and radius r and two points p 
and p’ on a half-line issuing from o satisfying 
op:op' =r*, the points p and p’ are called 
symmetric points (or reflection points) with 
respect to the circle. The transformation pp’ 
is called the inversion with respect to this cir- 
cle. In the complex plane, let z and z’ be sym- 
metric points with respect to a circle C. Sup- 
pose that by a linear transformation, z, z’ and 


C are carried to points w, w’ and a circle D, 
respectively; then w and w’ become symmetric 
with respect to the circle D (principle of reflec- 
tion). Thus symmetricity is invariant under 
linear transformations. Also the tanharmonic 
ratio of any four points z,, Z2, Z3, and Z4, 

(Z1, Z2; Z3, 24)= (Z1 —23)(21 —24):(22 —23)/ 
(z,—2,), is invariant under a linear transfor- 
mation; i.€., (Z1, Z2; Z3, 24) =(Wy, W2; W3, W4) 
holds, where w; is the image of z; under a linear 
transformation (j= 1,2, 3, 4). 


F. Normal Forms of Linear Transformations 


There exist fixed points of the transformation 
(1) on the Riemann sphere, i.e., points satisfy- 
ing z =(az + b)/(cz + d). The number of fixed 
points is 2 or 1, except when w =z. If the trans- 
formation has two fixed points, they will be 
denoted here by p and q. The natural conven- 
tion p=q is adopted if the transformation has 
one fixed point. If c=0, then p or q is œ, and 
furthermore if c=a—d=0, then p and q are 
both oo. 

For unequal finite p and q, (1) can be re- 
written in the following normal form: 
W-p Zp a—cp 


——-=4 , @ #1, 
W-p z-q a—cq 


in which, according as arga=0, |«|=1, or 
otherwise, (1) is called a hyperbolic (Fig. 4), 
elliptic (Fig. 5), or loxodromic transformation, 
respectively. This classification can be applied 
also for finite p and infinite q, i.e., to the trans- 
formation w—p=a(z—p). Furthermore, for 
p=q#, (1) is rewritten in the following 
form: 


7G 


Fig. 4 


Fig. 5 
Elliptic transformation. 
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In this case (1) is called a parabolic transfor- 
mation (Fig. 6). For p=q= œ, i.e., if w=z+B, 
(1) is also called parabolic. We can easily de- 
termine to which class (1) belongs from the 
discriminant D =(a +d} —4 of the quadratic 
equation cz? —(a—d)z—b=0 obtained from 
z=(az+b)/(cz +d) with ad— bc =1 by multi- 
plying both sides by cz +d. If a+ d is real, 
then according as D>0, <0, or =O, (1) is 
hyperbolic, elliptic, or parabolic, respectively, 
and if a+d is not real, then the transforma- 
tion is loxodromic. 


<i} 
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Fig. 6 
Parabolic transformation. 


Let D and D’ be two arbitrary circular disks. 
Then there always exists a linear transforma- 
tion which gives a one-to-one tconformal 
mapping from D onto D’. Conversely, any 
mapping with this property is given only by 
linear transformations (provided that the half- 
plane having a straight line together with the 
point at infinity as its boundary is regarded as 
a closed disk), which are uniquely determined 
by giving three points a, b, c from the bound- 
ary of D and as their corresponding points, 
three arbitrary points a’, b’, c from the bound- 
ary of D’. 


G. The Poincaré Metric 


Since conformal mappings from the domain 
|z|<1 onto |w|<1 are given by the trans- 
formations w=«(z —2Z9)/(1 —Zgz) (lel=1, 
|Zol< 1) (— Appendix A, Table 13), for corre- 
sponding z and w it holds that 


ldw|  |dz] 5 
jw? Foe ta 


|dz|/(1 — |z|?) is called Poincaré’s differential 
invariant. With a metric having ds =|dz|/(1— 
|z|?) as its tline element, the unit disk |z| < 1 
becomes a tnon-Euclidean space in the sense 
of Lobachevskii, and the metric is called the 
Poincaré metric. Furthermore, since the trans- 
formations (2) leave the length of curves invar- 
iant, they can be regarded as tmotions in this 
space, where the geodesic through two points 
zı and z, is the circular arc orthogonal to the 
unit circle. If we denote the intersections of the 
arc with the unit circle by z, and z4, then the 
tnon-Euclidean distance between two points 
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z, and z, along the geodesic is given by 
(1/2)log(z,,23; 23,24), provided that the points 
Z4» Z1, Z2, Z3 are arranged on the arc in this 
order (— 285 Non-Euclidean Geometry). 
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A. History 


Since the beginning of civilization, people 
have utilized tools for aiding computation. In 
Japan, bamboo computing rods were used in 
the 7th century; before the close of the 16th 
century, the abacus was imported from China 
(— 230 Japanese Mathematics (Wasan)). The 
first calculator capable of performing the four 
arithmetic operations automatically was de- 
signed by W. Schickard of Tübingen Univer- 
sity (1623). After that, B. Pascal independently 
made his famous adding machine; this was 
improved by G. W. Leibniz so that it was able 
to execute multiplications and divisions. 

Most calculators manufactored today are 
electronic; however, many of these machines 
are not “programmable” and hence require at 
every step of computation a manual operation 
for specifying the machine operations to be 
executed. On the other hand, modern auto- 
matic computers can execute automatically a 
sequence of operations according to a given 
program without any manual intervention. 

The automatic computer was conceived by 
the English mathematician C. Babbage in the 
19th century, but mechanical engineering at 
that time was not advanced enough to allow 
the construction of such a computer. His idea 
was first realized by the relay computers Z3 of 
K. Zuse (1941) and Mark I of Harvard Uni- 
versity (1944). In 1947, the first electronic 
computer, ENIAC, appeared, in which vacuum 
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tubes were utilized instead of mechanical com- 
ponents. Since then, the capabilities of com- 
puters have increased rapidly and it can be 
said that we now live in the “computer age.” 


B. Principles of Modern Computers 


Information processing in a computer is based 
on communication among its constituents by 
electric signals. In digital computers, informa- 
tion is encoded as a sequence of binary num- 
bers 0, 1, whereas continuous values are al- 
lowed in tanalog computers. The minimum 
quantity of information in digital computers 
is therefore a binary digit, called a bit. Since 
continuous values fluctuate on account of 
electric noise in the circuits, digital computers 
have the advantage of maintaining high preci- 
sion during computation. 

The binary numbers 0, 1 are represented in 
practice by two distinct electric signals: two 
distinct voltages, two distinct phases of alter- 
nating current, the existence or nonexistence 
of a pulse, etc. A system of circuits is called 
synchronous when it contains a clock, a gener- 
ator of periodic pulses, which synchronizes 
the transmission and transformation of in- 
formation. In an asynchronous system, circuits 
execute each step of information processing 
independently and advance to the next step 
after verifying the termination of the preceding 
step. 

In ENIAC, a program was inserted into the 
computer by carefully connecting many con- 
trol lines on plugboards. Since this caused 
much trouble in inserting the different pro- 
grams needed for different jobs, J. von Neu- 
mann proposed encoding programs in se- 
quences of binary digits and storing these 
programs in the memory unit of the computer. 
His principle, the stored program principle, was 
realized by EDSAC (1949) and has since been 
widely used. 

An automatic computer in general con- 
sists of the following five units: the arithme- 
tic unit, memory, control, input, and output 
(Fig. 1). These units are interconnected by 
wires that exchange information in the course 
of computation. 
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Construction of electronic computers. 
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The arithmetic unit consists mainly of sev- 
eral memory registers and operational circuits 
associated with them. Each register stores a 
binary number of n bits (usually 16<n< 64). 
An important building block of the opera- 
tional circuit is the basic adder of 1-bit num- 
bers. A parallel adder of n-bit numbers can 
be obtained by connecting n copies of the 
basic adder. Alternatively, a sequential adder 
can be composed of a single basic adder which 
is utilized repeatedly to sum up bit by bit two 
binary numbers from the lowest bit, Subtrac- 
tion is usually carried out by adding the com- 
plement of the subtrahend. Multiplication is 
realized by shifting the multiplicand to the 
left and adding it to the intermediate sum; 
this addition may be omitted, depending on 
the relevent bit of the multiplier. In division, 
we shift the dividend to the left and if possible 
subtract the divisor from it. By recording at 
each step whether or not the subtraction is 
possible, we obtain the quotient. 

The memory unit stores the instructions 
and given data as well as the necessary data 
obtained in the course of computation. It is 
divided into many registers, each of which 
contains a number and is referred to by means 
of a serial number called an address. At present, 
high-speed memory units are usually made 
up of magnetic cores or integrated circuits. As 
auxiliary large-scale memory, there are also 
magnetic drums, disks, and magnetic tapes, 
etc. Other elements under investigation are 
extreme-low-temperature elements, chemical 
elements, optical elements, etc. 

The control unit repeats the following oper- 
ations consecutively: (1) takes an instruction 
from the memory location indicated by the 
sequential control counter, (2) gets a data 
word from the memory according to the ad- 
dress part of the instruction, (3) decodes the 
function part of the instruction and sends 
control signals to appropriate circuits, and (4) 
increases the content of the sequential control 
counter by 1 and, after receiving end signals 
from the arithmetic unit, returns to step (1). 
For these purposes the control unit contains a 
counter, a decoder, an encoder to send control 
signals, and a register to store the instruction 
to be executed. 

The arithmetic, control, and memory units 
form the central processor of the computer. In 
contrast to the central processor, the input and 
output units are called peripheral devices. An 
input device receives the necessary information 
(a program and data); a card reader reads 
punched cards, and a teletypewriter sends 
signals directly into the computer. An output 
device presents the results obtained by the 
computer. The results are usually printed by a 
teletypewriter or line printer. Magnetic tape 
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units can be considered as auxiliary input- 
output devices, since tape reels are removable. 
The tape units can accept prepared input data 
as well as record the results, the output. 

These devices are operated according to 
given input-output instructions under the 
control of the central processor. However, 
since every input-output device contains me- 
chanical components and is extremely slow in 
comparison with the central processor, a large- 
scale computer is often accompanied by sat- 
ellite computers which undertake the control 
of input-output devices. 


C. Instructions and Programming 


In stored-program operation, a computer 
performs a sequence of calculations according 
to given instructions. There are various types 
of instructions, but those most frequently used 
are single-address instructions, each of which 
contains a single-address part designating an 
operand. 

A program is a finite sequence of instruc- 
tions arranged suitably for the required com- 
putation. Programming, or making a program, 
is therefore the task of decomposing the re- 
quired computation into elementary steps each 
of which corresponds to an instruction. 

Every instruction is represented in a com- 
puter by a number, a numeric code, which is 
determined in a definite way by the construc- 
tion of the control unit. Before starting com- 
putation, instructions thus encoded are stored 
in the memory. In this sense, a program is a 
sequence of numbers. This sequence is called a 
machine-language program. 

A program is usually divided into several 
blocks, called subprograms or subroutines. 
Some subroutines are made in advance, espe- 
cially those for frequently required jobs such as 
evaluating elementary functions and manipulat- 
ing input-output devices, etc. The system of 
these ready-made programs is called software, 
in contrast to the hardware (1.e., mechanico- 
electronic equipment) of the computer. Quite 
often basic routines are microprogrammed, i.e., 
written in a simple code and stored in a fast 
read-only memory (ROM). In this case the set 
of built-in programs is called firmware. The 
handliness of a computer depends mainly on 
close matching of the software and hardware. 

Programs are usually written in certain 
forms called external languages which are easy 
to master. A problem-oriented language is an 
advanced external language in which ordinary 
arithmetic expressions are available with slight 
modifications. Programs written in these lan- 
guages are translated into machine languages 
by program input routines. The translator for 
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a problem-oriented language is called the 
compiler. 

A compiler accepts several macroinstruc- 
tions. Moreover, it is equipped with the fol- 
lowing facilities: (i) the ability to translate 
arithmetic expressions into machine language; 
(ii) the ability to generate linkages to various 
ready-made subroutines according to certain 
simple indications; (iii) the ability to auto- 
matically allocate programs, subroutines, and 
data in the memory; (iv) the ability to check 
automatically the syntactic correctness of 
programs. Thus it accepts an external form 
such as 


if x >0 then printreal (SQRT(x)) 
else printstring (‘negative’). 


Another important translator is the as- 
sembler, which translates mnemonic codes of 
instructions (add for addition, etc.) into their 
numeric codes according to a given table. It 
allows us to utilize symbols for specifying 
addresses. It also converts decimal numbers 
into binary and generates certain segments of 
the program from rather simple indications. 
Compilers and assemblers are important con- 
stituents of software. 

The large-scale high-speed computers that 
have recently appeared have made software 
systems inevitably more complex. There is 
now software, so-called monitors or operating 
systems, that supervises the uninterrupted 
processing of many programs. Some operating 
systems coordinate several assemblers and 
compilers so that several languages can be 
mixed in writing a program. Examples of 
other important software are mathematical 
(numerical) software and database manage- 
ment systems. 


D. Mathematical Models of Computers 


An operational circuit in the arithmetic unit is 
usually constructed from basic elements (log- 
ical gates), each of which performs a certain 
operation on the binary signals. Thus the 
construction of a circuit from basic elements 
is represented by a composition of a logical 
function defined over the set {0,1} from a 
given set of basic functions. Post [3] consid- 
ered functional composition without feed- 
back loops and established a general crite- 
rion for a given set of logical functions to be 
complete, i.e., to be capable of generating the 
whole set of logical functions. Strictly, how- 
ever, a logical gate takes a definite time delay 
to perform its operation. Therefore Kudryav- 
tsev [4] proposed as a model of a logical gate 
a pair (f,d) of a logical function f and a non- 
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negative integer d: f represents the operation 
of the gate and d represents its delay. He de- 
fined the feedback-free composition of such 
functions with delays and gave a completeness 
criterion for a set of logical functions with 
delays. His study has been extended in various 
directions by Loomis [5], Nozaki [6], and 
Rosenberg [7]. 

When a circuit contains memory elements 
or feedback loops, its function is suitably 
represented by an tautomaton. Hence the 
design and analysis of circuits with memory 
elements, state-minimization, and equivalence 
checking [8], decomposition into simpler 
components [9, 10], verification of complete- 
ness [11], etc., can be studied in terms of auto- 
mata (— 31 Automata). A computer itself can 
be considered to be a finite automaton, since 
it has a finite number of memory elements and 
its behavior is completely determined by the 
content of the memory (its internal state) and 
the inputs. However, since a modern computer 
has an enormous memory and contains re- 
placeable parts, such as magnetic tapes, it is 
more adequately represented by an infinite 
model, such as a tTuring machine. 

Turing machines are capable of simulating 
many intellectual activities governed by for- 
mal rules. It is believed that any well-defined 
finite algorithm can be simulated by a Turing 
machine (Church’s thesis.) Thus any compu- 
ter can be simulated by a Turing machine, 
and hence it is unable to resolve those deci- 
sion problems which are unsolvable for Tur- 
ing machines. For instance, no program can 
decide in finite steps whether a given pro- 
gram written in a computer language, say 
FORTRAN or PASCAL, eventually stops or 
not. On the other hand, a modern computer 
can simulate a universal Turing machine pro- 
vided that its memory can be extended un- 
boundedly by supplying magnetic tapes. 
Hence any well-defined finite algorithm can 
be simulated by a computer unless the limi- 
tation of memory capacity hampers its 
accomplishment. 


E. Mathematical Theory of Programming 
Languages 


Along with the development of software, pro- 
gramming techniques have gradually accumu- 
lated. For instance, we now have an almost 
satisfactory method of translating arithmetic 
expressions into machine language. However, 
we still lack a general theory to cover effec- 
tively a wide array of programming problems. 
The design of an adequate metalanguage 
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is still an important problem, if the word 
“adequate” implies complete description of 
syntax and semantics of problem-oriented 
languages. An interesting problem is to define 
and suitably classify grammars with respect to 
their capabilities of forming languages. Such 
research is an important branch of mathema- 
tical linguistics. N. Chomsky [12] has inves- 
tigated this problem, starting from research on 
natural languages, and has given formal defini- 
tions of the grammars (— 31 Automata D). 

tContext-free grammars (type-2 languages) 
play an important role in the theory of soft- 
ware as well as in Chomsky’s syntax, since 
they are powerful enough to describe the pa- 
renthesis structure and simple and easy to 
manipulate. Some variants of context-free 
grammar have been proposed for attaining 
high efficiency in the automatic appraisal of 
programs [13,14] or for increasing the pro- 
grams’ generative power [15]. 


F. Branches of Information Science 


Many fields related to computers now make 
up the information sciences or informatiques. 
Important mathematical theories born or 
developed in the information sciences are: 

(1) Design and analysis of hardware devices: 
tBoolean algebra [16], switching theory [17], 
and theory of tautomata [18, 19]. 

(2) Design and analysis of programming lan- 
guage; theory of formal languages [14, 21]. 

(3) Design and analysis of algorithms; tnumer- 
ical analysis, theory of complexity of compu- 
tation, theory of tdata processing. 

(4) Mathematical foundation of programming; 
logical verification of correctness and equiva- 
lence of programs [22], frecursive function 
theory, tdecision problems. 

Applications of the information sciences are 
found in diversified fields, such as statistics, 
operations research, mathematical psychology, 
econometrics, jurimetrics, and behaviorimetrics. 
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A. Möbius Geometry 


We represent an n-dimensional sphere S” as 
the tquadric hypersurface S":x7+x3+...4+ 
x? —2x9x,,=0 in an (n+ 1)-dimensional real 
*projective space P"*', where the (x,) are 
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thomogeneous coordinates in P"*!. We denote 
by M(n) the group of all tprojective transfor- 
mations of P”*! that leave S” invariant. Then 
the transformation group M(n) acts on S". The 
pair (S", M(n)) is called the conformal geom- 
etry or Möbius geometry. We call S” an n- 
dimensional conformal space, a transformation 
belonging to M(n) a Mobius transformation, and 
M(n) the Möbius transformation group. Every 
point of the projective space P”*! corresponds 
to a hypersphere on S”. For example, if a point 
A lies outside of S”, then the intersection of S” 
and the tpolar hyperplane of A with respect to 
S" is an (n — 1)-dimensional sphere S""', and 
the point A corresponds to this real hyper- 
sphere S”~}. Similarly, if a point A lies on S$”, it 
corresponds to a point hypersphere, and if a 
point A lies inside of S”, then A corresponds 
to an imaginary hypersphere. We sometimes 
identify the point A with the corresponding 
hypersphere. For any two points A =(a,) and 
B=(b,), we put AB=a,b, +a,b,+...+4,b, 
—(dgb,, +a,,b,) and call it the inner product of 
the two hyperspheres A and B. The angle 6 
between two intersecting real hyperspheres A 
and B is defined by cos 0 = AB/(/ A? - ,/B?). 
This angle is invariant under the Möbius 
transformation. 

In the projective space P”*!, we take a 
tframe (Ap, A,,...,A,, Aœ) that satisfies the 
conditions 


Ag = Ay A;= A;A,, = AZ, =9, AjA, = —l, 


A;|A;=9; Lj=1,2,...,n, 
where (g;;) is a positive definite matrix. We see 
that Ao and A,, are points on S” and each 4A; is 
a real hypersphere passing through these two 
points. Every hypersphere X of S” can be 
written as a linear combination of the 4;: X = 
UgAg+u,A,+...+uU,A,+U,,A,,. That is, X 
is represented by tprojective coordinates (u,) 
with respect to the above frame. We call these 
homogeneous coordinates (u,) (n + 2)- 
hyperspherical coordinates of the hypersphere 
X. Mf we use these coordinates, the inner prod- 
uct of two hyperspheres X =(u,) and Y=(v,) is 
given by X Y= DL? <1 giv — (Ug Uo + Uw Yo). 
The Mobius transformation group M(n) is a 
topological group with two tconnected com- 
ponents. If we denote by M)(n) the maximal 
connected subgroup of M(n) and by H the 
subgroup of M(n) that leaves invariant a real 
hypersphere of S”, then the set E of all real 
hyperspheres of S” can be identified with the 
thomogeneous space M,(n)/H. The group H 
also consists of two connected components. 
If we denote by H, the maximal connected 
subgroup of H, the homogeneous space E = 
M,(n)/ Ho is a two-fold tcovering space of E. 
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For each real hypersphere Ac E, an element 
AcE over A is called an oriented real hyper- 
sphere. The Möbius transformation group con- 
tains as its subgroups ones that are isomor- 
phic to the group of fcongruent transforma- 
tions of a Euclidean space and ones that are 
isomorphic to the group of congruent trans- 
formations of a non-Euclidean space. That is, 
the subgroup of M (n) that leaves a point 
hypersphere invariant is isomorphic to the 
group generated by congruent transformations 
and thomotheties of the Euclidean space £”. 
The subgroup of tindex 2 (the factor group by 
a tcyclic subgroup of torder 2) of the subgroup 
of M(n) that leaves invariant a real (imaginary) 
hypersphere is isomorphic to the group of 
congruent transformation of n-dimensional] 
hyperbolic (elliptic) fnon-Euclidean space. 

In the n-dimensional Euclidean space E”, 
consider a hypersphere of radius r with center 
O. For each point P of E”, mark a point Q on 
the tray OP such that OP - OQ =r’. We call 
the point transformation that sends P to Q an 
inversion with respect to the hypersphere. A 
tsymmetry with respect to a hyperplane, con- 
sidered as an extreme case of inversions, is 
also called an inversion. We adjoin a point at 
infinity to the space E” to construct an n- 
dimensional sphere S”. Each inversion can be 
extended to a transformation of S", which we 
also call an inversion. Then each Mobius 
transformation ts generated by a finite number 
of inversions. By a Möbius transformation 
of S", each hypersphere is transformed to a 
hypersphere. Any angle between two curves 
that intersect at a point of E” is invariant 
under Möbius transformations. Conversely, 
if n> 3, each local transformation of E” that 
leaves invariant the angle of each pair of inter- 
secting curves is a trestriction of a Möbius 
transformation. However, for n=2 this is not 
true in general; any transformation that leaves 
angles invariant is called a tconformal map- 
ping. Any Mobius transformation z—>w on 
the tcomplex sphere S* = CU {00} can be ex- 
pressed by an equation of the form w=(az + f)/ 
(yz +0) or w=(a7 + B)/(yZ + ô), where «, B, y, 
and ô are complex numbers such that «ô — py 4 
0 and z denotes the tcomplex conjugate of z. 


B. Laguerre Geometry 


Let T be an oriented smooth curve in a Euclid- 
ean plane E°. The tangent line at a point p 

of T is supplied with an orientation that is 
induced by the orientation of the curve F in 
an obvious manner. The oriented line / thus 
obtained is called the oriented tangent line 

of I at p. 
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Let S be the set of oriented lines in E? and T 
be a given set of oriented smooth curves in E?. 
Consider a bijection y of the direct product S 
x T to itself satisfying the following condition: 
If / is an oriented tangent line of a curve I" 
belonging to T and y sends (l, T) to (F, T^, then 
ľ is an oriented tangent of the curve I’. The 
set of such bijections forms a group G. Sup- 
pose that we have leS for which there exist 
two curves I’, and T, in T such that lis a 
common tangent line of T} and T, at p, and 
p2, respectively. Further suppose that the 
element yeG sends (/, T;) to (I, I) (i= 1, 2). 
Then the element y of G is called an equilong 
transformation if the following conditions (i) 
and (ii) are satisfied: (i) I’ is tangent to Tj at 
points p; (i= 1, 2), (it) the distance between p, 
and p, is equal to the distance between p} and 
p2. The set of equilong transformations forms 
a subgroup H of G. In particular, if T is the set 
of oriented circles (including point circles), the 
elements of H are called Laguerre transfor- 
mations. In an obvious manner, we can divide 
the set of oriented circles into two classes, 
those with “positive” and those with “nega- 
tive” orientations. With an oriented circle F 
we associate the pair p(I)=(P,r), where P is 
the origin of the circle and r is a real number 
whose absolute value is equal to the radius of 
the circle and whose signature coincides with 
that of the orientation of T. The mapping 
go: TE? x E! thus defined is called Lie’s 
minimal projection. An example of a Laguerre 
transformation, called a dilatation, is given by 
a bijection y of S x T to itself satisfying the 
following condition: Let y(l, r)= (l, T^); then l 
is parallel to }, the distance between / and I’ is 
a given number, and g(I’)=(P,r+c), where 
o(T)=(P,r) and c is a given constant. We note 
here that the action of a Laguerre transfor- 
mation y:(1,)(l, T’) is determined by its 
action on S (y acts on S by y(/)=!’). Another 
example of a Laguerre transformation y, called 
a Laguerre inversion, is determined by means 
of a given oriented circle O and a line p that is 
not tangent to O; given an oriented line l, its 
image !’ under the action of y is determined 
as follows (here we describe the case where | 
is not parallel to p). There exists a uniquely 
determined oriented tangent line g of the circle 
O parallel to l. Now we have a uniquely deter- 
mined oriented tangent line g’ of O that passes 
through the point of intersection of the lines g 
and p such that g'#g. The image l is the line 
parallel to g’ and passing through the point of 
intersection of the lines / and p (Fig. 1). Each 
Laguerre transformation can be written as a 
product of a finite number of Laguerre inver- 
sions. We denote the group of Laguerre trans- 
formations by L. The pair (L, S) is, by defini- 
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tion, a model of Laguerre geometry. Notions 
such as Laguerre inversions, dilatations, and 
transformations can be generalized to cases of 
higher dimension by utilizing oriented hyper- 
spheres and oriented hyperplanes. 


Fig. 1 


C. Sphere Geometry 


Let S be the set of oriented circles (including 
point circles and oriented lines) in the Eucli- 
dean plane E?. Two oriented circles C, and 

C, are said to be in contact with each other 

if they have a point p and an oriented tan- 
gent line passing through p in common. (An 
oriented circle C and an oriented line / are in 
contact with each other if and only if l is an 
oriented tangent line of C.) In this case, we call 
the pair (C,,C,) a contact pair. A bijection y of 
S to itself is called a Lie transformation if it 
sends any contact pair to another. (The Lie 
transformation is a special case of the tcontact 
transformations.) An inversion with respect to 
a circle determines in an obvious manner a Lie 
transformation, which is also called an inver- 
sion. We denote the group of Lie transforma- 
tions by G. Any element y of G can be written 
as the product of a finite number of inversions 
and Laguerre inversions, and G contains the 
group of Möbius transformations and the 
group of Laguerre transformations as sub- 
groups. The pair (G, S) is called a model of 
circle geometry. The notion of circle geometry 
can be generalized to that of hypersphere geom- 
etry for the case of higher dimensions. Speci- 
fically, when we replace E? by E? and circles 
by spheres, we have sphere geometry. 

Let V be a complex 3-dimensional space, 
and let M, N be the sets of oriented lines and 
oriented spheres in V, respectively. Then M, N 
have the structure of 4-dimensional complex 
manifolds that are homeomorphic to each 
other. The homeomorphism is given by the 
Lie line-sphere transformation that induces a 
bijection from the set of pairs of intersecting 
oriented lines onto the set of pairs of ortented 
spheres that are in contact with each other. 
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D. Group-Theoretic Considerations 


Here we discuss the preceding three kinds of 
geometries from the group-theoretic point of 
view (— 137 Erlangen Program). Let us con- 
sider the quadratic form Q defined by Q(x)= 
—xp+xj+x3+x3—xj ina real projective 
space P*, where x=(xo, X1, X2, X3, X4) are 
homogeneous coordinates. We denote by G 
the set of all projective transformations of P* 
that leave Q invariant. The group G consists of 
the set of matrices A of order 5 such that det A 
= 1 and Q(Ax)= Q(x) holds for all x in P*; we 
denote by L? the set of all points x in P* that 
satisfy O(x)=0. Then G acts transitively on L?. 
Hence if we denote by H, the set of all ele- 
ments of G that fix a point a in L°, for example 
a=(—1,1,0,0,0), we may assume that L? = 
G/H,. The circle geometry that belongs to 

the group of Lie transformations of circles is 
exactly the geometry of the homogeneous 
space G/H,. The group G, of all transforma- 
tions of G that leave the hyperplane x,=0 
invariant acts transitively on L°N{x,=0} (G, 
is isomorphic to M(2)). The geometry of the 
homogeneous space G,/H,1 G is plane con- 
formal geometry. Next, the group G, of all 
transformations of G that leave invariant Xo 
+x,=0 coincides with H, and acts transitively 
on L3N {xo +x; =0}. The geometry of the 
homogeneous space L? N {xo +x; =0} (on 
which G, acts transitively) is plane Laguerre 
geometry. In this sense, the circle geometry 
that belongs to the group of Lie transforma- 
tions contains the other circle geometries as 
subgeometries. 

We now describe how plane Laguerre geom- 
etry can be realized as the geometry of the 
space L? N {xo +x; =0}. Let E? be a plane in a 
Euclidean space E?. We fix a Cartesian co- 
ordinate system (yo, y,, y2) in E> so that E? is 
given by y)=0. To each point y of E? we can 
associate an oriented circle in E? with center 
(0, 1, Y2), radius | yo], and positive (negative) 
orientation if yo is positive (negative). If y lies 
on E?, the corresponding circle is the point 
circle y itself. Now let us consider the group G, 
of all affine transformations of E? whose rota- 
tion parts leave the quadratic form Q’(y)= 
— yk + y? + y} invariant. (G; is an isometry 
with respect to the tmetric defined by Q’.) Each 
element of G; induces a transformation of the 
set of oriented circles in E? (including the 
oriented lines and point circles) onto itself. 

The mappings of E> into L? defined by x» = 

(1+ Q'(y))/2, xı =(1—Q'(y))/2, X2 =V1, X3= Va, 
X4 = Yo is a one-to-one correspondence of E’? 
onto the subset of L? such that x9 +x, #0. 
This correspondence induces an isomorphism 
of G; onto G,. In Laguerre geometry, there are 
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no essential distinctions between points and 

oriented circles on E?; and the group G, acts 
on a 3-dimensional space of oriented circles 

(including point circles). 


Reference 
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A. General Remarks 


Let a function w= f(z) that maps a domain D 
on the tcomplex z-sphere homeomorphically 
onto a domain A on the complex w-sphere 
satisfy the following two conditions: (1) Every 
curve C,:z(t)(O0<t< 1) that starts at any point 
Zo in D and possesses a tangent there has an 
image curve C,,:w=w/(t) = f(z(t)) (0<t <1) that 
also possesses a tangent at the image point wo 
= f (Zo). (2) The angle between any two curves 
C!” and C possessing tangents at zg is equal 
to the angle between their image curves CQ 
and CP, where the direction of the angle is 
also taken into account. Then the mapping 
from D onto A is said to be conformal, and A 
is called conformally equivalent to D. It has 
been proved that w= f(z) is necessarily a func- 
tion tanalytic in D (D. Men’shov, 1931). 

Consequently, the theory of conformal 
mapping is a branch of the theory of analytic 
functions. That a function w= f(z) maps a 
domain D conformally onto a domain A 
means that it is a tmeromorphic function 
tunivalent in D and its range is A. Then f'(z)# 
0 holds at every (finite) point in D, and the 
ratio of the lengths of the segments between 
two points wọ = w(0), w(t) on C,, and between 
two points zo = z(0), z(t) on C, tends to a fixed 
nonvanishing limit | f'(z))| as t—>0 indepen- 
dently of the choice of C,. Hence if z; and z, 
lie on Co” and C$, respectively, and w; and 
w, are their image points lying on the image 
curves C{P and C), respectively, then the 
two triangles Az,z )z, and Aw, wow, are nearly 
similar in the positive sense, provided that z, 
and z, are near enough to Zo. This justifies 
the word “conformal,” which means “of the 
same form” (— Appendix A, Table 13). 


B. Conformal Mapping onto the Unit Disk 


A fundamental theorem in the theory of con- 
formal mapping is Riemann’s mapping theo- 
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rem, which states that any tsimply connected 
domain D with at least two boundary points 
can be mapped conformally onto the interior 
A of the unit circle. This theorem is equivalent 
to the assertion of the existence of tGreen’s 
function of D and can be proved in various 
ways. B. Riemann (1851) gave a proof, based 
on an idea of C. F. Gauss, by assuming the 
existence of a solution for a variational prob- 
lem minimizing the tDirichlet integral. The 
logical incompleteness implied by this as- 
sumption was later removed by D. Hilbert 
and others. The proof that is now regarded 
as simplest is due to L. Fejér and F. Riesz’s 
method (T. Rado, 1922, 1923), which applies 
‘normal family theory. On the other hand, 
the osculating process due to P. Koebe (1912) 
is a purely constructive method of proving 
existence that is also applicable to the case 

of tmultiply connected domains. The map- 
ping function w= f(z) in Riemann’s mapping 
theorem is uniquely determined under the 
normalization condition f(z 9)=0, arg f'(zo) = 
at a point Zo in D, where 6, is a given angle. 

The other types of simply connected do- 
mains are the Riemann sphere less one point 
and the sphere itself. In both cases every con- 
formal mapping of these domains is a linear 
transformation. A simply connected domain 
is called hyperbolic, parabolic, or elliptic if 
it is conformally equivalent to the unit disk, 
the complex plane, or the Riemann sphere, 
respectively. 

Let w= f(z) map a simply connected domain 
D on the z-sphere conformally onto a simply 
connected domain A on the w-sphere. If a 
sequence {z,} in D tends to a boundary point 
¢ of D, then the corresponding sequence {w,} 
(w, = f(z,)) has no taccumulation point in A 
and does not necessarily tend to a boundary 
point of A. 

If for any sequence {z,} tending to ¢, { f(z,)} 
tends to a unique point w on the boundary of 
A, it is said that f(z) possesses a boundary value 
œ at ¢. To investigate the behavior of {w,' on 
a hyperbolic domain D, we can assume, by 
using a suitable mapping if necessary, that D is 
bounded. Then the problem is reduced, in view 
of Riemann’s mapping theorem, to the case 
where D is a bounded simply connected 
domain and A is the unit disk |w| <1. 


C. Correspondence between Boundaries 


Concerning the correspondence between 
boundaries under the conformal mapping w= 


J(z) of a bounded simply connected domain 


D onto the unit disk |w|< 1, we have the fol- 
lowing three theorems: 
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(1) To any taccessible boundary point zc of 
D there corresponds a unique point on |w| = 1, 
and to any distinct accessible boundary points 
Zc, and Zc, of D there correspond distinct 
points on the unit circumference. Furthermore, 
the set of all points on |w|=1 that correspond 
to accessible boundary points of D has tangu- 
lar measure equal to 27. 

(2) There is a one-to-one correspondence 
between tboundary elements of D and points 
on |w|=1 (C. Carathéodory). 

(3) Let w= f(z) map the interior D of a fJor- 
dan curve C conformally onto the unit disk 
A:|w|<1. Then it possesses a boundary value, 
say {(¢), at every point ¢ on C that satisfies 
| {(Q)|=1. Hence f(z) is continuous on the 
closed domain D= DUC and maps D bijec- 
tively onto the closed disk A:|w| < 1. Similarly, 
the inverse function z= (w) has an analogous 
property and maps A bijectively and continu- 
ously onto D; that is, a conformal mapping of 
the interior D of a Jordan curve onto the unit 
disk A can be extended into a homeomor- 
phism of the closure D to A (Carathéodory). 

In this case, if the Jordan curve C contains 
a tregular analytic arc I’, then the mapping 
function w= f(z) can be prolonged analytically 
beyond F (except at the endpoints of I). Hence 
the mapping w= f(z) is conformal at interior 
points of T. 

Problems on the correspondence of angles 
at the boundary are closely related to tangular 
derivatives. These problems have been at- 
tacked by Carathéodory, S. Warschawski, J. 
Wolff, and others. 


D. Schwarz-Christoffel Transformation 


The problem of determining the form of an 
analytic function that maps the interior of a 
circle or a half-plane conformally onto the 
interior of a polygon was first dealt with by 
H. A. Schwarz and E. B. Christoffel (1869) 
(— Appendix A, Table 13). 

Let P be a polygon in the complex w-plane 
with vertices b, (= 1,...,m) and interior 
angles a, at b,. Then a function w= f(z) that 
maps a circular disk or a half-plane in the z- 
plane conformally onto the interior of P is 
given by 


f(z= cf (I (a, — 2}! dz+ C. 
wal 


Here b, = f(a,), and C, C’ are constants de- 
pending on the position and magnitude of the 
polygon P. If instead of a disk we consider a 
half-plane in the z-plane, and if one of the a, is 
the point at infinity, then, modifying this for- 
mula by deleting the factor of the integrand 
corresponding to a, = 00, we obtain a formula 
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for w= f(z). This representation is called the 
Schwarz-Christoffel transformation formula. 
This formula was originally derived by Chris- 
toffel to solve a problem of 2-dimensional 
distribution of stationary temperature. It 
then found extensive application to several 
problems of conformal mapping concerning 
polygonal domains and to problems of deter- 
mining force lines or stream lines and equi- 
potential lines in 2-dimensional electrostatics 
or hydromechanics. 

An analogous formula is also derived for a 
function that maps the interior of a circle or a 
half-plane conformally onto the exterior of a 
polygon. In connection with these formulas, a 
third-order differential equation that is satis- 
fied by a function mapping the interior of a 
circle onto a domain bounded by a circular 
polygon is found useful in the theory of auto- 
morphic functions. Moreover, a representation 
analogous to the Schwarz-Christoffel trans- 
formation formula is obtained for a function 
that maps the interior of a circle onto a curvi- 
linear polygonal domain bounded by arcs of 
logarithmic spirals with the origin as asymp- 
totic point. 


E. Conformal Mapping of Multiply Connected 
Domains 


It is also important to consider problems 
concerning one-to-one conformal mapping of | 
a multiply connected domain on the z-sphere 
onto a suitable multiply connected domain D 
on the w-sphere. The two domains D and D 
are then homeomorphic, but the converse ts 
not true; i.e., there does not necessarily exist a 
one-to-one conformal mapping between D and 
D even when they are homeomorphic. Now let 
D and D be multiply connected domains on 
the z- and w-planes, respectively, both possess- 
ing at least three boundary points. The tuni- 
versal covering surfaces D and D of D and D 
are hyperbolic and they can be mapped onto 
the unit disks. The groups of their ‘covering 
transformations form the fFuchsian groups G 
and ©. Then in order for D to be mapped 
one-to-one and conformally onto Ð, it is neces- 
sary and sufficient that the group © be trans- 
formed into the group G by a suitable linear 
transformation. 

To a domain of finite tconnectivity having 
only continua for its boundary components, 
we can associate conformal invariants (namely, 
moduli) expressed by one real parameter in the 
doubly connected case and by 3n—6 real 
parameters in the n(>2)-connected case. A 
one-to-one conformal mapping is possible 
only within a class of domains having the 
same invariants (— 416 Teichmüller Spaces). 
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While a circular disk is taken as a canonical 
domain in the simply connected case, an tan- 
nulus is often taken as a canonical domain in 
the doubly connected case. In the latter case, 
the logarithm of the ratio (> 1) of the radii of 
two concentric boundary circles is usually 
called the modulus. There are various types of 
n( > 2)-connected canonical domains, for in- 
stance, the whole plane, a circular disk or 
annulus slit along concentric circular arcs or 
radial segments, a parallel slit plane, etc. The 
possibility of a one-to-one conformal mapping 
of a given domain onto a canonical domain of 
such a type was proved by Hilbert, Koebe, and 
others in a tpotential-theoretic way and by E. 
Rengel, R. de Possel, H. Grunsky, and others 
in a purely function-theoretic way. Such ca- 
nonical domains are characterized by some 
extremal properties. For example, the horizon- 
tal parallel slit mapping function po(Z, Zo) = 
(z—Zo) | +a[pol(Z—Zo) +... ZED, has 
the extremal property that it is the unique 
function maximizing Rea[ f] within the 
family of univalent functions f(z)=(z— Zo) ! + 
a[f](z— Zo) +...in D. For a general domain 
D, Po(Z, Zo), called the extremal horizontal slit 
mapping, is defined as the limit function of the 
sequence of horizontal slit mappings p%$” (z, Zo) 
of D,, Zo€ Dp. Here {D,}., is a canonical ex- 
haustion of D. It has the same extremal prop- 
erty as in the case of finite connectivity. The 
boundary components of the image domain of 
D under p,(z, Zo) consist of a horizontal slit 
and a point. The parallel slit mapping p,(z, Zo) 
in the direction of 0 is defined similarly to the 
one that maximizes Ree 7"a[ f] [8]. 

For a domain of infinite connectivity, by 
accumulation of boundary components, a 
pointlike boundary component can be mapped 
onto a continuum. A boundary component y 
of a domain D is called weak if its image f(y) is 
a point under every conformal mapping f on 
D. y is called strong if f(y) always consists of 
more than one point. A boundary component 
y that is neither weak nor strong is called 
unstable (L. Sario, J. Analyse Math., 5 (1956)). 
Criteria of weakness, etc., by means of ex- 
tremal length are given in [8]. Moreover, in 
terms of extremal length, a generalization of a 
boundary element is given by Oikawa and 
Suita [9]. On the other hand, the existence 
of one-to-one and conformal mapping of a 
domain bounded by a finite number of curves 
onto the whole plane with mutually disjoint 
circular disks removed was proved by Koebe 
and later derived by J. Douglas and R. Cou- 
rant as a particular case of the existence of a 
solution of tPlateau’s problem [10]. M. Schif- 
fer showed that the mapping function is a 
solution of an extremal problem involving 
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Fredholm eigenvalues [11]. L. Bieberbach and 
H. Grunsky showed the possibility of mapping 
an n-connected domain onto an n-sheeted 
disk. Concerning doubly connected domains, 
detailed investigations were made by O. Teich- 
müller, Y. Komatu, and others. For a domain 
D bounded by n analytic Jordan curves, as 

an extension of the Schwarz lemma, Ahlfors 
showed that there exists a unique extremal 
function folz) maximizing Re f'(z), Zo€ D, 
within the family of analytic functions f(z) 
satisfying | f(z)|<1in D and that f(z) maps D 
onto the n-sheeted unit disk (1947). fo(z) is 
called the Ahlfors function at z,. For a general 
domain the uniqueness of the extremal func- 
tion in nontrivial cases was established first by 
S. Havinson (1961) and later by L. Carleson 
(1967) and S. Fisher (1969). 

Most fkernel functions of a plane domain D 
have connections with conformal mappings. 
For example, the Bergman kernel U(z, ¢) of 
exact differentials is equal to (2z)~'(p5(z, C) 

— P2(Z, ¢)). The adjoint kernel V(z, ¢) is de- 
fined by (22)! (po(z, E) + pi j2(Z, ¢)). M. Schiffer 
showed that the integral of V(z, %), S(z, = 

fz V(z, ¢)dz, is univalent and maps D onto a 
domain bounded by n analytic convex curves 
if D is bounded by n Jordan curves [13]. 

For a general domain D, this is true in the 
sense that each boundary component of the 
image domain under S(z, £) is a convex set (K. 
Oikawa and N. Suita, Kédai Math. Sem. Rep., 
16 (1964)). It is known that S(z, č“) maximizes 
the area of the complementary set of the image 
domain under f(z) within the family of univa- 
lent functions f(z)=(z(z—Q)7'+6,(z—O+.... 
The maximum value multiplied by 47 is called 
the span of the domain D; it is equal to a[po] 
—a[ P2]. For the *Szeg6 kernel function 
k(z,¢) of a domain D bounded by n analytic 
curves, the adjoint kernel /(z, ¢) is defined, and 
the Ahlfors function f(z) at ¢ is expressed by 
k(z, €)/l(z, 2) (P. Garabedian, Trans. Amer. 
Math. Soc., 67 (1949)). 


F. Universal Constants 


Among various universal constants appearing 
in the theory of conformal mapping, Bloch’s 
constant ts especially famous. A. Bloch (1924) 
showed that a covering surface over the w- 
plane obtained from a mapping w= F(z)= 
z+... that is one-to-one, conformal, and 
holomorphic in |z| <1 always contains a tuni- 
valent (schlicht) disk whose radius B is a posi- 
tive number independent of the function F 
(Bloch’s theorem). The supremum 8% of such 
constants B is called Bloch’s constant. The true 
value of 8 is yet unknown, but estimates have 
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been given by Ahlfors (Trans. Amer. Math. 
Soc., 43 (1938)) and by Ahlfors and Grunsky 
(Math. Z., 42 (1937)) in the form 


V essi parC (TEN 


r(1/4\ T/12) 
=0.4719.... 


M. Heins showed that the equality sign can be 
deleted from the left-hand side (Nagoya Math. 
J., 21 (1962)) and C. Pommerenke gave a 
simpler proof for the same fact (J. London 
Math. Soc., 2 (1970)). It is conjectured that the 
correct value of $ is equal to the upper bound 
of Ahlfors and Grunsky. Landau’s constant 2 
corresponding to the case where the image 
disks are not necessarily univalent satisfies 
0.5<2<0.55. The lower and upper bounds 
are due to Ahlfors (cited above) and Landau 
(Math. Z., 30 (1929)). Pommerenke showed 
0.5 < £ (1970, cited above). For the family of 
univalent functions these constants coincide, 
and the value is called the schlicht Bloch 
constant, denoted by A. Its lower and upper 
bounds are known to be 0.5705 (J. Jenkins, J. 
Math. Mech., 10 (1961)) and 0.6565 (R. Robin- 
son, Bull. Amer. Math. Soc., 41 (1935)). (For 
distortion theorems and coefficient problems 
— 438 Univalent and Multivalent Functions.) 
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A. General Remarks 


Suppose that we are given two straight lines / 
and m intersecting at V (but not orthogonally) 
in the 3-dimensional Euclidean space E°. By 
rotating the line m around l, we obtain a sur- 
face %. We call this surface Ẹ a circular cone 
with vertex V and axis l; a straight line on the 
surface passing through V is called a generat- 
ing line of &. 

A section C of & by a plane z not passing 
through V (C=21 %) is called a conic section 
(or simply a conic). This C is a plane curve on 
the plane z. The point set Ẹ— V consists of 
two fconnected components %, and %,. Let 7; 
(i= 1,2, 3) be planes not passing through V. If 
the conic section C; = 74 N & is tbounded, then © 
C, is contained either in %, or in %, and is 
teonnected. We call such a C, an ellipse. When 
C =m, N ğ is not bounded but is connected, 
then z, is parallel to one of the generating 
lines of %, and C, is contained either in §, or 
in 2. We call such a C, a parabola. When z, 
intersects both of %, and 2, then C3 =r, N F 
has two connected components and is not 
bounded. We call such a C, a hyperbola. These 
three types exhaust all possible types of conic 
sections. In particular, if the plane z is per- 
pendicular to the axis /, then C=211% be- 
comes a circle. Thus a circle is a special kind 
of ellipse. 


B. Foci and Directrices 


Let C=21%, be an ellipse. The Euclidean 
space E’ is divided by zx into two thalf-spaces 
E}, E} (two “sides” of x). If we put %, N E? 
=61.. 6,9 E3 =8 2, we can construct a 
sphere S that is contained in E}, tangent to 
#1, along a circle K, and tangent to z ata 
point F. Similarly, we can construct a sphere 
S’ that is in E3, tangent to %,, along a circle 
K’, and tangent to z at a point F’. We call F, 
F' the foci of the ellipse (Fig. 1). 
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Let K, x’ be the planes containing K, K’. 
Straight lines d=xk Nn, d' =x Nr are called 
directrices of C. Unless C is a circle, we have 
F#F', and x, x (and x’, z) actually intersect; 
hence d, d’ exist. When C=21% is a parabola 
or a hyperbola (Figs. 2 and 3), we can similarly 
define foct (a parabola has only one focus, F, 
while a hyperbola has two foci, F, F’) and 
directrices (a parabola has only one directrix, 
d, while a hyperbola has two directrices, d, d’). 


Fig. 3 


Let X be a point on the plane z, let D(X) 
be the distance between the point X and a 
focus F, and let D,(X) be the distance between 
X and a directrix d. Then the curve C is the 
locus of the points X satisfying the condition 
D,(X)=e- D,(X), where e is a constant. We call 
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e the eccentricity of the conic section C. Ac- 
cording as C is an ellipse, a parabola, or a 
hyperbola, we have e<1!,e=1,ore>1.A 
circle is an ellipse whose eccentricity ts zero. 
An ellipse is also characterized as the locus of 
X such that FX + F'X =2a; a hyperbola is the 
locus of X such that |FX — F'X |= 2a, where a 
is a positive constant. When there are two foci, 
the straight line FF’ is perpendicular to direc- 
trices d and d’. 


C. Canonical Forms of Equations 


When C is a hyperbola or an ellipse that is not 
a circle, C has two foci, F and F’. In this case, 
the midpoint O of the segment FF’ ts the cen- 
ter of symmetry of C (when C is a circle, its 
center O is, of course, the center of symmetry 
of C). We call O the center of C; an ellipse or 
a hyperbola ts called a central conic. If we 
choose a rectangular coordinate system (x, y) 
having O as the origin and FF’ as x-axis, then 
the equation of C can be expressed in the 
form 


x?/a*+y7/b?=1, a,b>0. (1) 


According as C is an ellipse or a hyperbola, 
we take the + or — of the double sign. If 

C is an ellipse, we have a >b. Furthermore, 
e=,/a*—b?/a if C is an ellipse and e = 

</a? +b?/a if C is a hyperbola. We also have 
F =(ae,0) and F’ =(—ae,0); the equations of 
directrices are x = +a/e. 

On the other hand, if C is a parabola, the 
straight line that is perpendicular to the direc- 
trix d and passes through F becomes the axis 
of symmetry of C. We call this straight line the 
axis of C; the intersection O of the axis and C 
is called the vertex of C. If we choose a rectan- 
gular coordinate system (x, y) having O as the 
origin and having the axis of C as the x-axis, 
the equation of C can be expressed in the form 


y*=4ax, a>0. (2) 


We call (1) and (2) the canonical (or stan- 
dard) forms of the equation of C. We call the 
associated coordinate system the canonical 
coordinate system. Suppose that C is an ellipse 
(hence a> b). Let A, A’ be points of intersec- 
tion of the x-axis and the ellipse and B, B’ be 
the points of intersection of the y-axis and the 
ellipse. We call AA’ the major axis of C and 
BB’ the minor axis of C. If C is a hyperbola 
and (x, y) is the canonical coordinate system, 
we call the x-axis the transverse axis and the y- 
axis the conjugate axis. If C is a central conic, 
the x- and y-axes of the canonical coordinate 
system are called the principal axes; if C is a 
parabola, the x-axis is sometimes called the 
principal axis. 
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D. Properties of Ellipses 


An ellipse may be considered the image of a 
circle under a tparallel projection. Conse- 
quently, the section of a circular cylinder by a 
plane is an ellipse. Also, if we are given a circle 
C and a fixed diameter D of C, an ellipse is 
obtained as the locus of the points X lying on 
lines PM, which are perpendicular to D, with 
PeC, M eD, satisfying the condition that the 
ratio PM: XM is constant. 

Suppose that we are given two concentric 
circles having the center at the origin O and 
with radii a, b. Let P, Q be points of intersec- 
tion of moving half-lines through O and the 
two circles. Then the locus of points X of 
intersection of the ordinates (lines parallel to 
the y-axis) passing through P and the ab- 
scissae (lines parallel to the x-axis) passing 
through Q is an ellipse (Fig. 4). Suppose that 
the equation of an ellipse is given by x?/a? + 
y*/b? =1, with a >b. Then the lengths of its 
major axis and minor axis are 2a and 2b, 
respectively. 


Fig. 4 


Given an ellipse and its center O, the circle 
with center O and diameter equal to the major 
axis of the ellipse is called the auxiliary circle 
of the ellipse. Given an ellipse C and its focus 
F, the auxiliary circle of C is the locus of the 
points X satisfying the condition that the line 
FX is perpendicular to a tangent line to C 
passing through X. Suppose that X is a point 
on an ellipse with foci F, F’. Let TT’ be the 
line tangent to the ellipse at X (X lies between 
T and T’). Then the angle 2 TXF’ is equal to 
L T'XF (Fig. 5). Consequently, the rays start- 
ing from one focus of an ellipse and “reflected” 
by the ellipse converge on the other focus of 
the ellipse. Also, the product of the distances 
from two foci of an ellipse to an arbitrary tan- 
gent is constant and is equal to b>. 
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The ellipse C:x?/a? + y*/b? =1 is expressed 
parametrically in the form 


x=acos@, y=bsiné. (3) 


We call the parameter 0 the eccentric angle of 
a point (x, y) on C. Consequently, C is a tJor- 
dan curve and divides the plane into two parts, 
the inside and the outside. The inside is the set 
of points (x, y) satisfying x?/a? + y*/b? <1, and 
the outside is the set of points (x, y) satisfying 
x?/a? + y?/b? > 1. The inside is a tconvex set. 
From a point Q outside C, two tangents to C 
can be drawn. The locus of points Q such that 
these two tangents are orthogonal is the circle 
x? + y? =a? +b?. We call this circle the direct 
circle. The area of the “sector” OAX formed by 
two points A(a, 0), X (acos 9, b sin 0) (@>0) and 
the origin O is ab0/2 =(ab/2) Arccos(x/a); the 
length of the arc AX of the ellipse is repre- 
sented by the value of the felliptic integral 


8 
af ./1~—e* cos? 0 d0 =aE(n/2—0, e). 
o 


In particular, the area inside an ellipse is equal 
to zab, and the whole length of the ellipse is 
4aE(0, e). 

With respect to a polar coordinate system 
(r, 0) having the focus F(ae, 0) as the origin and 
the ray directed positively along the x-axis as 
the initial line, the equation of the ellipse C is 


l b? 
l=— 


r= —— . (4) 
1+/cos@ a 


Here l is equal to half of the length of the 
chord that is perpendicular to the major axis 
and passes through the focus. (This chord is 
called the latus rectum of the ellipse.) Suppose 
that F is a fixed point and that X is a moving 
particle attracted toward F by a tcentral force 
inversely proportional to the square of the 
length of FX. Suppose further that X begins 
with an initial velocity whose direction is 
tangent to the ellipse C with focus F. Then X 
always moves on C, and the areal velocity 
described by the radius FX is constant 
(Kepler’s second law). 


E. Properties of Hyperbolas 


Two straight lines x*/a? — y?/b? =0, that is, 
y/x = +b/a, are tasymptotes of the hyperbola 
C:x*/a? — y?/b* =1. The hyperbola C': x?/a? — 
y?/b? = —1 is called the conjugate hyperbola 

of C. When a=b, the asymptotes are ortho- 
gonal to each other, and C, C’ are congruent. 
In this case, we call C a rectangular hyperbola 
(or equilateral hyperbola), When we draw 
parallels to asymptotes from a point X on C, 
the area of the parallelogram formed by these 
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lines and two asymptotes is constant. (In par- 
ticular, when C is a rectangular hyperbola, the 
equation of C becomes xy = k?/2 if we take 
two asymptotes as coordinate axes. The seg- 
ment cut off by the asymptotes on the tangent 
to C at X is divided equally at X. In the case 
of the hyperbola as well, the product of the 
distances from two foci to an arbitrary tangent 
is constant and is equal to b?, and the angle 
between two straight lines joining two foci to a 
point X on C is divided equally by the tangent 
at X. 

A hyperbola C is represented parametrically 
by 


x=asecO, y=btané. (3’) 


In this case also, we call O the eccentric angle 
of (x, y). If we use the thyperbolic functions 
and the parameter u, the equation of a hyper- 
bola can be written as 


x=acoshu, y=bsinhu (3”) 


instead of (3’). The area of a “sector” OAX 
formed by two points A(a, 0), X(x, y) on the 
hyperbola and the origin O is, in this case, 


abu ab ab) x+./x*-a? 


x 
—-=—A h-=—1 
5 g recos- = log : 


The length of the arc AX of the hyperbola is 
given by the elliptic integral 


x e? x? — a? 
dy, 
o X Sad 


With respect to a polar coordinate system 
(r, o) having the focus F (ae, 0) as origin and the 
ray directed positively along the x-axis as the 
initial line, the equation of the hyperbola C 
becomes 


l=—, (4) 


where / is equal to half the length of the chord 
passing through the focus and perpendicular 
to the principal axis. (This chord, too, is called 
the latus rectum.) 


F. Properties of Parabolas 


The curve described by a particle attracted by 
“gravitation” in a fixed direction and affected 
by no other force is a parabola (G. Galilei). 
The tangent at a point X on a parabola makes 
equal angles with the straight line joining F 
and X and the direction of the principal axis 
(Fig. 6). Consequently, if the rays starting from 
the focus of a parabola are “reflected” by the 
parabola, they all become rays parallel to the 
principal axis. Let X’ be 
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Fig. 6 


the point of intersection of the tangent at 

X (Xo, Yo) and the x-axis, X” the point of inter- 
section of the normal at X and the x-axis, and 
Xo the foot of the perpendicular from X to the 
x-axis. Then FX = FX’, AF XX’ is an isosceles 
triangle, and XX’ is divided equally by the y- 
axis. Consequently, the locus of the foot of a 
perpendicular from F to a tangent is the y- 
axis. Also, the length of subtangent X’X)= 
2x», and the length of subnormal X"X)=2a= 
20F. Conversely, a curve whose length of 
subnormal is constant is a parabola. The locus 
of the midpoints of parallel chords of a para- 
bola is a straight line parallel to the principal 
axis. 

With respect to a polar coordinate system 
having the focus as origin and the ray directed 
positively along the x-axis as the initial line, 
the equation of a parabola is 


l 


r=———_.,__ l=2a. (4”) 
1—cos@ 


The area bounded by a chord BC and an arc 
BC ofa parabola (Fig. 7) is equal to 4/3 the 
area of AABC, where A is the point of contact 
on the tangent of the parabola parallel to BC 
(Archimedes). Also, the length of the arc OX of 
parabola (2) is 


+y +a? 
Yo ta taiog Ntt A 


4a 2a , 


X having the coordinates (xo, yo). 


Fig. 7 


G. Conjugate Diameters 


The diameter is a straight line passing through 
the center of a central conic. The locus of the 
midpoints of the chords parallel to a diameter 
d is another diameter d’, called conjugate to d. 


297 


Then the diameter conjugate to d’ is d. The x- 
axis and y-axis of a canonical coordinate sys- 
tem form a set of conjugate diameters. Let 

2a’, 2b’ be the lengths of the segments (some- 
times called conjugate diameters) cut off from 
d, d by the curve or by the curve and the con- 
jugate one for hyperbolas and by œ, the angle 
between d and d’. Then the following relations 
hold (as to the double signs +, we take + in 
the case of an ellipse and — in the case of a 
hyperbola): a’? +b’ =a? +b’, a'b' sin œ =ab. 
The product of the slopes of d, d’ is equal to 
+b?/a?. With respect to an oblique coordinate 
system having d and d’ as axes, the equation of 
the curve is x?/a’? + y?/b? =1. 


H. Confocal Conic Sections 


The set of ellipses and hyperbolas having two 
fixed points F, F' as foci is called the family of 
confocal central conics with foci F and F' (Fig. 
8). The family of confocal central conics con- 
taining the ellipse x*/a? + y’/b? =1 is repre- 
sented parametrically by 


x? y? 


ee ee 
eA bed 


There exist only one ellipse and only one 
hyperbola that pass through a point inside 
each quadrant (for example, a point (xo, Yo), Xo 
>0, vo > 0, inside the first quadrant) and be- 
long to the family of curves. The ellipses and 
hyperbolas of the same family cut each other 
orthogonally. Thus parameters corresponding 
to ellipses and hyperbolas belonging to a 
family of confocal central conics define an 
orthogonal curvilinear coordinate system, 
called an ‘elliptic coordinate system (— 90 
Coordinates). 


Fig. 8 


The set of parabolas having a fixed point F 
as focus and a straight line passing through F 
as axis is called a family of confocal parabolas 
(Fig. 9). The family of confocal parabolas 
containing y? =4ax is the set of curves 


y? =4(a+A)(xt+A). 


Such families also give rise to orthogonal 
curvilinear coordinate systems. 
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Fig. 9 


I. Curves of the Second Order 


With respect to a rectangular coordinate sys- 
tem, a curve represented by an equation with 
real coefficients of the second degree with two 
variables x, y, 


ax? + 2hxy + by? +29gx+2fy+c=0, (5) 


where (a, h, b) #(0, 0,0), is called a curve of the 
second order. A curve of the second order is 
either an empty set, one point, one or two 
straight lines, or a conic section. For equation 
(5), we put 


ah 


=y b 


ah g 
3 D=|h b f (6) 
gf c 


and call D the discriminant of the curve of the 
second order. If D, #0, D #0, and the curve is 
not an empty set, then the curve is a central 
conic. If Dy >0, then the curve is an ellipse or 
an empty set. If Dy <0, then the curve is a 
hyperbola. If D) =0, D #0, then the curve is a 
parabola. If D=0, Dy >0, then the curve con- 
sists of one point. If D=0, Do <0, then the 
curve is two intersecting straight lines. If D= 
D, =0, then the curve is an empty set, one 
straight line, or two parallel straight lines. 


J. Poles and Polars 


Let F(x, y)=ax? + 2hxy + by? + 2gx+2fyt+e 
=0 be the equation of a conic C and (xo, Yo) 
the coordinates of a point P on the plane. A 
straight line P* having the equation 


aXxoX +h(xoy +XxXYyo)+byoy + G(X + Xo) 
+ flyt+yo)+c=0 


is called the polar of P with respect to C (Fig. 
10). When the polar of a point P is J, we call P 
the pole of | and denote it by /*. In general, [* is 
uniquely determined by /, and P** = P, [** = 

l. If Qe P*, then PeQ*. If P’el, then [*eP’*. 
When a straight line passing through P inter- 
sects C at X, Y and intersects P* at P’, then P, 
P’ are tharmonic conjugate with respect to 

X, Y. In particular, if Pe P*, then PeC, and 
P* becomes the tangent of C at P. Givena 
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triangle APQR on the plane of C, we call 

the triangle with sides P*, Q*, R* the polar 
triangle of APQOR. Let Q* N R* =P’, R* N P* 
=Q, and P*(\Q* = R’. Then the three straight 
lines PU P’, QUQ’, RUR’ meet at a point (M. 
Chasles). When the polar triangle of APQR 
coincides with itself, then APOQR is called a 
self-polar triangle. The polar of a focus is a 
directrix. 


Fig. 10 


K. Curves of the Second Class 


When the coefficients u, v, w of a straight line 
ux +vy+w=0 satisfy an equation with real 
coefficients of the second order, 


Au? + 2Huv + Bv? + 2Guw4+ 2Fow + Cw? =0, 
(5) 

where (A, H, B)# (0, 0,0), the curve enveloped 
by these straight lines is called a curve of the 
second class. Let A be the discriminant defined 
analogously to D in (6) by using A, H, B,... 
instead of a, h, b,.... A curve of the second class 
(with A #0) is essentially the same as a curve of 
the second order with D #0. In order for the 
curve (5’) with A 0 to coincide with the curve 
(5) with D #0, it is necessary and sufficient that 
A, B, C, F, G, H be proportional to the tco- 
factors of a, b, c, f, g, h in the determinant D 
given by (6). If A =0, then (5’) represents either 
the empty set, or a point (regarded as the set of 
straight lines passing through the point), or 
two points. 

From a projective point of view, a curve 
of the second order is defined as a locus of 
the point of intersection IN l =X of corre- 
sponding lines l and I’ when two tpencils of 
lines A(i,m,...), A‘, m’,...) passing through 
two different centers A and A’ are in corre- 
spondence under a ‘projective mapping f (J. 
Steiner) (Fig. 11). From this it can be proved 
that three points of intersection of three pairs 
of opposite sides (AB, DE), (BC, EF), (CD, FA) 
of a hexagon inscribed in a curve of the second 
order are on the same straight line (Pascal’s 
theorem, Fig. 12). In particular, if the curve 
of the second order in this theorem consists 
of two straight lines, the theorem coincides 
with Pappus’s theorem (Fig. 13). We call this 
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straight line / the Pascal line of ABCDEF. 
Given a set of six points A, B, C, D, E, F ona 
curve of the second order, by considering all 
possible combinations of the points, we get 60 
Pascal lines. A configuration consisting of 
these 60 lines is called Pascal’s configuration, 
and has been studied by Steiner, Kirkman, 
and others. As a tdual to Pascal’s theorem, 
Brianchon’s theorem holds: Three diagonals of 
a hexagon with a curve of the second class 
inscribed meet at a point (Fig. 14). 


Fig. 13 


Fig. 14 
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A. General Remarks 


A ttopological space X is said to be connected 
if there are no proper closed subsets A and B 
of X such that AN B= @ and AUB=X (C. 
Jordan, Cours d’Analyse I, 1983). A subset S of 
X is connected if S considered as a tsubspace of 
X is connected. 

If a subset S of X is connected, then the 
tclosure S is also connected. Let {A,} be a 
family of connected subsets of X such that 
either A, A,#@ or AN Ag AØ for any 
pair A, and Aș. Then the union |), A, is con- 
nected. The continuous image of a connected 
set is connected. The tproduct space [], X, of 
a family of connected spaces { X,} is also con- 


299 


nected. Let {A,} be a family of connected 
subsets of X and A, be a connected subset of 
X. If AgNA,# Ø for every A,, then the union 
A,U\ ), A, is also connected. For a point p of 
a topological space X, the union of all connec- 
ted subsets containing p is connected and is 
called the connected component of p (F. Haus- 
dorff, 1927). 

The n-dimensional Euclidean space E” 
(n> 0) and the n-dimensional unit sphere S” 
(n> 1) are connected, whereas S°, consisting 
of two points, is not connected. E"\{0} is 
connected for n> 2, whereas E'\{0} is not 
connected. This fact implies that £! is not 
homeomorphic to E” (n>2). A connected 
open subset of a topological space X is called 
a domain (or region) in X. 

A topological space X is said to be locally 
connected at a point p if for every open set U 
containing p, there is an open set V containing 
p and contained in the connected component 
of pin U. X is said to be locally connected if it 
is locally connected at each point of X. A 
space X is locally connected if and only if 
every connected component of every open 
subset is open in X. 

There are connected spaces that are not 
locally connected. For example, the comb 
space {(x, y)€R?|x=1/n and 0<y<1 (n= 
1,2,3,...)or0<x<1 and y=0} is not locally 
connected at (0, 1) (see Fig. 1). 


Wea t | 
il! | 
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ty i 
Wi 


HH 


Fig. 1 
Comb space. 


B. Arcwise Connectedness 


Two points a and b of a topological space X 
are said to be joined by an arc in X if there is a 
continuous map f(t) of the closed interval J = 
[0,1] into X such that f(0)=a and f(1)=b. 

A topological space X is said to be arewise 
connected (or path-connected) if every two 
points of X are joined by an arc in X. For 

a point p of X, the union of all arcwise- 
connected subsets in X containing p is arcwise 
connected and is called the arewise-connected 
component (or path-component) of p. A topo- 
logical space X is said to be locally arcwise 
connected at a point p if for every open subset 
U containing p, there is an open subset V 
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containing p such that every two points in V 
are joined by an arc in U. X is said to be lo- 
cally arcwise connected if it is locally arcwise 
connected at each point of X. 

An arcwise-connected space is connected, 
but the converse is not true. The tsinusoid 
{(x, y)eR?| y=sin 1/x and0<x<1orx=0 
and —1<y<1} is connected but not arcwise 
connected. A complete metric space is arc- 
wise connected if it is connected and locally 
connected. 


C. Simple Connectedness and n-Connectedness 


Denote by S” the n-dimensional unit sphere 
and by D"*! the (n+ 1)-dimensional unit disk. 
A topological space X is said to be n-connected 
if every continuous map f from S” to X is 
extendable over D”*! (m=O, 1,2, ...,n). 0- 
Connectedness is equivalent to arcwise con- 
nectedness. A 1-connected space is also called 
simply connected. (For simple connectedness 
and n-ply connectedness of plane domains — 
333 Plane Domains) A topological space X ts 
said to be locally n-connected at a point p if for 
every open subset U containing p, there is an 
open subset V containing p and contained in 
U such that any continuous map f:S”"-> V is 
extendable to a continuous map f:D"*!=> 
U (m=0, 1, 2,...,n). X is said to be locally 
n-connected if it is locally n-connected at 
each point of X. A space X is said to be w- 
connected (or locally «-connected) if X is n- 
connected (or locally n-connected) for every n. 
S” is (n—1)-connected but not n-connected. 
Similarly for E"*'\{0}. If X is n-connected 
(n> 1), then the tsuspension SX is (n+ 1)- 
connected and the tloop space QX is (n— 1)- 
connected. The Hawaiian earring {(x, y)e 
R?|(x—1/n)? + y? =1/n? (n=1, 2,3, ...)} is 
arcwise connected but not locally simply con- 
nected. The ‘cone over the Hawaiian earring 
is simply connected, but not locally simply 
connected (Fig. 2). 


Fig. 2 
Hawaiian earring and its cone. 


A topological space X is said to be contrac- 
tible if the identity map 1, is thomotopic to a 
constant map c,, to a point xo of X (K. Bor- 
suk, Fund. Math., 24, 1935). A topological 
space X is said to be locally contractible at a 


79D 
Connectedness 


point p of X, if for every open subset U con- 
taining p, there is an open subset V containing 
p and contained in U such that the inclusion 
i: V c U is homotopic to a constant map toa 
point in U. X is said to be locally contractible 
if it is locally contractible at each point of X. 

A contractible or locally contractible space 
is w-connected or locally w-connected, res- 
pectively. The n-dimensional simplex, the n- 
dimensional disk D", and the n-dimensional 
Euclidean space E” are contractible. More 
generally, a tconvex set in E” is contractible. 
tTopological manifolds and tpolyhedra are 
locally contractible. 


D. Continua and Discontinua 


A topological space is said to be totally dis- 
connected if each connected component con- 
sists of one point. Divide the closed interval I 
=[0, 1] into three equal parts, and let J,, and 
I,, be the closed intervals obtained from I by 
removing the middle open interval. As the next 
step, divide I,, and I, into three equal parts, 
respectively, and remove the middle open 
intervals. Inductively we obtain 2"*! closed 
intervals I,,, ;(i=1,...,2"*') from 2” closed 
intervals I, ; by removing open intervals lying 
in their middles. Let C® =| JZ, 1, ; and C 
=(\2, C™. Then C is called the Cantor dis- 
continuum or simply the Cantor set or the 
ternary set (G. Cantor, Math. Ann., 21 (1883)) 
(Fig. 3). 


by Six” In log 
Fig. 3 
Cantor discontinuum. 


C is a subset of J consisting of points with 
coordinates t=(n,/3)+(n,/37)+ ... +(n,/3') 
+..., where n,=0 or 2. As a topological space 
C is homeomorphic to the Cartesian product 
of countably many copies of the discrete space 
D = {0,1}. C has the power of the continuum ¢ 
and is a compact, totally disconnected, tperfect 
set. The Cartesian product of infinitely many 
(countably or not) copies of D is called the 
general Cantor set. The continuous image of a 
general Cantor set is called a dyadic com- 
pactum. Compact metric spaces and compact 
groups are examples of dyadic compacta. 

A continuum is by definition a connected 
compact metric space consisting of more than 
one point. A metric space X is said to be well- 
chained if for every two points a, b and ¢>0 
there are points X,, X2,..., Xp- such that 
A(X;, X;41) <€ (Xp =a, X, =b). A well-chained 
compact metric space is a continum. 

Let K be a continuum containing two 


300 


points a and b. Then K is said to be irreducible 
between a and b if there is no proper subcon- 
tinuum of K containing a and b (L. Zoretti, 
Ann. Sci. Ecole Norm. Sup., 26 (1909)) (— 93 
Curves). 

A continuum K is said to be indecomposable 
if there are no proper subcontinua K,, K, 
such that K=K,UK, (L. E. J. Brouwer, Math. 
Ann., 66 (1910)). Simple examples of inde- 
composable continua have been given by 
A. Denjoy, C. R. Acad. Sci. Paris, 151 (1910); 
K. Yoneyama, Tohoku Math. J., 12 (1917); and 
B. Knaster, Fund. Math., 3 (1922). 

C. Kuratowski conjectured the following: 
If a plane continuum K ts homogeneous (that 
is, for any two points a, be K there exists a 
homeomorphism h: K >K with h(a)=b) then 
K is homeomorphic to the circle. A counter- 
example for this conjecture has been found by 
R. H. Bing and E. E. Moise. It is an indecom- 
posable, homogeneous, plane continuum and 
is called the pseudo-arc [8]. 
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A. History 


The geometric notion of connections orig- 
inated with T. Levi-Civita’s parallelism (Rend. 
Circ. Mat. Palermo, 42 (1917)) and was later 
generalized to the notion of connections of 
differentiable fiber bundles. Notions such as 
affine connections, Riemannian connections, 
projective connections, and conformal connec- 
tions can be described in terms of bundles 
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constructed from the tangent bundles of dif- 
ferentiable manifolds. They are also standard 
examples of the Cartan connections formu- 
lated by E. Cartan and C. Ehresmann. 


B. Connections in Principal Bundles 


Let P=(P, n, M, G) be a differentiable *prin- 
cipal fiber bundle. (For the sake of conve- 
nience, we assume that differentiability always 
means that of class C”.) The total space P and 
the base space M are tdifferentiable manifolds, 
and the projection z is a differentiable map- 
ping. The tstructure group G is a Lie group 
and acts on P from the right as a transforma- 
tion group. On each fiber, G acts transitively 
without fixed points. For elements a, x in G, P, 
we write R,(x)= xa. The mappings induced on 
*tangent vector spaces by R, and z will be 
denoted by the same letters, namely R,: T.(P) 
> T,,(P), n: T (P)> T,y)(M). The tangent vec- 
tor space T,(P) at each point x of P is mapped 
by the projection z onto the tangent vector 
space T,(M) at the point p=2(x) of M. The 
kernel of this mapping is denoted by V,(P), 
and each vector in V,(P) is said to be vertical. 
The kernel V,(P) is the totality of elements of 
T,(P) that are tangent to the fiber. 


C. Connections 


We say that a connection is given in P if for 
each point xe P, a subspace Q, of the tangent 
space 7,(P) is given in such a way that the 
following three conditions are satisfied: (i) 
T.(P)=V,(P)+ Q, (direct sum); (ii) R,(Q,)= 
Q,,(Q is invariant under G); and (iii) the map- 
ping x Q, is differentiable. A vector in Q, 

is said to be horizontal. 

Now suppose that X is an arbitrary tvector 
field on P. By condition (i), the value X, of X 
at each point x of P can be expressed uniquely 
as X,= Y, + Z,, where Ye V,(P) and Z €Q.. 
The vector fields Y and Z defined by Y, and Z, 
(xe P) are called the vertical and horizontal 
components of X, respectively. Condition (iti) 
implies that if X is a differentiable vector field, 
then its horizontal and vertical components 
are also differentiable vector fields. Let X be a 
vector field on the base space M. Since z de- 
fines an isomorphism of Q, and T,(M) (p= 
7m(x)), we have a unique vector field X* on P 
such that (a) x(X*)=X and (b) X*¥€Q,. We 
call X* the lift of X, and it is invariant under 
G by condition (ii). 

Suppose that a connection is given in P. If C 
is a piecewise differentiable curve in the base 
space M, we can define a mapping ¢ that 
maps the fiber over the initial point p of C 
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onto the fiber over the endpoint q of C as 
follows: Take an arbitrary point x on the fiber 
at p. Then we have a unique curve C* in P 
starting at x such that (a) 2(C*)=C, and (b) 
each tangent vector to C* is horizontal. (C¥ is 
called a lift of C that starts at x.) The endpoint 
y of the curve C* belongs to the fiber over q. 
We set ø(x)= y. Because C¥ = R,(C*), the 
mapping ~ commutes with transformations of 
G. We call this mapping ¢ the parallel dis- 
placement or parallel translation along the 
curve C. 


D. Holonomy Groups 


Fix a point p in the base space. If C is a closed 
curve in M starting from p, the parallel dis- 
placement along C maps the fiber over p onto 
itself. So if we fix a point x on the fiber over p, 
x 1s transformed by the parallel displacement 
to a point xa (ae G). Thus each closed curve C 
starting from p determines an element a(x, C) 
of G. If C varies over the set of closed curves 
that start from p, the totality of such elements 
of G forms a subgroup of G. This subgroup is 
called the holonomy group of the connection 
defined over P with the reference point x. If M 
is connected, holonomy groups with different 
reference points are conjugate. In the above, if 
we choose as the closed curves C starting from 
p only those curves that are null-homotopic, 
the elements a(x, C) form a subgroup of the 
holonomy group. This is called the restricted 
holonomy group. The holonomy group is a tLie 
subgroup of the structure group, and its con- 
nected component containing the identity 
coincides with the restricted holonomy group. 
Holonomy groups are useful in the study of 
the behavior of connections. 


E. Connection Forms 


Let g be the Lie algebra (— 249 Lie Groups) of 
the structure group of G of a principal fiber 


-bundle P=(P, 2, M, G). For each A in g, the 1- 


parameter subgroup exptA (—2 <t<o)ofG 
defines a tone-parameter group Rexpra of trans- 
formations on P, and it determines a vector 
field A* on P (— 105 Differentiable Mani- 
folds). Each element of the vector field A* 
is vertical at each point x on P, and the A* 
(Aeg)at x generate V,(P). Moreover, for each 
element a of G we have R,(A*)=(ad{a~')A)*. 
For a connection in P, we define the connec- 
tion form @ on P with values in g by the fol- 
lowing: (i) @,(A*)=A (A €g), and (i) w,{X)=0 
(X €Q,). The connection form w thus defined 
satisfies (iii) R*(@)=ad(a~')w (ae G), where 
Rž (œ) is the ‘differential form induced by the 
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transformation R, from the differential form 
w. Conversely, given a 1-form @ with values in 
g that satisfies conditions (i) and (it), we can 
define a connection in P by defining vectors X 
such that w(X)=0 as the horizontal, and its 
connection form coincides with w. Thus giving 
a connection in P is equivalent to giving a 
connection form in P. 

In particular, when a principal fiber bundle 
P is trivial, i.e., when P =M x G, we can iden- 
tify the tangent vector space T,(P) at a point x 
=(p,g) of P with the direct sum of T,(M) and 
T,(G). If we set Q, = T,(M), then Q defines a 
connection in P= M x G. Such a connection is 
called flat. When a connection can always be 
expressed as above locally, it is called locally 
flat. Since each principal fiber bundle is locally 
a product fiber bundle, we see that locally 
there exists a connection. If the base space M 
is paracompact, we can show the existence of 
connections on P. 


F. Extension and Restriction of Connections 


When a principal fiber bundle P =(P, n, M, G) 
has a treduced fiber bundle P’, we shall con- 
sider the relation between the connections of P 
and of P’. Let G’ be a Lie subgroup of G and g' 
its Lie algebra. We shall denote by j both the 
injection of G’ into G and also the injection of 
g' into g. If there exist a differentiable principal 
fiber bundle P’=(P’, x’, M, G’) and a differenti- 
able embedding f of P’ into P such that zo f 
=n and foR,=Ryao f (aeG’) are satisfied, 
then (P’, f) is said to be a reduced fiber bundle 
of P. Then we have f,(A*)=j(A)F,,. for each 
Aég and xeP’. 

Suppose that a connection is given in P’; we 
denote the horizontal space at the point x of 
P’ by Q}. At the point f(x) of P, we take f(Q) 
as the horizontal space and transform it by 
right translations of G. Thus we obtain a con- 
nection on P. Let œ and œ be the correspond- 
ing connection forms. Then we have jo’ = 
f*(@) on P’. Conversely, suppose that we 
are given a connection in P with the connec- 
tion form æ. If the induced form f*(w) on 
P’ has values always in j(q’), we can write 
{*(w)=joa’, and œw defines a connection in 
P’. In this case the connection in P is called 
an extension of the connection in P’, and the 
connection in P’ is called the restriction of the 
connection in P. 


G. Curvature Forms 


Suppose that a principal fiber bundle P= 

(P, nz, M, G) has a connection. Let F be a finite- 
dimensional vector space and « be a differen- 
tial form of degree k on P with values in F. We 
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define the covariant differential Da of a by 


(Da)(X,, --- Xa 1) = (da) (hx, w+, AX, 41), 


where the X; are vector fields on P and h de- 
notes the projection to the horizontal compo- 
nent. Dx is a differential form of degree k + 1 
on P with values in F. 

Let p:G-—GL(F) be a trepresentation of a 
Lie group G on F. A differential form « on P 
with values in F is called a pseudotensorial 
form of type p if satisfies R*(a)=p(a')a 
(aéG). In particular, if a pseudotensorial form 
a satisfies 1(A*)a=0 for any Aeg (— 105 Dif- 
ferentiable Manifolds Q), it is called a tensorial 
form of type p. For each representation p of G, 
we can construct an associated vector bundle 
E over M with fiber F. A tensorial form of 
type p is identified with a differential form on 
M with values in E. If « is a pseudotensorial 
form of type p, then Da is a tensorial form of 
type p. 

For a connection form œ on P, the co- 
variant differential Dw =Q of w is called the 
curvature form of the connection. Since @ is a 
pseudotensorial form of type ad, Q is a ten- 
sorial form of type ad. For the connection 
form we have the structure equation dw = 
—[@,w]+Q [4,6]. Let X and Y be vector 
fields on M, and let X* and Y* be their lifts, 
respectively. Then we have w([X*, Y*])= 
Q(X*, Y*), which shows that the curvature 
form Q for X*, Y* gives the vertical compo- 
nent of [X*, Y*]. 

It is known that the following three con- 
ditions for a connection are equivalent: (i) The 
connection is locally flat. (it) The curvature 
form vanishes. (iii) The restricted holonomy 
group is trivial (i.e., the identity group). 

The following two theorems are fundamental: 

(1) Suppose that a connection is given in a 
principal fiber bundle P =(P, n, M, G). Then 
the structure group of P can be reduced to the 
holonomy group [4, 6]. In fact, for xe P, let 
P(x) be the set of points y in P that can be 
connected to x by a piecewise horizontal curve 
in P. Then P(x) gives a reduced fiber bundle of 
P, and the connection in P is an extension of a 
connection in P(x) [4,6]. 

(2) The Lie algebra of the holonomy group 
with a reference point x in P coincides with the 
vector subspace of g spanned by {Q,(X, Y)| 
ye P(x), X, Ye T,(P)} [4,6]. 

The curvature form Q is used to express the 
tcharacteristic classes of the bundle P [1,2] 
(— 56 Characteristic Classes). 

In some cases, a connection in the principal 
fiber bundle induces a connection in an tas- 
sociated fiber bundle. In particular, when G is 
GL(n, R) or GL(n, C), we can define a connec- 
tion in any assoctated vector bundle. The 
notion of connections in vector bundles can be 
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defined more algebraically (M. F. Atiyah, 
Trans. Amer. Math. Soc., 85 (1957)) and can 
also be defined as a kind of differential oper- 
ator on vector bundles [8]. 


H. Affine Connections 


Let M be a differentiable manifold of dimen- 
sion n and P be the tbundle of tangent n- 
frames over M. Then P has the structure 
group GL(n, R), and it is the principal bundle 
associated with the tangent vector bundle of 
M, which consists of all tangent vectors of M. 
A connection in the bundle of tangent n- 
frames is called an affine connection (or linear 
connection) on M. An affine connection on M 
defines (as well as the curvature form Q) a new 
form © called the torsion form on P, which is 
given as follows: Let F be an n-dimensional 
vector space with a fixed basis (€,,€,,...,¢,). 
Since the bundle of tangent n-frames P is 

the set of all bases (i.e., n-frames) (e,,...,€,) 

in T,(M) at each point p of M, every point 
x=(e,,...,@,) of P is given as a mapping x 

of F onto T,(M) (p =n{x)) defined by x: č;>e;. 
We define differential form 0 of degree 1 with 
values in F on P by 6,(X)=x '(2,(X)) (Xe 
T,(P)). @ is called a canonical 1-form of the 
bundle of tangent n-frames of the manifold M 
and has the following property: Any diffeo- 
morphism ¢ of M onto itself induces a bundle 
automorphism @ of P onto itself, and @ pre- 
serves 0, that is, 6*(0)=6. Conversely, we can 
show that any bundle automorphism of P that 
preserves 9 is induced by a diffeomorphism of 
the base space M. 

For an affine connection on M, we define 
the torsion form © by © = D9. © is a differen- 
tial form of degree 2 on P with values in F and 
satisfies R,© =a -O (ae GL(n, R)). Further- 
more, we have the structure equation for 0, 
d0=[m,0]+ © [2,4,6]. 

For each element é in F, there exists a 
unique horizontal vector field B(é) on P such 
that 0(B())=€. B(é) is called the basic vector 
field corresponding to č. At each point xe P, 
B(é,),,---, B(é,),. form a basis of Q,. Let 
{A,,.--; Am} (m=n?) be a basis of g=gl(n, R). 
Then at each point xe P, {(A#),, ...,(A*),, 
B(é,).,..-, B(E,),} is a basis of the tangent 
vector space T(P). Thus the bundle P of 
frames is a tparallelizable manifold. The pro- 
jection to M of any tintegral curve of a basic 
vector field is a geodesic, which is defined in 
Section I [4]. 

An affine connection on M gives a parallel 
displacement of the tangent vector space of M 
as follows: Let C=p, (0<t<1) be a curve in M 
and C* =x, be a lift of C to P. The parallel 
displacement of the tangent n-frame x, at Po 
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along the curve C is x,, and the mapping x,o 
Xo: T,,(M)— T,,(M) is called the parallel dis- 
placement of the tangent space T, (M) onto 

T,,(M) along C. It is easily seen that the map- 


ping is independent of the choice of lifts. 


I. Covariant Differentials 


Let C={p,} (0<t<1) be a differentiable curve 
in M. If we have a vector Y, in T,,(M) for each 
t and the correspondence t— Y, is differenti- 
able, then { Y,} is called a vector field along the 
curve C. For {Y,} we set 


Y = Lim (1/1)( 9:4 Kean) Y), 


where ¢, „is the parallel displacement of 
T,,(M) onto T, ,,(M) along the curve C. The 
vector field {Y;} along C thus obtained is 
called the covariant derivative of { Y,}. {Y,} is 
parallel along C; that is, ¥,=@po_,,( Yo) if and 
only if Y,’=0. In particular, if the tangent 
vectors to a curve C are parallel along C itself, 
then C is said to be a geodesic. 

Let X and Y be vector fields on a manifold 
M with an affine connection. The covariant 
derivative V, Y of the vector field Y in the 
direction of the vector field X is defined as 
follows: Let pọ be a point in M, C={p,} 
(—é<t<e) be an integral curve of X through 
Po, and {@,} be the parallel displacement along 
C. We set 


(Vx ¥)p,=lim (1/1)(9,°"(¥,)— Ypo). 


Then Vj, Y is also a vector field on M. 

The mapping (X, Y)->V, Y satisfies the 
following three conditions: (i) Vy Y is linear 
with respect to X and Y; (ii) Vy Y= f- Vx Y; 
and (iii) Vx(fY)=f- Vy Y +(Xf): Y, where f is 
a differentiable function on M. Conversely, if a 
mapping satisfying conditions (i)—(iii) above is 
given, then there exists a unique affine connec- 
tion on M whose covariant derivative coin- 
cides with the given mapping [4, 6]. 

Fix a vector field Y. Then the mapping X > 
Vy Y defines a ttensor field of type (1, 1). This 
tensor field is called the covariant differential 
of Y and is denoted by VY. Now fix a vector 
field X. Then the mapping Y>V, Y can be 
naturally extended to tensor fields of arbitrary 
type, and it commutes with the tcontraction of 
the tensors. For a tensor field K this is denoted 
by K>V,K. Furthermore, the mapping X > 
V,K is called the covariant differential of K 
and is denoted by VK. We call V,K the co- 
variant derivative of K in the direction of X. A 
tensor field K is invariant under parallel dis- 
placements if and only if VK =0 (— 417 Ten- 
sor Calculus). 
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J. Curvature Tensors and Torsion Tensors 


For an affine connection on M, the curvature 
tensor R and the torsion tensor T are defined 
by 


R(X, Y)(Z) = Vx(Vy Z) or Vy(VxZ) = Vix, y}(Z), 
T(X- Y)=VxY—V;X —[X, Y], 


where X, Y, and Z are vector fields on M, and 
R and T are tensors of types (1, 3) and (1, 2), 
respectively. Also, in terms of the curvature 
form Q and the torsion form © on the bundle 
of tangent n-frames P over M, they can be 
defined by 


R(X, Y)(Z)=x!-O,(X*, Y*)- X(Z), 
D(X, Y)=x"(O(X*, ¥*)), 


where x(x)=p, X, Ye T,(M), and X*, Y* are 
lifts of X, Y, respectively. The curvature tensor 
and the torsion tensor satisfy the relations 
R(X, Y)= —R(Y, X), T(X, Y)= —T(Y, X). 
Moreover, Bianchi’s identities hold: 


S(R(X, Y)(Z))=S(T(T(X, Y), Z) 
+(Vx T)(Y, Z)) 

and 

S((V_ R)(¥Y, Z)+ R(T(X, Y), Z))=0, 


where © denotes the sum of terms that are 
obtained by cyclic permutations of X, Y, Z 
[4]. For instance, in the case of a Riemannian 
connection (Section K), we have T=0, and 
Bianchi’s identities reduce to 


R(X, Y)(Z)+ R(Y, Z)(X) + R(Z, X)(Y)=0, 
(Vy R)(Y, Z)+(VyR)(Z, X) +(V,R)(X, Y)=0. 


We now consider a system of coordinates 
(x',...,x") in an n-dimensional linear space M 
=R". The vector fields (X,,..., X„) (X;=6/0x') 
form a basis for vector fields on M. If we set 
Vx,X;=0, we get an affine connection qn R”. 
For such a connection we have R=0, T=0, 
and any straight line in R” is a geodesic with 
respect to this connection. The connection is 
called the canonical affine connection on R”. 
An affine connection on a manifold M satisfies 
R=0 and T=0 if and only if the connection 
on M is locally isomorphic to the canonical 
affine connection of R’. 

Let ọ be a diffeomorphism of a manifold M 
with an affine connection onto itself. We call @ 
an affine transformation of M if the induced 
automorphism @ on the bundle of tangent n- 
frames preserves the connection. In terms of 
covariant differentials, this condition is equiva- 
lent to the condition V,,,,9(Y) = (Vx Y) for 
any vector fields X, Y. An affine transforma- 
tion of the canonical affine connection on R” 
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is an ordinary affine transformation. For an 
affine connection on a manifold M, the set of 
all affine transformations forms a Lie group 
and acts on M as a Lie transformation group 
[4, 5]. 

Let M be a manifold with an affine connec- 
tion. M is called an affine locally symmetric 
space if VR=0 and VT=0 are satisfied. These 
conditions are satisfied if and only if at each 
point p of M, there exist a neighborhood U of 
p and an affine transformation g of U such 
that g?=1 and p is the isolated fixed point of 
o. If for each point p of M there exists an 
affine transformation of M such that g?=1 
and p is an isolated fixed point, then M is 
called an affine symmetric space. A symmetric 
Riemannian space is a special case of this type 
(— 413 Symmetric Spaces). 

At each point p in a manifold M with an 
affine connection, we can choose local coordi- 
nates (xt, ..., x") such that x‘(p)=0 and the 
curve x'=a't (—6<t<6) is a geodesic for each 
(a’,...,a") with D(a‘)? = 1. Such local coordi- 
nates are called geodesic coordinates at p [4]. 
With respect to geodesic coordinates, we have 
(Yx (Xp =0 (0/dx'= X;). 


K. Riemannian Connections 


When a Riemannian metric g (— 364 Rieman- 
nian Manifolds) is given on a manifold M, it 
defines a metric on the tangent space T,(M) at 
each point p of M, and we can take ortho- 
normal bases in T,(M). The set P’ of all ortho- 
normal bases of tangent spaces is a subset of 
the bundle P of tangent n-frames of M and 
forms a subbundle of P; its structure group is 
the torthogonal group O(n), and P’ gives a 
reduction of the bundle of tangent n-frames. 
Conversely, when a reduction of the bundle 
of frames to O(n) is given, we can define a 
Riemannian metric on M such that the re- 
duced bundle consists of all orthonormal 
frames. 

For a Riemannian metric g on M, there 
exists a unique affine connection on M such 
that (i) Vg =0, and (ii) the torsion tensor T 
vanishes [4]. This connection is called the 
Riemannian connection corresponding to g. 
The first condition is equivalent to the invar- 
iance of the Riemannian metric g under paral- 
lel displacement. Thus the affine connection 
transforms orthonormal bases on M to ortho- 
normal bases and induces a connection in the 
bundle P’. It is known that the restricted holo- 
nomy group of any Riemannian connection is 
a closed subgroup of O(n) [4]. An affine con- 


nection on a manifold M is called a metric 
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connection if it preserves a Riemannian metric 
g on M, i.e., if it satisfies the condition (i). 


L. Representations in Local Coordinates 


(1) Let (x!,...,x") be a local coordinate system 
in a manifold M and consider vector fields 
X,= 0/6x,. For an affine connection on M, 

the covariant derivative can be expressed as 


Vx (X4 =) TX 


The Ij, are called coefficients of the affine 
connection with respect to the local coordinate 
system (xt, ..., x"). We denote by I}, the coeffi- 
cients of connection with respect to another 
local coordinate system (y!,..., y”). Then on 
the intersection of their coordinate neighbor- 
hoods, we have 


B 2 
T= ys éy' 25 weds re + “x a) 
ap dy! dy dyloy" 

Conversely, if the F} are given in each local 
coordinate system of M and satisfy this rela- 
tion on each intersection of their coordinate 
neighborhoods, then there exists a unique 
affine connection such that the coefficients of 
the connection are given by Ij, 

(2) The coefficients of the Riemannian con- 
nection corresponding to a Riemannian metric 
g=%g,4x'dx/ on a manifold M are given by 


.= 509 il ae Of OD jx 
La= a Dat 


and are called the Christoffel symbols. 

(3) With respect to each local coordinate 
system (x',...,x"), we express the torsion ten- 
sor T and the curvature tensor R of an affine 
connection by 


T=} Tj, dx @ dx* @ X; 


ijk 


R= X Rig dx! ® dx* Q dx! ® Xi. 


ijkl 
The components T} and R} are given by 
Ti = he a Ti. 
na = (O1};/Ox* — OT f,/Ox') 
+) Tm Ti T im). 
(4) Let K =(K;'""7) be a tensor field of type 


(r,s). Then the covariant differential VK = 
(Kiir) is given by 


iyi,  agirei/A k 
Kt Z= OK /0x 


Jicejs 


(grexit) 


a 


g=1 


+ 


R 
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(5) A curve x'=x'(t) is a geodesic if and only 


d? x' dxidx* 


=0 > ee 
dt? ae an ne 


(— 178 Geodesics, 417 Tensor Calculus). 


M. Cartan Connections 


Let M be a differentiable manifold of dimen- 
sion n. Consider a homogeneous space F = 
G/G’' of the same dimension n, where G is a Lie 
group and G’ is a closed subgroup of G (— 199 
Homogeneous Spaces). Let B=(B, M, F, G) be 
a fiber bundle over M with fiber F and struc- 
ture group G, and P=(P, M, G) be the prin- 
cipal fiber bundle associated with B. Suppose 
that there exists a cross section f over M to B. 
Then the structure group of P can be reduced 
to G’. We denote this reduced fiber bundle by 
P' =(P’, M, G’) and the injection of P’ into P 
by j (— 147 Fiber Bundles). 

Suppose that a connection is given in P. Its 
connection form w is a differential form of 
degree 1 on P with values in g, and the in- 
duced form a’ = j* (œ) is also a differential form 
of degree 1 on P’ with values in g. We call the 
connection in P a Cartan connection on M 
with the fiber F =G/G' if at each point x of 
P’, œw, gives an isomorphism of T;,(P’) onto g 
as linear spaces. Such a connection in P is 
equivalently defined as a 1-form œw on P’ with 
values in g satisfying the following three con- 
ditions: (i) w’(A*)= A (Aéq’ (Lie algebra of 
G’)); (ti) R*(w’)=ad(a~')o' (ae G’); and (iii) œ 
gives an isomorphism of T(P’) onto g at each 
point xe P'. For such w’, we can take a con- 
nection form w in P such that w =j* (w); œ 
defines a Cartan connection. 


N. Soudures 


A cross section f over M to B gives a vector 
bundle T'(B) on M defined as follows: For 
each point p of M, the projection BM de- 
fines a mapping T;,,)(B)— T,(M). The kernel of 
this mapping is denoted by V,,,)(B). Then 
T'(B)=(), Vp (B) forms a vector bundle over 
M, and the dimension of its fibers is equal to 
n=dim F. 

A Cartan connection in P gives a bundle 
isomorphism between T'(B) and the tangent 
vector bundle T(M) of M as follows: Let x be 
an arbitrary point in P’, and put p=7x(x). The 
projection 2: P’>M induces an isomorphism 
of T,(P’)/V,(P’) onto T,(M). On the other hand, 
w, gives an isomorphism of T,(P’)/V,(P’) onto 
g/g’. As a point in P’, x gives a mapping of F 
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=G/G' onto the fiber in B over p and sends 
the point {G’} in F to f(p). By this map- 

ping, T)(F)=9/q' is mapped isomorphically 
onto V;,,)(B). Combining these isomorphisms, 
we get an isomorphism between T,(M) and 
Vrp (B) that is independent of the choice of 
xe P over p. The set of such isomorphisms for 
peM defines a bundle isomorphism of T(M) 
and T’(B). If a fiber bundle B over M has an 
isomorphism such as above through a cross 
section, then B is said to have a soudure [3]. 

Conversely, if a fiber bundle B over M has a 
soudure with respect to a cross section f, then 
there exists a Cartan connection in P such that 
the soudure given by the connection coincides 
with the original one [3]. There are many 
Cartan connections in P with the given sou- 
dure. However, when F = G/G’ is a symmetric 
space of compact type such that G is noncom- 
pact and contains the identity component of 
the group of isometries, we can determine 
uniquely the so-called normal Cartan connec- 
tion among the Cartan connections which 
gives rise to the given soudure [10]. 

For the tangent vector bundle T(M) of M, 
the fiber F is an n-dimensional linear space 
and can be expressed as F = G/G’, where G is 
the taffine transformation group of F and G’ 
= GL(n, R). Then T(M) has the 0-section over 
M, and there exists a natural soudure. Fur- 
thermore, an affine connection on M canoni- 
cally induces a Cartan connection on M with 
the fiber F = G/G’ [3]. 

For a Cartan connection on M, we can 
introduce the notion of development of a curve 
in M into the fiber and also the notion of 
completeness [3]. 


O. Projective Connections 


Let F, =G,/G, and F, =G,/G, be homoge- 
neous spaces with dim F, =dim F, =n. Sup- 
pose that G, ts a Lie subgroup of G, and G; is 
contained in G; by the injection. Then we have 
a canonical injection F, >F, (F; is an open 
subset of F, by the assumption). 

Suppose that a fiber bundle B, with fiber F, 
over M has a cross section f,. Using f,, we 
can construct a bundle B, with fiber F, over M 
which also has a cross section f}. The prin- 
cipal bundle of B, is given by extending the 
structure groups from the principal bundle 
of B,. We can show that if B; has a soudure 
with respect to the cross section f}, then B, 
also has a soudure with respect to the cross sec- 
tion f,. A Cartan connection in the principal 
fiber bundle P, associated with B, that is com- 
patible with a given soudure on B, induces 
a Cartan connection in the principal fiber 
bundle P, associated with B,, which then 
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induces a soudure on B,. The latter is called a 
Cartan connection induced from the former. 

Let F, be an n-dimensional linear space and 
F, be the real *projective space of dimension n. 
Then the affine transformation group of F, 
can be embedded into the projective transfor- 
mation group of F,. Thus the tangent vector 
bundle of a manifold M induces a fiber bundle 
over M with the n-dimensional projective 
space as its fiber. A Cartan connection in this 
fiber bundle is called a projective connection on 
M. By the argument in this section, we see that 
every affine connection on M induces a projec- 
tive connection on M. 

Given two affine connections on M, we 
denote by V and V’ their corresponding covar- 
iant differentials. The two affine connections 
on M induce the same projective connection 
on M if and only if there exists a differential 
form ọ of degree 1 on M such that Vy Y— 

VY =e(X)Y¥+ —(Y)X for any vector fields X, 
Y{7]. If a diffeomorphism ~ of M preserves 
the projective connection induced by an affine 
connection in M, then œ maps geodesics of M 
into geodesics. 


P. Conformal Connections 


Let F, be an n-dimensional Euclidean space 
and F, an n-dimensional sphere (a ‘conformal 
space). We can embed the group of tisometries 
of F, canonically into the group of tconformal 
transformations of F,. A Riemannian metric of 
the tangent vector bundle of a manifold M of 
dimension n gives a fiber bundle over M with 
fiber F,. A Cartan connection in this fiber 
bundle is called a conformal connection on M; 
a Riemannian connection on M induces a 
conformal connection on M. 

Two Riemannian metrics g}, ga on M in- 
duce the same conformal connection on M if 
and only if there exists a positive function f 
on M such that g, = fg,. Thus for a Rieman- 
nian manifold M with metric tensor g,, a dif- 
feomorphism ọ of M such that y*(g)= fg, 
leaves invariant the conformal connection 
induced by g,. Such a ọ is called a conformal 
transformation of M with respect to the given 
Riemannian metric g,. 

For a Riemannian manifold M with metric 
tensor g, we define Weyl’s conformal curvature 
tensor W by 


1 : 
Wia= Ria +o 5 (Rat — Rak + Gj Ri — Gr Ri) 


R 


z (n—Din—d (Giz 6; = G04)» 


where the Rj,, and R; are components of the 
curvature tensor and Ricci tensor, respectively, 
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and R is the scalar curvature (-> 364 Rieman- 
nian Manifolds, 417 Tensor Calculus). When 
dim M > 3, the conformal connection induced 
by g on M is locally flat if and only if the 
conformal curvature tensor vanishes [7]. 


Q. Yang-Mills G-Connection 


Let P =(P, n, M,G) be a differentiable principal 
fiber bundle over a compact oriented Rieman- 
nian manifold M with group G. Then fiber 
bundles Gp =P x ,G and gp=P x 44g asso- 
ciated with P are induced naturally from the 
group conjugation c:G— Aut(G) and the ad- 
joint representation Ad: G—> Aut(g), respec- 
tively. A (local) section of Gp is called a (local) 
gauge transformation of P. The set of all global 
gauge transformations, which is denoted by 
G,, has a group structure. 

A locally faithful representation p of G to 
an n-dimensional complex vector space F with 
a fixed basis (&,,..., €,) defines a differenti- 
able complex vector bundle E =P x „F asso- 
ciated with P. Every point x of P is identified 
with a linear mapping x of F onto the fiber 
Exx defined by X:¢;-e;, where e; denotes the 
equivalence class of {x, čp eP x F, 1 <i<n. 

In a manner similar to the case of an affine 
connection, a connection in P with connection 
form w gives a notion of parallel displacement 
of E as follows: Let c =p, (O<t<1) be a curve 
in M and c* =x, bea lift of c to P. The map- 
ping X,0X9':E, —E,, is called the parallel 
displacement of E,, onto E, along c. 

Let X be a vector field on M and ọ bea 
differentiable section of E. The covariant de- 
rivative V,@ of ọ in the direction of X is de- 
fined as follows: Let pọ be a point of M, c=p, 
(—e<t< —e) be an integral curve of X through 
Po and c* =x, be a lift of c to P. We set 


a oo 
(Vy P)p, = lim Pac O Xo ; (Pp) B Pp,)- 


Then Vy is also a differentiable section of E. 
The mapping (X, 9) > Vy¢ satisfies the fol- 
lowing conditions: (i) Vy @ is linear with respect 
to X and 9; (ii) Vx = fVxọ; and (iii) Vx(fo) = 
(Xf)o+fVxo@, where f is a differentiable func- 

tion on M. From these conditions it is seen 
that the mapping X > Vy¢q for a fixed section 
ọ of E defines a differential form of degree 
one with values in E, denoted by Vø. V is 
called a G-connection on E (induced from the 
connection in P). A linear operator dY: (A? ® 
E)—>T(A?*! Q E), defined by d («Q g)=da ® 
ot+(—1)? aA Vo for a differential form « of 
degree p and a differentiable section ọ of E is 
called a covariant exterior differentiation. 
Here T (A? Q E) denotes the set of all differ- 
ential forms of degree p with values in E. 
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The curvature form RY of V is defined by 
R(X, YQ=Vx(Vy 9) — V; (Vxo)— Vix. ne 
where X and Y are vector fields on M and 
ọ is a differentiable section of E. In terms of 
the curvature form Q of the connection in P, 
it can be defined also by 


RY(X, Yyp=xXo,(X*, Y*)-x (0), 


where X and Y are in T,M and xen ‘(p) and 
X*, Y* are lifts of X, Y to P, respectively, and 
ọ is an element of E,. Since Q is a tensorial 
form of type ad and the differential p, of 

p induces a faithful representation of g to 
ql(F), RY can be regarded as a differential 
form of degree two with values in the bundle 
Gp=PX aa. 

The curvature form RY satisfies Bianchi’s 
identity: dYRY =0, where d” is a covariant 
exterior differentiation with respect to the G- 
connection on gp canonically induced from 
the connection in P. 

We denote by @ the set of all G-connections 
on E. Now let G be a compact semisimple Lie 
group. A functional S :€—>R defined by V 
+ S(V)= —43fytr(RY A * RY) is called the 
action integral (Yang-Mills functional) of V, 
where * is Hodge’s star operator, given by the 
fixed orientation of M. 

The group fp acts on @ by Vo fl oVof 
for fo p and then the curvature form is trans- 
formed by this action as RY Ad(f~')RY. 
Thus F is Gp-invariant. 

A connection V is called a Yang-Mills G- 
connection if V is a critical point of Z. The 
Euler-Lagrange equation of Z is given by 
dY* RY =0 by the aid of the formal adjoint 
operator d”* of dY. This equation, called the 
Yang-Mills equation, is a system of non- 
linear second-order elliptic partial differential 
equations. 

When M is a 4-dimensional vector space R* 
with Minkowskian metric and G is the Abelian 
group U(1), the Yang-Mills equation coincides 
with Maxwell’s equations for an electromag- 
netic field. Thus the Yang-Mills equation for a 
non-Abelian group G is a natural extension of 
Maxwell’s equations. In fact, the theory of 
Yang-Mills connections has its origin in the 
field theory of physics [11]. 

In the case of dim M =4, special Yang- 
Mills G-connections occur. A G-connection 
V satisfying the condition * RY = RY (resp. * 
RY = — RY) as a differential form of degree 
two is called a self-dual (resp. anti-self-dual) 
G-connection. From Bianchi’s identity and 
the expression of d’":d”" = —*od’ o «, it fol- 
lows that every self-dual (anti-self-dual) G- 
connection gives a solution to the Yang-Mills 
equation. Since the first Pontryagin number 
p,(E) is given by p, (E)= — (1/47?) Ím tr(RY A 
RY) by virtue of the Chern-Weil theorem, the 
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action integral satisfies 4 (V) > 2x7|p,(E)| for 
every V in @ and the equality holds if and 
only if V is self-dual or anti-self-dual. 

Explicit forms have been obtained for (anti-) 
self-dual connections over the 4-sphere S* by 
many interesting methods [12-15]. And 
it has been shown that moduli space of self- 
dual G-connections (i.e., the set of all solu- 
tions to the Yang-Mills equation modulo Gp) 
over S* has the structure of a Hausdorff mani- 
fold of dimension p, (gp)—dim G for every 
principal bundle P with group G [16, 17]. It is 
not yet known whether there exists a Yang- 
Mills G-connection over S4 whose holonomy 
group is an open subgroup of G and which 
is neither self-dual nor anti-self-dual [18]. 

The following is one of the few known facts 
concerning the properties of Yang-Mills G- 
connections: If a Yang-Mills G-connection, 
G=SU(2), SU(3), or O(3), over S* is weakly 
stable, i.e., if the second variation of Z is posi- 
tive semidefinite, then it is self-dual or anti- 
self-dual [19]. 
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81 (1.11) 
Constructive Ordinal 
Numbers 


A. General Remarks 


To extend the theory of frecursive functions to 
transfinite ordinal numbers, A. Church and S. 
C. Kleene [1] considered the set of effectively 
accessible ordinal numbers and defined the 
concept of constructive ordinal numbers as 
explained later in this article. Their work 
became the basis of fruitful research by Kleene 
W. Markwald, C. Spector, and others [2-5]. 
A constructive ordinal number was origin- 
ally introduced as an “expression” in a tformal 
system utilizing the A-notation. Since such a 
system is “effective,” we can arithmetize it 
utilizing tGédel numbers and assume from the 
outset that each ordinal number is represent- 
able by a natural number. The notations, 
terminology, and theorems mentioned in this 
article are mainly those for constructive 
ordinal numbers of the second number class. 


kd 


B. Definition and Fundamental Properties 


We call a set of natural numbers satisfying 
conditions (I) and (II) a system of notations for 
ordinal numbers, and an ordinal number a 
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constructive ordinal number when it is represent- 
able by a natural number belonging to such 

a system of notations: (I) No natural number 
represents two distinct ordinal numbers. (IT) 
There are three tpartial recursive functions 
K(x), P(x), and Q(x, n) defined as follows: (i) for 
any natural number x representing X, K(x) 
takes the value 0, 1, or 2 according as X is 
zero, an tisolated ordinal number, or a tlimit 
ordinal number, respectively; (ii) when X is the 
ordinal number timmediately after an ordinal 
number Y, P(x) represents Y for any natural 
number x representing X; (iii) when X is a 
limit ordinal number, for any natural number 
x representing X there exists an increasing 
sequence {Y,} of ordinal numbers such that 

X =lim, Y, and Q(x,n) represents Y, for each 
natural number n. 

The system called S, by Kleene is the most 
useful and convenient among systems of nota- 
tions for ordinal numbers. Let ng be a tprimi- 
tive recursive function of the variable n defined 
by 09=1, (n+ 1)o = 2". The fundamental 
notion acO and relation a < gb of the system 
S, are introduced by the following inductive 
definition: (1) 1€0O; (2) if ye O, then 2”€O and 
Y< o2; (3) if a sequence {y,} of natural num- 
bers has the property that for each n, y,¢O 
and y,<oVn+1, and if y is a Godel number 
that defines y, recursively as a function of 
Ng (1.€., Yn = {y} (no) (— 356 Recursive 
Functions)), then 3-5” € QO, and for each n, 
Vn<o3' P; (4) if x, y, z€0, x< oy, and 
y<oz, then x < gz; (5) a€ O, a < gb hold only 
when they follow from (1)-(4). 

Now, for brevity, we write a < gb for 
(a<gb)v(a=b). The following propositions 
hold for S3: (1) If a < gb, then b¥ 1; (2) If 
a< ob, then a, beO; (3) If a < 2”, then a < py); 
(4) If a< o3: 5, then there is a natural number 
n such that a<gy,, where y, = {y} (no); (5) If 
ae O, then 1 < ga; (6) If acO, then for any 
*number-theoretic function a such that a«(0)= 
a and Yn(a(n) 4 1—>a(n+ 1)<,a(n)), there 
is a k such that «(k)= 1; (7) For each a, 9 
(a< ga); (8) If ceO, a < oc, and b < oc, then 
a<ob or a=b or b<oa. 

Each member a of O represents an ordinal 
number |a] as follows: |1|=0; |2”|=|y|+1 for 
yeO; |3-5*|=lim,|y,| for 3-5”e0O, where y, = 
{y} (no). Let b be a member of O. Then |a| < 
|b| when a < ob; and conversely, for each «< 
|b], there is a number a such that Ja|=« 
and a < gb. Hence the set {a|a<gb} isa 
twell-ordered set with respect to <Q, and its 
torder type is |b|. The least number € greater 
than |a| for every member a of O is the least 
ordinal number that is not constructive. It is 
denoted by wS" (Church and Kleene denoted 
it by œ). There is a subsystem of S, that is well 
ordered with respect to <o and contains a 
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unique notation for each constructive ordinal 
number «. For such a subsystem we can take 
a IT} set K such that K is trecursive in O (S. 
Feferman and Spector, R. O. Gandy) (— 356 
Recursive Functions H). 


C. Constructive Ordinals and the Kleene 
Hierarchy 


Let R(x, y) be a tpredicate on natural numbers. 
We write x< zy for any natural numbers x, y 
for which R(x, y) holds. We consider only the 
case where <p is a ‘linear ordering on the set 
Dp={x|Ay(R(x, y)v R(y, x))}. If Dz is a well- 
ordered set with respect to <p, we denote its 
torder type by |R|. (1) For each (constructive) 
ordinal number «<«{*, there is a tgeneral 
recursive (more strictly, tprimitive recursive) 
predicate R such that |R|=« (Markwald, 
Spector, Kleene [3]). (2) Conversely, if R is a 
thyperarithmetic predicate (e.g., R is general 
recursive), then |R| < w$ (Markwald, Spec- 
tor). The following theorems are the most 
fruitful ones in the theory of constructive 
ordinal numbers, and they fully support the 
validity of Kleene’s idea of tanalytic hierarchy. 
(3) The set O is 17} (— 356 Recursive Func- 
tions H), and so is the predicate a < gb. Name- 
ly, for O, there is a primitive recursive predi- 
cate R(a, x, a) such that 


ace O <> VaIxR(a, x, &) 


(Kleene [3]). (4) For each ordinal number a < 
oS, the set {a| «> lal} is a hyperarithmetic 
set (Spector). (5) O is a tcomplete set for 77}. 
That is, for any JMi set E, there is a primitive 
recursive function @ such that aeE<+(a)eO 
(Kleene [3]). Accordingly, O is not a Z! set 
(— 356 Recursive Functions H). 


D. Relativization and Extension 


Given a (number-theoretic) predicate Q of one 
variable (or a set of natural numbers), we can 
trelativize to Q the notion of constructive 
ordinal numbers. The least ordinal that is not 
constructive relative to Q is denoted by œ. 
The relativization to Q of the fundamental 
notion aéO and relation a < gb of the system 
S, of notations are denoted by ace O2 and a< 
o2b, respectively. Then we can relativize the 
results of the preceding paragraphs to Q. For 
example, as the relativization of (3), we have 
the following: There is a predicate R (a, x, œ) 
which is primitive recursive tuniformly in Q 
such that 


ac O02 <> VaIxR2(a, x, a). 


When Q is hyperarithmetic, we have no gen- 
eralization of the constructive ordinal numbers 
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by relativizing them to Q, that is, w = œf" 
holds (Spector). Now by relativizing to O the 
concept of constructive ordinal numbers, we 
obtain a proper extension of it (wf* <9), and 
then, performing such extensions successively, 
we have a (transfinite) sequence O, 0°, 
0°°.....On the other hand, we can extend the 
constructive ordinal numbers to those corre- 
sponding to any like higher number class, 
beyond the second, in which partial recursive 
functions are used at limit levels to provide an 
“accessibility” mapping from previously de- 
fined number classes. There are several exten- 
sions done by Church and Kleene, H. C. 
Wang, D. L. Kreider and H. Rogers, Jr., H. 
Putnam, W. Richter, A. Kino and G. Takeuti, 
and others. Richter [9] has shown that these 
two ways of extending the constructive 
ordinals are equivalent, provided the sets of 
notations for the higher number classes satisfy 
certain natural conditions. Specifically, the 
ordinals of the Addison and Kleene [6] con- 
structive third number class are exactly the 
ordinals less than w? and the set O,, of nota- 
tions for those is recursively isomorphic to 

O° (Richter). 


E. Constructive Ordinals in Higher Number 
Classes 


Putnum [8] has defined a system C of nota- 
tions for constructive ordinals of the Cantor 
higher number classes, improving that of 
Kreider and Rogers [7], as follows: N, is the 
set of notations for the ordinal number g; if 
for some a, xe N, let |x| = ué [xe N], which 

is called a hyperconstructive ordinal; and 

let C,=U{N.|¢ <a}. Define i to be a Gödel 
number of the tidentity function, and n to be 
an index in C, if 3”: 5'e C, for some t. There 
are four cases: 

Case 1. a=0. Then N, = {1}. 

Case 2. «= ß + 1, where N; is already defined. 
Then N, = {2*|xeEN,}. 

Case 3. « is a limit ordinal such that N; is 
already defined for all €<«a, and there exists 
an ordinal $ <« such that 3°: 5'e N; for some 
a, and a partial recursive function f giving an 
order-preserving tcofinal mapping (0.p.c.m.) 
from C; into C,. Then N, is taken to be the set 
of all numbers 3°: 5" such that 3%: S'e N, (where 
ß is any ordinal with the above property) and 
{n} is an o.p.c.m. from C; into C,. 

Case 4. « is a limit ordinal such that N; is 
already defined for all č < a, there is no f as in 
Case 3, but there is a number ae C, which is 
not an index in C,. Let fi, be the least f for 
which such a’s belong to N,. Then N,= 

{3% 5"]ae Ng, and {n} is an o.p.c.m. from C, 
into C,}. 
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We have that xe N, only as required by Cases 
1-4. Finally, let wç be the least ordinal num- 
ber which is not hyperconstructive, and let 
C = Coc 

Extending Kleene’s H, for ye O (— 356 
Recursive Functions H), Putnum has defined a 
hierarchy H, for xe C, and shown that hyper- 
constructive ordinal numbers are uniqueness 
ordinals, i.e., for x, ye C, if |x| = |y], then H, 
and H, are of the same tdegree of (recursive) 
unsolvability. E, is the type-2 object (— 356 
Recursive Functions F) introduced by T 
Tugué such that E, («)=0 if VBAx[a(B(x))=0]; 
otherwise, E,(a)=1. It is known that wc = 
œ": , where w*' is the least ordinal which is 
not the order type of any well-ordering recur- 
sive in E, (Richter [10]). 
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A. General Remarks 


A transformation of 2n + 1 variables z, x;, p; 
(j=1,2,...,n), 
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LA LZ Xs ses Mas Dis P2 -<-s Pnb 
X; = XZ, X1; X2; -5 Xps Pas P2 ++» Pn) 
JE he ai 
s Xn» Pis P2 -++ Pnb 
F= aan, 


P, = P(z, X4, X3; 


is called a contact transformation in the (n + 1)- 
dimensional space R”*! with the coordinate 
system (z,x,,...,X,) if the ttotal differential 
equation 

dz—p,dx,—p,dx,—...—p,dx,=0 (2) 
is invariant under the transformation, 1.e., if 
the equality 

= p(dz— p, dx, —p,dx.—...—p,dXx,) (3) 


holds identically for a suitable nonzero func- 
tion p of z, x;, p; Here we assume that (1) has 
an inverse transformation. Using Lagrange’s 
bracket 


a= (42) (1), 


fN S of 
dx) dx, oz 


we see that (1) is a contact transformation if 
and only if [X;, X,]=LX;, Z] =[P,, P,J=9, 
[P;, Xi] =p, LP;,Z]=pP,, where 6, is *Kro- 
necker’s delta. 

From this fact it follows that the composite 
of two contact transformations and the inverse 
transformation of a contact transformation are 
also contact transformations. Since the iden- 
tity transformation Z =z, X;,=x,, P =p;isa 
contact transformation, the set of all contact 
transformations forms an infinite-dimensional 
topological group. Given a set of scalars 
P1» +++» Pn» a pair consisting of a point (z, x;) 
and an n-dimensional hyperplane z* —z= 
Zr- p(x} — x;) in an (n+ 1)-dimensional space 
is called a hypersurface element, and the set 
of hypersurface elements satisfying (2) is called 
a union of hypersurface elements. Using these 
concepts, we can state that a transformation 
(1) of coordinates z, x; p; (j=1,2,...,n) isa 
contact transformation if it transforms each 
union of hypersurface elements into another 
one. Consequently, if two n-dimensional 
hypersurfaces are tangent at a point (z, x,) in 
the (n+ 1)-dimensional space, their images 
under a contact transformation, which are 
again two n-dimensional hypersurfaces, are 
tangent at the image point of (z, x,). The name 
“contact transformation” is derived from this 
fact. 

For instance, the tcorrelation with respect 
to a hypersurface of the second order gives a 
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contact transformation. In fact, from the re- 
lation between ‘poles and tpolar lines with 
respect to the parabola x* +2y=0 in a plane, 
we have Legendre’s transformation X = — p, 
Y=xp—y, P= —x(p=—x). 

In general, an invertible transformation 
defined by the three relations Q(x, y, X, Y)=0, 
0Q/éX + PEQ/0Y =0, 6Q/éx + peQ/dy=0 
derived from a function Q(x, y, X, Y) is a con- 
tact transformation. The function Q is called 
the generating function of this transformation. 
In this transformation, to each point (Xo, Vo) 
there corresponds a curve Q(X9, Yo, X, Y)= 
0. These results are valid also in the case of 
several variables. For instance, in an (n+ 1)- 
dimensional space, a transformation Z =z — 
X1Py— ++» —XyPys Xi = Py, Xy = Py Kya = 
Xiri testy = Xp tr = — hiss PS ky 
Pi4,=Pyi1,---,P, =p, represents a contact 
transformation. Here v is an integer between 
1 and n. In the case n=2, v=2, this transfor- 
mation reduces to a Legendre transforma- 
tion; and in the case n=2, v=1, it is called 
Ampère’s transformation (— Appendix A, 
Table 15.IV). 


B. Canonical Transformations 


A transformation of 2n variables x,, p; (j= 
1,2,...,n) 


X;=X (x1; X2, Hig Xn Pio P2 ++- Pn) 
P= P(x1, X25 -0-3 Xm P1 P2 -<-s Pn) 

jJ=1,2,...,n, (4) 
is called a canonical transformation if the tdif- 
ferential form Xr- (P,dX;— p;dx;) is texact in 


Xj, Pj, Le., if there exists a function U of x, p 
such that 


2 (PdX;—pdx)=4dU. (5) 
£ 


Let (1) denote a contact transformation, and 
let 2 denote a new variable which is different 
from zero. Set Xo =z, Po =A, X= X; Dj= — AD; 
(j=1,2,...,n) and define a transformation of 
(2n+ 2) variables by 


Xo(X, P)=Z(z, x, p), 
Po(X, P) = Po/p(z, x, p), XX, P) = Xj(x, p), 
P(X, P) = — Po Pilz, x, p)/p(z, x, p), 
j=1,2,...,n, (6) 


where x, p, X, and p denote points (x4, X2, 


3 Xn) (Pis P25 <--> Pn) (Xos X1- Xn) and (Po, 
Pis -s Pa), respectively. Then the identity (3) 
when multiplied by 4 on both sides becomes 


P dX +P, dX, +...+P,dX, 


=PodXot+p,dx,+...+p,dx,. (7) 
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Thus the transformation (6) represents a 
canonical transformation (U = constant). 
Moreover, from the definition the relation 
(6) is homogeneous, i.e., for all 440, 
X(%,up)=X,(X,p), P(X, up) = P(X, p), 
j=9,1,...,n. 


Conversely, let a transformation of (2n + 2) 
variables 


X,=X,(X,p), P,= PAX, B), 


j=0,1,...,n, (8) 


be a homogeneous canonical transformation. 
Define a transformation of (2n+ 1) variables 
Z, Xj, Pi (j=1,2, ey) by 


Z(z, x, p)= Xo(Z,X 5-0-3 Xns A —AP 15 «+> ~APn)s 


1/p(z, x, p) 


Xp As —Ap,, eae 


=P, Z, Xise’ =A > 
i o(Z, X1 Pn) 


X j(Z, x, P) = Xj(Z, X15 ++ Xns A — AD +++ —ADn)s 


P,(z, x, p) 


= -Ê Bez, x,, sae 
A 


2 


, Xn A —ÀPis o — ÀPn), 


j=1,2,...,n. (9) 


Since the necessary and sufficient condition for 
(8) to be a homogeneous canonical transfor- 
mation is that (7) hold, it follows that 


A 
—(dZ —P,dX,—...—P,dX,) 
p 


=A(dz—p, dx, J eee — p, aX,). 


Thus the transformation (9) represents a con- 
tact transformation. Therefore the most general 
contact transformation of (2n + 1) variables 
and the most general homogeneous canonical 
transformation are identical concepts, differing 
only in the choice of notation. 

The necessary and sufficient condition for 
(4) to be a canonical transformation is ex- 
pressed by the relations 


(X;, X,)=9, (X; P) = ôr, (P, P,)=0, 


j,k=1,2,...,n, (10) 


where (-, +) is Poisson’s bracket, i.e., for a pair 
of functions f(x, p), g(x, p), 

n (Of dg of ch 
G=} (za i . 

jai \Ox; Op; 

Denote Poisson’s bracket in the coordinate 
system X,,X>,.-.,Xp_; Pi, Pas c00 Pa by Co N 
and for f(x, p) and g(x, p) denote their trans- 
formations by f(X, P) and g'(X, P). Then the 
relation (10) is equivalent to 


(f.g9)=(f',gY forall f and g. 
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From this fact it follows that the set of all 
canonical transformations forms an infinite- 
dimensional topological group. 

Suppose that we can take x4, X3, ++, Xn, X1> 
Xz, ..-, X„ as 2n independent variables. Denote 
by Q (x, X) the function U of (5) represented 
by x, X. By means of this Q,, 2n functions 
which give a canonical transformation (4) are 
derived from the relations 


A TA 
= — = ’ i) Oy eee . 
BE page IE Qe a i 


2 
Conversely, for Q(x, X) such that det ( 3x, aa 
#0, a transformation determined by the 
relations 


oa ,m _, 
D;= Ôx; = Ox; J= et) 1, 


represents a canonical transformation. For 
a general canonical transformation, x4, X2, 
a, Xp X1, X2, --. Xn are not necessarily 2n 
independent variables. But for a canonical 
transformation there exists n new variables 
Xip Xise Xi P P, „such that x4, 


t Ji? Ja 
E E PE Mp vee kis By R 


jo 
are 2n independent variables, where {i,,i,, 
wees dy), Gijs ---Jn-p) is a partition of the set 
(1,2, ...,n) to two disjoint subsets. Denote 
by Q, the function U— Li} X,,P,,. Then the 
transformation (4) is derived from relations 


gee =] 2 n 
j ax,” gS ere hy 
AQ, 
iAP, l=1,2, ,n—k, 
29, 
Pes. Jaiak 
" AX 


These functions Q,, Q, are called the generat- 

ing functions of a canonical transformation. 
Consider a transformation of 2n variables 

depending smoothly on a parameter t: 

X= X,(t, X1, X2, -3 Xns P19 P25 < Pah 

*2Xm Pio P25 <--> Pab 


j=1,2,...,m. (11) 


P= P(t, X1, X2,- 


If (11) represents a canonical transformation 
of 2n variables for each t, then X; and P,, 
j=1,2,...,n, satisfy, tHamilton’s canonical 
equations 


dX, ôH dP, ôH 
er E AA. X, P), = —-— ,X, P), 
r A a a 


j=1,2,...,n, (12) 


for some *Hamiltonian function H(t, x, p). 
Conversely, for any solution X;, P, of Hamil- 
ton’s canonical equations (12) depending on 
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parameters X4, X2, +- Xas PisP2> +++» Pns if the 
transformation (11) is a canonical transforma- 
tion for t=0, (11) represents a canonical trans- 
formation for each t. By a canonical trans- 
formation (4), the solutions of Hamilton’s 
canonical equations 

cad a ia 
TE N ; 


j=1,2,...,n, (13) 


are transformed to the solutions of 


aX) OK iy p MK ay py 
dt 3B O? dt 06X,°” ” 


j=1,2,...,n, (14) 


where the new Hamiltonian function is given 
by K(t, X, P)= H(t, x, p). 


C. Applications to the Integration of 
Differential Equations 


Contact transformations have applications to 
the integration of differential equations since 
they transform each union of surface elements 
into another one. 

As an example, we shall describe an outline 
of their application to a partial differential 
equation of the first order 


F(x, y, z, p,q) =9; 
p=0z/0x, q=0z/éy. (15) 


For this purpose, we regard (15) as an equation 
defining unions of surface elements and trans- 
form it into a simpler equation by means of a 
contact transformation. If the transformed 
equation can be solved, then the solution of 
the original equation can be obtained by 
means of the inverse transformation. Now, let 
z= (x, y, a,b) be a tcomplete solution of (15). 
Then (15) is reduced to Z — c =0 by the trans- 
formation generated by the function 


Q=Z—z+a(x, y, X, Y)—c=0, 


where c is a constant. In this equation the 
solution X =a, Y=b, Z =c, aP + BQ =0 (a,b, 

c, a, B are constants) plays an important role, 
and this line element will be called a charac- 
teristic line element. The characteristic line 
element satisfies equations that can be trans- 
formed by means of the inverse transformation 
into Charpit’s subsidiary equations for (15): 


dx dy _ dz 
ôF/ôp ôF/ðq peF/dp+qoF/dq 
—dp —dg 


=—— n 16 
OF/dx+poF/éz OF /0y+qeF/dz (18) 


Consequently, if we have p= p(x, y, z; a), 4 = 
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q(x, y, z; a) from the solution G(x, y, z, p,4)=a 
of (16) and F =0, then the total differential 
equation dz = pdx + qdy is tcompletely inte- 
grable, and the tgeneral solution of this equa- 
tion is a complete solution of (15). Also, if we 
know two independent solutions G(x, y, z, p,q) 
=a, H(x, y, Z, p,q)=b of (16) such that [G, H] 
=0, we can obtain a complete integral of (15) 
by eliminating p, q among the three equa- 
tions F=0, G=a, H=b. This method is called 
the tLagrange-Charpit method, which ts ap- 
plicable also to the equation F(z, x,,...,%p3 
Pi>-++sPn) = 0 (> 322 Partial Differential Equa- 
tions (Methods of Integration)). 


D. Applications to Analytical Dynamics 


Consider Hamilton’s canonical equations (13). 
An example of system (13) is the equation of 
motion for a dynamical system which is de- 
rived from ôf L(x;, x;)dt =0. Here the x; denote 
generalized coordinates, p;=0L/éx,; and H = 
2j- jXj— L. 

Let S{(xX4, X25 -003 Xn Xio X25- Xn) be a 
solution depending on parameters X,, X3, 
..., X, Of the tHamilton-Jacobi equation 


os os os 
Hi ot, Xis Xea Xis ` nh 
OX, OX, OX, 
= K(X, X>,...,Xp) (17) 
2 
such that det ( ss =) #0. Then by a canon- 


ical transformation of a generating function S, 
i.e., by a transformation determined by the 
relations 


és os 
Pi ~ Bx," a” ax: j= 1,2... oN, 


the system (13) is transformed to (14). Here 


Ga 
K(t,X, P)=H| t,x, —— )}=K(X). 
ôx 


Then (14) becomes 

dx, dP, OK 

— =0, ae 5 JEL; 2a yf, 
dt dt Ox; 


and the solutions are 


OK 
P(t) = P(0)—t— 


X;(t)= X,(0), OX;\x=x0) 
jlx= 


In particular, a transformation that makes 
K =0 is called a transformation to an equilib- 
rium system. Thus solving Hamilton’s canon- 
ical equations reduces to finding a tgeneral 
solution of the Hamilton-Jacobi equation (17). 
When the variables of the Hamilton-Jacobi 
equation (17) are totally separable we can 
obtain a general solution of (17) by tquadra- 
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tures. Then one can integrate by quadratures 
the system of equations (13). For many im- 
portant problems exact solutions are obtained 
by this method (— 271 Mechanics). 

Furthermore, every mapping in an optical 
system is a canonical transformation; in optics, 
the quantity corresponding to S is called an 
eikonal (— 180 Geometric Optics). 
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A. The Notion of Continued Fractions 
Let {b,} (n=0,...,m) and {c,} (n=1,...,m) be 


finite sequences of elements in a ffield F. A 
fraction of the form 


C2 


Cm~1 
wot 


c 
bm- +#— 
ae 


m 


is called a finite continued fraction. It expresses 
an element in the field F unless devision by 0 
occurs in the process of reduction. Symboli- 
cally, it is also written in the forms 


bed ore Cm-1 Cm 
"i by by bb, +B, 
Cy cal Cm-1l Cm 
$+ fe += 
0 > 
lb; |b, [Bm—1 LD 
c Crete ies 
b+ 424 LHZ, 


etc., or more briefly, 


bo + fail 
: b, ae 
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If {b,} (n=0,1,...) and {c,} (n=1,2,...) are 
infinite sequences, the expression 


Cy 


Ca 


boven 


is called an infinite continued fraction. By 
analogy with the finite case, it is expressed by 


gii C; Cn 
hrt by te +b, tes: 
C wo 
or by by + e] . 
b, n=1 
For an infinite continued fraction, the quantity 
Ci Ch 
k, = bo +— —, kjy=b 
n 0 b; af at +b, 0 0 


is called its nth convergent, by is called the 
initial term, and c, and b, (n> 1) are called the 
partial numerator and partial denominator, 
respectively. If F is a ttopological field (e.g., 
the real or complex number field) and the se- 
quence {k,} of its elements converges, then the 
infinite continued fraction is said to converge, 
and the limit is called its value. 

A finite or infinite continued fraction in 
which c, (n> 1) are all equal to 1, bọ is a ra- 
tional integer, and b, (n> 1) are all positive 
rational integers is called a simple continued 
fraction. It is expressed by [bp,b,,... ]. bn is 
often called the nth partial quotient. In the 
following paragraphs we shall mostly discuss 
simple continued fractions. 

For a real number x we mean by [x] the 
greatest integer not exceeding x. [ ] is called 
the Gauss symbol. Let œw be any given real 
number, and put 


1 
w=bob+—, ba=Llo]; 
on 


l 
O,=b,+ i 
On+1 


bb=lo,]) n=1,2,.... 
Then an expansion of w into a simple con- 
tinued fraction 


1 l 


i 0 bite tb, + 


is obtained. If œ is irrational, this expansion is 
determined uniquely. Conversely, any infinite 
simple continued fraction converges to an 
irrational number. If œ is rational, the process 
is interrupted at a finite step (w,,=5,,), result- 
ing in 

b 1 1 
w= F pede 
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An alternative representation of a rational 
number by a simple continued fraction is given 
by replacing b,, above by (6,,—1)+ 1/1. 

Examples of infinite simple continued frac- 
tions are 


ee ae P 1 1 
3p+ e +(2na+1)p+ 


er] 
where p is a natural number (J. H. Lambert), 
and 

1 1 1 1 1 1 


e=2+- z = - — > 
1+2+1+-+1+2n+1+- 


(L. Euler). 


B. Convergents 


Let the nth convergent of a simple continued 
fraction be expressed in the form of an irreduc- 
ible fraction 


P, b ip >0 
SS os — 7,3 na > 
QO, ° byte +b, 


and for convenience put P_,=0, P_,=1, 
Q_,=1, Q_,=0. Then we have the recurrence 
relations 


P = bpa Pa-1 t+ Pa—2s Qn=PnQn-1 + On-a5 
n>0, 

whence follows 

P,Qn-1— Py-1On=(—1)", n> —1. 


Any simple continued fraction represents a 
real number œw which satisfies 


DOnt Q, + Qn-1) 


in terms of the notation defined in this and the 
previous section. 

In particular, let w be an irrational number. 
Then each of the fractions 


o=(a,4,P, + P,- 


PP PaatkPa k=1,2,... 
OF Q,-2 + kQn-1 

which are inserted between two conver- 

gents Pa-2/Qn-2 and P,/Q, =(P,-2+5,P,-1)/ 
(Q,-.+,Q,-,), is called an intermediate con- 
vergent, while the original convergent P,/Q,, is 
called a principal convergent. 

If a fraction P/Q approximating an irra- 
tional number w satisfies |œ — P/Q| <|@— p/q| 
for any other fraction p/q with q <Q, then it is 
said to give the best approximation. The frac- 
tion giving the best approximation of w is 
always a principal or intermediate convergent 
P®©/Q of œw with k> b,/2 or k=b,/2, O,> 
On-1 Oy. 

The convergents satisfy the relation P,/Q, — 
P,-1/Q,-1=(—1)""!Q,Q,-15 hence the se- 
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quence {P>,/Q>,} (resp. {P2n+1/Qon+1}) is 
monotonically increasing (decreasing). Ap- 
proximation by convergents is shown in the 
relations 


| P, 1 | n P,- 
n < ry -— |< > 
Qn Qn+1Qn QO, Q,-1 
P, o (—1)" 
o=lim—=b)+ } ) 
n> Qn n= oQn+1 Qn 


In particular, |œ — P,/Q,|<1/Q2. On the other 
hand, if P, Q are relatively prime positive 
integers with |œ — P/Q| < 1/20°, then P/Q is a 
convergent to œw (A. M. Legendre). There are 
several results concerning the measure of 
approximation by convergents. Any œw satisfies 

—P,/Q,\< 14/502) for infinitely many n, 
while there exists an œ which satisfies |œ — 
P./Q,„|< 1/402) for only a finite number of n 
provided that A> J5 (e.g, @= go A. Hur- 
witz); at least one of two adjacent convergents 
satisfies the inequality |œ — P,/Q,|<1/(2Q2) (K. 
T. Vahlen); at least one of three consecutive 
convergents satisfies the inequality |œ — P,/Q,| 
< 1/(\/3Q3) (E. Borel); |o — P/Q| <1/,/8Q? 
has infinitely many rational solutions P/Q 
whenever w is not tequivalent to RE +1)/2 
(Hurwitz). One calls an irrational number œ 
badly approximable if there is a constant c= 
c(@)>0 such that |œ — P/O|>c/Q? for every 
rational P/Q. œ is badly approximable pre- 
cisely if the partial quotients of the continued 
fraction expansion of œ are bounded. In partic- 
ular, real quadratic irrationals have bounded 
partial quotients (— Section C; 182 Geometry 
of Numbers F). 


C. Quadratic Irrationals 


If an infinite simple continued fraction 

[bo, 5, ... ] satisfies bmk = bme, V= 

0, 1,2,...), it is called a periodic continued 
fraction and is denoted by the symbol 

[bo bis <-s Bms -+s Om+k_1)- According as m=0 
or m> 1, the periodic continued fraction is said 
to be pure or mixed, and the sequence bp, 
Din+1>+++>0m+y%-1 18 called a period. In order for 
the continued fraction of œ to be periodic, it is 
necessary and sufficient that œ be a quadratic 
irrational number, i.e., a root of ax? +bx+c 
=Q with rational integral coefficients a, b, c 
and nonsquare discriminant b? —4ac (J. L. 
Lagrange). In order for œw to be represented by 
a purely periodic continued fraction, it is neces- 
sary and sufficient that w be an irreducible 
quadratic irrational number, i.e., that œ satisfy 
œ> 1 and 0>% > —1, where w’ is the conju- 
gate root of œ (E. Galois). In order for œw to be 
equal to the square root of a nonsquare ra- 
tional number, it is necessary and sufficient 
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that its continued fraction be of the form 
[bo, by, ---, bg-1, 209 ] (Legendre). 


D. Application to Pell’s Equation 


Let ax —by=1 ((a,b)=1) be a tDiophan- 

tine equation of the first degree, and a/b= 

[bo, bis «++ 5 bm] = Pan/Qm- Since PnQm-1— 

Pa-1 Om=(—1)""', a solution of the equation 
is given by xo =(—1)" 'Qms Vo=(— D"! Pry: 
The general solution is then represented in the 
form xọ + bt, yo tat (te Z). This method of 
obtaining a solution is essentially the same 

as the method which uses the tEuclidean 
algorithm. 

Pell’s equation x? — Dy? = 1 (D is a non- 
square integer > 1) was solved by Lagrange in 
terms of continued fractions. If the length of 
the period of D is k, all positive solutions of 
Pell’s equation are given by x= P3,,_,, y= 
Ooy,-1 If k is odd, and by x=Py-1, Y= Qvr-1 
if k is even (v=1,2,...), where P,/Q, denotes 
the nth convergent of the continued fraction 
expansion of JD. Incidentally, x = Poy—1yk-15 
y=Qeoy-1ye-1 V= 1, 2,...) are the positive 
solutions of x? — Dy? = —1 provided that k 
is odd. There are no solutions of x? — Dy? = 
+1 other than x,,y,(v=1,2,...) given by 
(x, +D y =x, + Dy, where x,, y, 1S 
the least positive solution. For instance, the 
least positive solution of x? —21ly?=1 is x= 
278,354,373,650, y = 19,162,705,353. 

Lagrange made further use of continued 
fractions in order to obtain approximate 
values of roots of algebraic equations. The 
method is especially useful for precise compu- 
tation of neighboring roots. 

The theory of continued fractions may be 
investigated geometrically making use of lat- 
tices ([2,3]; — 182 Geometry of Numbers) (F. 
Klein, G. Humbert). For instance, a measure 
of approximation of P,/Q,, to œw in Diophantine 
approximation is represented by the closeness 
of a lattice point (P,, Q,,) to the straight line y 
=x on the plane. 


E. Continued Fractions with Variable Terms 


There are few results on continued fractions 
with variable terms. It is noteworthy that 
from the expansion of tanz into a continued 
fraction 

Z 2 = A 


tanz=— 
eee ee ae ee T 


(Lambert), the irrationality of x and of tanz 
for rational z (40) can be deduced (A. 
Pringsheim). 

Among continued fractions with variable 
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terms, those of the form 


Ay Z AyZ 


TERI ead {cans 


[ag,4,2]}° = 


are called normal continued fractions. Let the 
convergent of such a continued fraction be 
P,(z)/Q,(z), and for convenience put P_,(z)= 
0, O_,(z)=1. Then we have the recurrence 
formulas 


P,(Z) = P,- (z)+4,2P,—2(2), 
O,,(z) = Q,-1(2) +4,Z0,,-2(2), ne l. 


There are the further relations 


P0,- 8) Py (2) Qn (2) =(— 17" 4. 


[a a z]?= 5 (—1)"2" I ; 
0> LS a aay a v 
" n=0 Q,-1(Z)Qn(Z) v=0 
where the latter is formal. Let the tpower series 
expansion of the nth convergent of [ao, anz] i 
be 


Then b,,, =bn, (0<v<m<n). If [ao,a,z]? has a 
power series expansion about the origin, then 


eo) 
[ao, anz] 7 X banz”. 
n=0 


If the supremum g of {|a,|}{° is finite, then 
[ao,4,Z] converges uniformly for |z| < (1/4) 9, 
and hence it represents an tanalytic function 
which is holomorphic in |z| <(1/4)g. 
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84 (X.2) 
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A. General Remarks 


The notion of continuity is defined for a map- 
ping or a function f: XY from a topolog- 
ical space X to a topological space Y (— 425 
Topological Spaces G). In the present article, 
however, we are concerned mainly with the 
case where both X and Y are tmetric spaces 
with the distances py and py, respectively. The 
most usual case is X =R” (Euclidean space), 
Y=R (real numbers). 

A function f: X — Y is said to be continuous 
at a point x,¢X if for every positive number e, 
we can select a suitable positive number 6 
(depending on € and also on xo) such that 
Px(X, Xo) <ô implies py( f(x), f(xo)) <¢. This is 
equivalent to the condition that xx, implies 
f(x) f (Xo) (— 87 Convergence). We call f 
continuous (on X) if it is continuous at every 
point x, of X. If for every positive number g, 
we can select a suitable positive number 6 
independent of x and y such that p,(x, y)< ô 
implies py( f(x), f(y))<e for all x, ye X, we call 
f uniformly continuous on X. 

The tsupremum (6) of py( f(x), f(y)) for 
x, ye X satisfying py(x, y)<6 is called the 
modulus of continuity of the function f in X. 
Uniform continuity means that w(d) 90 for 
60. 

If w(6)< Mô" for suitable constants M, «> 
0, that is, if the inequality py(f(x), f(y) < 
M(px(x, y))* holds for x, ye X, then f is said 
to satisfy the Hélder condition of order «x, also 
known as the Lipschitz condition of order «. 

If «= 1, this condition is called simply the 
Lipschitz condition. A function satisfying one 
of these conditions is uniformly continuous. 
The family of functions satisfying the Lip- 
schitz condition of order « is sometimes de- 
noted by Lipa. 

In general, the composite function go f: 

X >Z is continuous if both functions f: X >Y 
and g: YZ are continuous. If the ranges of f, 
g are both the real field R (or the complex field 
C, or more generally a ttopological field), then 
fxg, fg are continuous if f and g are contin- 
uous; and f/g is continuous provided that 
g(x) #0. If R is the range of both f and g, then 
min( f, g) and max(f, g) are continuous when f 
and g are continuous. If X is tconnected (for 
example, an interval J in R) and if f is contin- 
uous, the image f(X) is also connected. 


B. Continuity from One Side 


In this section, we always assume that the 
domain X is an interval J in R and f is a func- 


84C 
Continuous Functions 


tion from I to a metric space Y. A point xo 

of X is called a discontinuity (point) of the 

first kind of f if both limits lim,,,, f(x) and 
lim,,,,, f(x) exist in Y and are different. Then 
we say also that f has a jump (or gap) at Xo. If 
these two limits exist and have the same value, 
then f is continuous at xo. 

We say that f has a discontinuity of at most 
the first kind at x, if f is continuous at x or if 
Xo is a discontinuity of the first kind for f. 
(Sometimes the phrase “discontinuity of the 
first kind” is used to mean “discontinuity of at 
most the first kind.”) A discontinuity point of f 
(i.e., a point at which f is not continuous) that 
is not of the first kind is called a discontinu- 
ity point of the second kind.When lim, x, f(x) 
= f (Xo), we call f right continuous (or con- 
tinuous from the right) at xo. In this case, 
lim, 1x, f(x) need not exist. Replacing x| xo by 


~ xftxX 9, we can similarly define the concept of 


being left continuous (or continuous from the 
left). If a function f has a finite number of 
discontinuity points of the first kind in the 
interval [a,b] and is continuous at all other 
points, we call f a piecewise continuous func- 
tion in [a,b]. 


C. Semicontinuous Functions 


In this section, we assume that the domain of 
the functions is a subset E of a metric space X, 
and that the range is the set of real numbers 
extended to include +00. Let x be a point in 
the closure of E. We denote by M(x, ô) and 
m(x, 6), respectively, the supremum and the 
infimum of the values of a given function f in 
the 6-neighborhood of x. We put 


M(x) =lim M(x,6), m(x) =lim m(x, 0), 

and call them the upper limit function and 
lower limit function, respectively. We have 

~o <m(x)< M(x)< +00. If M(x9)=f(Xo) at 
Xo€ E, then f is called upper semicontinuous at 
Xo. If m(xo)=f(Xo) at XoEE (Le., if —f is upper 
semicontinuous at xo), then f is called lower 
semicontinuous at x). The function with one 
of these two properties is said to be semicon- 
tinuous at Xo. 

Either of the following two conditions is 
necessary and sufficient for the function f to 
be upper semicontinuous at X9éE: (1) f(xo)= 
+, or for every constant 4 such that f(xo)< 
2, there exists a 6-neighborhood such that 
M (xo, 6) <2. (2) For every sequence x, of E 
converging to xo we have limsup,,,., J (Xn) = 
F(X). 

A function f is called upper (lower) semi- 
continuous in £ if it is upper (lower) semicon- 
tinuous at every point xe E. A necessary and 
sufficient condition for the upper semicontinu- 
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ity of the function f(x) in E is that {x| f(x) <a} 
be a frelatively open set in E for every real 
number «. We can define semicontinuity for 
functions on a topological space by using this 
latter property. 

A real-valued function f(x) is continuous at 
Xo €E if and only if it is upper and lower semi- 
continuous at x and f(x) is finite. A function 
J(x) ts continuous on E if and only if it takes 
finite real values on E and both {x|f(x)<a!} 
and {x|f(x)>qa} are relatively open in E for 
any real number a. When E is tcompact, an 
upper (lower) semicontinuous function on E 
attains its (supremum) infimum at a point in 
E. In particular, a continuous function on a 
compact set E is bounded and assumes its 
maximum and minimum on E (Weierstrass’s 
theorem). Furthermore, if E is connected (e.g., 
the interval J in R), it follows from the con- 
nectedness of the image f(E) that if «, Be 
f(E) and y lies between « and $, then ye f(E) 
(intermediate-value theorem). 

A real-valued function f(x) ona set E of R 
satisfies the Lipschitz condition if it is tdiffer- 
entiable and the derivative is bounded. Such a 
function is also tabsolutely continuous, con- 
tinuous, and of tbounded variation. (For the 
polynomial approximation of real continuous 
functions — 336 Polynomial Approximation.) 

The limit function f(x) of a monotone de- 
creasing sequence of upper semicontinuous 
functions f,,(x) is also upper semicontinuous. 
The limit function f(x) of a uniformly converg- 
ing sequence of continuous functions is con- 
tinuous. (Regarding the tequicontinuous family 
of functions — 435 Uniform Convergence.) 


D. Baire Functions 


The limit function of a pointwise converging 
sequence of continuous functions defined on a 
metric space X is not necessarily continuous. 
R. Baire (Ann. Mat. Pura Appl., (1899)) intro- 
duced the notion of Baire functions as follows: 
He named continuous functions the functions 
of class 0. Then he called a function that is a 
pointwise limit of a sequence of continuous 
functions a function of at most class 1. A func- 
tion is said to be of class 1 if it is of at most 
class 1 and is not of class 0. He similarly de- 
fined the notion of class n for arbitrary natural 
number n. 

Further, a function is said to be of at most 
class œ if it is a pointwise limit of a sequence of 
functions of class n, for a sequence of natural 
numbers n,. A function is said to be of class 
œ if it is of at most class œw and is not of class 
n for any finite number n. In general, using 
ttransfinite induction, we can define the notion 
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of functions of class é for an arbitrary tordinal 
number é [4]. 

All these functions are called Baire functions. 
Actually, there is no function of class & for an 
uncountable ordinal number €. If X is a tper- 
fect set in Euclidean space, then there is actu- 
ally a function defined on X of class č for an 
arbitrary countable ordinal number €. Here- 
after, we shall be concerned with this case only. 

If X has the cardinality of the fcontinuum, 
then the set of all Baire functions on X has the 
cardinality of the continuum. On the other 
hand, the cardinality of all functions on X is 
actually greater than that of the continuum. 
Hence there exist functions that are not Baire 
functions on X. A function is a Baire function 
if and only if it is Borel measurable (H. Le- 
besgue). Therefore a necessary and sufficient 
condition for a function f to be a Baire func- 
tion is that the set (x| f(x)>a,xeX} be a Borel 
set for any real number « (— 270 Measure 
Theory J). The limit of a countable sequence of 
Baire functions is also a Baire function. If f(x) 
and g(x) are of at most class « on X, then the 
following functions are also of at most class 
a [Fd FO) +900), £00) g), and f(x0/g(x) 
(provided that g(x) 40 on X). 

The condition that a function f is of at most 
class 1 on X is equivalent to either of the 
following two conditions: (1) For any closed 
subset F of X, the restriction f* of f to F has 
a continuity point in F. (2) For every real 
number a, the set {x|f(x)<a,xeX} is an tF, 
set (Baire). In a fcomplete metric space, a 
necessary and sufficient condition for a func- 
tion f to be of at most class 1 is that the set of 
continuity points be dense in X. 

For example, the Dirichlet function, which 
takes the value 1 at rational points and 0 at 
irrational points, is expressed as 


lim (im (cosy! my"), 
you \ koa 

which is of class 2. A function f(x, y) of two 
real variables that is continuous in each vari- 
able x and y separately is a function of at most 
class 1. 
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85 (VI.19) 
Continuous Geometry 


A. General Remarks 


The structure of a tprojective geometry is 
determined by the tlattice (lattice-ordered set) 
of subspaces of the projective space. For this 
reason, this lattice itself is sometimes called 

a projective geometry. The concept of con- 
tinuous geometry was introduced by J. von 
Neumann as an abstraction of lattice-theoretic 
properties from a special class of tlattices 
(lattice-ordered sets) which he encountered in 
his research on foperator rings in Hilbert 
spaces [1]. Continuous geometry contains 
projective geometry as a special case when the 
dimension is discrete; but more usually the 
lattices are of continuous dimension. 

A continuous geometry is a tcomplete and 
tcomplemented tmodular lattice L (— 243 
Lattices F) that satisfies the following prop- 
erty and its dual (both called properties of 
continuity): For any element a of L and any 
subset W of L which is twell ordered with 
respect to the ordering in L, we have aN supw 
=sup(a N w) (we W). The tcenter Z of the 
lattice L is called the center of the continuous 
geometry L, which is said to be irreducible 
when Z has no elements other than the tleast 
element 0 and the fgreatest element J; other- 
wise L is said to be reducible. A reducible 
continuous geometry is isomorphic to a sub- 
lattice of a tdirect product of irreducible con- 
tinuous geometries. 

On any continuous geometry L, there can 
be defined a function d(x) whose values belong 
to a complete lattice-ordered linear space M 
and which satisfies the following four con- 
ditions: (1) d(x) > 0; (2) d(x)=d(y) implies the 
existence of a common complement of x and y; 
(3) d(xU y)+d(xl y) = d(x) + d(y); (4) supd(w) = 
d(sup w) (we W) for any subset W of L which 
is well ordered with respect to the ordering in 
L. Such a function d(x) is called a dimension 
function on L. Irreducibility of L is equivalent 
to the property that a real-valued dimension 
function can be introduced; in this case, if d(w) 
takes only a finite number of values, then L is 
a finite-dimensional projective geometry; on 
the other hand, if d(w) takes every number in 
the closed interval [0,1], then L is called a 
continuous geometry in the strict sense. An 
example of the latter can be constructed as a 
limit of a sequence of projective geometries of 
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increasing dimensions; another example is the 
lattice of projection operators of the tvon 
Neumann algebra of type II,. If a group G of 
tautomorphisms of L is given, there can be 
introduced a generalized dimension function 
which is invariant under G satisfying slightly 
weaker conditions (T. Iwamura [3]). 


B. Representation of Continuous Geometry 


A ring R is called a regular ring if it has a unity 
element and if, for any element a of R, there 
exists an element x in R such that axa=a. A 
continuous geometry L is isomorphic to the 
lattice (with > as its ordering) of tprincipal left 
ideals of a regular ring R provided that d(I) 
=n: d(x) (n>4) for some natural number n 
and some element x of L. The decomposition 
of R into a tdirect sum of ideals corresponds 
to the decomposition of L into a direct prod- 
uct of lattices. The condition that L is irreduc- 
ible and finite-dimensional is equivalent to 

the condition that R is a matrix ring over a 
tskew field; when these conditions hold and L 
is considered as a projective geometry, then 
the coordinates are given by this skew field. In 
continuous geometries, join and meet are often 
denoted by the symbols for sum and product, 
respectively, and sometimes a direct product of 
continuous geometries is called their direct 
sum. Sometimes, the requirement that a con- 
tinuous geometry be complete is weakened to 
the requirement that it be tconditionally ø- 
complete. 
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Control Theory 


A. General Remarks 


The classical theory of automatic control 
mostly deals with linear feedback contro! 
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systems with single input and single output. 
Mathematical structures of such systems must 
be, in principle, described in terms of ordinary 
linear differential equations with constant 
coefficients. Hence control engineers use block 
diagrams to describe systems, and operational 
calculus based on tLaplace transforms to ob- 
tain response characteristics (= 240 Laplace 
Transform). Thus the input/output relation of 
a system is described in terms of transfer func- 
tions. The main objectives of control theory 
are to ensure system stability, to maximize 
closed-loop response characteristics by choos- 
ing the best feedback, and to obtain a desir- 
able transient response to an impulse or step 
input. One of the remarkable contributions to 
classical control theory is Nyquist’s criterion 
for stability testing of linear feedback systems 
[1]. The test consists of plotting the Nyquist 
diagram of a transfer function in the frequency 
domain (complex plane), and differs essentially 
from the tRouth-Hurwitz stability test for 
linear differential equations with constant 
coefficient. Classical control theory was almost 
complete by the end of World War II. 

Revolutionary technological innovations in 
electronics and computers and the invention of 
new control instruments and systems devel- 
oped after World War II have opened the way 
to modern control theory. Around 1960, three 
remarkable contributions were made con- 
currently; they are dynamic programming 
(proposed by R. E. Bellman [2]), Pontryagin’s 
maximum principle (L. S. Pontryagin et al. 
[3]), and linear system theory (R. E. Kalman 
[4]). The first two give rise to mathematical 
tools to solve optimal control problems and to 
design optimal controllers or regulators. In 
contrast to the classical theory of control, 
optimal control problems are formulated in 
terms of the system of linear or nonlinear 
multivariable differential equations with multi- 
input forcing terms called control variables. 
This leads to the state-space approach, which 
has become ubiquitous in modern control 
theory. Linear system theory derives from the 
concepts of controllability and observability. 
The former was introduced by Pontryagin et 
al. [3] and later modified by Kalman [5], and 
the latter was introduced by Kalman [5]. 
Controllability and observability are con- 
cerned with the interrelation between internal 
states of a system and its inputs and outputs. 
Hence linear system theory, profoundly related 
to the state-space approach, remains a prin- 
cipal theme of modern control theory. More 
advanced control theories, such as adaptive 
control or learning control, are mostly de- 
scribed and formulated in the framework of 
linear system theory. 

Modern control theory has stimulated 
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the development of cybernetics, the science 
of control and communication, and con- 
trol theory is important in the information 
sciences. In fact, current control theory has 
many features that might be considered to 
belong as well to other areas, such as mathe- 
matical programming (— 264 Mathematical 
Programming), operations research (— 307 
Operations Research), game theory (— 173 
Game Theory), prediction and filtering theory, 
digital signal processing, circuit theory, and 
computer or microprocessor technology. 
Current control theory embodied in modern 
instrumentation and computer technology has 
a vast range of applications. We do not con- 
sider these applications here; we confine our- 
selves to some important theories that con- 
tinue to be major topics in control theory. 


B. Linear Dynamical System 


Let the state of a physical process to be con- 
trolled be represented by a real n-dimensional 
column vector x(t)=(x,(t),...,x,(t))'€R", 
where T denotes the transpose of a vector or 
matrix. If the state is determined by a differen- 
tial system with input u(t)= (u; (t), ...,u,(t))" € 
R’ and output y(t)=(y,(0), ..., Ya (t) ER”, 


© x(t)= A(t)x(t) + B(t)u(t), 


y(t)= C()x(t), (1) 


then the process is called a linear dynamical 
system or simply a linear system. The dimen- 
sions of the coefficient matrices are given by 


A(t)eR™™", B(t)eR™*", C(HeR”””, 


If A(t), B(t), and C(t) are constant matrices, the 
system described by 


d 
X=—x=Ax+4+ Bu, 
dt 


y=Cx (2) 


is called a linear time-invariant system. Corre- 
spondingly, the system described by equation 
(1) is called a linear time-varying system. 
Given an initial state x(t 9) and input func- 

tion u(t) for tÈ tọ, the state of system (1) is 
represented by 

t 
x(t}=O(t, tosto | D(t, t)B(t) u(t) dr, 

to 
where ®(t, t) is a ffundamental solution to the 
matrix differential equation 


<(t,1)=AOE9, 


O(t,t1)=! (nxn identity matrix). 
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Hence the output of the system is described by 


Y(t) = C(A) PL, to) x(to) 


t 
ef C(t) P(t, t)B(t)u(z) dt. 
to 

It is seen from this equation that the input- 
output relation of the system is governed by 


Wit, t) = C(t)@(t, 2) B(t)ER™”", 


which is called a weighting matrix. As a spe- 
cial case, the weighting matrix for the time- 
invariant system (2) becomes 


W(t, t)= W(t —1)= Cet" 9B, 
where e“ is a matrix exponential function 
defined by 
t? t? 
Aat ER > a eee 
e =I+tAt+s A +74 +... 


The Laplace transform of the weighting matrix 
is 


G(s)= N e *W(t)dt =C(sI — A)“ B, 
(U 


which is called a transfer function matrix of the 
system (2). First we note that the weighting 
matrix W(t, t) for the system (1) is invariant 
under any linear transformation X(t)= P(t)x(t) 
of the state, and also that the transfer func- 
tion matrix G(s) for the system (2) is invariant 
under X= Px. For example, the transformation 
X= Px yields 


$= PAP!4+ PBu, 
y=CP'. (3) 


In fact, the transfer function matrix for the 
system (3) becomes equivalent to that for the 
system (2), that is, 


G(s)=CP7'(sI — PAP“)! PB=C(sI —A)'B 
= G(s). 


As a converse to this, there arises the problem 
of whether any linear time-invariant systems 
with the same transfer function matrix are 
similar to each other. To solve this problem, it 
is necessary to introduce the concept of con- 
trollability and observability for the system (1) 
or (2). 


C. Controllability and Observability 


The concept of controllability and observa- 
bility plays a fundamental role in linear system 
theory. Roughly, controllability tmplies the 
possibility of steering the state from the input. 
Definition of controllability: The linear sys- 
tem (1) is said to be controllable at time tọ if 
for any state x(tọ)e R" and xt eR" there exists a 
control input u(t), té[t,t,], that transfers the 
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state x(tg) to the state x! at a finite time t, >to. 
Otherwise, the system (1) is said to be un- 
controllable at time fo. 

The controllability theorem states: The 
system (1) is controllable at time tọ if and only 
if there exists a finite t, (> tọ) such that the 
matrix 


D(to,tı)= i Pty, t)B(E)B AD (to dt (4) 


is nonsingular. Similarly, the linear time- 
invariant system (2) is controllable if and only 
if the n x (nr) matrix 


D =[B, AB, ..., A"1B] (5) 


has rank n. The matrix D is called the con- 
trollability matrix of the system (2). 

The concept of observability is dual to that 
of controllability. It implies the possibility of 
estimating the state from the output. 

Definition of observability: The linear system 
(1) is said to be observable at tọ if for any state 
x(to)e R” there exists a finite t, (>to) such that 
the knowledge of the input u(t) and the output 
y(t) over the time interval [t,,t, ] suffices to 
determine the state x(t,). Otherwise, the sys- 
tem (1) is said to be unobservable at tọ. 

The observability theorem states: The sys- 
tem (1) is observable at time tọ if and only 
if there exists a finite t, (>to) such that the 
matrix 


M(to, t= f D(t, to) CACAH, todt (6) 


is nonsingular. Similarly, the linear time- 
invariant system (2) is observable if and only if 
the n x (nm) matrix 


M =[C7, ATC", ... (4T CT] (7) 


has rank n. The matrix M is called the ob- 
servability matrix of the system (2). 

We deal only with linear time-invariant 
systems for simplicity, although some of the 
main results to be stated here can be extended 
to linear time-varying systems. If the controlla- 
bility matrix D has rank n, (<n), there exists a 
linear transformation X = Px with nonsingular 
matrix P which transforms the system (2) into 


X, ba Ae Aiz Xe + B, 
S| ke Ae eel to” 


(8) 
y= KO Ce] Xe 
Xe 
so that the n,-dimensional subsystem 
X,=A,x,+ B.u, 
(9) 


y=C,x, 


is controllable. Clearly this subsystem has the 
same transfer function matrix as the system (2). 
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If the observability matrix M has rank n, (<n), 
there exists a linear transformation X = Qx 
that transforms the system (2) into 


X A, 0 Ix, B, 
= -H u, 
H P E i 
y=[C, O]} x, |, 
Xo’ 


so that the n,-dimensional subsystem 


X = AXo t Bu, 

(11) 
y=C,Xp 
is observable. This subsystem also has the 
same transfer function matrix as the system (2). 
Combination of these two properties yields the 
canonical decomposition theorem (found by 
Kalman [6]), which states: There is a nonsin- 
gular transformation x = Px by which the 
system (2) can be transformed into the canon- 
ical form 


Xes Aco 0 A;3 0 Xco 
New ue Az, Az2 Az3 Ara J| Xe 
Xeo 0 0 A33 0 Xorg 
Xero! 0 0 A43 Asa Xero 
Dey 
B 
+j? | u, (12) 
0 
y=[Co 0 G OJfxo |, 
Xeo 
Xeo 
x 


where the subvector x., of X is controllable 
and observable, x, is controllable but not 
observable, x., is not controllable but obser- 
vable, and x», is neither controllable nor 
observable. Furthermore, the transfer function 
matrix of the system (2) is equal to that of the 
system (12), which is described by 


G(s) m C.o(S1 os Aco) E B.o» 


that is, the transfer function matrix of the 
system (2) depends solely on the controllable 
and observable part of the state. 

A linear system (2) is said to be reducible if 
and only if there exists a linear time-invariant 
system of smaller dimension that has the same 
transfer function matrix. Otherwise, the system 
is said to be irreducible. The canonical decom- 
position theorem implies that a linear time- 
invariant system is irreducible if and only if it 
is controllable and observable. 


D. Realization Theory 


Realization theory is concerned with determin- 
ing (1) a linear time-invariant dynamical sys- 
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tem that has a prescribed rational function 
matrix or (2) a linear time-varying system that 
has a prescribed weighting matrix. Here we 
consider only the former, although there are 
many interesting results for the latter as well. 

The rational function matrix G(s) is said to 
be strictly proper if and only if G(«o)=0. A 
linear time-invariant system with a prescribed 
G(s) as a transfer function matrix is called a 
realization of G(s). A realization of G(s) with 
the least possible dimension is said to be mini- 
mal. A minimal realization should be irreduc- 
ible by definition. The fundamental theorem 
on minimal realization, which answers the 
question raised at the end of Section B, states: 
For a given strictly proper rational matrix 
G(s), any two minimal realizations of G(s) are 
mutually similar in the sense that there exists a 
linear transformation X= Px that transforms 
one of the systems into the other. 

A minimal realization of a scalar transfer 
function 

bs" 1 +...+b,-,;8+, 


i(s) = a 13 
ats) s"+a,s" '+...44, ;s+4a, co 


is given by 
0 1 0 0 
0 0 1 0 
x= x 
0 0 0 1 
“an An- an-2 ~a] 
w (14) 
0 
+]: Ju, 
0 
1 


y=Lb, By-1 by -2 b,]x. 


It is easy to see that this system is controllable. 
Hence system (14) is called a controllable 
canonical realization. Kalman [4] showed that 
this realization is observable and therefore 
minimal if and only if there is no common 
factor between the denominator and the nu- 
merator of ĝ(s). An observable canonical re- 
alization of ĝ(s) can be similarly written down 
as a dual of the system (14). 

For a given strictly proper matrix G(s), the 
degree of the least common denominator of 
all entries of G(s) is called the degree of G(s), 
provided that there is no common factor be- 
tween the denominator and the numerator 
of each entry. It was shown by Kalman [7], 
together with a realization procedure, that 
the dimension of minimal realization of G(s) 
is equal to the degree of G(s). Other impor- 
tant realization procedures for rational func- 
tion matrices were proposed by B. L. Ho and 
Kalman [8], W. A. Wolovich and P. L. Falb 
[9, 10], J. Rissanen and T. Kailath [11], and 
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B. Dickinson et al. [12]. Procedures for time- 
varying system realization for prescribed 
weighting matrices were presented by Kalman 
[4], D. C. Youla [15], and L. M. Silverman 
[16]. Realization problems for a given impulse 
response matrix were first discussed by C. A. 
Desoer and P. P. Varaiya [17], and a fast 
algorithm for discrete-time system realiza- 
tion was proposed by C. T. Mullis and R. A. 
Roberts [18]. 


E. State Estimation 


When a linear system is observable, it is pos- 
sible to construct a device, called a state esti- 
mator, that approximates the state vector. For 
the given system (2), define a linear dynamical 
system 


ż=Dz+ Ey + Gu, 
X=L,z+Ly3y (15) 


with imputs ye R” and ueR’, state ze R’, and 
output $e R”. If the system (2) is observable, 
rank (C)=m, and q2n— m, there exist matrices 
D, E, G, L4, La, and M that satisfy 


MA-DM=EC, 


L,M+L,C=I, (nxn identity matrix), 


G= MB. 
Then it follows that 
X(t)— x(t) = L,e?*[z(0)— Mx(0)]. 


Furthermore, it is possible to choose a matrix 
D which is asymptotically stable, that is, every 
eigenvalue of D has negative real part. Thus 
X(t) x(t) as t— œ, and system (15) yields a 
state estimator. This result was first shown by 
D. G. Luenberger [19, 20], and thus the system 
(15) is called a Luenberger observer. 

The Kalman filter proposed by Kalman [21] 
for discrete-time processes and by Kalman and 
R. S. Bucy [22] for continuous-time processes 
is an optimal state estimator for a stochastic 
system. Let 


X(t)= A(t)x(t)+ B(t)u(t), 
y(t) = C(t) x(t) + vlt), (16) 


in which u(t) is a white Gaussian noise (tBrown- 
ian motion) (— 45 Brownian Motion) such 


that 
Eu(t)=0, ~— Eu(t)u"(s) = U(t)d(t —), 


and v(t) is also a white Gaussian noise such 


that 
Ev(t)=0, — Ev(t)v'(s)= V(t)d(t—s), 


where E and ' denote the texpectation and the 
transpose, respectively. In addition, it is as- 
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sumed that 

Ex(to)=0,  Ex(to)x"(to)= Xo, 


and that x(to), u(t), and v(s) are uncorrelated. 
Under these conditions, the process x(t) be- 
comes a wide-sense Gauss-Markov process 
(J. L. Doob [23]). The Kalman-Bucy filter is a 
linear dynamical system described by 


(D =[A()— LA CHJA + LOW), 
where 
L(t)=P(t)}C™(t)V (e) 


and P(t) is a solution to the matrix Riccati 
differential equation 


P(t)= A(t) P(t) + P(O AT (t) + BAU (0) B1(t) 
-PMC (QV *(t)C() P(t), 
P(to) = Xo. 


The Wiener filter proposed in 1949 [24] is 
a special case of the Kalman filter in which 
the signal process is stationary and of single 
input and single output. Extensions to non- 
linear filtering were investigated first by 
H. J. Kushner [25] and subsequently by 
W. M. Wonham [26], R. S. Liptzer and A. N. 
Shiryaev [27], and M. Fuzisaki et al. [28]. 


F. Optimal Control 


Consider a nonlinear dynamical system de- 
scribed by a system of differential equations 


X= f(x,u), (17) 


where xe R" and weR’. For a given set U in 
R’, a piecewise continuous function u(t) de- 
fined over [to, t; ] is said to be an admissible 
control function if u(t)e U for every te[to, t]. 
When a functional 


= | ” folx(thule)ydt 


is given as a performance index of control 
and two points x? and x! are given in R”, the 
optimal control problem consists of finding an 
admissible function u(t) that minimizes J and 
simultaneously transforms the state from x(f9) 
=x° to x(t,)=x!'. To solve this, a necessary 
condition for optimality was developed by 

L. S. Pontryagin and his colleagues [3] and 
termed the maximum principle. It states: In 
order for an admissible control u(t) and a 
corresponding solution trajectory of equation 
(17) to be optimal, it is necessary that there 
exist a nonzero vector w(t)=(W, (0), .-..W,(0)" 
such that (1) y(t) satisfies 


6H (y(t), x(t), u(t) 


w(t)h= axlo : 
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where 


H(, x,u)= — fo(x,u) + "f(x, u), 


(ii) the function H (y(t), x(t), u) of the variable 
ue U attains its maximum at the point u=u(t) 
almost everywhere in [to, t, ], and (iii) at the 
terminal time ft, it holds that 


H(W(t,), x(t), u(t,))=0. 


When f(x, u) is linear in x and u, fo(x,wW=1, 
and t, is free, we have the time-optimal control 
problem investigated by Bellman et al. [29] 
and J. P. LaSalle [30]. When f(x, u) is instead 
a quadratic function of x and u, we have the 
optimal regulator problem solved by Kalman 
[31]. The relation between the maximum 
principle and the fcalculus of variations (— 46 
Calculus of Variations) has been pointed out 
by L. D. Berkovitz [32]. Generalizations of 
the maximum principle and other results 
on optimal control problems are found in 
[33-35]. 


G. Miscellany 


There are many other areas of control theory. 
For details, we refer the reader to [36] for 
adaptive control, [37] for stability theory, and 
[38] for system identification. 
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A. Introduction 


The notion of convergence was first intro- 
duced in the real number system to deal with 
sequences of numbers, functions, series, or 
definite integrals (— 379 Series; 216 Integral 
Calculus). The notion was then extended to 
the case of generalized sequences where the 
index moves over a directed set, and the terms 
are in a topological space. 


B. Convergence of Sequences of Numbers 


A sequence {a,} of numbers is said to be con- 
vergent to a number a or to converge to a, 

written lim,_,,,@,=4@ Or a,>a as n> œ, if for 
any positive number g we can choose a (suffi- 
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ciently large) natural number no such that for 
every n larger than no the inequality |a,—a| 
<e holds. Then a is called the limit (or limit 
point) of the sequence {a,}. If {a,' has a limit, 
it is unique. A sequence which is not conver- 
gent is said to be divergent or to diverge. 

A set A of real numbers is said to be 
bounded from above if there is a real number b 
such that a <b for all ae A, bounded from 
below if there is a real number c such that 
a>c for all ae A, and bounded if it is bounded 
from above and below. A sequence {a,} of real 
numbers is said to be monotonically increasing 
(monotonically decreasing), written a, ft (a, |), 
if a, SA, S... $A, Sans, S... (4, 2a,2...2 
A, > A,+,=2...). A monotonically increasing or 
decreasing sequence is called a monotone 
sequence. 


C. Criteria for the Convergence of Sequences 
of Numbers 


Every bounded monotone sequence of real 
numbers is convergent; its limit is sup{a,} 
(inf {a,,}) (— 355 Real Numbers B) if it is 
monotonically increasing (decreasing). For 
any bounded sequence {a,' of real num- 
bers, setting a, =inf{a,,a,,,,-..} and B,= 
sup{a,,4,41,.-.}, we have xf, Bad, and 
a, <a, S fh Hence liM, o %,=« (=sup{a,}) 
and lim,,,.. B, = B (= inf{ B,}) exist. æ is called 
the inferior limit (or limit inferior) of {a,}, 
written lim inf, ,., a, or lim,_,,. ân, while f 
is called the superior limit (or limit supe- 
rior), written lim supp» a, or lim,_,,, a, If 
lim sup,,_,.o 4, =liminf,_,,, @,=4%, then lim, ,,. a, 
exists and equals a. The limit of a conver- 
gent subsequence of a sequence {a,} of num- 
bers is called an accumulation point of the 
sequence. Here we should distinguish an ac- 
cumulation point of a sequence {a,} from 
an accumulation point of {a,} viewed as 
a set (— 425 Topological Spaces O); for ex- 
ample, if a, =1 for every n, then the former 
is 1 and the latter does not exist. For any 
bounded sequence of real numbers, its supe- 
rior (inferior) limit is the maximum (minimum) 
of its accumulation points. Moreover, if $ is 
the superior limit of a sequence {a,}, then for 
any positive number g, there exist only a finite 
number of n’s for which a, is greater than f +e, 
while there may exist an infinite number of n’s 
for which a, is less than B —¢. The inferior limit 
of the sequence has a similar property. 
Suppose that we are given a sequence {a,} 
of real numbers and that there exist two se- 
quences {u,} and {v,} such that u, < a, S Vy, 
lim(u, —v,} =0, {u,} is monotonically increas- 
ing, and {v,} is monotonically decreasing. 
Then lim a, exists and is equal to lim u, = 
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lim v, (principle of nested intervals). In partic- 
ular, if lim sup a, =lim infa,, then lim a,, exists. 
The converse also holds. 

If {a,} is convergent, then |a, —a,,|>0 as n, 
m-— oo, and vice versa; that is, {a,,} is conver- 
gent if and only if for any positive number e 
there exists a positive integer no such that 
|an —Gm|<é for all n, m > no (Cauchy’s 
criterion). 


D. Infinity 


For a set A of real numbers, the expression 
sup A= +% means that A is not bounded 
from above; inf A= —oo means that A is not 
bounded from below. For a sequence {a,} of 
real numbers, lim a, = +00 means that for any 
real number b there exists a positive integer no 
such that a, >b for all nèng; the notation 

lim a, = —œ has a similar meaning. The sym- 
bols +œ and —o are called positive (or plus) 
infinity and negative (or minus) infinity, res- 
pectively. We say that the limit of {a,} is 

+00 (—00) if lima, = +œ (—0o). In these 
cases, we customarily say that {a,} diverges (or 
is divergent) to +00 (—00), or that a, becomes 
positively (negatively) infinite as noo. We 
also define lim sup a, = +00 (liminfa, = —o) 
to mean sup{a,} = +00 (inf{a,} = — 0). A se- 
quence {a,} is said to oscillate if lim supa, > 
lim infa,,. 

We now have the following propositions 
concerning sequences of numbers: If lim a, =a 
and lim b, = b, then lim(aa, + Bb,) = «a+ Bb, 
lim(a,,b,) =ab, and lim(a,/b,) = a/b (provided 
that b, #0, b #0). For sequences of real num- 
bers, these formulas also hold when a or b is 
infinity. In those cases we set «:(+00)= +00 
(a>0), «-(+00)= Fæ (a<0), ct+o=+0, 
a/(+00)=0 for a real number «. The cases 
0-(+00), + 00 +(~-00), +00/(+00) are 
excluded. 


E. Convergence of Sequences of Points in a 
Topological Space 


A sequence {a,} of points in a topological 
space (— 425 Topological Spaces) is said to 
converge to a point a if for any tneighborhood 
U of a there exists a positive integer ny such 
that a,éU for all n> nọ. The point a is called 
a limit (or limit point) of {a,} and we write 
liM, oa, = 4 OF A, a as n> 00. A sequence 
is said to diverge if it does not converge to any 
point. 

In particular, the set R of all real numbers is 
a topological space in which the set of inter- 
vals (a—é,a+ 6) for some e >Q is a tbase for 
the neighborhood system of a point a, so that 
the notion of limit in R explained previously ts 
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a special case of the same notion in a topolog- 
ical space. By adding the symbols +00 and 
—o to R, we obtain the topological space R, 
in which any set containing {x|x>a,xeR} 
({x|x<a,xeR}) for some «eR is a neighbor- 
hood of +œ (—œ), where the ordering is 
defined as —œ <a< +œ («e R). Then lima, = 
+æ (—oo) is interpreted as convergence in 
the topological space R. The elements of R are 
called extended real numbers. 

In the case where the topological space is a 
‘metric space (— 273 Metric Spaces) with 
metric p, a, >a is equivalent to p(a,, a). 

For convergence of sequences of points in a 
topological space, the following properties (S) 
hold: (S) (1). If a, =a for all n, then lima, =a; 
(2) if a,—a, then a, >A for any subsequence 
{an 53 (3) if there is a point a such that any 
subsequence {an} of {a,} has a suitable sub- 
sequence converging to a, then a,—a. Ina 
*Hausdorff space (e.g., a metric space), the 
additional property (S*) holds: If a sequence 
{a,} has a limit, this limit is unique, and can be 
denoted lim a. 


F. Limits of Functions 


Let a real-valued function f(x) of a real vari- 
able x be defined for x#a belonging to a 
neighborhood of the point a. We say that the 
limit of f(x) is b as x tends to a, and write 

lim -aJ (x)=b or f(x)>b as xa, if for any 
positive number e there exists a positive num- 
ber 6 such that 04|x—a|<6 implies | f(x)— 
b|<e«. Replacing 04|x—a|<6 bya<x< 
a+6(a—6d<x <a), we define f(x)—>b as 
x7a+0(x—a—O) and say that b is the 

limit on the right (left) of f(x) as x tends to a. 
We define f(x) +œ or f(x) —œ as xa 
analogously to the case of sequences. The 
expression f(x)—>b as x > +œ means that for 
any positive number e there is a real number k 
such that | f(x)—b|<e for any x >k. There 

are similar definitions for x— —0oo and b= 
+oo. When f(x) +0 as xa, we often say 
that f diverges definitely at a. 

In general, for a mapping f from a subset D 
of a topological space X into a topological 
space Y, with a point a in the closure of D and 
a point b in Y, lim,_, f(x)=b or f(x)>b as 
x—a means that any neighborhood V of b 
contains f(U N D — {a}) for some neighbor- 
hood U ofa. If Y is a Hausdorff space, b is 
unique (if it exists) for given f and a. This 
point b is called the limit (or limit value) of f(x) 
as Xa. 

It is easy to see that this definition of 
lim, ,, f(x) =b is a generalization of the cases 
where the topological spaces are R or R. Let N 
be the set of all natural numbers, and let N= 
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NU {+00} be supplied with the trelative 
topology as a subspace of R. A sequence {a,} 
of real numbers or points can be identified 
with a mapping f from N into R (N>R) 
defined by f(n)=a,. It converges to a if and 
only if lim, ,,, f(a)=a. 

Suppose, in particular, that f is a mapping 
from a metric space (X, p) into a metric space 
(Y, 0). Then, f(x)-+b as xa means that 
for any ¢>0 there exists a ô >0 such that 
a( f(x), b)<e for all xe D such that 0< p(x, a) 
<ò. Thus f(x)>b as xa if and only if f(x,) 
—b for any {x,} in D with x,—a. If we set 
P(Z1,2Z2)=|Z; —Z| for complex numbers z,, 
Z2, the function p supplies the set of all com- 
plex numbers C with a metric and C becomes 
isometric to the plane R? (— 74 Complex 
Numbers). Thus the cases X =C or Y=C are 
particular cases of the above generalization. 
Furthermore, we introduce the tRiemann 
sphere C=CU {co} by adding the tpoint at 
infinity œ to C. We can define a topology 
on Č such that any set containing {co} U 
{z||z|>r} for some positive number r is a 
neighborhood of œ. Thus we can define the 
notions {(z)->b as z= œ, f(x) > œ as xa, 
etc., for a complex-valued function f by con- 
sidering f as a mapping from the topological 
space Č into itself. Then f(x)> 00 is equiva- 
lent to 1/f(x)-0. 


G. Orders of Infinities and Infinitesimals 


Let f be a complex-valued function defined 
on a topological space X and aa point of X. 
Then f is called an infinity (at a) or an in- 
finitesimal (at a) if f(x) 00 as xa or f(x)> 
0 as xa, respectively. Suppose that f and 


g are infinities and f/g is an infinitesimal. Then 


f is said to be of lower order than g, and g is 
said to be of higher order than f. If both f/g 


and g/f are bounded, then f is said to be of the 


same order as g. This last relation, written f~ 
g, is an equivalence relation. An infinity f is 

said to be of the nth order with g if f~g". For 
two infinitesimals f and g, f is called of higher 
order than g and g of lower order than f if f/g 


is an infinitesimal. For infinitesimals, the terms 


of the same order and of the nth order are 
defined similarly as above. In particular, when 
X =C and a= œ, we usually omit the phrase 
“at oo.” Also, for such a function f, we cus- 
tomarily say that the order of an infinity (in- 
finitesimal) is n if f ~z" (z~"). 

To describe the order of an infinity or an 


infinitesimal simply, the following notions, due 


to E. Landau [10], are in common use. Let 
f and g be two functions. If | f(x)/g(x)| is 
bounded as xa, then f is called at most of 
the order of g as xa, and we write f(x)= 
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O(g(x)) as xa. Second, if f(x)/g(x) is an in- 
finitesimal at a, then f is said to be of lower 
order than g as xa, and we write f(x)= 
0(g(x)) as x->a. The symbols O, o, indicat- 
ing the word “order,” are called Landau’s 
symbols. The notation {(x)=h(x)+O(g(x)) 
means f(x) —h(x)=O(g(x)). When we use the 
symbols O, o we should indicate clearly the 
phrase “as xa,” which is sometimes omitted 
when no confusion is to be feared (e.g., for the 
case of a complex variable with a= oo). These 
symbols are employed for sequences as well, to 
describe their behavior as n> 00. 


H. Convergence of Nets (Moore-Smith 
Convergence) [7,8] 


Let A be a (preordered) tdirected set. A family 
of points in a set X with index set 2 (namely, a 
mapping from YF to X) is called a net in X. A 
net is denoted by {x,},¢u ({X,}m or {x,}). A net 
{¥g}m in X is called a subnet of {x,},, if there 
exists a mapping p:B—Y such that (1) y; = 
Xp) and (2) for any a) €% there exists a Bye B 
such that f > Bp implies o(p) 2 &o. In particu- 
lar, if B is a cofinal directed subset of W, {xg} 
is called a cofinal subnet of {x,},,. A net is 
called a universal net if either {«|x,¢ Y} or 

{a| x E X — Y} is residual in A for any subset 
Y of X. For any net there is a universal subnet. 

For a net {x,},, in a topological space X, 
{X,} is said to converge to a point x in X if for 
any neighborhood U of x there is an æo such 
that {x,|& > &o} c U. Then a is called a limit 
of the net {x,}. We then write x,>x (ae A) 
(or simply x,—x). The convergence of se- 
quences of points is the special case where 
Y=N. The notion of convergence using nets 
was introduced by E. H. Moore and H. E. 
Smith. 

Concerning this convergence, we have the 
following propositions (D): (D) (1) If x,= x for 
all æ, then x,—x. (2) If x,>x and {y,} is a 
subnet of {x}, then yg—x. (3) If for a net {x,} 
there is a point x such that any subnet {y,} of 
{x,} has a suitable subnet converging to x, 
then x,— x. (4) Suppose that x,>x (xe W) 
and y,,x, (BE 8,) for each «. Then let C= 
A x FI B, be the direct product of directed 
sets with projections p:€ + and p,:C>%,, 
and for ye € define z,=y,,, where «= p(y) and 
B=p,(y). Then z,—x (ye @). Furthermore, the 
space X is a tHausdorff space if and only if 
we have the condition (D*): Any net in X has 
at most one limit. 

A limit of {x,} is denoted by lim x, or 
lim,eyX,- Then x,—.x if and only if x is con- 
tained in the closure of any subnet {y| Pe B} 
of {x,}. (We may consider this to be a defini- 
tion of x, x.) 
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I. Convergence of Filters [9] 


Let X be a set. A set ® of subsets of X is called 
a filter if the following conditions are satisfied: 
(i) OED (A is the empty set); (it) AC BCX 
and Ae® imply Be; (iii) A, BE ® imply 

AN Be®. Let B be a set of subsets of X and 

® be the collection of subsets of X such that 
each element A of ® contains a subset belong- 
ing to B. If is a filter, then B is called a filter 
base which generates ®. $ is a filter base if and 
only if (i) Ø ¢B; (ii) A, BEB implies that there 
isa Ce% with AN B>C. A filter ® is called 

an ultrafilter (or maximal filter) if there exists 
no filter which contains ® properly. For any 
filter there exists an ultrafilter containing it. If 
{®,}, is a family of filters, then the intersection 
(\®, is a filter. If % is a filter base in the index 
set A, then (|) wreg (( \aem ®,) is also a filter. 

We denote by U(x) the tneighborhood sys- 
tem of a point x in a topological space X. A 
filter ® in X is said to converge to a point a, 
written Pa, if U(x) ®. A filter base B is 
said to converge to a if the filter generated by 
B converges to a. 

The convergence of filters just defined has 
the following fundamental properties (L): (1) 
for a point ain X the filter O,={A|aeAc 
X} converges to a; (2) for two filters ® and 
YF, ®>a and OcY imply Y —>a; (3) if ®, > 
a for all members in a family {®,} of filters, 
then (\®,=@®-a; (4) suppose that filters 
®,— y are assigned for all points y in a subset 
Y of X, and that we are given a filter Y in X 
which converges to a, generated by a filter 
base B in Y; then | ses ({\yex®,) >a. Fur- 
thermore, the space X is Hausdorff if and only 
if we have the condition (L*): Each filter in X 
has at most one limit. 


J. Relations among Various Definitions of 
Convergence 


Convergence of sequences of points is a special 
case of that of nets. Properties (1), (2), and (3) 
of (D) imply (1), (2), and (3) of (S), respectively, 
and (D*) implies (S*). Consider a net {x,},, in 
X. Then the set {{x,|ceU,a2a9}|a 6%} of 
subsets of X is a filter base in X which gener- 
ates a filter ®, and ®—>x if and only if x,>x. 
In this situation, (L) implies (D), and (L*) 
implies (D*). Suppose that we are given a 
function f: X > Y with the domain D and a 
point ae X. Let U(a) be the neighborhood 
system of a and assume that, for any Ue l(a), 
UND—{a}#@. Then the set { f(U N D — 
{a})|UeU(a)} is 4 filter base. Let © be the 
filter generated by it. Then f(x)—>b as xa if 
and only if ®-+b. Consequently, the various 
types of convergence described previously can 
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be expressed by means of convergence of 
filters. 


K. Convergence and Topology 


In a topological space X, the concept of con- 
vergence of nets and that of filters can be 
defined. Conversely, convergence of nets in X 
defines a topology of X. In fact, let us assume 
that we are given a set X and a definition of 
convergence of filters which satisfies the prop- 
erties (L). Then convergence of nets that satis- 
fies (D) can be introduced as above. If A is 
defined as the set of limits of all nets contained 
in a subset A of X, then A satisfies the axiom 
of closures (— 425 Topological Spaces), and a 
topology can be defined on X. Then we have 
the following propositions: (i) ae A if and only 
if there is a net {x,} with x,¢ A converging to 
a; (ii) U is a neighborhood of a if and only if 
X, >a implies that there exists an &o such that 
x,€U for all a>a,. Thus, if X is a topological 
space, it carries a “new” topology defined by 
way of convergence of nets. But this “new” 
topology coincides with the original one. 
Similarly, starting from convergence of filters 
(or nets), we can obtain a “new” definition of 
convergence of filters (or nets), which coincides 
with the initial one. In conclusion, defining a 
topology on a space X is the same thing as 
defining convergence of filters in X or of nets 
in X. 

We shall describe here a few notions on 
topological spaces in terms of convergence. 
The fact that a topological space X is compact 
is equivalent to the fact that every universal 
net in X converges, and to the fact that every 
ultrafilter in X converges. Also equivalent is 
the fact that every net has a convergent subnet. 
A mapping f from a topological space X into 
a topological space Y is continuous at a if and 
only if one of the following conditions is satis- 
fied: (1) for any net {x,} in X converging to 
ae X, we have f(x,)— f(a) in Y; (2) for any filter 
® in X converging to ae X, we have f(®)= 
{f(M)|Me®}- f(a) in Y; (3) f(x) f(a) in Y as 
x—a in X (in the sense of the limit of a func- 
tion at a). 

M. Fréchet [6] gave a definition of a topol- 
ogy on a space using the notion of conver- 
gence as a foundation. A set is called an L- 
space (or Fréchet L-space) if convergence of 
sequences of points in it is defined so as to 
satisfy conditions (1) and (2) of (S) and (S*) 
(1906). Such convergence is called star conver- 
gence if it also satisfies (3) of (S), and in that 
case the space is called an L*-space. For any 
subset A of an L-space X, define A as the set 
of all points a such that x, — a for some se- 
quence {x,} contained in A. Then the axioms 
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A >A, AUB =AUB, and Ø = Ø are satis- 
fied, so that X is a tgeneralized topological 
space (the axiom A = A is not necessarily 
satisfied). For a Hausdorff space X with the 
‘first countability axiom and convergence of 
sequences defined by means of its topology, 
the closure operation defined above gives the 
same topology as the initial one. 


L. (o)-Convergence 


A sequence {a,‘ of elements of an ordered set 
S is said to be (0)-convergent to an element a of 
S if there exist two sequences {u,} and {vp} 
such that u,<a,<v,, U,<U,4,, and v, 20,41; 
and a=supu, =infv,. When we write this 
a,— 4, properties (1) and (2) of (S) and (S*) con- 
cerning the convergence of sequences hold. 
Next, a sequence {a,} is said to be (0)-star 
convergent to a if any subsequence of {a,} has 
a suitable subsequence which converges to a. 
Then (o0)-star convergence satisfies the prop- 
erties (S) and (S*). 

For any set X the set P(X) of all subsets of 
X 1s an ordered set under the inclusion rela- 
tion. The fact that a sequence {A,} of subsets 
is (o)-convergent to a subset A is equivalent to: 


The set A is also equal to lim A,, which is the 
tlimit of a sequence {A,} of subsets. 
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A. Convex Functions 


A real-valued function f(x) defined on a tcon- 
vex set D in a linear space over R is called a 
convex function if for every x, ye D and 0< 
A<1 we have 


F(AX+(L—A)y) SAf(x) + (1 — AS (y). (1) 


The function f(x) is called a strictly convex 
function if the sign < in (1) is replaced by 
< for x #y except when J is either 0 or 1. If 
— w(x) is convex (strictly convex), the function 
w(x) is called a concave function (strictly con- 
cave function). The notion of convex function 
was introduced by J. L. W. V. Jensen [1] for 
the case where the domain D is an interval on 
the real line R. 

Sometimes the condition for a convex func- 
tion is weakened in such a way that (1) is as- 
sumed only for 4=1/2. However, if D is a 
*topological linear space and f is continu- 
ous, then the weakened condition implies the 
original one. Hereafter, we mainly consider the 
case where D is an interval on the real line. In 
this case, a convex function f(x) (in the weaker 
sense) is continuous in the interior of the inter- 
val if f(x) is tmeasurable or bounded from 
above on a set of positive measure (the latter 
was proved by A. Ostrowski [2]). In particu- 
lar, suppose that f(x) is defined in the interval 
I and is bounded from below. Then either f(x) 
is continuous or its graph is dense in the set 
{(x, y) xel, y>g(x)}, where g(x) is a suitable 
convex continuous function (Hukuhara [3]). 
We note here that the original definition of a 
convex function f(x) implies the continuity of 
f(x) in the interior of the interval. In such a 
case, f(x) always has fright and tleft deriva- 
tives and satisfies f(x) < f(x) < fL) 
f(y) for x< y. Hence it is differentiable ex- 
cept for at most countably many points. 

A function f(x) is a continuous convex 
function in a<x <b if and only if it is expre- 
ssible in the form 


x 


fosas | p(t)dt, 


a 
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where g(t) is a monotone increasing function. 
If f(x) is twice differentiable, then f’(x)>0 
(a<x <b) is a necessary and sufficient condi- 
tion for f(x) to be convex in (a, b). 


B. Convex Functions and Inequalities 


If f(x) is convex (in the original sense), we 
have, for a, >0, 


(Eas Za )< Eaa) Say D 


Similarly, we have, for p>0, 


s( | ova] |oax)< f ofwa] | oax (3) 


The functions x* (a>1 or a<0), — x" (0 <a < 
1), — log x, xlog x are strictly convex for x>0, 
and the functions x?" (n> 1), exp x, log(1 +e”), 
./a* +x? (a€0) are strictly convex in —% x 
x< +00. Applying the inequalities (2) or (3) to 
these functions, we obtain various inequalities, 
including the inequalities on means (— 211 
Inequalities). 

A continuous convex function f(x) over a 
topological linear space satisfying the relation 
f(ax)=af(x) for an arbitrary positive number g 
is called a subadditive functional and is often 
utilized in functional analysis. 


C. M. Riesz’s Convexity Theorem 


Let x =(€,,...,¢,) be an n-tuple of com- 
plex numbers, and let v>0. We put N,(x)= 
(X71 č; for v>0 and No(x)=sup |é)I. 
Let («;;) be an m x n complex matrix, x = 
(čis <5 Gs Z= (Cis exes bm) VO, and u20. 
We put 


n 


3 » a ij0iCj 


i=1 j=1 


M(v,w)= sup 


N (x) <1, Ny(z) <1 


Then log M (v, u) is a convex function of (v, 1) 
in the following sense: Let 0< y< 1,0<y;<1, 
and v; +; > 1 (i=1,2). Then log M((1—t)v, + 
tvz, (1—t)u, +tu,) is a convex function with 
respect to t forO0<t<1 [4,5]. These results are 
called M. Riesz’s convexity theorem. Famous 
inequalities such as the tHdlder inequality or 
the Minkowski inequality follow from this 
theorem. For example, let T be an tadditive 
operator from the ‘function space L,(Q) into 
L,(Q) for all 1 <p< œ. If T is a continuous 
operator for p=1 and p= œ, and the norm 
of Tis <C for p=1 and p= œ, then T is con- 
tinuous for all p (1 < p< œ), and its norm is 
always <C. 
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D. Subdifferentials of Convex Functions 


Currently, convex analysis is playing an im- 
portant role in the study of nonlinear evolu- 
tion equations. This analysis treats convex 
functions on infinite-dimensional spaces [9]. 

In convex analysis, it is sometimes conve- 
nient to consider a proper convex function @ 
on a Hilbert space X. A mapping ọ from X 
into (—0oo, +00] is called a proper convex 
function if ọ is not identically equal to +00 
and if 


p(Ax +(1—A)y) <AG(x) + (1 — A p(y) 
for every x, ye X and 0</A<1. The convex set 
D(p)={xEX | Q(x) <wpoX 


is called the effective domain of œ. The subdif- 
ferential Cg of ọ is defined to be a multivalued 
function which assigns to each x in D(ọ) all 
elements ye X satisfying o(€)> e(x)+<y,é— 
x» for any če X. Then Gg is monotone in 

X in the following sense: If y; €@@(x,) and 
y,€0(x,), then 


[Yi — Y2: X1 —- Xn) 20. 


Under some conditions on o, 6g becomes a 
single-valued function, as is seen in the follow- 
ing example: Let A be a nonnegative self- 
adjoint operator in a Hilbert space X. For the 
fractional power JA of A, define ọ by 


1 
sl Axl?, xe D(A), 


+00, otherwise. 


Then D(g)= D(./A) and ĉĝọ = A. 


ọ(x)= 


E. Convex Functions and Nonlinear 
Semigroups 


Let ọ be a lower semicontinuous proper con- 
vex function on a Hilbert space X. Since A= 

—ĉọ is a maximal tdissipative operator in X, 
it generates a nonlinear semigroup {T;|t>0} 

on the closed convex set D(A): 


T,x= lim (I—tA) "x 


(— 286 Nonlinear Functional Analysis, 378 
Semigroups of Operators and Evolution Equa- 
tions). It can be shown that for any ae D(A), 
T,ae D(A) whenever t>0. Thus the tabstract 
Cauchy problem 


du 
X +ôg(u)}ə20 (t>0), 


u(+0)=a, 


is considered to be “parabolic.” Furthermore, 


331 


we have 


d 
Paa 


for all ae D(A) (H. Brézis [10]). 


<'la-Tal (¢>0) 
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A. General Remarks 


A nonempty subset X of the n-dimensional 
Euclidean space R” is called a convex set if 
for any elements x, y in X and any number a 
such that 0<a<1, the element ax+(1—a)y 
of R” is also contained in X. The tinterior 
and the *closure of a convex set are also con- 
vex. A point x of a convex set X is called an 
extreme point of X if x cannot be expressed as 
(x, +x,)/2 in terms of a pair of distinct points 
x4, X2 in X. A bounded closed convex set is 
called a convex body if it has tinterior points. 


89 B 
Convex Sets 


Given an arbitrary nonempty subset X of R’, 
the minimum convex set containing X exists, 
called the convex hull of X and denoted by 
[X]. Each point x of [X] can be expressed as 
x="! a,x, where x; belongs to X and the a; 
are nonnegative numbers such that Erti a;=1. 
When X is a finite set, [X] is called a convex 
polyhedron. If X denotes the set of extreme 
points (also called vertices) of a convex poly- 
hedron X, then X =[X]]. 

For elements x, y of R”, denote the inner 
product by (x, y). Given a nonzero element v of 
R” and a fixed number a, the thyperplane H = 
{x|(v,x)=a} divides the space R” into two 
thalf-spaces {x|(v, x)<a} and {x|(v, x) >a}, 
each of which is a closed convex set. If a con- 
vex set X is contained in one of the half-spaces 
S determined by the hyperplane H and the 
boundaries of X and H intersect, then we say 
that H is a supporting hyperplane of X and S is 
a supporting half-space of X. A closed convex 
set X is the intersection of its supporting half- 
spaces. A boundary point of a convex set X is 
contained in a supporting hyperplane of X. 
Given mutually disjoint convex sets X and 
Y, the separation theorems (1) and (2) hold. 

(1) If X has inner points, then there exist a 
nonzero element v of R” and a number a such 
that X is contained in the set {x|(x,v)>a} and 
Y is contained in the set {x|(x,v) <a}. 

(2) If X and Y are closed and X is bounded, 
then we can replace the signs < and > in (1) 
by < and >, respectively (when the separa- 
tion of convex sets X and Y is described by 
strict inequalities, we say that X and Y are 
strongly separated). 

As an immediate consequence of the separa- 
tion theorems, we obtain the following pro- 
position: Suppose that A is an m x n matrix 
with real entries. For an element z in a Eu- 
clidean space, we write z>0(>0) if each 
component of z is >0 (>0). Now if ‘Ay>0 
never holds for an m-dimensional vector y >0, 
then there exists a nonzero n-dimensional 
vector x >0 such that Ax <0 (— 173 Game 
Theory). 

The definitions given previously for subsets 
of R” can be naturally extended to the case of 
treal topological linear spaces (— Section G). 
Also, in the theory of analytic functions of 
several complex variables, various notions of 
convexity of the subsets of C” are considered 
(— 21 Analytic Functions of Several Complex 
Variables). 


B. Helly’s Theorem 


Suppose that we are given an index set A of 
cardinality greater than n+ 1, and bounded 
closed convex sets C, (A€ A) in R”. If any n+1 
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sets of the C, have nonempty intersection, then 
the intersection of all the C, is nonempty 
(Helly’s theorem). 

This theorem has a wide range of applica- 
tions. For example, we have propositions (1)— 
(4). 

(1) If a convex set X of R” is covered by a 
finite number of half-spaces, then X can be 
covered by no more than n+ 1 half-spaces 
among them. (2) Let X and Y be finite subsets 
of R". X and Y are strongly separated by a 
hyperplane if for an arbitrarily chosen subset § 
of X UY consisting of at most n +2 points, the 
sets SN X and SN Y are strongly separated by 
a hyperplane. (3) If the tdiameter of a subset X 
of R” is not greater than 2, then X is contained 
in a tball of radius (2n/(n+ 1))'”. (4) Let X be 
a convex body in R”. There exists a point x 
in X such that ||x—ul|/||v»—ul| <n/(n+ 1), 
where u, v are points of intersection of an arbi- 
trary straight line passing through x with the 
boundary of X, and ||x|| denotes the length 
(x, x)? of x. 

Helly’s theorem can also be applied to 
problems of approximation of functions. 


C. Ovals 


The boundaries of convex bodies in R? and R? 
are called ovals and ovaloids, respectively. An 
oval E is a tJordan curve C(t) which admits at 
every point Py = C(to) left and right “tangents” 
I,, and 1, , where I% is a straight line expressed 
as the set of points P(A), AER such that P(A) 

— Py=Aa*, with a* =lim,., , 9(C(t)— Po)/|C(t) 
— Pol). There may exist exceptional points P 
on E for which left and right tangents do not 
coincide, but the set of such points is at most 
countable. Each tangent I> shares a point or a 
segment with E. A line satisfying this condition 
is called a supporting line of the oval. It is also 
a supporting hyperplane in R? of the convex 
body [E] in the sense of Section A. If we fix an 
interior point O of a convex body X and take 
an arbitrary point P different from O, then the 
boundary E of X admits one and only one 
supporting line /(P) which is perpendicular to 
the line OP and meets the half-line OP. Take a 
rectangular coordinate system with the origin 
O, and let (x, y) denote the coordinates of P. 
Then the points (¢, n) on I(P) satisfy the equa- 
tion ¢x+ny=H(x, y), where H(x, y) is a func- 
tion determined for all (x, y) in R? and satisfy- 
ing the following conditions: (i) H(0, 0)=0; (ii) 
A(tx, ty)=tH(x, y), for t>0; (ili) A(x, + x,y, 
+ y2)< A(x, ¥,)+ H(x>, y2). The function 

H (x, y) is called the supporting line function of 
E. The magnitude and shape of E are deter- 
mined by H, and any function satisfying con- 
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ditions (i)-(iii) is a supporting line function of 
an oval. An oval E has a finite length L = L(E), 
and the convex body [E] has a finite area F = 
F(E). If OP’ denotes the half-line with direc- 
tion opposite to that of OP and I’ denotes the 
supporting line [(P’), the distance between the 
parallel lines / and I’ is called the breadth of E 
in the direction PP’. Let D= D(E) and A=A(E) 
be the maximum and minimum of the breadth 
of E, respectively. D is the tdiameter of E (or 
[E]), and A is called the thickness of F (or 
[E]). In particular, if D =A, then the oval E is 
called a curve of constant breadth. In the fol- 
lowing inequalities, equality holds only when 
E is one of the figures mentioned in paren- 
theses: (1) L? > 4xF (circles, J. Steiner (1838)) 
(— 228 Isoperimetric Problems); (2) nD? >4F 
(circles, L. Bieberbach (1915)); (3) L< nD 
(curves of constant breadth, W. Blaschke 
(1916)); (4) F> A?/,/3 (regular triangles, J. Pal 
(1921)). See T. Kubota, Téhoku Sci. Bull., I, 

12, 13; Téhoku Math. J., 24, 49. 


D. Linear Combinations of Ovals 


Let H, and H, be supporting line functions of 
ovals E, and E,, and let t, and t, be positive 
numbers. Since the function t, H, +t, H, satis- 
fies conditions (i)—{iii), given before, it is a 
supporting line function of an oval E(t,,¢,). In 
this case, we can also write E(t,,t,)=t, E; + 
t E, and call it a linear combination of £, 

and F,. In particular, the oval (E, + E,)/2 is 
called the mean oval of E, and E,. In general, 
there exists a quantity M, called the mixed 
area of E, and E,, such that F(E(t,,t,))= 
F(E,)t{+2Mt,t,+F(E,)t3. M does not 
depend on the choice of t; and t,, and M? > 
F(E,)F(E,). Here, the equality holds if and 
only if E, and E, are homothetic and situated 
in a position of homothety. Furthermore, if 
O<z<1, then the square root of F(E(t, 1 —t)) 
is a tconvex function of t (H. Minkowski). 


E. Specific Ovals 


Suppose that we are given an equilateral 
triangle ABC. Draw three circles C,, C,, and 
C, with centers A, B, and C and radii equal to 
the length of the sides of ABC. The minor arcs 
AB, BC, and CA of the circles form an oval 
which is called a Reuleaux triangle. This oval 
is of constant breadth. Furthermore, given a 
fixed breadth D, the area F(E) of an oval E of 
constant breadth D attains its minimum when 
E is a Reuleaux triangle. A Reuleaux triangle 
obtained from a triangle ABC revolves freely 
within the square of side AB and touches each 
side. In general, an oval which revolves touch- 
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ing the sides of a convex polygon from the 
inside is called an inrevolvable oval. Any such 
oval revolves inside some regular polygon (M. 
Fujiwara, S. Kakeya). 

Various properties of an oval already de- 
scribed can be generalized to the case of a 
boundary of a convex body in R”. For the 
volumes of subsets A, B in R” and A+ B={x+ 
y|xe A, ye B} in R", the Brunn-Minkowski 
inequality 


[vol(A + B)]!" > [vol(A)]”" + [vol(B)]" 
holds. 


F. Convex Cones 


A nonempty subset X of R” is called a convex 
cone if for any elements x, y of X and a non- 
negative number a, ax and x + y are contained 
in X. A convex cone is a convex set. Given any 
nonempty subset X of R”, the minimum con- 
vex cone K(X) containing X exists. Given two 
convex cones X and Y, a convex cone X + Y, 
called the sum of X and Y, is defined as the set 
of elements x + y, where x, y are elements of X, 
Y. The intersection of convex cones X and Y is 
also a convex cone. Given a convex cone X, 
the subset of R” consisting of the elements y 
such that (x, y)<0 for any element x in X is a 
convex cone which is called the dual convex 
cone (or conjugate convex cone) of X, denoted 
by X*. If X is a finite set, K(X) is called a 
convex polyhedral cone. For example, if v 

is a nonzero vector, then the half-line (v) = 

{x |x =av,a>0} or the half-space (v)* = 

{x|(v, x) <0} is a convex polyhedral cone. A 
convex polyhedral cone is closed. A convex 
cone X is a convex polyhedral cone if and only 
if X is the sum of a finite number of half-lines. 
Given convex cones X and Y, we have pro- 
positions (1)—(3): (1) If X, c X,, then X¥c 

XPS (2) (Ky + XQ)¥ = XF OXF; (3) XF + AF 
(X, X,)*. If X, and X, are convex poly- 
hedral cones, then Xf + X¥=(X, N X,)*. Gen- 
erally, X <(X*)* = X** for a convex cone X. 
If X is a closed convex cone, then X = X**. 
Namely, the duality principle holds for closed 
convex cones. A linear subspace of R” is a 
convex polyhedral cone. Also, if A is an m x n 
real matrix, the subsets {x| Ax =0, x >0} and 
{x| Ax 20} are convex polyhedral cones. Since 
the duality principle holds for convex poly- 
hedral cones, we obtain the tMinkowski- 
Farkas theorem (i.e., if A is an m x n real 
matrix and v is an element of R”, then the 
equation Ay=v has a solution y>0 in R” if 
and only if (v, x) >0 for all xe R” such that 
‘Ax >0). (For linear inequalities — 255 Linear 
Programming.) 
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G. Convex Sets in Function Spaces 


The definitions of convex sets and convex 
cones in R” can be naturally extended to the 
case of any real linear space. Some of their 
properties can be generalized and applied to 
the case of function spaces. 

(1) Let E be a flocally convex real topolog- 
ical linear space satisfying Hausdorff’s tsepara- 
tion axiom. Let A and B be convex sets in E, 
and assume that B has interior points and 
A! B is empty. Then A and B are separated by 
a hyperplane. Namely, there exists a nonzero 
tcontinuous linear functional f on E such that 
sup f(A) <inf f(B). 

(2) Let E be as in (1), and let C be a convex 
set in E. If a boundary point x of C admits a 
nonzero continuous linear functional f such 
that f(x)=sup f(C), we call such a point x a 
supporting point of C, and f a supporting func- 
tional of C. If C has interior points, then any 
boundary point x of C is a supporting point of 
C. 

(3) Let C be a closed convex set of a 
*Banach space E. The set of supporting points 
of C is dense in its boundary. 

A convex set C contained in the dual space 
E* of a real topological linear space E is called 
a regularly convex set if for any fọ in E* not 
contained in C, there exists a point x, in E 
such that sup{ f(xo)| f EC} <fo(Xo). 

Let E be a real topological linear space 
satisfying Hausdorff’s separation axiom, and 
let C be a closed convex cone having 0 as its 
extreme point. Furthermore, assume that CN 
(—C)={0}. If we set x< y when y—xeEC, 

a partial ordering < is defined in E. For 
example, if E is R”, then the positive orthant 
C={x=(x;)|x;20, i=1,...,n} satisfies these 
requirements, and the partial ordering x < y 
defined by means of C is equivalent to the 
relation x;< y; for all i. 

Some of the properties of matrices of posi- 
tive entries or tintegral operators whose tker- 
nel functions are positive-valued can be gen- 
eralized to properties of mappings f: E>E 
such that f(C)< C (— 255 Linear Program- 
ming; for the fKrein-Milman theorem — 424 
Topological Linear Spaces). 
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A. General Remarks 


Suppose that we are given a Euclidean plane 
E? and two lines X’X and Y’Y in E? per- 
pendicular to each other. Let O be the point of 
intersection of X'X and Y’Y. We identify each 
of the straight lines X'X and Y’Y with the set 
of real numbers R; the point O on each line is 
identified with zero. Let P be an arbitrary 
point in E*. We draw lines PQ, PR parallel to 
Y'Y, X'X, where Q, R are on X’X, Y’Y, respec- 
tively. Let x and y be the real numbers corre- 
sponding to Q and R. Thus we obtain a map- 
ping sending the point P to the ordered pair 
(x, y) of real numbers. This mapping gives a 
one-to-one correspondence between the points 
P of E? and the ordered pairs (x, y) of real 
numbers in R?. The numbers x and y are 
called the coordinates of P. 

In general, given a set of mathematical 
objects, if we have a mechanism that assigns 
quantities to each element of the set, then such 
a mechanism is called a coordinate system on 
the set, and the quantities corresponding to 
each element are called its coordinates. In the 
previous example, the mechanism is called a 
rectangular coordinate system. Coordinate 
systems are also useful in expressing quantita- 
tive concepts by geometric ones which are 
intuitively easier to grasp, c.g., diagrams of 
train schedules and tnomograms. tMap projec- 
tion, tgraphical calculation, tdescriptive geom- 
etry, etc., may be viewed as applications of the 
concept of coordinate systems. 

In many cases, when we introduce a coor- 
dinate system in a space, it is determined 
uniquely by fixing a basic figure in the space. 
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In the case of a rectangular coordinate system 
on a plane E?, the basic figure consists of X’X 
and Y’Y, which are called coordinate axes (the 
point O is called the origin). Sometimes it is 
convenient to consider real-valued functions f 
and g on R and a coordinate system on the 
plane E? determined by the function that 
sends an arbitrary point P to (f(x), g(y)), 
where (x, y) are the coordinates of P in the 
rectangular coordinate system. Logarithmic 
papers, tprobability papers, and tstochastic 
papers (binomial probability papers), etc., are 
constructed in this way to fit their respective 
purposes. 

In various branches of mathematics there 
are many varieties of coordinate systems. In 
this article we deal with frames and coordi- 
nates, curvilinear coordinates, and local 
coordinates. 


B. Frames and Coordinates 


Suppose that we are given a space M anda 
*transformation group G acting on M. It is 
desirable to introduce a coordinate system 
that best represents the geometric structure of 
M. Let G, be a set of figures in M such that G 
acts tsimply transitively on G,. Each element 
of G, is called a frame. Utilizing each frame as 
basic figure, we introduce a coordinate system 
that is “G-invariant” in the following sense: 
Let ReG,, XEM, and Cp(X) be the coordi- 
nates of X with R as basic figure. Then the 
coordinate system is G-invariant if C,(X)= 
C,r(gX) for any element g in G. If we have 
such a coordinate system for each ReG,,, then 
the expressions of geometric properties of M in 
terms of the coordinates are independent of 
the choice of frames. 


(1) Projective Coordinates. Let M be an n- 
dimensional tprojective space P” over a field 
K, and let G be the tprojective transformation 
group of P”. As a frame we can take the system 
of n+1 points (Ag, A,,...,A,,) in general po- 
sition. The thomogeneous coordinates of an 
arbitrary point X e P” are given by the (n+ 1)- 
tuple (x9, X,,...,X,) Satisfying the equation X 
= Lj=0 X;A;, xjE€ K. They are called tprojec- 
tive coordinates. In fact, if (x9,x,,...,X,)# 
(0,0, ...,0), then (x9, x,,...,X,) and (Ax, Ax,, 
...,AX,) (A #0) represent the same point in P”. 
A thyperplane z of P” is expressed as the set 
of points whose coordinates (x9, X1, -<3 Xp) 
satisfy a linear homogeneous equation 

Li Xju;=0, u;e K. Therefore, the hyperplane 
n is represented by the homogeneous coor- 
dinates (uo, U,,... 
coordinates of z. 


,u,), called thyperplane 
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(2) Affine Coordinates. Let M be an n- 
dimensional taffine space E”, and let G be the 
tgroup of affine transformations of E”. As a 
frame we can take the system (O;e,,€), ...,€,); 
where O, called the origin, is a point in E”, and 
the set of vectors {e;} is a basis of the tstan- 
dard vector space of E”. Then tinhomogeneous 
coordinates of an arbitrary point X e E” are 
given by the n-tuple (x1, X2, ---, Xp) where X = 
O + È; x,e;. They are also called taffine 
coordinates of X in E”. Sometimes we replace 
G above by the group of tequivalent affinities 
and consider the frames (O;e,,e>,...,¢e,) such 
that the volume of [e,,...,e,]=1. 
Furthermore, if E” has the structure of a 
tEuclidean space, we sometimes replace G 
by the tgroup of motions and consider a sys- 
tem of rectangular coordinates determined 
by an torthogonal frame, that is, a frame 
(O;e,,€,...,e,) such that the inner product 
(e;,€)) =6,, where ô is the Kronecker delta. By 
contrast, the general affine coordinate system 
of a Euclidean space is called a system of 
oblique coordinates. In this case the inner prod- 
ucts (e;, e;)=g,; are invariants of Euclidean 
geometry, and the distance p between two 
points (x,) and (y;) is given by p=(2? jar Gili 
—x,)(yj—x;,))’”. We sometimes consider an 
oblique coordinate system satisfying (e;,e;) = 1 
(i=1,...,n). In such cases the angle 6,; between 
two basis vectors e; and e; is determined by 
gj = COs b;j. 


(3) Barycentric Coordinates. In an n- 
dimensional affine space E”, we take n+ 1 
linearly independent points Ap, A,,..., A, and 
denote the position vectors from a point O to 
these points by ap, a,,...,a,, respectively. 
Then for any point X e E” there exists a unique 
set of numbers (Ay, 4,,...,4,) such that X =O 
+ Dye 4jaj;, L-o4;= 1. We call these numbers 
barycentric coordinates of X in E”. They are 
independent of the choice of the point O. 


(4) Pliicker Coordinates. Let V(n, m) be the 

set of all m-dimensional subspaces in an n- 
dimensional projective space P”. Then V(n, m) 
has the structure of a tGrassmann manifold. In 
order to introduce a coordinate system on 
V(n,m), we fix a projective coordinate system 
on P”. An m-dimensional subspace ze V(n, m) 
in P” is spanned by m+ 1 independent points 
Bo, B,,...,B,€P”". We denote projective co- 


ordinates of these points by (b9,), (b1;), «-» (bm) 
and construct the determinants 
bo;,oj, wee boj, 
| de ade E CE TEE AE TE sist US ; O<jo, w+ 5 Jp SN. 
bmj Pmj, ein bmj, 


Then the subspace z can be represented by 
homogeneous coordinates (..., Pjioj, jn e} 
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These coordinates are independent of the 
choice of m+ 1 points that span z and are 
called Plücker coordinates (or Grassmann 
coordinates) of z in V(n, m). In these coordi- 
nates, the p; j,...j„ are alternating and satisfy 
the Pliicker relations 


m+1 


where j, means that j, is removed. In partic- 
ular, when n=3 and m=1, we have only one 
Plucker relation Q: po; P23 — Po2P13 + Po3P12= 
0, which is a homogeneous equation of the 
second degree. In other words, the set V(3, 1) 
of all lines in a 3-dimensional projective space 
P? is realized as a quadric surface Q in a 5- 
dimensional projective space P* that has 
(Po1+Po2>Po3sP12> P13» P23) as projective co- 
ordinates. Moreover, when P? is a complex 
projective space, we put 
Porz=Sitics, Pos =čČ2+İčs, 


Pi3 = — 6, +ič4, 


Poi = Go + ids, 


P23 =o — ies, Piı2=62—İő5 


and obtain a relation 
Ea tE + 4 F462 +62 =0 


corresponding to the Pliicker relation. Thus 
every line in P? can be represented by homo- 
geneous coordinates (€,¢,,...,€.), which we 
call Klein’s line coordinates. 


(5) (n + 2)-Hyperspherical Coordinates. Let 

(Xos Xis -<3 Xn Xo) be projective coordinates in 
an (n + 1)-dimensional real projective space 
P”*!. An n-dimensional tconformal space S" 

is realized as a quadric hypersurface S” in 

Prt E? j=1 GyjXiXj— 2XoXq =0, where (g;) is a 
positive definite symmetric matrix. A general 
point in P"*! represents a thypersphere of S". 
That is, a hypersphere represented by a point 
X eP”! is realized as the intersection of S" 
with the tpolar hyperplane of X with respect 
to S"; according as X lies outside of S", on S”, 
or inside of S”, it represents a real hypersphere, 
a point hypersphere, or an imaginary hyper- 
sphere. Therefore any hypersphere of S” in 
P”*! is expressed by homogeneous coordinates 
(Xos X15 -+3 Xns Xœ) called (n + 2)-hyperspherical 
coordinates. When n =2, they are called tetra- 
cyclic coordinates, and when n= 3, pentaspher- 
ical coordinates. Therefore, if we restrict (n + 2)- 
hyperspherical coordinates for points on S”, 
then they satisfy the quadratic relation stated 
before. In the frame (Ay, A;,...,A,,A,,) of P"*! 
which defines the (n + 2)-hyperspherical co- 
ordinates, Ay and A,, are points on S”, and the 
other A, are real hyperspheres passing through 
the points A and A,,. It is possible to choose 
a frame (Ap, A1, ---, An, Aœ) such that the equa- 
tion for S” becomes X}; x? —2xox,, =0 (i.e., 
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gi = 9,;). Among hypersurfaces in S”, one that 
is expressed by a homogeneous equation of 
the second degree with respect to (n+ 2)- 
hyperspherical coordinates is called a cyclide. 
It is an algebraic surface of the fourth order 
and is an enveloping surface of the family of 
hyperspheres that are tangent to n fixed 
hyperspheres. 


(6) Moving Coordinates. When we study the 
differential geometry of an m-dimensional 
surface W in a space M on which a transfor- 
mation group G acts, it is often preferable to 
take a frame or frames at each point of W and 
consider a tconnection among them. These 
frames are called moving frames, and the set of 
coordinate systems with respect to moving 
frames is called a moving coordinate system 
(— 111 Differential Geometry of Curves and 
Surfaces). 


C. Curvilinear Coordinates 


Let (x,,X2,...,X,) be a rectangular coordinate 
system on an n-dimensional Euclidean space 
E”. If x;=x,(u,,U2,...,u,), = 1, ..., n, are func- 
tions of n variables (u,,u>,...,u,,) of class 

C’ (r> 1) and the tfunctional determinant 
D(x,,...,X,)/D(uy, ...,u,) is not equal to zero 
in some open domain, then (u,,u5,...,u,) are 
considered local coordinates in E”. We call 
them curvilinear coordinates of E£". A hypersur- 
face u;=constant (obtained by fixing the value 
of one of the variables u,) is called a coordinate 
hypersurface, and a curve u;=constant (j +i) is 
called a coordinate curve. The line element ds 
of a Euclidean space E” is given by 


Thus E” is equipped with a tRiemannian 
metric. However, as E” is tflat, its tcurva- 
ture tensor satisfies Ri,,=0. If the metric is 
diagonal, namely, if ds? = D?_, gidu}, the co- 
ordinates are called orthogonal curvilinear 
coordinates. Moreover, if g, =...=g,, the 
coordinates are called isothermal coordinates. 
The metric is diagonal if and only if coordi- 
nate curves are mutually perpendicular at the 
points of intersection. Actually, curvilinear 
coordinate systems that are often used practi- 
cally are diagonal. The concept of curvilinear 
coordinates has been generalized to the case of 
tdifferentiable manifolds and is utilized to 
determine their local coordinates. 

On any 2-dimensional Riemannian manifold 
there always exist isothermal coordinates in a 
neighborhood of any point [7]. 
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(1) Curvilinear Coordinates on Planes or Spaces 
(— Appendix A, Table 3.V). Let (x, y) be rect- 
angular coordinates of a point in a Euclidean 
plane E?. We have the following coordinate 
systems on E?: 

Polar coordinates (r, 0), where 
x=rcos@, y=rsiné. 

Elliptic coordinates (A, 4), where 
x? =(A+a’)(u+a*)/(a’ —b’), 
y?=(A+b?)(utb?\/(b? —a’), 
a>b>0, s>—b?>py>—a?. 
Parabolic coordinates (x, 8), where 


x=—(a+f), y=./—4a0f, a>O0>F8. 


Equilateral (or rectangular) hyperbolic co- 
ordinates (u, v), where 
x=uv, y=(u? —v?)/2. 

Bipolar coordinates (¢, 7), where 
x=asiné/(cosh ë + cosy), 
y=asinn/(cosh č + cosy), 
—0o<€<0, O<n<2xz. 


Next we consider the case of a 3- 
dimensional Euclidean space E? and let (x, y, z) 
be rectangular coordinates on E*. The follow- 
ing systems of coordinates on E> are some- 
times useful. 

Cylindrical coordinates (r, 0, z), where 
x=rcos@, y=rsin@, z=z. 

Spherical coordinates (r, 8, œ), where 
xX=rsinécosg, y=rsinésing, z=rcosé. 
Ellipsoidal coordinates (A, u, v), where 
(4 +a’) (u +a’) (v +a")/(a? —b*)(a? —c’), 
y? =(A+b*)(u+b?)(v + b?)/(b? — c?) (b? —a?), 


22 =(A+c?)(u+c7)(v +c7)(c? — a”) (c? b’), 


x? 


a>b>c>0, A>-—c?>y>—b*>v> —g?. 


These coordinate systems are all systems of 
orthogonal curvilinear coordinates. Suppose 
that we are given two rectangular coordinate 
systems (&, y, ¢) and (x, y, z) sharing the same 
origin. The correlation of the two is given by 
Euler’s angles (0, o, Y), where 0, pọ, and y are 
the angles between the z-axis and ¢-axis, zx- 
plane and z¢-plane, and ¢č-plane and ¢z-plane, 
respectively. The Euler angles 0, p, and w are 
subject to the inequalities 0O<0 <7% and 0< ọ, 
yw <2n. They are often utilized in the dynamics 
of rigid bodies. 


(2) Multipolar Coordinates. Let P,, P,,...,P,, 
be m points in general position in an n- 
dimensional Euclidean space E", m <n. If we 
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denote by p; (20) the distance between a point 
X of E" and P,, then (p,, P2, --- Pm) can be re- 
garded as coordinates of a point X contained 
in a suitable domain of E”. They are called 
multipolar coordinates. In particular, if m=2 
they are called bipolar coordinates, and if m=3, 
tripolar coordinates. When m >n, these coor- 
dinates satisfy m—n relations. Next let «,, 
Q5,.+.,%_, be m hyperplanes in general position 
in E", m<n. For an arbitrary point X of E", we 
denote by č; the directed distance of X from 
each hyperplane «;. The m-tuple (€,, 3, ..., En) 
provides coordinates of X that are called 
multiplanar coordinates in E”. When m>n, 
these coordinates satisfy m—n relations. In 
particular, when n=2 and m=3, they are 
called trilinear coordinates. In this case, if we 
denote by S the area of the triangle defined by 
three lines «,, %3, %3, and by a,, a,, a, the 
lengths of the three sides of the triangle, then 
the trilinear coordinates (¢,, €,, €) satisfy a 
linear relation a, č; +a,€, +4 ,¢;=2S. 


(3) Tangential Polar Coordinates. In a Eu- 
clidean plane E?, we take a directed line lo 
passing through a point O. For an arbitrary 
directed line g, let p be the directed distance 
between O and g, and let 0 be the angle be- 
tween lo and g. Then (p, 0) are called tangential 
polar coordinates (Fig. 1). They are useful for 
representing tangent lines to curves in E?. Let 
C be an foval in E?. A line is called a tsupport- 
ing line of C if its intersection with C consists 
of a point or a line segment. In this case we 
take the origin O inside C and consider the 
coordinates (p, 0) of the supporting lines of C. 
Then the equation of C can be represented as 
p= p(0), where p(0) is a periodic function of 
period 22. The coordinates (p, 0) are especially 
useful when the function p(@) can be expanded 
in a tFourier series. In the case of Euclidean 
space E’, the notion of tangential polar co- 
ordinate system can also be defined by using 
tangent planes. 


Fig. 1 
Tangential polar coordinates. 


(4) Normal Coordinates. Let M be an n- 
dimensional tRiemannian manifold, and let 
T,(M) be the tangent space to M at a point A. 
For each tangent vector ve Ti(M), we draw a 
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tgeodesic through A with the initial direction v 
and take a point P on the geodesic such that 
the distance from A to P is equal to the length 
of v. Then the correspondence that sends v to 
P is a diffeomorphism of a neighborhood of 
the zero vector 0 of T,(M) with a neighbor- 
hood of A in M. Therefore the components 
(v',v?,...,v") of v with respect to a basis of 
T,(M) give the coordinates of the points P 
contained in a suitable neighborhood of A. We 
call them normal coordinates about the point 
A of M. In these coordinates, each geodesic 
passing through A is given by equations v‘ 
=a'r (i=1,2,...,n), where the («') are compo- 
nents of the unit vector in the direction of v 
and r is the parameter that represents the arc 
length from A to the point (v’,...,v”). In partic- 
ular, when n= 2, we fix a tangent vector vg at 
A and denote the angle between v and vy by 0. 
Then (r, 0) are coordinates of P called geodesic 
polar coordinates. The notion of normal co- 
ordinates can also be defined for tLie groups 
or differentiable manifolds with taffine 
connections. 


D: Local Coordinates 


Suppose that we have a space M that has a 
covering by a family of open neighborhoods 
with coordinate systems. If, for each pair of 
neighborhoods with nonempty intersection, 
the coordinate transformation in the intersec- 
tion satisfies certain specified conditions, then 
a mathematical structure on M can be defined. 

Let E be a topological space. Suppose that 
® is a family of open sets in E such that the 
union of any number of open sets in ® and the 
intersection of any finite number of open sets 
in ® also belong to ®. A set I of thomeomor- 
phisms is called a pseudogroup of transforma- 
tions on E ifT satisfies the following three con- 
ditions: (i) Any homeomorphism feT is de- 
fined on an open set Uc®, and f(U)e®. (ii) 
When an open set Ue® is expressed as the 
union of a family {U;} of open sets U;e®, a 
homeomorphism f defined on U belongs to T 
if and only if its restriction to each U; belongs 
to F. (iii) For any open set Ue, the identity 
mapping on U belongs to I, and if f, gel, 
then the inverse f ' and the composition go f, 
if it exists, belong to T. 

Let E and M be topological spaces. A 
homeomorphism ¢:U >V of an open set U in 
E to an open set V in M is called a local co- 
ordinate system of M with respect to E. For 
two local coordinate systems ọ,: U; —> V, and 
,:U,>V,, the homeomorphism 


p, OMG (V, NV)> p: (V NV) 


is called a transformation of local coordinates. 
Let I be a pseudogroup of transformations 
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on E. A set & of local coordinate systems of M 
with respect to E is said to define a T -structure 
on M if & satisfies the following two con- 
ditions: (1) the totality of the images of local 
coordinate systems belonging to X covers M; 
(2) if two local coordinate systems ~, and ọ, of 
£ have a transformation of local coordinates, 
it belongs to I. Now suppose that two sets X 
and 2” of local coordinate systems define T- 
structures on M. If the union of X and Z” 
defines a I’-structure, then we say that the first 
two I -structures are equivalent. Let T be a 
pseudogroup of tdiffeomorphisms, each de- 
fined from an open subset of the n-dimensional 
space R” onto another open set. If a T- 
structure is defined on a space M, then M is 

an n-dimensional ‘differentiable manifold. 

On the other hand, let be a pseudogroup 
of complex analytic homeomorphisms in an n- 
dimensional complex number space C”. If a T- 
structure is defined on a space M, then M is an 
n-dimensional tcomplex analytic manifold. 
Locally homogeneous spaces, tfoliated mani- 
folds, the tfiber bundles are all equipped with 
local coordinate systems with suitable T- 
structures. 
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A. General Remarks 


A continuous mapping p: YY of an tarewise 
connected topological space Ý onto a con- 
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nected topological space Y is called a covering 
mapping (covering map) if the following con- 
dition (C) is satisfied: (C) Each point of Y has 
an open neighborhood V such that every 
tconnected component of p~'(V) is mapped 
homeomorphically onto V by p. Here we note 
that Y is in fact arcwise connected. 

If there is a covering mapping p: > Y, we 
call f a covering space of Y and (Y,p, Y) a 
covering. In particular, for a tdifferentiable 
manifold Y, if Ÿ is also a differentiable mani- 
fold and p is differentiable, then F is called a 
covering (differentiable) manifold of Y. (In the 
theory of tRiemann surfaces, a tcovering sur- 
face may have some tbranch points violating 
condition (C). Upon removing such points, we 
obtain a covering space as defined above.) 

For each tpath w:I> Y (J=[0,1]) of Y,a 
path w:I— Y with po w=w is uniquely deter- 
mined by the point #(1)ep~'(w(1)), and a 
bijection wy: p”'(w(1))->p *(w(0)) is deter- 
mined by w,(w(1))=w(0). Thus there exists a 
one-to-one correspondence between p~! (y) 
and p ‘(y’) for every pair of points y, y’ of Y, 
and (F, p, Y, p~'(yo)) is a flocally trivial fiber 
space with discrete fiber p~!(y,). When the 
cardinal number of p~! (y) is a finite number n, 
we call (Y, p, Y) an n-fold covering. In this case, 
for a tloop w(I, Ì)—>(Y, yo) with base point yo, 
w,:p (Vo) >p (yo) is a permutation of the n 
elements in p~! (yo), and we obtain a homo- 
morphism of the tf'undamental group z,(Y)= 
n(Y, Yo) of Y into the tsymmetric group G,, 
given by the correspondence w>w,. The 
permutation group Wt, which is the image of 
this homomorphism, is called the monodromy 
group of the n-fold covering. 

Two coverings (Ë, p;, Y) (i= 1,2) are said to 
be equivalent if there is a homeomorphism 
@:Y,-Y, with p,og=p,; sucha o is called 
an equivalence. In particular, a self-equivalence 
op: YY of a covering (Ë p, Y) is called a cover- 
ing transformation. The set z of all covering 
transformations forms a group by the compo- 
sition of mappings, which is called the covering 
transformation group of Y. We call (F, p, Y) 

a regular covering if for each ye Y and f,, 
j,Ep '(y), there exists a unique covering 
transformation that maps j, to ĵ,. In this 
case, the forbit space Y/z is homeomorphic 
to Y,(Y,p, Y, x) is a tprincipal bundle, and 
the monodromy group Mi is isomorphic 

to m. 

For a covering (Y, p, Y), we call Ÿ a covering 
group of Y if f and Y are topological groups 
and p is a homomorphism. Then (F, p, Y) is a 
regular covering, and its covering transfor- 
mation group is isomorphic to p~'(e) (e is the 
identity element of Y), which is a discrete 
subgroup lying in the center of ¥(— 423 Topo- 
logical Groups O). 
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B. Universal Covering Spaces 


For a covering (Y, p, Y), we have the following 
relations of the thomotopy groups: Pym (Y)> 
m,(Y) 1s isomorphic (monomorphic) for i> 2 
(i=1), and 2,(¥)/p,(7,(Y)) is in one-to-one 
correspondence with p~'(y9) (yo€ Y). If ¥ 

is tsimply connected, Ý is called a universal 
covering space of Y; if in addition f is a cover- 
ing group, Y is called a universal covering 
group of Y. 

For a flocally arcwise connected space Y, 

a covering (Y, p, Y) is regular if and only if 
p,(,(¥)) is a normal subgroup of 7,(Y¥), and a 
universal covering space of Y is a covering 
space of any covering space of Y. Moreover, if 
Y is a topological group, any covering space Y 
of Y can be given a unique topological group 
structure with which Y is a covering group of 
Y. 

Let Y be an arcwise connected, locally arc- 
wise connected, and tlocally simply connected 
space. Then the following classification theo- 
rem of coverings holds: The set of equivalence 
classes of coverings of Y is in one-to-one cor- 
respondence with the set of conjugate classes 
of subgroups of the fundamental group 7 (Y); 
in particular, the equivalence class of a cover- 
ing (Y, p, Y) corresponds to the conjugate 
class of the subgroup p,((Y)). Also, there is a 
unique universal covering space Y of Y up to 
homeomorphism. If in addition Y is a topolog- 
ical group, then F is a unique universal cover- 
ing group of Y up to isomorphism of topolog- 
ical groups. Such a space Y is obtained as 
follows: Consider the tpath space Q(Y; yo, Y) 
of all paths in Y starting from a fixed point 
Yo€ Y, and define two paths wo, w; :(1,0)—> 
(Y, Yo) such that wo{1)=w, (1) to be equiva- 
lent if and only if there is a thomotopy w,: 

(1, 0)-(Y, yo) with w,(1)=wo(1) (O<t< 1). 


Then we obtain the tidentification space F of — 


Q(Y; yo, Y) by this equivalence relation and 
the mapping p: Y> Y by p{w} = w(1); this F is 
the universal covering space of Y. 

Let (Y, p, Y) be a regular covering with the 
covering transformation group z. Then there is 
a tlocally trivial fiber space (Y’,q, B, Y) such 
that the total space Y’ has the same t(co) ho- 
mology groups as Y, and the base space B is 
an tEilenberg-MacLane space K(z, 1). The (co) 
homology fspectral sequence of this fiber space 
is called that of the given regular covering 
(F, p, Y), Eœ is a bigraded module tassociated 
with a certain filtration of the fsingular (co) 
homology module H(Y), and E, is the (co) 
homology module H(z; H(Y)) of the group z, 
where z operates on the coefficient module 
H(Y) via the induced homomorphisms of 
covering transformations (— 148 Fiber 
Spaces). 
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For any group 7, there is a regular covering 
(E. p, B,) with the covering transformation 
group 7 such that E, is tcontractible; B, is an 
Eilenberg-MacLane space K(z, 1). We can take 
St (1-sphere) as Bz of the infinite cyclic group 
Z, and the following infinite lens space as Bz, 
of the finite cyclic group Z, (— 70 Complexes). 


C. Lens Spaces 


Let k be a positive integer and /,,...,1,, be 
integers prime to k. Let S2""' = {(zo,...,z,)€ 
CH [|z]? +... +]z,|? = 1} be the unit sphere 
in the (n+ 1)-dimensional complex linear space 
C”"*!, and define the rotation y by y(Zo, Z1, 

<-s Zn) = (Zoexp 27i/k, z exp 271, i/k, ..., 

z, exp 271, i/k). Then the torbit space $*"*!/Z, 
=L(k;l,,...,1,), where Z, = Z/kZ is interpreted 
as the cyclic group generated by y, is called 

a lens space. It is an orientable (2n + 1)- 
dimensional tdifferentiable manifold. Also, the 
infinite lens space L®(k)= L(k;1,...,1,...) 

is defined by taking n= œ; the infinite sphere 
S°” is a k-fold covering space of L®(k), and 
L®(k)= Bz, = K(Z,, 1). Its tcohomology ring 

is given as follows. 

(1) For integral coefficients, H*'*!(L(k))= 
0, H?'(L®(k))=Z, (i> 0), and the tcup prod- 
uct of generators of degree 2i and 2j is a gen- 
erator of degree 2(i +j). 

(2) Let k=pk’ (p is a prime). If p#2, or p=2 
and k’ is even, H*(L®(k); Z,)= Ne) © Zp Le. ]- 
If p=2 and Kk’ is odd, H*(L” (k); Z,)=Z,[e, | 
(e; is an element of degree i). Here A indicates 
the texterior algebra over Z,, and Z,[ ] the 
*polynomial ring over Z,. 

Two lens spaces L(k;1,,...,1,) and L(k’3 li, 
...,/,) are of the same homotopy type if and 
only if k=k’ and there is an integer m prime 
to k with 


l... = tm" 1, ... (mod k) 


[8]. Furthermore, the condition k=k’, l= 
+1'*!(mod k) holds if and only if L(k; I) and 
L(k'; l) are homeomorphic. (Sufficiency is 
shown in [1]; necessity follows from the fact 
that the Hauptvermutung is valid for com- 
binatorial 3-manifolds and that the condition 
holds if the polyhedra L(k; l) and L(k; I’) have 
isomorphic subdivisions [6].) Also — [7,11] 
and 65 Combinatorial Manifolds. _ 
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A. Space Groups and Point Groups 


Let G be a discrete subgroup of the group of 
tmotions © in the real n-dimensional Eu- 
clidean space V. If G contains n linearly inde- 
pendent translations, then G is called an n- 
dimensional crystallographic space group, a 
space group, or a crystallographic group. The 
following assertions are known to be equiva- 
lent to each other: (i) A discrete subgroup G of 
© is a space group; (ii) G/G is compact; (iii) 
V/G is compact; (iv) There is a compact subset 
P of V such that V=GP=|_),.¢gP [1]. The 
subgroup T consisting of all translations in G 
is called the lattice group of G. T is a normal 
subgroup of G and is generated by n linearly 
independent translations, say, t,,t>,...,t,- 
Take a point x of V. Then the T-orbit of the 
point x is called a lattice of G (or T). t,,...,¢, 
can be identified with a basis of V. Let K be 
the quotient group G/T and k=gT an element 
of K. Then k gives rise to a linear transforma- 
tion k on T via the formula k(t)=gtg™ (the 
product in G), te T, and K can be regarded 

as a subgroup of the torthogonal group of 

V via the above identification of T with a 
lattice in V. That is, the set of orthogonal 
transformations stabilizing x, {g|(g(y) =x +. 
g(y)— g(x), ye V), ge G} forms a group isomor- 
phic to K. The quotient group K is a finite 
group and is called the point group of G. A 
group isomorphic to a point group of an n- 
dimensional space group is sometimes called 
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an n-dimensional crystallographic group. An 
algebraic characterization of space groups 
among abstract groups is given by “A group G 
is isomorphic to an n-dimensional space group 
<= G has a normal tfree Abelian subgroup of 
trank n, which is maximal Abelian and has 
finite index” [1]. 

By representing the action of K on T in 
terms of a basis of T, K can be regarded as a 
subgroup of GL(n, Z). Hence we obtain the 
following crystallographic restriction: If K 
contains an element of order m, then n> (m). 
Here ¢ is the tEuler function. For example, 
if n=2, 3, then m=1, 2, 3, 4 or 6; and if n= 
4, 5, then m= 1, 2, 3, 4, 5, 6, 8, 10, or 12. The 
number of nonconjugate finite subgroups of 
GL({n, Z) is finite (Jordan-Zassenhaus theorem; 
— (2,3,21]). 

Two space groups G, and G, are called 
equivalent if they are conjugate via an ‘affine 
transformation f of V, i.e., G,=/G,f—!. Let T, 
and K; be the lattice group and the point 
group of G; (i=1, 2), respectively. Then T, = 
hT, and K, =hK h! for the lincar trans- 
formation h induced by f. Moreover, there is 
an orthogonal transformation k such that 
K,=kK,k +; this k can naturally be consid- 
ered as an element of GL(n, Z). G, and G, are 
equivalent <> G, and G, are isomorphic (as 
abstract groups) [1]. In applications it is often 
required that in the definition of equivalence f 
be an torientation-preserving affine transforma- 
tion. Under this definition, there can be a pair 
G,, G, such that G, and G, are mutually iso- 
morphic but not equivalent, since they are 
conjugate only by means of an torientation- 
reversing affine transformation. In this case, 
they are called enantiomorphous to each other, 
or the pair is called an enantiomorphic pair 
(Table 1 below). For a given dimension n, 
there is only a finite number of equivalence 
classes of space groups [1]. We also conclude 
from this that point groups have finitely many 
equivalence classes up to conjugacy in the 
orthogonal group O(V) or in GL(n, Z). 


B. Crystal Classes, Bravais Types 


Let T be an n-dimensional lattice in n- 
dimensional Euclidean space V, and K a finite 
subgroup of the orthogonal group O(V). De- 
note by a pair (T, K) a tfaithful linear trepre- 
sentation of K on T, i.e., a *monomorphism 
of groups K >Aut(T). A space group G deter- 
mines a pair (T, G/T). Conversely, for any pair 
(T, K) the following holds: “Any group texten- 
sion of K over the tkernel T is isomorphic to a 
space group” (— the algebraic characteriza- 
tions in Sections A and C). Thus to each pair 
(T, K) there corresponds a set of space groups. 
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Two pairs (7,, K,) and (T,, K,) are called 
arithmetically equivalent (this is denoted by 
(T,,K,)*(T,, K,)) if there exists an invertible 
linear transformation ge GL(V) such that T, 
=gT,, K=gK g '. These pairs are called 
geometrically equivalent or simply equivalent 
(this is denoted by (T,, K,)~(T), K,)) if there 
exists a ge GL(V) such that K,=gK,g~'. The 
relation ~(~) is an equivalence relation, and 
equivalence classes are called arithmetic (geo- 
metric) crystal classes. The set of geometric 
crystal classes is in one-to-one correspondence 
with the set of equivalence classes of point 
groups up to conjugacy, and is sometimes 
called the set of crystal classes. The geometric 
crystal class of (T, K) is usually denoted by K. 

Now let T be a lattice. Then the group of all 
orthogonal transformations that leave T in- 
variant is called the Bravais group of T and 
is denoted by B(T), i.e., B(T)={geEO(V)|gT= 
T}. The group B(T) is finite and determines 
a pair (T, B(T)). Two lattices T, and T, are 
called arithmetically equivalent (this is denoted 
by T, = T,) if (7,, B(T,)) and (4, B(T,)) are 
arithmetically equivalent. An equivalence class 
is called a Bravais type, and an arithmetic 
crystal class determined by (T, B(T)) is called a 
Bravais class. For each Bravais type, a repre- 
sentative is said to be its Bravais lattice (Fig. 3 
below). 

Define an torder relation on the set of lat- 
tices belonging to an arithmetic crystal class as 
follows. When (T,, K,)*(T,, K3), define T, < T, 
if there exists a ge GL(V) such that T, =gT,, 
K =gK g, and B(T,)<gB(T,)g™'. Note that 
T, = T,<> T, < T, and T, <T. In each arith- 
metic crystal class, there is a pair (T, K) witha 
tminimal T with respect to this order relation. 
In this case the Bravais type of T is referred to 
as the Bravais type of the class of (T, K). This 
lattice T is, intuitively, of the most general type 
appearing in the class. Now define a relation 
T, ~s T, if there are pairs (7;, K,), i=1, 2, such 
that (T,, K) ~(T,, K,) and T, is minimal in the 
class of (7;, K,;). The equivalence class of the 
equivalence relation generated by the relation 
~, is called a crystal family. 

Summarizing, we have defined the following. 
Let F, 4, 8G, B, CF be the set of equivalence 
classes of space groups, arithmetic crystal 
classes, geometric crystal classes, Bravais 
types, and crystal families, respectively. Then 
we have the relations shown in Fig. 1. (All 
arrows in Fig. 1 are surjective.) @F is nothing 
but the tcoproduct of and B over 4, GF 
>g ]],2. 


S— J+ G Hu» G 
| a 
B—>CF CSL —>CF 

Fig. 1 Fig. 2 


92 C 
Crystallographic Groups 


Moreover, there are injective mappings 2 c. 
and /<¥ such that the composites Z œ. > 
B and J4S-MA are identities. The first is 
the mapping sending a Bravais type T to the 
arithmetic crystal class defined by (T, B(T)). 
The second one sends a class of (T, K) to 
a tsemidirect product of T and K (— 190 
Groups N). A space group belonging to a class 
of the image of £ cS is said to be symmorphic 
or symmorphous. A class which belongs to the 
image # of the composite Z œ. >G is called 
a holohedral or holosymmetric class, or a holo- 
hedry. Define a mapping M of Y to the power 
set P(B) by M =(4—.A)- (4 >F) t. For two 
elements C, and C, of Y, define C, ~,C, if 
M(C,)=M(C,). Each equivalence class is 
called a crystal system, and the set of crystal 
systems is denoted by €. Two elements C 
and C’ of belong to the same crystal family 
if and only if there exists a sequence of ele- 
ments of , C,, C,,...,C, such that C, =C 
and C,=C’ and M(C)N M(C,,,)4#@ for i= 
1,...,k—1. Therefore there is a surjective map- 
ping 6S >@F (Fig. 2). When n <4, the com- 
posite #.G—-€F is bijective. It should be 
remarked that when n > 3, there is no map- 
ping >@¥F such that J -9>@F and A —> 
BCF coincide. 

The numbers of elements of these sets are 
shown in Table 1. 


Table 1 

n = | 2 3 4 
EF , crystal ; 
families 1 4 6 23 
CEF? crystal 
systems l 4 7 33 
2, Bravais types 1 5 14 64 
G, point groups 
(isomorphic 
classes)? 2(2) 10(9) 32 (18) 227 (118) 
W, arithmetic 
crystal 
classes 2 13 73 710 
S, space groups 2 17 219° = 47834 


a When n<4, the number of elements of # coincides 
with that of @F. 
° The number in parentheses denotes the number of equiva- 
lence classes under algebraic isomorphism. 
*4The number of equivalence classes under orientation- 
preserving transformations is 230 and 4895, respectively. 


C. Construction of Space Groups 


Take an element (T, K) of an arithmetic crystal 
class. Since K is finite and T is finitely gen- 
erated, the cohomology groups H'(K, T) are 
finite groups (— 200 Homological Algebra G). 
Let « be a tsecond cocycle representing an 
element [«] of H?(K, T). Let (T, K), be the 
textension of K with kernel T correspond- 

ing to [a] (— 190 Groups N). Then the set 

of elements of (T, K), is given by {<t,k>| 
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teT,ke K}, and the product is defined by 

<tr sky > Stz, k2) = Cty thytz+a(ky, k2), kı k2). 
Since H'(K, V)=0, i>1, there is a tfirst co- 
chain BeC'(K, V) such that a(k,,k,)=f(k,)+ 
kı B(k,) —B(k,k,), and the tfirst cocycle $ on 
V/T defined by £ induces an isomorphism 
H!(K, V/T) H?(K, T), [B]++[«]. Define an 
action of <t, k> on V by 


<t,ky(x)=kx+t+ß(k), xev. 


Then (T, K), acts on V as a space group. Con- 
cerning their equivalence, this result holds: 
“The equivalence classes of (T, K),, [w]e 
H?(K, T) are in one-to-one correspondence 
with the orbits of N(K), the normalizer of K 
in Aut(T) acting on H?(K, TY” [4]. The action 
of he N(K) is given by "x(k)=ha(h7! kh), «e 
C?(K, T). 

For a pair (T, K) the weight lattice or the 
weight group 7* of (T, K) is defined by {ve V | 
v—kveT,VkeK}. Let R be an irreducible re- 
duced ‘root system, Q(R) its lattice, and W(R) 
its Weyl group. Then for a pair (O(R), W(R)), 
Q(R)* ts nothing but the usual weight lattice 
P(R). (— 248 Lie Algebras). Suppose that K of 
(T, K) does not contain the central inversion 
— l, v —v, ve V. Denote the group KU 
(—I,)K by +K. Then there is an isomorphism 


H*(+K,V/T)S[H'(K, V/T)], 
x (T*/T)/2(T*/T). 


Here [-], denotes the subgroup of elements 
of order not greater than 2 [5]. This isomor- 
phism provides powerful means for the con- 
struction of space groups. 

In constructing a pair (7, K), it is useful to 
take a basis of T in a special form. Let {t,,t2, 
-s ln} be a basis of a lattice T, and set a; = 
(ti, t;), where (,) is the inner product in V. The 
nxn matrix A =(a;;) is symmetric and tposi- 
tive definite, and the numbers a;; are called 
the lattice constants of T. A basis {t,,...,t,} is 
called a reduced basis if the tquadratic form 
(Ax, x)= La;;x;x; is reduced (> 348 Qua- 
dratic Forms). A lattice admits at least one re- 
duced basis, and in this case + 2a, <a,;<a,;, 
i<j [6, Table 5.1; 7]. The totality of an n- 
dimensional reduced basis forms in a nat- 
ural manner a semialgebraic set of dimension 
(n+ 2)(n—1)/2 in R“*!"”, Its closure, say Q,,, 
is compact. On the other hand, the set L, = 
(R, x O(R"))\GL(n, R)/GL(n, Z) is identi- 
fied with the set of equivalence classes of n- 
dimensional lattices under the relation of 
“same shape.” There is a natural inclusion 
L,U...UL,<@Q,, and L, is dense in Q,. More- 
over, the inclusion gives a bijective homeo- 
morphism when n<4 [8]. 

A finite subgroup K of the orthogonal 
group O(V) is said to be fully transitive if there 
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is a set S={e,,...,e,} that spans V on which K 
acts ttransitively and if K has no tinvariant 
subspace in V. In this case one can choose S as 
either of the following: (i) the primitive hyper- 
cubic type, S={e,,..., en}, (€i €;) = 0, (*Kro- 
necker delta); (ii) the primitive hyperbolic type, 
S={fiso Sth =l i=1,..,n+1, 

(fo fJ= —1/n, i, j=1,...,n+1, i¢j (especially, 
rt fi=0) [10]. 


D. Color Symmetry Groups, Twinning 


A color symmetry group, or a colored symme- 
try group, associated with a space group G is a 
pair (G, G’) such that G’ is a subgroup of G 
with finite index. The index r=[G:G’] is called 
the number of colors. We call (G, G’) a white 
group when r= 1, a black and white group or a 
magnetic group when r= 2, and a polychroma- 
tic group when r>3. G’ is also a space group 
with the lattice group T’ =G NT, where T is 
the lattice group of G. Two color symmetry 
groups (G, Gi) and (G,, G5) are called equiva- 
lent if there exists an taffine transformation f 
such that G,=/G,f~! and G} = fG f !. Then 
(Gi, Gi) and (G,, G3) are equivalent if and only 
if there is an isomorphism o:G,-+G, such that 
o(G,)=G). A color symmetry group (G, G’) is 
called lattice equivalent if T’ = T, K'c K, and is 
called class equivalent if T’< T, K' =K. Take a 
pair (G, G’), and let G” be the tinverse image of 
K’ under the canonical tepimorphism G— K. 
Then (G, G”) is lattice equivalent and (G”, G’) is 
class equivalent. Fix the color number r. Then 
(i) the equivalence classes of lattice equivalent 
color symmetry groups are in one-to-one 
correspondence with the conjugacy classes of 
subgroups K, of index r, and (ii) the number of 
equivalence classes of class equivalent color 
symmetry groups is finite. When r is prime to 
the order of K, the classes are in one-to-one 
correspondence with K-invariant sublattices 
T, of index r in T. Therefore, when the number 
of colors is given, the set of equivalence classes 
of n-dimensional color symmetry groups is 
finite. 

In case (i), a pair (K, K,) is called a color 
point group, and in case (11), (T, T,) is called a 
color lattice. In particular, when r=2, they are 
called a black and white point group and a 
black and white lattice, respectively. Their 
equivalence is defined in a similar way. 

Take a color lattice (T, T,) of color number 
r. Let m be the least natural number such that 
mTcT,, then mis a divisor of r. Let m= 
pi -+ Pe’, Pi # p; (ij) be the decomposition 
of m into its prime factors. Then there is a 
sequence of K-invariant sublattices of T, T; 
i=2,...,k—1, such that (i) Te T,... c a 
T = T; (ii) p Ta c Tj, i=1, ..., k; Gii) the 


343 


submodules T;/p T; of i=1,...,k, Z" @ 
Z/piiZ are uniquely determined independently 
of the choice of such a sequence {7;}. Thus 

the existence of color lattices is related to the 
tmodular representation of K (— 362 Repre- 
sentations G). For example, if r is a prime 
number, then the representation of K on T/rT 
~Z" © Z/rZ is the reduction modulo r of 
that of K on T. 

A mathematical treatment of twinning is 
given as follows. Let V be an n-dimensional 
Euclidean space and T an n-dimensional lat- 
tice in V. The volume of T, vol(7), is defined 
by vol(T)=(|det A|)"”, where A = (a;) is the 
matrix of lattice constants. This definition is 
independent of the choice of a basis of T. If 
T’ is a sublattice of index r in T, vol(T’)=r- 
vol(T) follows. Let K be a finite subgroup 
of O(V), T, and T, K-invariant lattices, and 
U a hyperplane in V. Then the quadruple 
(T,, T2, K, U) is called a twinning structure if 
it satisfies the following: (i) vol(7,) = vol(T,); 
(ii) the lattice 7; is minimal in the arithmetic 
crystal class of (7;, K), i=1, 2; and (ii) T, QU = 
T, 1U=T,, and Tp is an (n— 1)-dimensional 
lattice in U. Two arithmetic crystal classes are 
said to be twinable if there exist representa- 
tives (T,, K), (T>, K), and a hyperplane U such 
that (7,, T;, K, U) is a twinning structure. Let 
(T,, K) and (T,, K) be two arithmetic crystal 
classes. If there exists a K-invariant hyper- 
surface U such that 79 =7,0U=T,NU is 
an (n— 1)-dimensional lattice in U, then two 
classes (T,, K) and (7,, K) are twinable. 


E. Three- and Two-Dimensional 
Crystallography 


Three-dimensional crystal classes are listed in 
Appendix B, Table 5.1V. To name these crystal 
classes, both Schoenflies’ notation and the 
international notation are used. 

Fix an torthogonal tright-handed system 
(x,y,z) in V. For each crystal system, define 
three axes as follows. The first axis is the z- 
axis. The second is the line x = y=z for the 
cubic system and the x-axis for the others. The 
third is the y-axis for orthorhombic, trigonal, 
and hexagonal systems, and the line x= y, z=0 
for the others. Then a crystal class is expressed 
by its generators with respect to these axes. 
The symbol k (k= 1, 2, 3,4, 6) denotes the rota- 
tion by the angle 2x/k around the axis; k is a 
composition of the rotation k and the central 
inversion (v> —-v, ve V), and k/m (k= 2, 4, 6) 
means the composition of k and the reflection 
about the hyperplane perpendicular to the 
above axis. The symbol m denotes 2. Usually, 
1 is omitted. This is the full international nota- 
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tion and a set of generators can be read from 
it. The short international notation is also used. 

Schoenflies’ notation consists of the letters 
C,, Dp» Sn, T, O, and the subscripts h, v, d. The 
letters denote the group and order: C, is the 
tcyclic group of order n; D, is the dihedral 
group of order 2n; T is the ttetrahedral group 
isomorphic to the fourth talternating group; O 
is the toctahedral group isomorphic to the 
fourth ‘symmetric group; and S, =1, S4=4, S, 
=3. The subscripts h, v, d mean that the group 
is generated by reflections in the hyperplane 
which is horizontal, vertical, or perpendicular 
to a diagonal, respectively. The orientation is 
appropriately given. For example, in C,,, and 
D,, there is an nth order rotation around the 
z-axis and the horizontal plane is the xy-plane. 

The above notations are also applied to 
noncrystallographic finite subgroups of the 3- 
dimensional orthogonal group [13]. 

The symbol for Bravais type is specified by 
the name of its crystal system and one of the 
letters P, A, B, C, F, I. Six kinds of simple 
Bravais lattices are given as follows. A simple 
lattice is generated by {a,b,c} and satisfies the 
following condition. Denote the lengths of 
vectors by a= |al|, b= ||bl], c=|le||, and let x = 
Z (b,c), B= Z(¢, a), y= Z (a,b) be the angles 
between the indicated pairs of vectors. For 
triclinic P, a #b # c #a, a # f #y Ha; for mono- 
clinic P, a#b#cH#a, x =y = 90° £ ß; for or- 
thorhombic P, a#b4¢cH#a, a= B =y =90°; 
for tetragonal P, a =b £c, «= f =y =90°; for 
hexagonal P or trigonal P, a =b #c, a = ĝ = 90°, 
y= 120°; for cubic P, a =b =c, c= f =y = 90°. 
Let T be one of the above lattices generated by 
{a,b,c}. Then the lattices generated by {a, b, 
(b+c)/2}, {(c +a)/2, b,c}, {a,(a+b)/2,c} {(a + 
b+ c)/2, b,c}, {(a+b)/2, (b + ¢)/2, (c +a)/2} are 
called A, B, C, I, F lattices determined by T. 
The original lattice T is called the primitive 
lattice. The trigonal R lattice is generated by 
{a',b’,c’} satisfying a’ =b' =c, a =p’=y'< 
120°, 490°. Set a=a’—b’, b=b’—c’", c=a’ + 
b +c’. Then the hexagonal P lattice gener- 
ated by {a,b,c} is defined to be the primitive 
lattice of the trigonal R lattice. The fourteen 
Bravais lattices are illustrated in Fig. 3. 

An arithmetic crystal class (T, K) is denoted 
by the symbol of the Bravais lattice T and the 
symbol of K with respect to the action of K on 
T. Let the primitive lattice of T be generated 
by {a,b,c}. Put the vector a on the x-axis, and 
b on the xy-plane. The action of K ts then 
determined by the international notation for 
K, except when K = 42m, 32, 3m, 3m, 6m2, 
and T is the primitive or / lattice. In these 
cases two kinds of actions interchanging the 
second and the third symbols are not equiva- 
lent. If such is the case, 1 is inserted into the 
third position for trigonal systems. Let (K) be 
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the symbol determined as above and (T) be 
the symbol of T. Then the action (T, K) is 
expressed by ((T), (K)). For example, when K 
= 32, there are three arithmetic crystal classes 
(P, 321), (P, 312) (R, 32). 


P C 
Triclinic Monoclinic 
c | 
b i k 
a X / 
P C I F 
Orthorhombic 
AF AA s ora. 
} 1 
bi KN ! 
1 
Z NA tt 
zra 
P I R P 
Tetragonal Trigonal Hexagonal, 


Trigonal 


Fig. 3 
The conventional unit cell for each of the 14 Bravais 
lattices. 


The notation of space groups determined by 
(T, K) is given by writing both the symbol (T), 
and the symbols of actions of elements on V 
which appear in the symbol (K) defined above. 
The symbol (TXK) denotes the tsemidirect 
product of T and K by the action (T, K), and 
is the symmorphic space group determined 
by (T, K). Other nonsymmorphic groups are 
designated by replacing symbols in (K). If k is 
replaced by k; (j=1,...,k~1), then k; is a k- 
fold screw glide with pitch j/k, that is, the com- 
posite of the rotation k and the translation 
jf/k, where f is the vector of minimum length 
of T along the axis of k. If m is replaced by 
a, b, c, n, d, then it is a glide reflection, that 
is, the composite of the reflection m and the 
translation in the reflecting hyperplane with 
the direction of a, b, c, a face diagonal and 
a diamond, respectively. See [6] for the pre- 
cise meaning of the notation discussed tn this 
paragraph. 

The notations for 2-dimensional point 
groups and space groups can be adopted from 
the ones for 3-dimensional ones by setting the 
reference plane as the xy-plane and the z-axis 
perpendicular to that plane. In this setting, the 
notation of a plane point group is given by the 
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corresponding point group in 3-space, and 
similarly for plane space groups. There is only 
one glide operation, denoted by g. The motion 
g is the composite of the line reflection about 
an axis and half the translation along the axis 
with the vector of minimum length of T. In 
[13], the precise group-theoretic description of 
space groups can be found. It should be noted 
that the notations for the two groups p3m! 
and p3im are frequently interchanged in the 
literature (including earlier editions of [13]). 


F. History 


The first mathematical study of the structure 
of crystals was done by the mineralogist J. F. 
C. Hessel (1830). He enumerated the finite 
subgroups of the 3-dimensional orthogonal 
group. Afterward, his result was rediscovered 
many times. A. Bravais (1850) showed that in 
3-dimensional Euclidean space there are only 
14 different lattices. All space groups consist- 
ing of orientation-preserving transformations 
were determined by C. Jordan (1868, 1869) and 
L. Sohnke (1879). Finally, almost all the space 
groups were determined independently by E. 
S. von Fedorov (1885, 1889) and A. Schoenflies 
(1887, 1889). Probably as a result of com- 
parison of each other’s lists, they established in 
1891 the existence of the 230 space groups. 
Then A. Barlow (1894, 1896) derived these 
space groups by adding reflection operations 
to Sohnke’s 65 groups. 

In 1900 in his eighteenth problem D. Hilbert 
raised the question of whether the number 
of equivalence classes of space groups of a 
given dimension is finite. This was answered 
affirmatively by L. Bieberbach (1910 [1]). An 
algorithm for determining space groups was 
given by H. Zassenhaus (1948 [4]). Their 
results, including [2,3], and work by C. Her- 
mann (1949 [9], 1952 [10]) gave a solid 
foundation for n-dimensional crystallography. 

Then followed a concrete treatment of 4- 
dimensional crystallography. A. C. Hurley 
(1951), using an earlier work of M. E. Goursat 
(1889) on finite subgroups of the 4-dimensional 
special orthogonal group, determined 221 of 
the 4-dimensional point groups. In the 1960s 
A. L. Mackay and G. S. Pawley (1963) and 
others gave 56 of the 4-dimensional Bravais 
types. After E. C. Dade (1965) determined 
9 maximal finite subgroups of GL(4, Z), R. 
Bülow (1967) and H. Brown, J. Neubuser, and 
H. Zassenhaus (1968) independently deter- 
mined 710 arithmetic crystal classes. They 
reconfirmed the result with H. Wondratschek, 
and as a result, 64 Bravais types and 227 
geometric crystal classes were established 
(1971 [15]). They ran Brown’s computer 
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program for the determination of space groups 
based on [4] and obtained 4,783 space groups. 
Their work culminated in a book (1978 [16]). 
There have been several attempts to establish 
a unified treatment of n-dimensional crystal- 
lography (5, 8, 14, 17] and crystallography 

in complex Euclidean spaces. 

Two-dimensional space groups were em- 
pirically known in ancient times, as demon- 
strated by artistic decorations. All the 17 
2-dimensional space groups appear in the 
tile patterns of the Alhambra in Granada. A 
discrete subgroup of the plane motion group 
whose translation subgroup is only rank one 
is called a frieze group. 

The notion of black and white groups was 
introduced by H. Heesch (1930), H. J. Woods 
(1935), and A. V. Shubnikov (1951). Only with 
the introduction of the use of neutron diffrac- 
tion techniques did it become apparent that 
these groups could be used in the description 
of magnetically ordered structures [12]. Color 
symmetry groups were defined by B. L. van 
der Waerden and J. J. Bruckhardt (1961 [11]) 
for an arbitrary number of colors. A mathe- 
matical treatment of the twinning structure 
is found in papers by T. Ito (1938) and R. 
Sadanaga (1959). 


References 


[1] L. Bieberbach, Uber die Bewegungsgrup- 
pen der Euklidischen Räume, Nachr. Akad. 
Wiss. Gottingen (1910), 75-84; Math. Ann. 70 
(1910), 297-336 and 72 (1912), 400-412. 

[2] L. Bieberbach, Uber die Minkowskische 
Reduktion der positiven quadratischen For- 
men und die endlichen Gruppen linear ganz- 
zahliger Substitutionen, Nachr. Akad. Wiss. 
Gottingen (1912), 207-216. 

[3] H. Zassenhaus, Neuer Beweis der Endlich- 
keit der Klassenzahl bei unimidularer Äquiva- 
lenz endlicher ganzzahliger Substitutions- 
gruppen, Abh. Math. Sem. Univ. Hamburg, 12 
(1938), 278-288. 

[4] H. Zassenhaus, Über einen Algorithmus 
zur Bestimmung der Raumgruppen, Comment. 
Math. Helv., 21 (1948), 117—141. 

[5] G. Maxwell (and N. Broderick), The cry- 
stallography of Coxeter groups, J. Algebra, 35 
(1975), 159-177 and 44 (1977), 290-318. 

[6] N. F. M. Henry and K. Lonsdale, Interna- 
tional tables for X-ray crystallography, vol. 1, 
Symmetry groups, Kynoch, Birmingham, 1952, 
1965, 1969. 

[7] B. L. van der Waerden, Die Reduktions- 
theorie der positiven quadratischen Formen, 
Acta Math., 96 (1956), 265-293. 

[8] R. L. E. Schwarzenberger, Classification of 
crystal lattices, Proc. Cambridge Philos. Soc., 
72 (1972), 325-349. 


93 A 
Curves 


[9] C. Hermann, Kristallographie in Raumen 
beliebiger Dimensionzahl I, Die Symmetrie- 
operationen, Acta Cryst., 2 (1949), 139-145. 
[10] C. Hermann, Translationsgruppen in n- 
Dimensionen, Zur Struktur und Materie der 
Festkörper, Springer, 1952. 

[11] B. L. van der Waerden and J. J. Burck- 
hardt, Farbgruppen, Z. Krist., 115 (1961), 
231-234. 

[12] A. V. Shubnikov and N. V. Belov, Col- 
ored symmetry, Pergamon, 1964. 

[13] H. S. M. Coxeter and W. O. Moser, Gen- 
erators and relations for discrete groups, 
Springer, 1957, 1964, 1972, 1979. 

[14] E. Ascher and A. Janner, Algebraic as- 
pects of crystallography, Helv. Phys. Acta, 38 
(1965), 551-571, and Comm. Math. Phys., 11 
(1968), 138—167. 

[15] R. Bülow, J. Neubüser, and H. Wondrat- 
schek, On crystallography in higher dimen- 
sions, Acta Cryst., A27 (1971), 517-5385. 

[16] H. Brown, R. Bülow, J. Neubüser, H. 
Wondratschek, and H. Zassenhaus, Crystallo- 
graphic groups of 4-dimensional space, Wiley, 
1978. 

[17] R. L. E. Schwarzenberger, N-dimensional 
crystallography, Research notes in math., 
Pitman, 1980. 

[18] A. Kelly and G. W. Groves, Crystal- 
lography and crystal defects, Longman, 1970, 
[19] M. J. Buerger, Introduction to crystal 
geometry, McGraw-Hill, 1971. 

[20] G. J. Bradley and A. P. Cracknell, The 
mathematical theory of symmetry in solids, 
Clarendon Press, 1972. 

[21] C. W. Curtis and I. Reiner, Representa- 
tion theory of finite groups and associative 
algebras, Wiley, 1962. 


93 (VI.20) 
Curves 


A. Introduction 


In the beginning of his Elements, Euclid gave 
definitions such as: A line is a length having no 
width; an end of a line is a point. However, he 
left notions such as width and length unde- 
fined. Thus his definitions were far from satis- 
factory. Actually, it was only during the latter 
half of the 19th century that efforts were made 
to obtain exact definitions of lines and curves. 
Euclid, among others, distinguished two kinds 
of curves: straight lines and curves. Nowadays, 
however, lines in the sense of Euclid are called 
curves, and a straight line is considered a 
curve. A first effort to give an exact definition 
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of a curve using analytic methods was made 
by C. Jordan in his Cours d’analyse I (1893). 


B. Jordan Arcs and Jordan Curves 


Following Jordan, we define a continuous plane 
curve C to be the image of a fcontinuous 
mapping sending the interval [0, 1] into the 
Euclidean plane E?. Namely, C is the set of 
points (x, y) in E? such that 


x= f(t), 


with continuous functions f, g defined on 
[0,1]. A continuous curve is also called a 
continuous arc. We call ( f(0), g(0)) and (/(1), 
g(1)) the ends of the arc. Given continuous 
functions f, g defined on (0, 1), the set {(x, y)| 
x= f(t), y=g(t), 0<t< 1} is called an open 

arc. More generally, the image of a continuous 
mapping of (0, 1], (0, 1), (0, 1), or (0, 1] is called 
an are (or curve). Suppose that C is an arc that 
is the image of an interval I and P=(x, y) is.a 
point on C to which there correspond two 
elements t4, tz (t, <t,) of I such that P is the 
image of both t,, t,. In this case, the point P is 
called a multiple point on C. An arc having no 
multiple point is called a simple are or Jordan 
arc. 

An arc with one and only one multiple point 
P=(f(0), g(0))=(f(1),g(1)) is called a Jordan 
curve or simple closed curve (— Section K). A 
Jordan curve can be regarded as a topological 
image in a plane of a circle. Let C be a curve 
that is the image (J) of an interval. Then C is 
said to be of class C* (analytic) if the mapping 
¢ is of tclass C* (tanalytic). In general, if S is a 
topological space, then the image (I) in S of 
an interval is called a curve in S. In particular, 
if S has the structure of a differentiable (ana- 
lytic) manifold, we can define the notion of 
curve of class C* (analytic curve) in S. 


y=g(t), O<t<l, 


C. Ordinary Curves 


A tconnected subset of E? that is the union of 
a finite number of simple arcs meeting at a 
finite number of points is called an ordinary 
curve. An ordinary curve is called a tree if it 
does not contain a subset that is homeomor- 
phic to a Jordan curve. Let p be a point on an 
ordinary curve C. The tboundary of a suffi- 
ciently small tneighborhood of p meets C 

at a finite number of points, and this number 
is independent of the choice of the small 
neighborhood. We call it the order of p in C. 
A point of order 1 is called an endpoint of C, 
a point of order 2 an ordinary point, and a 
point of order >3 a branch point. If we can 
represent an ordinary curve C as a continuous 
curve tracing each simple arc of C just once, 
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we say that C is unicursal (Fig. 1). A necessary 
and sufficient condition for C to be unicursal is 
that the number of points of odd orders in C 
be less than or equal to 2 (L. Euler). 


Fig. 1 


D. Further Consideration of Definitions 


Although the set of ordinary curves as defined 
above contains most familiar curves, 1t does 
not contain a point set defined by y=sin 1/x in 
O0<x<land —1<x<0O, and by —1<y<1at 
x=0 (Fig. 2). This point set is called a sinusoid, 
and it is desirable to obtain a definition of 
curves wide enough to contain a sinusoid. On 
the other hand, the notion of continuous 
curves is, in a sense, too wide, because a curve 
such as a Peano curve (— Section J), which 
covers a whose square, is among such curves. 
The notion of simple arcs is too narrow, 
because even a circle is not a simple arc. As a 
point set in E”, a continuous arc is character- 
ized as a flocally connected tcontinuum and 

is sometimes called a Peano continuum (H. 
Hahn, S. Mazurkiewicz). On the other hand, 
A. Schoenflies, inspired by the statement of the 
Jordan curve theorem (— Section K), consid- 
ered a closed set that divides the plane into 
two parts, forming the common boundary of 
both domains, and called it a closed curve. 
According to this definition, however, a simple 
curve is not a closed curve. Thus as a general 
definition of curves it is not appropriate. 


Fig. 2 


To give a general notion of curves on a 
plane (containing sinusoids), we may define a 
curve as a continuum that is tnowhere dense 
in E? (i.e., a continuum that is a boundary of 
open sets on the plane) (G. Cantor). Further- 
more, to deal with the curves on a topological 
space, P. S. Uryson and K. Menger defined a 
general curve to be a 1-dimensional continuum 
(Menger, 1921-1922 [1]). In E’, the latter 
notion coincides with the notion of curves 
defined by Cantor, while in E? a general curve 
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is a continuum that does not divide any 
tdomain (— 79 Connectedness). 


E. Universal Curve 


Consider a 3-dimensional cube 1° (J = [0, 1). 
Draw two planes parallel to each face so that 
the two planes parallel to a face trisect the 
edges of the cube meeting the planes. Thus 7° 
is divided into 3% cubes. Let M, be the closure 
of the subset of 1° that is obtained from I° by 
deleting the cube J? (L =[1/3, 2/3]) and the 6 
cubes having common faces with I>. Then M, 
consists of 20 cubes (Fig. 3). We apply to each 
of the 20 cubes forming M, the same operation 
that we applied to 7° and denote by M, the 
union of point sets thus obtained (consisting of 
20? cubes, the length of whose edges equals 
1/37). Repeating this process, we obtain a 
point set M, consisting of 20" cubes, the length 
of whose edges equals 1/3". Thus we obtain the 
sequence M, > M,>M,>.... The set U 
=()2., M, is a general curve in the sense of 
Uryson and Menger. Moreover, we can prove 
that an arbitrary general curve is homeomor- 
phic to a subset of U. Hence we call U the 
universal curve. 


Fig. 3 


F. Length of a Curve 


In this section, by a curve we mean a continu- 
ous curve in a Euclidean space E”. Let C be 

a curve in E” defined by x;= f,(t) (i=1, 2,...,n; 
a<t<b; the f; are real-valued continuous 
functions defined in [a, b]). (We sometimes 
write this simply as X = f(t), where X = 
(X,,...,X,).) We divide [a,b] arbitrarily and 
denote the dividing points by a=tọ <t; <t, < 
.. <t,=b. Let X,=f(t,), k=0,...,r, and let 
X,,-,X;,, be the length of the straight line seg- 
ment joining X,_, and X,. If the length / 

= Vie Xz- X, of the broken line (X,X, ... X,) 
(Fig. 4) is bounded for any subdivision of 
[a,b], C is called a rectifiable curve, and the 
upper limit of I with respect to the subdivisions 
is called the length of C. For C to be rectifi- 
able, it is necessary and sufficient that the 

J, (i=1,2, ...,n) be of thounded variation 
(Jordan). Thus if C is a rectifiable curve, then 
each f(t) Gi=1,2,...,n) is almost everywhere 
differentiable (H. Lebesgue). In particular, if C 
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is of class C!, then C is rectifiable, and its 
length can be represented by 


LEO 


(— 246 Length and Area). 


Fig. 4 


G. Shapes of Curves 


In this section by a curve we mean the image 
(J) in a Euclidean space E" of an interval J 
(bounded or unbounded), where ¢ is a continu- 
ous mapping. When a curve C of class C* is 
given in a Euclidean space E", it often becomes 
necessary to examine the shape of C globally. 
The determination of the global shape of C 
from the equation of the curve is called the 
curve tracing of C. The problem has been 
thoroughly studied in the particular case in 
which n=2 and the equation of C is given by 
f(x, y)=0 in a rectangular coordinate system 
(by F(r, €)=0 in a fpolar coordinate system), 
where f (or F) is an analytic function. The 
problems in the case of a rectangular coordi- 
nate system are as follows: 

Let ọ be a single-valued analytic function 
and I a (bounded or unbounded) interval on 
the x-axis. If a curve Co represented by y= 
p(x) (xe) is a subset of C, then Co is called 
a branch of C. According as I is bounded or 
unbounded, Co is said to be a finite branch or 
infinite branch of C. When a curve represented 
by x=w(y) (yeJ) (w is a single-valued analytic 
function and J is an interval on the y-axis) is a 
subset of C, it is also called a branch of C. If it 
is necessary to distinguish these two branches, 
we call the former the x-branch and the latter 
the y-branch. C consists of an at most de- 
numerable set of branches. If P(x9, Yọ)€ C and 
f,=Of/éy #0 at P, then there exists an x- 
branch containing P; if f, = f/x #0, there 
exists a y-branch of C. If éf/éx =0, of/éy=0 at 
P, then P is called a singular point of C. Points 
on C that are not singular points are called 
ordinary points of C. 

When P is an ordinary point of C, a branch 
Co of C containing P is determined, and the 
tangent line and normal line to C at P are 
the same as those to Cy and are uniquely 
determined. The equations of these are (x — 


Xo) fx(X0, Yo) + (y — Yo) fy(Xo, Yo) =O and (x— 
Xo) fy(Xo, Yo) —(¥ — Yo) f(X0, Yo) =0, respec- 
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tively. If we choose a coordinate system with P 
as origin and the tangent line and normal line 
as €-axis and y-axis, respectively, then the 
equation of C with respect to this coordinate 
system is of the form 7=c,é*+c,é7+... in the 
neighborhood of P. If we denote by p the 
tcurvature of C at P, then p=2c,= -(f,,.f7 — 
Sof dy + yf OW +S. When c, = p= 

0, P is a tstationary point. A stationary point 
of a curve of class C? on C is also called a 
point of inflection. When P is not a point of 
inflection and (č, y) are points on C in a neigh- 
borhood V of P, the sign of y is definite if V is 
small enough (Fig. 5). However, if P is a point 
of inflection and c3 #0, then C is of the shape 
shown in Fig. 6. At a point of inflection, if c3 = 
... =C,., =0, c, £0, and v is even, C is of the 
shape shown in Fig. 5, and if v is odd, C is of 
the shape shown in Fig. 6. 


Fig. 5 Fig. 6 


In a neighborhood of a singular point, C 
takes various shapes. For example, consider a 
curve represented by y? = x(x +a), and let P 
be the origin (0,0). If a>0, then there are two 
branches of C passing through P, and they 
have different tangents at P (Fig. 7). As in this 
case, if there are a finite number of different 
branches passing through P with different 
tangents, P is called a node of C. If a<0, then 
PéC, but there is no other point of C in the 
neighborhood of P (Fig. 8). Such a point ts 
called an isolated point of C. If a=0, then there 
are two branches of C starting from P, and the 
tangents to these at P are the same (Fig. 9). 
Such a point is called a cusp of C. When C is 


Fig. 7 


Fig. 8 


Fig. 9 
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an talgebraic curve, we can examine the shape 
of a curve in a neighborhood of a singular 
point using the *Puiseux series. 

When C has an infinite branch Co (for 
example, when Cy: y= @(x) (xe I) is an x- 
branch of C and I =[a, 00)), if the tangent to 
Co at P(X, Yo) (xg €/) has a limiting line for 
X 00, then the limiting line / is called an 
asymptote of C}. In this case, the distance from 
a point P(Xo, Yo) of Co to I converges to zero 
when x) œ. An asymptote of an infinite 
branch of C is also called an asymptote of C. 


H. Special Plane Curves 


The following are the well-known curves (for 
ellipses, parabolas, and hyperbolas — 78 
Conic Sections). 

Among curves of the third order, those 
having an equation of the form 


y? = f(x)/(x—a) (1) 


(f(x) is a rational expression of at most the 
third order in x) are symmetric with respect to 
the x-axis and have x =a as an asymptote. In 
particular, if a>0, f(x)= — x? in (1), then the 
curve is as shown in Fig. 10 and the origin is a 
cusp. Let a half-line starting from the origin 
meet the curve, the circle with diameter [0, a], 
and the straight line x =a at points X, Y, and 
A, respectively. Then we have OX = YA. This 
curve is called a cissoid of Diocles. 

If a=0, f(x)=c?(c— x) (c > 0) in (1), then the 
curve takes the shape shown in Fig. 11. Let A, 
C be the points whose coordinates are (a, 0), 
(c,0) (0<a<c), respectively, and let X, Y be 
the points in the first quadrant at which the 
straight line parallel to the y-axis and passing 
through A meets the curve and the circle with 
diameter OC, respectively (Fig. 11). Then we 
have AX: AY=OC: OA. This curve is called a 
witch of Agnesi. 


Fig. 10 


Fig. 11 


If a<0, f(x)= —x?(x/3 +a) in (1), then the 
curve takes the shape shown in Fig. 12a. If we 
rotate it by 7/4 and put it in the position 
shown in Fig. 12b, then the equation of the 
curve takes the form x? + y? =3cxy (c= — gı 
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a). If we take as parameter t = y/x, then we get 
the parametric representation x = 3ct/(1 + t°), 
y=3ct?/(1 +t). This curve is called a folium 
cartesii (or folium of Descartes). A curve that 
has a parametric representation of the form 
x=(t), y=w(t), where ọ, y are rational func- 
tions, is called an (algebraic) unicursal curve (or 
rational curve). Such a curve is an algebraic 
curve of tgenus 0. 


Fig. 12 


Let r= f, (0), r= f,(0), ...,r=f,(0) be equa- 
tions of curves C,, C,,...,C, with respect to a 
polar coordinate system with origin O. A curve 
C having equation r=, f,(0)+...+/,f,(0) 
(the A; are constants, usually +1 or —1) in the 
same coordinate system is called a cissoidal 
curve with respect to O. (Fig. 13: r= — f,(@)+ 
f,(6)). In Fig. 10, let C, be the circumference 
of the circle with the diameter [0, a], let C, be 
the straight line x =a, and put 4, = —1,2,=1. 
Then we have a cissoid of Diocles. We can 
regard the folium cartesii as a cissoidal curve 
obtained from a straight line and an ellipse. 
When G, is a circle with center at O, we call C 
a conchoidal curve of C, with respect to O. In 
particular, when C, is a straight line and O is 
not on C,, the conchoidal curve is called a 
conchoid of Nicomedes. 


Fig. 13 


As shown in Fig. 14, when C, is perpendic- 
ular to the initial line of the polar coordinate 
system, the equation of the conchoid is r= 
asec +b (b is the radius of C,), and the Car- 
tesian equation of the curve is (x — a)? (x? + 
y?)=b?x?. According as a> b, a=b, or a<b, 
the curve has a node, cusp, or isolated point, 
respectively. When C; is a circle and O is on 
C,, the conchoidal curve of C, with respect to 
O is called a limaçon (or limaçon of Pascal) 
(Fig. 15). The equation of a limaçon C with 
respect to a polar coordinate system having 
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the diameter of a circle passing through O as 
its initial line is r=acos0+b, while the equa- 
tion of C with respect to a Cartesian coordi- 
nate system is (x? + y? — ax}? =b? (x? + y?). In 
this case, if a >b, O is a node of the curve; if 
a=b, O is acusp. When a=b, the curve is 
called a cardioid (the curve shown by a dashed 
line in Fig. 15; see also Fig. 26 below). 


Fig. 14 


Fig. 15 


The locus of a point X having a constant 
product of its distances from two fixed points 
A, B is called Cassini’s oval (Fig. 16). The 
equation of this curve with respect to the 
Cartesian coordinate system whose origin O is 
the midpoint of the segment AB and whose 
x-axis is the straight line AB is (x? + y?)? — 
2a*(x? — y?)=k* —a* (where AB=2a, k? = 
AX: BX). In particular, if a? =k’, then O is a 
nodal point of the curve. In this case, the curve 
(shown by the dashed line in Fig. 16) is called a 
lemniscate (or Bernoulli’s lemniscate) (Jakob 
Bernoulli). 


Fig. 16 


The locus of the foot of the perpendicular 
drawn from a fixed point O to the tangent of a 
fixed curve C at each point of the curve is 
called the pedal curve of C with respect to O. 
The pedal curve of a rectangular hyperbola 
with respect to its center is a lemniscate (Fig. 
17), and the pedal curve of a circle with respect 
to a point is a limaçon. 
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Fig. 17 


When a curve C’ rolls on a fixed curve C 
without slipping and is always tangent to C, 
the locus I of a point X kept fixed with re- 
spect to the curve C’ is called a roulette whose 
base is C, rolling curve is C’, and pole is X. In 
particular, when C is a straight line, C’ is a 
circle, and X is on C’, T is called a cycloid (Fig. 
18). When X is not on C’, F is called a trochoid 
(Fig. 19). A trochoid is represented parametri- 
cally by the equations x =a0@—bsin@, y=a— 
bcos @, where the parameter @ is the angle of 
rotation of C’. When a=b, the equation repre- 
sents a cycloid. The tevolute and finvolute of a 
cycloid are also cycloids (Fig. 20). 


Fig. 20 


Suppose that we are in a gravitational field 
with a given path represented as a cycloid, as 
is shown in Fig. 21. Assuming that there is no 
friction, the time necessary for a particle to 
slide down the path from a point X on the 
curve to the lowest point C of the curve is 
independent of the initial position X (C. Huy- 
gens). Because of this property, the cycloid is 
also called a tautochrone. Suppose that a par- 
ticle starts from a point A in the space and 
slides down to a lower point B along a curve T 
(without friction) under the effect of a gravita- 
tional force. To minimize the elapsed time, 
we simply take T as a cycloid that lies in the 
vertical plane containing AB and has a hori- 
zontal line through A as the base (Fig. 22). 
Because of this property, the cycloid is called 
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also a brachistochrone, i.e., the line of swiftest 
descent (Johann Bernoulli and others). 


B 
Fig. 22 


When the base curve C and the rolling curve 
C’ are both circles and X is on C’, we call T an 
epicycloid if C, C’ are externally tangent (Fig. 
23), and a hypocycloid if C, C’ are internally 
tangent (Fig. 24). When X is not on C’, cor- 
responding to these two cases, we have an 
epitrochoid and a hypotrochoid, respectively. 
Let a, b be radii of C, C’, respectively, c the 
distance from the center of C’ to X, and 0 
the angle of rotation of C’. Then the para- 
metric equations of these curves are x =(a+ 
b)cos 0 F ccos((a+b)/b)0, y=(a+b)sin 0 — 
csin((a + b)/b)@. (Take the upper signs when 
the curve is an epicycloid and the lower signs 
when the curve is a hypotrochoid. When b =c 
the equations are equations of epicycloids and 
hypocycloids.) When the ratio a:b is a rational 
number p/q (p, q are mutually prime), then C’ 
returns to its initial position after rotating q 
times around C; in this case each I becomes 
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an algebraic curve. In particular, when a= 
4b=4c, the hypocycloid is called an astroid 
(Fig. 25). Its equation (with respect to a Car- 
tesian coordinate system) is x79 + y?? =a”. 
The envelope of line segments of length a 
whose endpoints are on the x-axis and y-axis, 
respectively, is an astroid (Fig. 25). When a= 
b=c an epicycloid is a cardioid (Fig. 26). 


Fig. 26 


When the base C is a straight line, the roll- 
ing curve C’ is an ellipse or a hyperbola, and 
the pole is a focus of C’, then the roulette is 
called a Delaunay curve (Fig. 27). 


C! 


WKS 


Fig. 27 


When C is a straight line, C’ is a parabola, 
and X is the focus of C’, the roulette is called a 
catenary (Fig. 28). When we hold two ends of a 
string of homogeneous density in the gravita- 
tional field, the string takes the form of this 
curve. The equation of the catenary with re- 
spect to a Cartesian coordinate system is y= 
acosh x/a=a(e*/*+e *!)/2. The involute 
starting at the point A(0, a) of this curve is 
called a tractrix (Fig. 29). Let Q be the point of 
intersection of the tangent at P to the tractrix 
and the x-axis; then the length of PQ is con- 
stant and is equal to a. Consequently, when we 
drag a weight at A by a string of length OA 
along the x-axis, the curve described by the 
weight is the tractrix. The parametric equa- 
tions of the tractrix are x =a(logtant/2 + 
cost), y=asint. 


Fig. 28 Fig. 29 


Suppose that a point Q moves with constant 
velocity on the x-axis and another point P also 
moves with constant velocity always toward Q. 
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The locus of the point P is called a curve of 
pursuit (Fig. 30). When the velocity of Q is « 
times that of P, the equation of the curve of 
pursuit is 2(x—a)=y'!~“/c(1—a) —cy' **4/(1 +a) 
if a#1 and 2(x—a)=(1/clogy—cy*/2 ifa=1. 
We can consider similar problems when Q 
moves on a general curve instead of on the x- 
axis. 


Fig. 30 


Many plane curves that are called spirals 
can be expressed by r= f(8) (f monotonic) in 
polar coordinates (r, 0). An Archimedes spiral 
is a curve having the equation r=aé (Fig. 31). 
Archimedes found that the area bounded by 
two straight lines 0=0,, 0=0, (6, <0,) and the 
curve is a? (03 — 63)/6. A logarithmic spiral 
(equiangular spiral or Bernoulli spiral) is a 
curve having the equation r= ke (Fig. 32). 
The angle between the straight line 0 =con- 
stant and the tangent to the curve ts constant. 
Johann Bernoulli found that the involute and 
evolute of this curve are congruent to the orig- 
inal curve. A curve having the equation r= 
a/@ is called a hyperbolic spiral (or reciprocal 
spiral), and the one having the equation r?0=a 
is called a lituus. These two spirals are shown 
in Figs. 33 and 34, respectively. Let p = (s) be 
the tnatural equation of a curve. Properties of 
the curves for which the functions p(s) are 
simple have been investigated. Specifically, a 
curve having the natural equation p=ks (k is a 
constant) is a logarithmic spiral. A curve hav- 
ing the equation p =a’/s‘is called a Cornu 
spiral (or clothoid; — 167 Functions of Con- 
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Fig. 31 Fig. 32 
Fig. 33 Fig. 34 
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fluent Type). Its parametric representation is 
t nt? 
x zaz | cos— dt, 
0 2 
t ! nt? 
y= a/n | sin- -dt 
0 


(tFresnel integral). M. A. Cornu used this 
curve in the representation of diffraction in 
physical optics. 

There are also curves that appear as graphs 
of telementary functions. For example, a curve 
having the equation y=sinx is called the sine 
curve, and the graphs of equations y=e* and y 
= log x are called the exponential curve and the 
logarithmic curve, respectively, although they 
are congruent. In contrast to algebraic curves, 
these analytic curves that are not algebraic are 
called transcendental curves. (Regarding the 
differential geometric properties of plane and 
spaces curves — 111 Differential Geometry 
of Curves and Surfaces; for plane algebraic 
curves — 9 Algebraic Curves.) 


I. Envelopes 


Let f(s, t) be a function of class C! of real vari- 
ables s, t. If we fix t=t), then r= f(s, tg) is the 
equation of a curve C,, with a parameter s. If 
s(t) is a function of t, then f(s(to), to) represents 
a point P, on C, Let E be the locus of P,, 
when tọ moves. If s(t) is a function of class C}, 
then E is a curve of class C+. If E and C, are 
always tangent at each point P, , we call E the 
envelope of the family of curves C,. When {(s, t) 
is given, to find E we need only determine the 
function s(t). We note that 0f/és=Aéf/ét is a 
condition that must be satisfied by the func- 
tion s(t). When n=2 and the equation of C,, is 
given in the form f(x, y, £9) =0, the point of 
intersection of C,, and f(x, y, to) =0 (f= éf/ét) 
is P,,. The equation R(x, y)=0 obtained by 
eliminating t from f(x, y,t)=0 and f(x, y,t)=0 
is called the discriminant of f(x, y, t)=0. The 
set of points (x, y) satisfying the discriminant 
R=0 is the union of E and the locus of the 
singular points of C,,. 


J. Peano Curve 


A tcontinuous curve in the Euclidean plane E? 
(i.e., the image f(I) of a segment J =[0, 1] 
under a continuous mapping f:I— E?) may 
cover a square. We call such a curve a Peano 
curve after G. Peano, who gave the first exam- 
ple. D. Hilbert simplified the example and 
constructed a Peano curve as follows (Math. 
Ann., 36 (1890), 38 (1891)). 

Divide a square and a segment into four 
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equal parts (Fig. 35) and let each square D; 
correspond to the segment 7; (i=0, 1, 2, 3). 
Then divide each D; into four equal parts (Fig. 
36) and let D; correspond to T;; (j=0, 1, 2, 3), 
and continue this process (Fig. 37). A sequence 
of squares D; > D; >D; > ... has the unique 
common point p;x.. and we let this point 
correspond to the unique common point tijk... 
of the sequence of segments T; > Tj > Tijk 
The correspondence tijk... Piję... 8 a continu- 
ous mapping of the segment [0, 1] onto the 
square D, and this continuous curve has 
double points, triple points, and quadruple 
points. The set of multiple points has the car- 
dinal number of the continuum and is a 
tdense set in the square. We can improve this 
method so that there are no multiple points 
other than double and triple points, but it is 
impossible to eliminate triple points 
altogether. 


Darrie 


Fig. 36 
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Fig. 37 


We can also construct Peano curves as 
shown in Figs. 38, 39, and 40. That is, we 
“bisect” a triangle and a segment [0, 1] suc- 
cessively and build a correspondence as fol- 
lows: Let a point represented by a binary 
number t=0.ijk ... (i,j, k, ... =0, 1) correspond 
to the unique common point of the sequence 
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A;>Aj> Aj, >... and obtain a continuous 
mapping from the segment [0, 1] onto the 


triangle. 


Fig. 40 


K. The Jordan Curve Theorem 


The Jordan curve theorem states: A Jordan 
curve J in the plane R? separates R? into inner 
and outer fregions (C. Jordan, Cours d'analyse, 
2nd ed., 1893). More precisely, R? —J is the 
disjoint union G, UG, of two regions G, and 
G, whose common tboundary is J. Let p be a 
point of J. Then there is a Jordan arc with p as 
an endpoint such that all points of the Jordan 
arc are contained in G; (i= 1 or 2) except for p 
(A. Schdnflies), that is, J is accessible from G,. 
Conversely, let J be a compact subset of R? 
such that R?—J =G, UG, and G,NG,=@, 
where the G; are regions such that J is acces- 
sible from both G, and G,. Then J is a Jordan 
curve (Schénflies, 1908). A homeomorphism 
between a Jordan curve J and the circle C 
extends to a homeomorphism (more precisely 
to a tconformal mapping) between a plane 
containing J and a plane containing the circle 
C (— 65 Combinatorial Manifolds G; 77 Con- 
formal Mappings). 
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A. General Remarks 


The integral of a function (or more precisely, a 
tdifferential form) along a tcurve (tsurface) is 
called a curvilinear integral (surface integral). 
Because a curvilinear integral is a special case 
of the Stieltjes integral, we shall first explain 
this notion, formulated by T. J. Stieltjes (1894) 
as a generalization of the tRiemann integral. 
The notion was introduced in connection with 
Stieltjes’s study of tcontinued fractions, and 
led to the idea of integrals with respect to 
general measures. 


B. The Riemann-Stieltjes Integral 


Suppose that f(x), x(x) are real-valued 
bounded functions defined on [a, b]. Take 

a partition of the interval a =xọ < X1 < X3 

<.. < X,-1 < X =b (— 216 Integral Calculus) 
and consider the Riemann sum with respect 
to a(x): 


I MENU 41) aC), Peps 


Suppose that the Riemann sum tends to a 
fixed number as max(x;,, — x;) tends to zero. 
Then the limit is called the Riemann-Stieltjes 
integral (or simply Stieltjes integral) of f(x) 
with respect to a(x) and is denoted by 
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f? f(x) da(x). The Riemann integral of f(x) is a 
special case, where a(x)= x. 

The Riemann-Stieltjes integral has the ele- 
mentary properties, such as linearity, of the 
usual Riemann integral. We also have the 
following theorem: The integral f f(x) da(x) 
exists for every continuous function f(x) if and 
only if a(x) is of tbounded variation. Hence 
when we consider the Stieltjes integral of f(x) 
with respect to a(x), we usually assume that 
J(x) is continuous and g(x) is of bounded 
variation. However, the Stieltjes integral can 
be defined if f(x) is of bounded variation (not 
necessarily continuous) and a(x) is continu- 
ous (not necessarily of bounded variation). If 
a sequence f,(n) (n= 1, 2, ...) of uniformly 
bounded continuous functions defined on the 
interval [a, b] converges to a continuous func- 
tion f(x) on the interval [a,b], we have 


b b 
im | In(x) d(x) = | I (x)da(x), 


where «(x) 1s a function of bounded variation. 
Furthermore, if a(x) and «,(x) (n=1,2,...) 

are functions of bounded variation whose 
total variations are uniformly bounded and 
liM,» &,(x) = a(x) at every point of continuity 
of a(x), then we have 


b b 
im | S dato = | fœ)da(x) 


for every continuous function f(x) on [a,b] 
(Helly’s theorem). 

Let a(x) be a tstrictly monotone increasing 
continuous function, and let B(y) be its inverse 
function. Then we have 


b a(b} 
| I(x)da(x)= | — f(B(y)) dy, (1) 
a ala) 
where the right-hand side is the usual Rie- 
mann integral. A function «(x) of bounded 
variation is represented as the difference of two 
strictly monotone increasing functions a, (x) 
and a(x). If we denote by f(y) the inverse 
function of «;(x) (i= 1,2), we have 


b a, (b) 
[oww] S(Bi(y)) ay 


a a (a) 


a(b) 


= f(B2(y))dy. (2) 


azla) 


It «'(x) exists and is continuous, we have 


| f ildats)= | I (x)a'(x) dx. (3) 


C. The Lebesgue-Stieltjes Integral 


Suppose that a(x) is a monotone increasing 
and right continuous function and I =(x,, x3]. 
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We define an finterval function U(I)=«(x,)— 
a(x,). It is nonnegative and countably addi- 
tive. Hence by utilizing U (I) we can con- 
struct the outer measure and also a completely 
additive measure (— 270 Measure Theory; 380 
Set Functions). The Lebesgue integral with 
respect to this measure is called the Lebesgue- 
Stieltjes integral (or Lebesgue-Radon integral) 
and is denoted by |? f(x)da(x). If a(x) is a 
strictly monotone increasing continuous func- 
tion and f(y) its inverse function, then formula 
(1) is true if the left-hand side is a Lebesgue- 
Stieltjes integral and the right-hand side is a 
Lebesgue integral. If «(x) is a function of 
bounded variation, decomposing «(x) into the 
difference of two strictly monotone increas- 
ing functions, we also have formula (2). If 
a(x) is tabsolutely continuous, formula (3) is 
valid, where the right-hand side is a Lebesgue 
integral. 

The Stieltjes integral has the following two 
properties. 

Integration by parts: In the interval [a,b], 
we have 


b b 
[vava] V dU =U(b)V(b)— U (a) V (a) 


a a 


if one of U(x), V(x) is continuous and the other 
is of bounded variation. 

Second mean value theorem: If U (x) is mono- 
tone increasing and V(x) is continuous, then 
there exists a č in [a,b] such that 


b 
[vav 


=U (a)(V(€)— V(a)) + U (b)(V(b)— V(¢)). 


D. The Curvilinear Integral 


A continuous mapping from an interval a< 
t<bin R' into R”: ø(t)=(¢;(t), ..., p(t) is 

an oriented curve. Suppose that a function 
S(X1,-..,X,) is defined in a neighborhood U of 
the image C of the mapping ¢(t) or merely on 
C. The Stieltjes integral 


b 
f So (t), SSNS, p,(t)) dg,(t), i= 1, <A, (4) 


is called the curvilinear integral of the function 
(Xis ---, Xn) along the curve C with respect to 
x; and is denoted by fc fdx;. The curve C is 
called the contour (or path) of the integration, 
o(a) is called the initial point (or lower end), 
and @(b) is called the terminal point (or upper 
end) of the integration. Let C be a rectifiable 
curve, and denote by s(t) the arc length of C 
from the initial point to the point g(t). Then 
the Stieltjes integral 


b 
| f(e(O)ds(t) 


a 
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or simply fe fds is called the curvilinear inte- 
gral with respect to the line element. Here, 
the line element, denoted by ds, means 

(oi (t))? +... + (L(t)? dt when ọ(t) is of class 
C!. If the integrand in (4) is of bounded vari- 
ation as a function of t, the curvilinear integral 
is well defined. If C is a trectifiable curve, the 
curvilinear integral is defined for an arbitrary 
continuous function. In the usual case, we are 
concerned mainly with this sort of situation. 
For a differential form w= f,dx,+...+f,dx, 
defined on U, the curvilinear integral fc% is 
defined by Xr- fe fidx;. 

The curvilinear integral is linear with re- 
spect to its integrand. If the terminal point of 
C; is the initial point of C,, we can construct 
the joint curve C=C, + C3, and we have 
additivity for the contours 


[s=] saxa | fdx; 
C C, Ca 


(a similar formula holds if we replace dx; by 
ds). Monotonicity, which asserts that fe fdx; < 
[cgdx; whenever f < g, holds if ¢,(t) is mono- 
tone increasing, and monotonicity also holds 
for the curvilinear integral with respect to the 
line element. 

If n=2, R? can be identified with the tcom- 
plex plane C={z=x +iy}, and we define 
fefle)dz by 


i uodas- | oteydyh 
Cc C 
+if{ ne)ax+| udv}, 
C c 


where f(z)=u(z)+ iv(z). The integral is then 
said to be an integral in the complex domain. 
(For the application of integrals in the com- 
plex domain to complex analysis — 198 
Holomorphic Functions.) 


E. The Surface Integral 


By an m-dimensional smooth surface § we 
mean the image S of a fregular mapping of 
class C! from a domain G in R” into R" (m< 
n), E(u) = (E1 (Uy, <-s Um) +++» Sula, +++ Un) 
Given a continuous function f(x,,...,X,) 
defined in a neighborhood U of S in R’, the 
multiple integral 


Jal S€ (u), --- , Sulu) 
G 


D(x; > > ing) 
Dice.) du, ...du,,, 
Li piceyptet ella nh, (5) 


is called the surface integral of f along S 
with respect to x;,,...,x;, and is denoted by 
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fsfdxi, dx; or f... fs fdx;,...dx;,. This 
definition depends on the choice of the para- 
meters (t4, ... , Hm) in the following way. Let 
(uis ...,4,,) be another parametrization of S. 
Since 


DG ik): Deena) 


D(Uy, -r Um) DU,- Ui) D(is p Um) 
and 
D(is... Um 
E T E aa Rin tae, 
D(uy, ..- Uy) 


thus according as 


D(u,,..- 


n) 
D(u),, -s Un) 


the foregoing expression remains the same or 
changes its sign. Usually we assign to the 
parameters (u,,...,u,,) positive or negative 
orientation, in which case S is called oriented. 
If we replace the Jacobian D(x, ,..-,;,)/ 
D(u,,...,t,,) in (5) by the quantity 


DO yee) y 
“DN pe 
iK. Kim Duy, ---,Um) 


which corresponds to the surface element of S, 
the integral is called a surface integral with 
respect to the surface element and is denoted 
by J; fdS or fs fdo. The surface integral of a 
differential form of degree m in R” is similarly 
defined. In the case m= 1, the surface integral 
reduces to a curvilinear integral. Just as the 
Stieltjes integral is a generalization of the 
curvilinear integral, there are several ways 

to generalize the notion of surface integral 
without assuming that the mapping ¢(u) is of 
class C!. 


F. The Stokes Formula 


Let S be an m-dimensional smooth surface in 
R" (m<n) and @S be the (m— 1)-dimensional 
surface corresponding to the boundary of S. 
Let w be a differential form of class C! of 
degree (m—1) and dq be its exterior deriva- 
tive. Then we have |,5@=|sda, which is called 
the Stokes formula (or the Green-Stokes for- 
mula). (For the Stokes formula on a general 
differentiable manifold — 105 Differentiable 
Manifolds.) As special cases of the Stokes 
formula, we have the following three classical 
theorems: 

(1) The case of a plane domain: Let D be a 
bounded domain on the xy-plane bounded by 
a finite number of smooth curves C with posi- 
tive directions. If œ = P dx +Q dy is a differen- 
tial form of class C! on D, we have 


f pax+ody=| f (Be avay, (6) 
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since dw =(6Q/0x — 0P/éy)dx A dy. This is 
called Green’s formula (or Green’s formula on 
the plane). Equality (6) is true if P, Q are totally 
differentiable and the integrand on the right- 
hand side is continuous (even if the functions 
6Q/dx, 0P/éy themselves are not continuous) 
(E. Goursat). This formula remains true under 
the following weaker assumptions: (1) C is 
rectifiable; (ii) P and Q are continuous in D 
and 6P/dy and €Q/0x are continuous and 
summable (in the sense of Lebesgue) in D. 

(2) The case of a domain in a 3-dimensional 
space: Let D be a bounded domain in xyz- 
space surrounded by a finite number of 
smooth surfaces S. For a vector field V = 
<P,Q,R» of class C! on D, we put w=Pdy a 
dz+Qdzndx+Rdx a dy. Then since dw = 
(OP/6x + 0Q/dy + @R/6z)dx ^ dy ^ dz, we have 


\| Pdydz+Qdzdx+Rdxdy 
S 


-f div Vdxdydz 
ôP REAN 
a a 


Equality (7) is called the Gauss formula (Ost- 
rogradskii’s formula or the divergence theo- 
rem). The left-hand side of (7) is equal to the 
surface integral {{;(V,n)do, which is the tvec- 
tor flux through S (nere n means the outer 
unit normal vector of the surface S). This 
formula remains true under the following 
weaker assumptions: (i) S is piecewise of class 
C!; (ii) P, Q, R are continuous in D and dP/éx, 
6Q/éy, and 0R/éz are continuous and sum- 
mable in D. 

(3) The case of a bordered surface in a 3- 
dimensional space: Let S:x=x(u, v), y= y(u, v), 

= x(u, v) ((u, v)e G) be a smooth surface in 

xyz-space, and suppose that the boundary F of 
the domain of the parameters G consists of a 
finite number of smooth curves with positive 
direction. The boundary C of the surface S is 
the image curve of T. Now let V = <P, Q, R> be 
a vector field of class C! on S, n be the unit 
normal vector of S (its direction being canoni- 
cally assigned by the parameter (u, v)), and t be 
the unit tangent vector, and set w = Pdx+Qdy 
+ Rdz. Then since dw =(dR/dy ~ 6Q/dz)dy ^ dz 
+(0P/6z—6R/0x)dz ^ dx +(€Q/éx —0P/éy): 
dx Ady, we have 


[frre (C2) 
(Eea) 


-| (Pav + Ody + Rds)= | (V,t)ds. (8) 
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Equation (8) is called the Stokes formula (— 
Appendix A, Table 3.III). 
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The term cybernetics was invented in 1947 by 
Norbert Wiener [1] to denote a field of science 
that treats the system of control and communi- 
cation in animals and machine. The term was 
derived from a Greek word Kkvfepvýtyg, also 
the source of the word “governor.” The stimu- 
lus for establishing such a new area of science 
came from studies of automatic computation, 
automatic control, and information process- 
ing. These fields had given rise to technological 
innovations such as the high-speed electronic 
computer and automatic control instruments. 
Such machines have had a profound influence 
on the information sciences, as well as on 
theoretical investigations in biology. However, 
even now, we can hardly say that cybernetics 
has been formly established as a systematic 
branch of science or of applied mathematics. 
Nevertheless, it has had far-reaching influence 
on both biology and machine engineering as 

a methodology and as a philosophy. In the 
Soviet Union and in European countries, the 
word corresponding to cybernetics is still 
widely used for the “Grenzgebiete” between 
biology and machine engineering in the wider 
sense. Nonetheless, it 1s difficult to say that 
cybernetics as a whole has undergone system- 
atic development. Systematic theories have 
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been established and developed for separate 
parts of the field, and these have become in- 
dependent disciplines; they are, for example, 
tcontrol theory, tinformation theory, the 
theory of artificial intelligence, the theory of 
tautomata, mathematical biology, genetics, 
and ecology. 

The contributions of Wiener to cybernetics 
include not only the invention of the concept 
itself, but also prediction theory or the theory 
of the Wiener filter [2]. The latter was sub- 
sequently formalized by R. E. Kalman from 
a different point of view as an estimation 
problem within a linear system, working from 
finite observation data, and including the non- 
stationary case (— 405 Stochastic Control and 
Stochastic Filtering G). Another contribution 
by Wiener is the input-output identification of 
a nonlinear system using statistical time series 
analysis of the outputs when the inputs are 
white noises [4]. This is achieved by expan- 
sion of the output function in terms of the 
convolution of input functions, which corre- 
sponds to the expansion of functions in terms 
of tHermite interpolation polynomials. The 
kernel of the expansion is called a Wiener 
kernel. Recently, this method has been applied 
to input-output identification for nervous 
systems; it also finds application in nonlinear 
system theory [5]. Wiener also studied brain 
waves [6,9]. 

Cybernetics in the wider sense, although not 
necessarily called by this name, may include 
investigations of the following “Grenzgebiete”: 
information processing of the nervous sys- 
tem [7]; self-organizing systems in the non- 
equilibrium thermodynamics of Prigogine and 
Haken; the self-reproducing machines of von 
Neumann; and the theory of pattern formation 
in biological or chemical systems [8]. 

For related topics — 176 Gaussian Process 
I, 395 Stationary Processes D. 
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Data Processing 


A. General Remarks 


With the development of electronic computers, 
effective systems for data transmission and 
processing have been created on a large scale, 
and there is now a large literature concerning 
data-processing methods and techniques for 
such systems. 

Research on data processing encompasses 
both specific techniques and entire systems of 
processing, for example, the system of pro- 
gramming techniques, as well as mathematical 
problems originating from them, such as the 
complexity of computations (— 71 Complexity 
of Computations). As domains of application, 
we have, for example, information retrieval, 
stock management, program evaluation, and 
review techniques. We denote the set of data 
by D and assume that D is finite. Depending 
on the properties of D, we have various suit- 
able representations and processing problems. 


B. The Notion of Data 


In recent applications of computers, the main 
task has been arranging and searching for 
items or attributes in storage, rather than 
numerical computation. Individual informa- 
tion is called a record. Records arranged and 
stored in the memories of computers are usu- 
ally called data. A collection of such data ts 
called a file. A complex, large-scale data collec- 
tion is often called a data base. 

In an abstract sense, the record consists of a 
string of letters, but it is usually convenient to 
view it as consisting of its identifying mark 
followed by a finite sequence of items. The 
contents can be classified according to their 
properties, such as topological relations, order 
relations, or items representing numerical 
values. For each case, there may be different 
suitable representations and operations. 

We often store information after a suitable 
process of information compression. In some 
cases the operations are reversible, and com- 
plete recovery is possible. In other cases the 
operations are not completely reversible, and 
we must throw away part of the information 
in order to compress the rest. An example of 
recoverable information is the replacement 
of a run of 1’s (or 0’s) in a binary code by its 
length. An example of lost information often 
occurs in the graphical manipulation of num- 
bers by means of hashing. In this process we 
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select a suitable function s= f(a,,...,a,), where 
the vector (a,,...,a,) corresponds to the repre- 
sentation of the record in such a way that the 
value s can be monomorphic for the universe 
of the records as often as possible, and the 
data (a,,...,a,) is kept in the storage corre- 
sponding to the value s. Generally, the vari- 
able representing the location of data is called 
a pointer. 

In the abstract sense, data bases are simply 
sets of items, but in most cases a base will also 
exhibit some mathematical structure, such as 
order relations. In order to handle a data base 
efficiently, we must take its particular structure 
into account, and suitably represent that struc- 
ture. Such structures and their representations 
are called data structures. 


C. Linear Structures 


The component of a vector or the stations 
along a single railway line form linearly 
ordered sets. Their most essential feature is the 
notion of immediate predecessor or immediate 
successor. Such a data structure is called a 
linear structure, and the aligned sequence is 
called an array. Multidimensional arrays, such 
as the elements of a matrix, are stored in the 
form of 1-dimensional arrays in the memory of 
a computer. 

When the set of data D is a linearly ordered 
set, it is usually represented as a suitable array. 
Here, the problems of ordering and table look- 
up are fundamental. Ordering in this sense 
means putting the given elements of D into the 
order defined for D. Since, historically, sort- 
ing machines were used to put punched cards 
in order, this process is also called sorting. 

The process of arranging several individually 
sorted data packs into one sorted pack is 
called merging. 

A fundamental data-sorting operation is 
“comparison” with respect to the order for 
D. There have been many investigations of 
the estimation of the lower or upper bounds 
for the number of comparisons and of devel- 
oping efficient algorithms. Asymptotically, 
O(nlogn) is the theoretical lower bound for 
n elements, and some algorithms are known 
to achieve this bound (— 71 Complexity of 
Computations). 

If there is given a univalent correspon- 
dence f: DD’ and the correspondence table 
is stored in the memory in a suitable form, the 
problem of table look-up arises, which requires 
finding f(d) for a given de D. In this process, a 
fundamental operation is a comparison of 
deD with some xe?D in the table. There have 
been many investigations of efficient arrange- 
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ments and of algorithms with minimal num- 
bers of comparisons. 


D. List Representation 


If the given finite set D has an order satisfying 
the reflexive and transitive laws, addressing is 
often used as a medium for representing the 
order in the memory. Let d,,...,d,, be all the 
immediate successors to an element de D. We 
can represent them by the following sequence 
of triplets: 


Ifn=0, we take (d,0,0); 
Ifn=1, we take (d, df, 0); 
If n=2, we take (d,d*, dž); 
If n=3, we take (d, df, e*), (e,,d¥, dł); 
Ifn>4, we take (d,d*, ež), (e,,d%, e%), 
(e2,d3, e3), ... , (€n-2»dn-1> di), 


where d* means the pointer to d;, e; is an ele- 
ment introduced for convenience, and e* is the 
pointer to e;. This is called a list representa- 
tion. If we denote by d—d,, the fact that d, is 
an immediate successor of d, then the total set 
can be regarded as a fdirect graph. Usually, 
this graph is a ttree, and then the graph is 
called a tree structure and its representation a 
tree representation. The advantage of this 
representation is that addition or deletion ts 
quite easy. 

The set of logical formulas is partially 
ordered, with the order given by the rules of 
inference. The set consisting of a series of 
inferences forms an ordered subset. Thus, if 
a tree representation can be automatically 
treated, so can the process of inference. 

In dealing with linguistic data (words or 
sentences, for example), it is often natural to 
consider a noncommutative tfree semigroup D 
generated by a finite number of generators (the 
alphabet or vocabulary). In this case, if there 
exists a natural order for the generators, it 
determines in D a lexicographic partial order- 
ing. Then the tree representation can be used 
for representing a dictionary whose entires are 
elements of D. This method is not efficient with 
respect to speed of table look-up and economy 
of memory, but it sometimes has the advan- 
tage of simplifying the treatment of compli- 
cated data. 


E. Memory Devices for Processing 


In dealing with algebraic formulas or lan- 
guages with parentheses, data maintenance 
methods such as tree representation or push- 
down storage are often convenient as auxiliary 
memory-controlling methods. The character- 
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istic of the push-down storage method is that it 
returns the data in reverse order with respect 
to the time of acceptance and remittance. 

It is sometimes called a stack or a first-in—last- 
out memory. Contrary to a stack, there is a 
storage that returns the data in the same order 
as the acceptance. It is called a queue or a 
first-in—first-out memory. 


F. Information Retrieval 


A request to obtain all the records qualified 
by some property in a data base Q is called a 
query. The part of the collection of records Q 
qualified for a query, therefore, can be identi- 
fied by its characteristic subset of the space of 
all possible records. A procedure that deter- 
mines the subset Q, of Q composed of all the 
records relevant to a query q is called the 
information retrieval of the query. A system 
which provides such a procedure for every 
query q€Q is called an information retrieval 
system organized for Q with respect to Q. 

It is essential to design the system in such a 
way that records can be retrieved quickly for 
queries in a certain class. Such a selected set of 


` queries can be composed of all the first-order 


queries specifying an item and asking for all 
the records containing the attribute value or 
the key characterizing the item. It can, in some 
cases, include second- or higher-order queries 
specifying a number of items and asking for all 
the records in which attribute values or keys 
characterizing the item occur simultaneously. 

In an information retrieval system, the mas- 
ter file of a data base Q is usually organized by 
way of auxiliary memories on data structures 
such as sequential files, indexed sequential 
files, virtual storage files, or direct access files, 
using the magnitude of the accession number 
or primary key of each record. Various man- 
agement systems, called SAM (Sequential 
Access Method), ISAM (Index Sequential 
Access Method), VSAM (Virtual Storage 
Access Method), or DAM (Direct Access 
Method), exist. 

In addition to the organization of such a 
master file, various directory files or indexes 
are organized in order to retrieve each query 
qEQ quickly, because there are many relevant 
keys or combination of keys other than the 
primary key. An inverted filing scheme (IFS) is 
a typical scheme for organizing such indexes 
or directory files. A bucket B, or an address- 
able set of secondary memories is provided 
for each canonical query q,€Q in a one-to-one 
way. An index or a list of accession numbers of 
pertinent records is organized in each of the 
buckets contiguously so as to make it possible 
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to retrieve all accession numbers of relevant 
records quickly. The essentials of an IFS is 

to define a one-to-one QAT (Query to Ad- 
dress Transformation) from the inverted set of 
queries Q= {q,} to the set of buckets B= {B,}. 

An IFS has efficient retrieval performance 
with respect to the inverted canonical queries. 
The scheme, however, may in some cases 
require a large number of buckets. Moreover, 
the scheme requires quite redundant storage 
of accession numbers by a number of buc- 
kets because a record may be pertinent to a 
number of canonical queries simultaneously. 
Although higher retrieval performance can be 
expected by including higher-order queries in 
the inverted set, the space and machine time 
needed for such organization is prohibitive. 
This is one of the reasons why an IFS for first- 
order queries is preferable in almost all prac- 
tical cases. Another reason is that if the first- 
order queries are inverted, every retrieval can 
logically be performed by certain Boolean 
operations executed among a certain number 
of retrieved sets of accession numbers. A trade- 
off of space and time needed for the organiza- 
tion of a scheme and its retrieval performance 
might be the determining factor for the selec- 
tion of the set of canonical queries to be 
inverted. 

An attempt to overcome the limitations 
inherent in the inverted scheme can be found 
in the work on the balanced file organization 
scheme due to Abraham et al. [7]. By extract- 
ing the essentials, Yamamoto et al. [9, 10] 
defined a BFS (Balanced File-organization 
Scheme) in a wider sense as follows: 

(i) Buckets are organized in such a manner 
that every bucket is associated with more than 
one query. 

(ii) Every canonical query is associated with a 
unique bucket. 

(ii) The accession number of a record with 
some additional information is stored in a 
bucket once if and only if it is pertinent to at 
least one of the associated queries. 

The essentials of a BFS is to define a many- 
to-one transformation from the set of canon- 
ical queries Q to the set of bucket addresses B, 
or an MQAT (Multiple Queries to Address 
Transformation). An MQAT defines a parti- 
tion of Q into mutually disjoint subsets. It is 
a generalization of a QAT which defines an 
IFS. If c is the number of queries to be asso- 
ciated simultaneously with a unique bucket 
in a BFS, then the number of buckets to be 
prepared is 1/c, which is a drastic reduction 
from that for an IFS. This reduction makes 
it possible to extend the feasible range of 
canonical queries. Actually, in the system 
HUNDRED (Hiroshima University New 
Documents REtrieval and Dissemination), a 
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51-to-1 MQAT is used for the organization of 
indexes. 

Although some space overload may occur 
by storing some additional information in 
order to tell which record is pertinent to which 
query in a bucket, reduction of redundancy 
can be expected, because a record may be 
pertinent to more than one query simulta- 
neously. The reduction in the number of buc- 
kets and redundancy would contribute much 
in saving time and space needed for the or- 
ganization of indexes. 

Among those BFS’s defined by MQAT’s 
with a given data base Q, Q, and the number c 
of queries associated with the same bucket, 
one BFS can be called the best if its redun- 
dancy is the least under some reasonable as- 
sumptions imposed on the distribution of keys. 
Several combinatorial problems have been 
raised and solved in this connection [10]. 
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97 (1.10) 
Decision Problem 


Suppose that we are given a set S and a propo- 
sition P(x,,X,...,X,) for elements x, of S. 
Then we have the problem of universal validity 
of P, which is the problem of finding a general 
algorithm (i.e., a finitary procedure) by which 
we can discern whether P(x,,...,x,) is true for 
all n-tuples (x,,...,x,). The problem of finding 
an algorithm by which we can discern the 
validity of P(x,,...,x,) for some specifically 
chosen n-tuples (x,,...,x,) is called the prob- 
lem of satisfiability of P. These two problems 
are customarily called decision problems. The 
problems are such that affirmative solution of 
one of them implies negative solution of the 
other. 

To give a precise definition of decision prob- 
lems, let us note that a tfree semigroup with 
countable generators can be identified with a 
subset of the set N of natural numbers (by 
virtue of tGodel numbering; — 185 Gödel 
numbers). On the other hand, if S is a given 
formal system with countably many symbols, 
the set of all *formulas in © is a subset of the 
free semigroup generated by the symbols in GS. 
Thus the set of all formulas in S is identified 
with a subset of N. A subset M of N (or N x 
N x... x N) is (general) recursive if its tre- 
presenting function is general recursive (— 356 
Recursive Functions). By using the concept of 
recursive function, a precise definition of the 
decision problem can be given as follows: The 
decision problem of M is solved affirmatively if 
and only if we can obtain effectively a proce- 
dure defining the representing function of M, 
and the function is general recursive. The 
decision problem of is solved negatively if 
and only if we can obtain a proof that M is 
not recursive. 

For a set A of formulas in S, we let g(A) be 
the set of all Gödel numbers corresponding to 
the elements of A. Let A’ be the set of formulas 
in A that are deducible in ©, and let t(A)= 
g(A’). The decision problem of the set A of 
formulas is said to be solved affirmatively 
(negatively) if the decision problem of 1(A) is 
solved affirmatively (negatively). By refining 
this concept we arrive at the notion of the 
degree of (recursive) unsolvability. Let A and B 
be subsets of N. The relation “A is recursive in 
B and B is recursive in A” (— 356 Recursive 
Functions) is reflexive, symmetric, and transi- 
tive. Hence this relation decomposes the class 
of all subsets in N into disjoint nonempty 
equivalence classes. A and B are defined to 
have the same degree of unsolvability if they 
belong to the same equivalence class. Thus the 
degrees of unsolvability can be identified with 
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the equivalence classes. The degree of recursive 

sets is 0. 

The relation a<b ts defined between the 
degrees a of A and b of B to mean “A is recur- 
sive in B.” Clearly, for any degree a, we have 
0 <a. The partially ordered system of degrees 
constitutes an tupper semilattice. 

Research on the decision problem has been 
done mostly in areas related to the tfirst-order 
predicate calculus L' and the formal systems 
on it. We now list some important results. 

(I) Results concerning L!. The decision 
problem has been solved negatively for the sets 
of formulas of the following forms. (Here it is 
assumed that no function symbols appear and 
that YW represents a formula involving no 
occurrence of Y, J, or free variables.) 

(1) All formulas in L! (A. Church, A. M. 
Turning), 

(2) 4x,4x,...4x,,Vy, Vy, ... Vy, A (T. Skolem), 

(3) dx,4x,4x3Vy,Vy,... Vy,dz A (K. Gödel), 

(4) 4x, 4x, Vy, Vy. ... Vy, 3z A (L. Kalmar), 

(5) 4x, 4x,Vy4z,Jz, ... 3z, M (J. Pepis), 

(6) Vx dy Vzdu,du,... Ju, MA (W. Ackermann), 

(7) 4x,4x,4x,Vy A or 
dx, Jx Yy 3z A (J. Surányi), 

(8) 3x Vy, Vy,4z,4z, A or 
Yx dy Vz Ju, Ju, A (Surányi). 

The decision problem has been solved 
affirmatively for the sets of formulas of the 
following forms, where it is assumed again 
that no function symbols appear and that A 
is as above. 

(1) All formulas involving variables only on 
predicates with one argument. (L. Lowen- 
heim, Skolem, H. Behmann), 

(2) Vx, Vx, ...Vx,, A (P. Bernays, M. Schonfin- 
kel, Ackermann), 

(3) Vx, Vx... Vx,,4y,4y2... dy, M (Bernays, 
Schonfinkel, Ackermann), 

(4) Vx,Vx,...Vx,,4y,4y,Vz,Vz,... Vz, A 
(Godel, Kalmar, K. Schutte). 

(II) Results concerning forma! systems on 
L!. Throughout the rest of this article we 
assume that no tfunction variables appear. 
Predicate constants, function constants, and 
object constants may appear. By the decision 
problem for a formal system £ we mean the 
decision problem for all tclosed formulas in &. 
Most of the results obtained so far concerning 
the decision problem for formal systems have 
been negative. Such results include those for 
formal systems formalizing natural number 
theory, the theory of rational integers, the 
elementary theory of tgroups, trings, tfields, 
tlattices, and the like, and axiomatic set theory 
(A. Tarski et al.). 

The word problem for groups was solved 
negatively by P. S. Novikov (— 161 Free 
Groups B). In connection with this deci- 
sion problem, there are some investigations 
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by W. W. Boone, G. Higman, and others 
[14-17]. 

The decision problem for a formal system 
formalizing the elementary theory of tAbelian 
groups has been solved affirmatively (W. 
Szmielew). Little is known about the decision 
problem concerning partial systems of for- 
mulas of given formal systems except the fol- 
lowing: (1) the decision problem for the set of 
formulas of the form Vx, Yx, ... Vx,, M in a 
formal system (— 161 Free Groups); (2) the 
Hilbert-type problem, which is the decision 
problem for the set of formulas of the form 
dx, Ix, ... Ix,, (t= 8s) in a formal system. In 
particular, the Hilbert-type problem in a 
formal system formalizing natural number 
theory is called Hilbert’s tenth problem (— 
196 Hilbert). The latter is the problem of find- 
ing an algorithm for deciding whether a *Dio- 
phantine equation has an integral solution. 

This decision problem was studied by M. 
Davis, H. Putnam, J. Robinson, and others, 
and finally Yu. V. Matiyasevich solved it neg- 
atively by showing that every recursively 
enumerable relation is Diophantine [9]. (A 
relation R(m,,...,m,) is called Diophantine if 
there is a polynomial P(x4,.-., Xj, Yis -+3 Yk) 
with integer coefficients such that R(m,, ...,m;) 
holds if and only if P(m,,...,m;,V1,-..,Y,)=0 
has a solution for y,,..., y, in natural num- 
bers.) In addition, some investigations have 
been made about the tsecond-order predicate 
calculus L?, tintuitionistic logic, ete. [1,2]. 
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98 (XXI.18) 
Dedekind, Julius Wilhelm 
Richard 


Julius Wilhelm Richard Dedekind (October 6, 
1831—February 12, 1916) was born in the city 
of Braunschweig in central Germany and 
studied at the University of Göttingen under 
C. F. tGauss, who was then in his later years. 
He received his doctorate at Géttingen with a 
thesis on the tEuler integral. He was professor 
of mathematics from 1858 to 1862 at Ziirich 
and from 1863 to 1894 at the Technische 
Hochschule in Braunschweig. During his early 
twenties, he wrote papers concerning analysis 
and the theory of probability, but in 1857 he 
began publishing papers on the theory of 
numbers. He edited *Dirichlet’s lectures on 
number theory (Vorlesungen tiber Zahlen- 
theorie, first edition 1863, fourth edition 1899) 
and concentrated on research in arithmetic 
and algebra. The theory of tideals, which he 
founded, was originally set out in a supple- 
ment (1863) to Dirichlet’s Vorlesungen. 
Dedekind treated subjects ranging from the 
axiomatic foundations of the theory of ideals 
to tlattices and tgroups as algebraic systems. 
He was a pioneer of the abstract algebra of the 
20th century. Among his notable achievements 
are the {Dedekind zeta functions of talgebraic 
number fields, Dedekind cuts in the theory of 
real numbers, the algebraic theory of talge- 
braic functions (of which he was a coauthor 
with H. Weber), and the theory of natural 
numbers. He was one of the first to support 
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*Cantor’s set theory. His theory of natural 
numbers was founded on the concept of sets 
and included the idea of trecursive functions. 


References 


[1] R. Dedekind, Gesammelte mathematische 
Werke J-III, Vieweg, 1930—1932 (Chelsea, 
1969). 

[2] M. Cantor and R. Dedekind, Briefwechsel, 
Cantor-Dedekind, Actualités Sci. Ind., Her- 
mann, 1937. 


99 (1X.4) 
Degree of Mapping 


A. Degree of Mapping 


Let M" and N” be n-dimensional tclosed 
toriented tC’ manifolds (or tcombinatorial 
manifolds). For example, M" = N” = S" (the tn- 
sphere). Their nth thomology groups with 
integral coefficients H,(M"; Z) and H,(N"; Z) 
are infinite cyclic groups generated by the 
fundamental homology classes [M"] and [N"], 
respectively (— 201 Homology Theory). A 
continuous mapping f:M"— N” induces a 
homomorphism f,:H,(M"; Z)> H,(N"; Z), and 
there exists an integer d; such that /,([M"])= 
d,[N"]. This integer d; is called the degree of 
mapping (or the mapping degree) of f. When 
M" = N", d; does not depend on the orienta- 
tion of M”. 

If a continuous mapping g: M"-~N" is 
*homotopic to f (f =g), then we have d,=d,. 
If f is homotopic to a tconstant mapping 
(f ~0), then d,=0, while if f is a homeomor- 
phism, then d;= +1. When M” = N”, a homeo- 
morphism f: M” =M" is called an orientation- 
preserving mapping if d= 1 and an orientation- 
reversing mapping if d, = —1. 

Suppose that M” and N” are closed oriented 
n-dimensional combinatorial manifolds and 
that f:M"—>N" is a tsimplicial mapping. Let 
Ło; and Ès; (or, s; are n-simplexes of M”, 
N”, respectively) represent [M"], [N"], and let 
p; (resp. q;) be the number of n-simplexes o; 
such that f(g;’) is equal to s; (resp. —s;). Then 
p;—q; is independent of the index j and equal 
to dy. 

Suppose that f, g: S"—S” are continuous 
mappings (n> 1). Then f =g if and only if d; = 
d, (Brouwer mapping theorem). This implies 
that z,(S")=Z (— 202 Homotopy Theory). 
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B. Local Degree of Mapping 


Suppose that M” and N” are n-dimensional 
oriented C’ (or combinatorial) manifolds and 
f:M"—>N" is a continuous mapping. Suppose 
further that a point p of M” has a neighbor- 
hood U such that f(p)# f(q) for any point q 
contained in U—{p}. Then f induces a homo- 
morphism f,:H,(U, U—{p})>H,(N, N — 

{ f(p)}) of n-dimensional tlocal homology 
groups with integral coefficients that are both 
isomorphic to Z. If u and v are generators of 
the groups H,(U, U —{p}) and H,(N, N — 

{ f(p)}) corresponding to orientations, re- 
spectively, then there exists an integer k such 
that f,(u) = kv. We call this integer k the local 
degree of mapping f at p. If M" and N" are 
closed oriented C" manifolds (r > 1) and f: M” 
+N" is a C" mapping, then there exists a point 
r of N” such that the set f ‘(r) is a discrete 
subset {p;,...,p,} of M", and each p; has a 
neighborhood U, satisfying the foregoing con- 
dition (tSard’s theorem). If k; is the local de- 
gree of f at p;, then dp = Dk;. 


C. Linking Numbers 


Given two mutually disjoint smooth closed 
curves C, and C, in Euclidean 3-space, a quan- 
tity Lk(C,, C,) indicating how closely they 

are interlinked with each other was given by 
Gauss as follows: Let C; be expressed by the 
parameters x;=x,(t;) (i= 1, 2), where x;,(t,) are 
tcontinuously differentiable. Then the quantity 
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is an integer called the linking number of C, 
and C}. 

More generally, let M” be an n-dimensional 
oriented tcombinatorial manifold (or C" mani- 
fold (r>1)) and K and K* its fcellular decom- 
positions such that K* is dual to K. Let 2} and 
z5 (r+s=n—1) be tboundaries belonging to 
the complex K and K*. Suppose that C’*? is 
any tchain of K whose boundary is z}. Then 
the tintersection number [C’*!]-[z$] does not 
depend on the choice of such a chain C’*’. We 
set Lk(z',, 25)=[C"*']-[z5] and call it the link- 
ing number of z{ and z3. The linking number 
Lk(24, 25) of tsingular boundaries 24, 25 (r +s = 
n— 1) of M” is similarly defined by consider- 
ing the approximations z{, z5 of 21, Z3 belong- 
ing to a suitable cellular decomposition K and 
its dual K*. The number Lk(Z}, 25) is bilinear 
with respect to 21, Z3, and we have Lk(Z{, Z5)= 
(—1)'5"! Lk(Z5, 24). In the example in 3-dimen- 
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sional Euclidean space R? shown in the left 
half of Fig. 1, we have Lk(Z!, z3)=1, while 
Lk(2!, 23)=2 for the example shown in the 
right half of the same figure. In particular, if Z{ 
is homologous to 0 in M"—|25|, then we have 
Lk(Z{, 75) =0 (Fig. 2). Generally, if Z] and Z7 
are homologous in M"—|Z5|, then Lk(Z}, 75) 
= Lk(27, Z5). 


Fig. 1 


D. Order of a Point with Respect to a Cycle 


Let M” be an n-dimensional oriented com- 
binatorial manifold (or C” manifold (r> 1)) 
with the nth Betti number b, =0, 2""? an 
(n—1)-dimensional singular boundary of M”, 
and o a point of M” that is not contained in 
|2"- "|. We set ord(2""', 0)= Lk(Z""', o) and call 
it the order of the point o with respect to 2""'. 
For example, when M"=R? and 2! ={ f()|O< 
t<1,f(0)=/(1)}, where f is a continuous 
function, the order ord(Z!', o) is equal to the 
rotation number around o of a moving vector 
of (0 as t varies from 0 to 1. This ord(zZ!, o) 
stays invariant as the point o moves in a tcon- 
nected component of the complement R? —|2"| 
(Fig. 3). On the other hand, if 2! = { f(]0 < 
t<1, f,(0)=f,(1)} (i=0, 1) are closed curves in 
R? and the distance p( f,(t), fi (©) is smaller 
than p( folt), 0) for all t in the interval [0, 1], 
then we have ord(z}, 0)= ord(Z!, 0) (Rouché’s 
theorem). 


Fig. 2 Fig. 3 
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100 (X.17) 
Denjoy Integrals 


A. History 


For a real-valued function f(x) of a real vari- 
able to be tLebesgue integrable, it is necessary 
and sufficient that there exist an tabsolutely 
continuous function F(x) such that F’(x)= f(x) 
at talmost all points x (— 221 Integration 
Theory D). In general, the derivative of a func- 
tion is not necessarily Lebesgue integrable. A 
function f(x) is Lebesgue integrable if and only 
if | f{(x)| is integrable. Hence a function which 
has an improper Riemann integral is not neces- 
sarily Lebesgue integrable (— 221 Integration 
Theory A). For this reason, it is desirable to 
extend the concept of Lebesgue integrals. In 
1912, A. Denjoy constructively defined a new 
concept of integrals (Denjoy integral in the 
restricted sense; — Section D), which is an 
extension of both Lebesgue and Riemann 
integrals. Later, N. N. Luzin provided the 
descriptive theory of this integral. Indepen- 
dently, and nearly simultaneously, A. J. Khin- 
chin and Denjoy defined a more general in- 
tegration (Denjoy integral in the wide sense 
(1916); — Section D). 

In 1914, O. Perron, independently of Den- 
joy, defined a concept of integrals (Perron 
integrals) that is equivalent to that of Denjoy 
integrals in the restricted sense. To establish 
this concept, Perron considered the differen- 
tial equation y'= f(x) and utilized a method 
similar to the one used in the proof of the 
existence theorem for the solution of the dif- 
ferential equation y'= f(x, y). However, the 
concept of Denjoy integrals is inadequate to 
treat unbounded functions. Thus to extend 
the concepts of Riemann and Lebesgue inte- 
grals, various ideas have been introduced; for 
example, Denjoy (1921), J. C. Burkill (1951), 
and R. D. James (1950) introduced new con- 
cepts as byproducts of investigations concern- 
ing the coefficients of trigonometric series 
[2,3]. The A-integral concept devised by A. N. 
Kolmogorov was meant to deal with the pro- 
blem of the conjugate function of Fourier 
series [4]. As a certain completion of the space 
of functionals of step functions, K. Kunugui 
defined the notion of E. R. integrals (1956), 
which coincides with that of A-integrals in a 
special case [5,6]. 

What has been stated so far deals only with 
functions of a real variable. Concerning the 
extension of Denjoy integrals to the case of 
several variables, research has been done by 
M. Loomis, S. Kempisty, S. Nakanishi (née 
Enomoto), and others [7, 8]. 
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B. Approximate Derivative 


If we have limy 94.0 MEN(E—h, č +k) (ht 
k)=1 at a point č of a tmeasurable subset E 
of the real line, where m is the 1-dimensional 
Lebesgue measure, the point é is called a point 
of density for E. Almost all the points of E are 
points of density for E (Lebesgue’s density 
theorem). Let E be a measurable set having xo 
as a point of density, and let F(x) be a measur- 
able function on E. If there exists a number / 
such that for each e>0, xọ is a point of density 
for the set {x|!—e<(F(x)— F(x9))/(x —x9) < 
I+¢,xeE}, then l is called the approximate 
derivative of F(x) at xg and is denoted by 

AD F(x,). If AD F(x) exists, F(x) is said to 

be approximately derivable at xo. If F(x) is ap- 
proximately derivable at each point of E, then 
F(x) is said to be approximately derivable in 
E. If F(x) exists at a point x, then AD F(x) 
exists at x, and we have AD F(x)= F(x). How- 
ever, there exists a continuous function F(x) 
that is approximately derivable at almost all 
points of an interval and yet not differentiable 
at any point of a set of positive measure. 


C. Generalized Absolute Continuity 


Let E be a set in R, and let F(x) be a real- 
valued function whose domain contains E. If 
for each ¢>0, there is a ô >0 such that for 
every sequence {[a,, b,]} of nonoverlapping 
intervals whose endpoints belong to E the 
inequality 2(b,—a,)<6 implies }|F(b,)— 
F(a,)|<é, then the function F(x) is said to be 
absolutely continuous on E. We denote by AC 
the set of all functions that are absolutely 
continuous on E. If F(x) is continuous on E 
and E is the union of a countable sequence of 
sets E, on each of which Fe AC, then F(x) is 
called a generalized absolutely continuous 
function, and we write Fe GAC. If Fe GAC, 
AD F(x) exists almost everywhere. 

If, for each ¢>0, there is a ô >0 such that for 
every sequence {[a,, b,]} of nonoverlapping 
intervals whose endpoints belong to E the in- 
equality (b, —a,)<e implies ©), O{F;[a,,b,]} 
<«(O{F;[a,,b,]} denotes the oscillation of 
the function F(x) in [a,, b,], i.e., the difference 
between the least upper bound and the great- 
est lower bound of the values assumed by F(x) 
on [a,,5,]), then F(x) is said to be absolutely 
continuous in the restricted sense (or abso- 
lutely continuous (*)) on E; and we write Fe 
AC(*). Just as we defined the notions of gen- 
eralized absolute continuity and absolute con- 
tinuity, so we define the notions of generalized 
absolute continuity in the restricted sense and 
generalized absolute continuity (+). Thus Fe 
GAC(«) means that F(x) is a generalized ab- 
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solute continuous (*) function on E. If Fe 
GAC(«), then F’(x) exists almost everywhere. 


D. Definitions of Denjoy Integrals 


Let f(x) be a real-valued function defined on 
I=[a,b]. If for f(x) there exists a function 
F(x) that belongs to GAC on J and for which 
AD F(x)= f(x) holds almost everywhere, then 
f(x) is said to be Denjoy integrable in the wide 
sense (or D-integrable) on J. We call F(b)— 
F(a) the definite D-integral of f(x) over J, 

and denote the value by (D){? f(x)dx. The 
function F(x) is called an indefinite D-integral 
of f(x) on I. Similarly, we obtain the definition 
of Denjoy integral in the restricted sense (or 
D(+)-integral) by replacing GAC by GAC(x) 
and AD F(x) by F(x) in the definition of the 
D-integral. If a continuous function F(x) satis- 
fies the equality AD F(x) = f(x) # +00 (F'(x)= 
f(x) # +00) for all except countably many 
points in 7, then F(x) is an indefinite D- 
integral (D(x )-integral) of f(x). A Lebesgue- 
integrable function is D(«)-integrable, a D( + )- 
integrable function is D-integrable, and a D- 
integrable function that is almost everywhere 
nonnegative is Lebesgue integrable. 


E. Constructive Definition of Integrals 


Let S be a functional whose domain |_), K(S; 1) 
consists of the union of sets K(S; J) of real- 
valued functions defined on closed intervals 
I=[a,b]. If f belongs to K(S; I), we denote 
the value S(f) by S(f;1). Such a functional S 
is called an integral operator if the following 
three conditions are satisfied: (1) If fe K(S; Ip) 
and J is an arbitrary interval contained in Ip, 
then the trestriction f, to I of f also belongs 
to K(S; I). Also, S( f; I) is a fcontinuous addi- 
tive function of the interval I c Jy. (2) Let I, = 
[a, b], I, =[b, c], and I=[a,c] (a<b<c). If for 
a function f defined on J, f, € K(S;1,) and 

f,€ K(S; L), where fı =f;, and fz = fi, then 
feK(S; J). (3) If f is identically 0 on J, then 
feK(S;1) and S(f; D=0. For two integral 
operators S, and S,, we say that S, includes S, 
(or S; is weaker than S,) if K(S,; Dc K(S,; 1) 
for every I and S,(f; 1) =S,(/; J) for every 
feK(S,; 1). The D-integral (D( *)-integral) is 
the weakest integral operator containing the 
Lebesgue integral and satisfying the following 
two conditions, (C) and (H) (resp. H(*)): (C) 
Cauchy’s condition. If, for every function f 
defined on lọ, we have fe K(S; I) for any I= 
[a+06,b—e] ẸṢ lo =[a, b], and also if the finite 
limit lim;_.9,-.0 S( f; J) exists, then fe K(S; Ip) 
and S(f;J,) coincides with the foregoing limit 
value. (H) Harnack’s condition. Let E be a 
closed subset of Ip, {1,} be a sequence of inter- 
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vals contiguous to the set consisting of the 
points of E and the endpoints of lọ, and f be a 
function on Jy satisfying the following three 
conditions: (i) fse K (S; lọ), where f-(x) = f(x) 
whenever xe E and f, =0 otherwise; (ii) f = 
J, € K(S; 1,) for each k; and (iii) Dy |S( fx; Il < 
+% and lim,.,,, O(S; f,; I) =0 when the se- 
quence {/,} is infinite. Then it follows that 
feK(S; lo) and S(f; 1o) =S(fe; fo) + Lx S( Je; I). 
(Here O(S; fx; 1,) denotes the variation of S(/,) 
on Jp, that is, the least upper bound of the 
numbers |S(f,;J)|, where J denotes any sub- 
interval of J,.) We obtain condition (H(*)) by 
replacing condition (iii) in (H) with a more 
restrictive condition: ©, O(S; f,; I) < +œ. The 
constructive definition of the Denjoy integral 
in the wide sense (the Denjoy integral in the 
restricted sense) is obtained by ttransfinite 
induction starting with the Lebesgue integral 
and using two methods, (C) and (H) (resp. 
(H()), of extensions. 


F. Perron Integrals 


Given a function f(x) defined on an interval 
[a,b], suppose that F(x) is a function defined 
on the same interval such that (1) F(x) > f(x); 
(2) F(x) # —o0 (resp. (1) F(x) < f(x); (2) F(x) # 
+00) at every point x, where F(x) (resp. F(x)) 
denotes the tlower (upper) derivative of F(x). 
In this case, F(x) is called a major (minor) 
function of f(x). If for any ¢>0 there is a major 
function w(x) and a minor function g(x) of 
f(x) such that y(b)— p(b)<e, then f(x) is 

said to be Perron integrable. We denote by 
(P){? f(x) dx the value inf, {y(b)—y(a)} = 

sup, {p(b)— 9(a)}. 


G. Properties of Integrals 


If { fi} is a nondecreasing sequence of func- 
tions that are D-integrable on an interval 
[a,b] and whose D-integrals over [a,b] consti- 
tute a sequence bounded from above, then the 
function f(x)=lim,,_,,, f(x) is D-integrable on 
[a,b], and we have 


b b 
Éf fœ)dx= lim | Sn (x) dx. 


If F(x) is a function of tbounded variation 
and g(x) is a D-integrable function on an 
interval [a, b], then F(x)g(x) is D-integrable on 
[a, b]; moreover, denoting by G(x) the inde- 
finite D-integral of g(x), the following formula 
is valid: 


b 
o| F(x)g(x)dx 


a 


b 
= G(b) F(b)— cora- | G(x) dF (x), 


a 
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where the last term is the tStieltjes integral 
(integration by parts). 

If F(x) is a nondecreasing function and g(x) 
is D-integrable on [a,b], there is a point č in 
[a,b] for which the following formula is valid: 


of g(x) F(x) dx 


č b 
=F(a)-(D) | glx)dx-+ F(b): (D) | g(x)dx 
a č 
(the second mean value theorem). 
The foregoing theorems remain valid if D 
is replaced by D( *) in the hypotheses and 
conclusions. 
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Descartes, Rene 


René Descartes, (March 31, 1596—February 
11, 1650), philosopher, mathematician, and 
natural scientist, was born in the province of 
Touraine in France. He became dissatisfied 
with his studies of scholastic philosophy in the 
Jesuit Academy in La Flèche, and later, in 
1619, while stationed in Ulm during a tour of 
duty in the army, he underwent a philosoph- 
ical conversion. He had an idea of methodo- 
logically unifying the various fields of interest 
to him using mathematics as a model. He 
returned to Paris in 1621, but moved to Hol- 
land in 1628 to concentrate on his work. Swe- 
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den’s Queen Christina invited him in 1649 
to that country, where he died the next year, 
evidently from a combination of cold and 
overwork. 

Descartes, often considered the founder of 
modern philosophy, discarded early the tra- 
ditional theological view of the world and 
stated that all knowledge should be recognized 
as logical only after it has been submitted to 
rational criticism. This ushered in the modern 
view of the world based on mathematics and 
physics. In 1637, he published Géométrie as an 
appendix to his Discours de la méthode, which 
also contained his works on optics and meteo- 
rology. In it, he promoted F. tViéte’s symbolic 
algebra, which he applied to geometric prob- 
lems. His idea that algebra could be used as a 
general method for geometry established him 
as the founder of tanalytic geometry. 
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Design of Experiments 


A. General Remarks 


The design of experiments is a part of the 
statistical planning required to collect the data 
appropriate to the purpose of statistical in- 
ference (— 401 Statistica! Inference) in various 
fields of scientific research and application. 
The main purposes of the design of experi- 
ments are (1) to analyze a given statistical 
linear model (— 403 Statistical Models) and 
(2) to devise a “good” statistical linear model. 
Sometimes this term also refers to a statistical 
method including the fanalysis of variances. 
Thus the purpose of designing an experiment 
is to provide the most efficient and economical 
methods of reaching valid and relevant conclu- 
sions from that experiment (— 403 Statistical 
Models). 

R. A. Fisher, whose contributions to statis- 
tical theory were remarkable and far-ranging, 
propounded three required principles to con- 
trol the experimental field in order to guaran- 
tee the validity of statistical methods and to 
increase the sensitivity of experiments: (i) re- 
plication, or the repetition of the set of all the 
phenomena to be compared in the experiment, 
for the evaluation of experimental error var-- 
iance; (ii) randomization, or the procedure 
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allotting various experimental treatments at 
random, in order to change systematic errors 
into random errors; and (iii) local control, 
or the procedure which makes the variation 
within each experimental block as small as 
possible, in order to minimize or remove sys- 
tematic errors. These are called Fisher’s three 
principles. A design satisfying principles (i) and 
(ii) is called a completely randomized design; it 
enables us to attach a probability statement to 
estimated treatment differences by obtaining 
valid estimate of experimental error variance. 
Let an n-dimensional frandom variable Y = 
(Y,,..., Y y be represented by a linear model 


Y=XéE+W, (1) 


where X is a given n x s real matrix, €=(€,, 
<... É) is an s-vector, and W=(W,,..., W, y isa 
random vector with the texpectation E(W) = 
0. Then Y is called the observation vector, W 
the error vector, č the effect vector of Y, and 
X the design matrix. 

According to the properties of the effect 
vector č, the linear model (1) is separated 
into three classes: (i) The class of fixed-effects 
models for which € is a fixed unknown para- 
meter. In this case, the component €; of é is 
called a fixed effect, and a linear function z= 
F’é of é, with a given coefficient vector F is 
called a linear parameter or parametric func- 
tion. (ii) The class of random-effects models for 
which the components ©, of € are random 
variables. In this case, each component &; is 
called a random effect, and é is denoted by =. 
(iii) The class of mixed models for which there 
are both fixed effects č; and random effects 5; 
in é. In this case, the model (1) becomes 


Y=X,214+X,224+W, (2) 


where €'=(é,,...,¢,)' is a fixed-effect vector 
and S?=(£,,,,...,&,) is a random-effect vec- 
tor. The conditions frequently assumed for the 
tdistribution law of Y are: (a) The errors W, 
(i=1,...,n) are uncorrelated and E(W,)=... 
= E(W,)=0. (b) The errors W, (i=1,...,) have 
a common unknown tvariance o°. (c) The 
errors W, (i=1,...,n) have the tnormal distri- 
bution. (d) The random effects =; are uncor- 
related and independent of the error vector W. 
(e) The random effects =, have a common 
unknown variance o?. (f) The random effects 
=; have the normal distribution. 

Let L(X) be a ‘linear subspace of R” span- 
ned by the column vectors of X. The linear 
model 


Y=Xé+W (3) 


is called a hypothesis on the linear model (1) if 
L(X) cL(X). 
The main issues of the theory of design of 


experiments are concerned with (I) statis- 
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tical inferences, such as estimation or testing 
hypotheses, under models (1), (2), (3), (1), (11), 
(iii); AI) determination of the design matrix X 
satisfying certain optimal conditions; (IIT) 
construction of a theoretical foundation that 
can explain the validity of the statistical treat- 
ment of the observed data by means of the 
above models. 


B. Block Design 


The design of experiments is described here in 
terms of the so-called block design. There are n 
experimental units x= 1, ...,n called plots, and 
an observation Y, is assigned to each plot x. A 
block is constructed with several plots under 
Fisher’s principle (iii), and the number of plots 
in a block is called the block size, the jth one 
being denoted by k, j=1,...,b, with Dj k;= 

n. One of v operations, called treatments or 
varieties, is applied to each plot. It is assumed 
that the observation Y, at the plot « in the 

jth block under the ith treatment has the 
structure 


Y =é ++ W, 
The €;,i=1,...,v, are called treatment effects, 
and the 7;,j,...,b, block effects. It is also as- 


sumed that $}; č;=0. In this case, Y is repre- 
sented in matrix notation as 


Y=0¢+ Yy+W, (4) 
where ® =(Q,;), e=1,...,n, i=1,...,v, with 


when the ith treatment is applied 


1 
Pai a 
0 otherwise, 


to the plot «, 
and Y =(,;), =1,...,n,j=1,...,b, with 


1 
n-d 
0 


Here it is assumed that X; pu = 1, Xa Qai = 
n21, Dinan, Lj =!, and X, Yaj kj l. 
We call r; the number of replications of the ith 
treatment. We set N =(n;)=P'Y. Then n; is 
the number of observations in the jth block to 
which the ith treatment is applied. The matrix 
N is called the incidence matrix of the block 
design. 


when the plot « belongs to 
the jth block, 
otherwise. 


In any experiment, each plot has its own - 
effect. The blocks are constructed so that this 
plot effect in each block becomes as homoge- 
neous as possible, although it is impossible 
to eliminate the effect completely. For this 
purpose, randomization is adopted. Suppose 
that we are given k plots in a block and v 
treatments (k <v). Then randomization is 
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utilized to select a treatment out of v treat- 
ments to be allocated to each plot so that the 
selection is “at random.” Then the plot effects 
are random, and the error term in model (4) 
can be considered to be the sum of a plot effect 
and an original error. 

A block design satisfying Fisher’s three 
principles is called a randomized block design. 
Blocks that can accommodate all the treat- 
ments to be studied are called complete blocks. 
Those that do not contain all the treatments 
are called incomplete blocks. Blocking can be 
considered to be an extension of pairing. In 
the terminology of block design, we can say 
that many experiments are block experiments, 
or even (tautologically, if we allow those with 
only one block) that all experiments are block 
experiments. 

In a block design N =(n,;), treatments io 
and i, are said to be connected if there exists a 
chain 
lojii j2 -eh jil 
of integers such that 1 <i, <v(p=0, 1, 2,...,0), 
1<j,<b (q=1,2,...,), and n; j, >0,n;,;,>9, 
Ni j, > 0, ni > 0, n> 0. If all pairs of 
treatments are mutually connected, then the 
design is said to be connected. In this case, the 
rank of the matrix C defined in Section C is 
v— 1. Hf the design is disconnected, then the 


incidence matrix N can be partitioned into 
two or more connected portions, e.g., 


N= 
0 N, 


Thus without loss of generality we can restrict 
ourselves to the connected case. 


C. Estimation under the Fixed-Effects Model 


Conditions (a) and (b) of Section A are as- 
sumed here. The tnormal equation which gives 
the tleast square estimates Ê and 4 of é and y, 
respectively, is 


(enG) 
y ( > ) i iv y - (5) 


Set ®'@ = diag(r,,...,7,)=D,, YY = 
diag(k,,...,k,)=D,, C=D,— ND; 'N', Q= 
(®© — ND; 'Y)Y, where diag(...) means a 
diagonal matrix with the diagonal elements 
.... Then (5) reduces to 


Cé=Q, 
Let L be an orthogonal matrix that transforms 
the matrix C to a diagonal form; that is, L'CL 


= diag(p,,.-., Pp-1,0)=A, pi> 0 for all i. Set 
A* =diag(p;',...,9,,,0), C* =LA*L’. Then Ê 


=D; (YY —N’E). (6) 
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=C*Q is a particular solution of (6) and £; Ê 
=. A parametric function z= F’€é with coeffi- 
cient vector F=(F,, F,,..., F,) is called a treat- 
ment contrast if the sum È; F; of coefficients 
vanishes. A treatment contrast n = F'é is called 
a normalized contrast if F'F = 1. It is called the 
elementary contrast if F has only two nonzero 
elements 1 and —1. Elementary contrasts of 
treatment effects show the comparison of 
treatments involved in them. When a design is 
connected, any contrast z is festimable, and 
the tbest linear unbiased estimate of z is #= 
Fé. Furthermore, if f; is the eigenvector of 

the matrix C with unit length corresponding to 
an eigenvalue p; and F = È;a;f;, then the var- 
iance of the estimate 7 is given by o7,a?/p;. 
The following properties are equivalent: (i) A 
design is connected. (11) Any treatment con- 
trast is estimable. (iit) The rank of the matrix 
C is v— 1. (iv) The minimum eigenvalue 0 of 
D; “? CD; *? is simple and other eigenvalues 6, 
satisfy 0< 0, <1. (v) The maximum eigenvalue 
1 of D712 ND + N'D] +} is simple and other 
eigenvalues 0, satisfy 0<0, <1. (vi) There 
exists a positive integer p such that each ele- 
ment of (D7 Y? ND, + N’D,1*)? is positive. It 


r 


holds that 0,=1—6,. 


D. Test of a Hypothesis H : č, =... = č, in the 
Fixed-Effects Model 


Conditions (a), (b), and (c) of Section A are 
assumed here. The hypothesis H:é,=...=€, is 
represented by 


Y=Py+W. (7) 


Consider a direct sum decomposition R” = 
L(V) + LEC) + Le(®, Y) + Lé.y of R”, where 
L}(A) and L stand for the torthocomple- 
ments of L(B) with respect to L(A) and R”, 
respectively, and F =(1,1,..., 1 e R”. The 
tprojection operator matrices for the decom- 
posed subspaces L(T), LE(¥), Lå (®, Y), and 
Lg.y are denoted by P,, P}, P}, and P,, respec- 
tively. Then we have 


Pi =n Ens P,=PD,'Y'~n VE 


P,=(1,—PD, ' P)OC*O'(L, — YD; ' Y’), 

P,=1,—P, —P,—P, 

where E, is an a x b matrix whose entries are 

all unity and J, is the n x n identity matrix. 
The analysis of variance for the hypothesis 

(7) in model (4) is given by 

Y'Y=Y'P,¥Y+Y'P,Y+Y'P;¥+Y’P,Y. 


This is called an intrablock analysis. A usual 
test for the hypothesis H is given by a critical 
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region with 


p-”Z 2b Y’P,Y 
~  p=1 YPY 


> constant 


(— 403 Statistical Models). 


E. Optimal Block Design 


A block design is said to be optimal when it 
minimizes the variance of the estimate # of a 
normalized contrast z. Suppose that the num- 
ber v of treatments, the number b of blocks, 
and each block size k,, j=1,...,b, are given. 
Under each of the following criteria the corre- 
sponding block design is optimal in the sense 
indicated: For positive eigenvalues, p,, p2, 

<->, Po-1, Of the matrix C in Section C, (I) 
T1?=} p; is maximal (D-optimality); (IT) min p; is 
maximal (E-optimality); (III) 5?=} p+ is mini- 
mal (or the average variance of the estimates 
of all normalized contrasts of the parameters 
č; is minimal) (A-optimality). 

If py =... =p,- =(n—b)/(v— 1) (=p, say) 
and n,; is either 1 or 0, then the design is opti- 
mal for each of the optimality criteria (T), (11), 
and (III). In this case, we have 


C=p(l,—v 'E,,). 


Such a design is called a variance-balanced 
block design in which every normalized con- 
trast is estimable with the same variance. If 
D'ND; N'—n E, r =pu(l,—n E, r’) (or 
C=(1—yp)(D,—n“'rr’)), every normalized con- 
trast is estimable with the same efficiency 1 — p, 
where r= D,E, and 0< u< 1. Such a design 
is called an efficiency-balanced block design. 
For a block design, any two of the following 
properties imply the third: (i) The design is 
variance-balanced; (ii) the design is efficiency- 
balanced; (iii) the design is equireplicated. 
When all block sizes k; equal some number 
k independent of j, all numbers r; of replica- 
tions equal some number r independent of i, 
and A;r = nn; ; (number of times that both 
treatments i and i’ are applied to the same 
block) equals some number å independent of 
i and 7’, then the design is combinatorially 
balanced. The design is usually called the 
balanced incomplete block design (BIBD) if 
these three conditions are fulfilled and k < 
v. A BIBD is both variance-balanced and 
efficiency-balanced. If all the treatments are 
replicated the same number of times and the 
blocks are of the same size in a block design, 
then the only variance-balanced design is a 
BIBD, provided such a design exists. A BIBD 
is often denoted by BIBD(v, b,r, k, A) and we 
have the relations vr = bk, A(v —1)=r(k—1) 
and v <b, among the parameters; the last 
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relation is called Fisher’s inequality. The design 
is said to be symmetric when v=b. Further- 
more, if v=b is even, then r—A must be a 
perfect square. If v=b ts odd, then the equa- 
tion x? =(r—A)y? +(—1)°"? 2z? must have a 
solution in integers x, y, z not all zero. This is 
called the Bruck-Ryser-Chowla theorem. Neces- 
sary conditions for the existence of a BIBD 
have been obtained. One of these conditions 
is stated in terms of the Hasse-Minkowski p- 
invariant C,(A)=(—1, —1), H, (D; —D;-1) 
where (m, m’), is the *Hilbert norm-residue 
symbol, D, = 1, and D; is the principal minor 
of the n x n matrix A. Another is described in 
terms of the embedding of a quasiresidual 
design into the corresponding symmetric 
BIBD. No effective necessary and sufficient 
condition for the existence of a BIBD has been 
obtained. In general, it is conjectured that for 
a positive integer k, with finitely many excep- 
tions, BIBD (v, b, r, k, Ays exist for all pairs v, A 
of positive integers satisfying v> k, A(v—1)=0 
(mod k—1)and Av(v—1)=0 (mod k(k—1)). 
The known methods of constructing block 
designs are of two main types, direct and re- 
cursive. Recursive methods are a way of con- 
structing designs from smaller ones. Direct 
methods yield easier constructions, but are 
applicable only for special values of the para- 
meters. A direct method of constructing a 
BIBD designates the treatments and blocks, 
respectively, as the points and subspaces of the 
*projective space and the taffine space over a 
‘finite field. To explain another method of 
constructing a BIBD, we let G be an tadditive 
group of order n and x‘), ..., x be m treat- 
ments corresponding to each element x“ of the 
group (i=1,2,...,n). The treatment x“ is said 
to belong to the ath class («=1,2,...,m), and a 
pair (x®, xi) of treatments is called a dif- 
ference of type (x, $, x!) if xX —x = xP (40). 
We can form t blocks of size k 


p? 


By = {x,..,, x®), 


a, stag Doge 
Bem {xg sXe ; 
such that each block B; contains exactly r 
treatments belonging to the ath class («= 
1,...,m) and among all pairs of treatments 
in the same block there are 2 differences of 
each type (a, B, x"). Such a set of t blocks is 
called a difference set. The ¢ blocks in a dif- 
ference set are called initial blocks. Given 
such a difference set, we can obtain nt blocks 
by joining elements of G to the elements of 
each B, (s=1,2,...,t). These nt blocks form 
a BIBD(v= mn, b=nt,r,k=rm/t, A) (— 66 
Combinatorics). 

As a generalization of Fisher’s inequality, 
for an unequal-replicated block design with 
unequal block sizes, b >v—6 holds, where 6 is 
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the multiplicity of the maximum eigenvalue 1 
of the matrix D, "? CD, *?. The equality sign 
holds if and only if the projection operator cor- 
responding to zero eigenvalue of D7 1? CD, "° 
is a zero matrix. 


F. Estimation in a Mixed Model 


Consider a block design (4), where every block 
has the same size k and every treatment has 
the same number r of replications. Let € be a 
fixed effect and y a random effect denoted by 
H, and assume that W satisfies conditions (a) 
and (b) of Section A and that H satisfies con- 
ditions (d), (e), and (f) (where the £, are re- 
placed by the coordinates H; of H). If E(H) =y, 
j=1,...,5, then, changing the notation H — 
E.,,y to H, we can rewrite (4) as 


Y=ly+0€+PH+W, (8) 


with E(H)=0. The normal equation that gives 
the least square estimate of é is 


(C+07(07 +ko?) 1 C,)é 
=Q+0°(o*+ko?) 'Q,, (9) 


where C and Q are the same as in (6) and 
C,=ND,'N'—rvE,,, Q, =(ND OW — 

v`! E, I ^)Y. Equation (9) cannot be solved 
unless the ratio g?:0? is given. When 07:4? is 
not known, substituting in o° + ka? its tun- 
biased estimate given by analysis of variance, 
one obtains a solution of (9) which tends to a 
tconsistent estimate of č as the number of 
blocks tends to infinity. 


G. Estimation in a Random-Effects Model 


Let € (denoted by £) and H in the model (8) be 
random effects. Suppose that =, H, and W are 
mutually independent, and that the distribu- 
tions of =, H, and W are N(0,031,), N(0,671,), 
and N(0,o7I,), respectively. The distribution of 
Y in (8) contains four parameters y, 07, a7, 03. 
When k;<v, the ‘minimal sufficient statistic is 
generally incomplete, and therefore the op- 
timal estimate of y, o”, 07, and a? cannot be 
determined. As an example, the minimal suffi- 
cient statistic for the random-effects model of a 
BIBD (v, b, r, k, A) is 


(£; Y, Y' Pa, Y, YP), Y, Y'P,Y, Y' PLY, Y’ PY), 
where 
Pa =k (r—/) | BTB—kr(r—A) tn E 
P, =k((k-1)r +A) r 

x (T—k-! BTXT ~k TB), 


nn? 


P,=k`B—k~'(r— À)! BTB 
+0A(r—A) nn E 


nn? 


nn? 
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with T=0@’ and B= YY". In this case, 
E(Y’P,Y)=(n—v—b + 1)o?, E(Y’P; Y) =(b— 
v)(a? + ka?). From these equations unbiased 
estimates of o? and a? can be derived, but their 
optimality is not guaranteed (— 396 Statistic). 


H. Factorial Experiments 


Suppose that there are h factors F,,..., Fp 
which affect Y, and each factor F, has s; levels 
(i=1,...,h). It is assumed that v=s, xs. x... X 
s, treatment combinations are derived by all 
the combinations of the levels of h factors, and 
that v treatment effects are represented by the 
sum of subeffects called main effects and inter- 
actions, or factorial effects covering both. Such 
an experiment is called a factorial experi- 
ment or factorial design; it allows us to ex- 
amine the effects of two or more factors, each 
factor being applied at two or more levels, 

by testing all possible treatment combinations 
formed from the factors under study. Specifi- 
cally, suppose that we have the case h=2, 
called a two-way layout. Let main effects be 
denoted by č' =(č1, -s čs) 7 =(P; -e Ča)’ 
and interaction by č"? =(¢if, čiZ, -s Ča) 
where X; č} =0, Lé? =0, 2,65? =2, 657 =0. 
When there is no restriction on the number of 
observations, each of v=s, x s, treatment com- 
binations is replicated t times. Components Yj, 
of the observation vector Y are represented 
by a linear model 


Y= +E] t+ G+ EL? + Wigs 


te Je bense. OK Let 


or, in vector notation, 
Y=Ty+X,é! +X,€74+-X,,€17+ W. 
The analysis of variance in this case 1s given by 


Y'Y=F YPY, 


where P, is the projection operator matrix 
on the subspace derived by a direct sum 
decomposition 


R" = L(V) + LE(X,) + Lr(X2)+ Ly, x, (X12) 
+Ly,,, 


where n=5s,5,t. Denoting by Y. the arith- 
metic mean of Y; over the subscript k and 


using similar notation Y,.., Y and Y..., we 
have Y'P, Y=nY2, Y'P, Y=s,tD(Y,..— Y..)’, 
Y'P,Y=s,tX(¥,.— ¥.), YP Y =t2(¥%+ 
Y...— ¥,.—Y,)*, ¥’PsY= (Yin. — Yj)’. The 
tanalysis of variance table is given in Table 1. 
For h>3 similar models can also be con- 
sidered for dealing with interactions up to h- 
factor. The factorial experiment is said to be 
symmetric ifs, =s,=...=s,=s and ts called 
the s* factorial experiment. Otherwise it is 
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Table 1 Analysis of Variance 


Sum of Degrees of 
Source Squares Freedom 
F, main effects Y’P,Y s,—1 
F, main effects Y’P,Y S,—1 
F, F, interactions Y’P,Y (s, —1)(s,—1) 
Error YP; Y $153(t— 1) 


said to be asymmetric and is called the 
S4 X S2 X... X Sp factorial experiment. 


I. Fractional Factorial Designs 


Factorial designs require at least an experi- 
ment for all the combinations of levels of 
factors under consideration. When there are a 
large number of treatment combinations re- 
sulting from a large number of factors to be 
tested, it will often be beyond the resources of 
the investigator to experiment with all of them. 
For such cases Finney (Ann. Eugen., 12 (1945)) 
proposed a method in which only a fraction of 
the treatment combinations are experimented 
with. A design of this type is called a fractional 
factorial design. One reason fot the usefulness 
of fractional factorial designs in preference to 
factorial designs is that they involve a smaller 
number of treatment combinations, since, in 
most scientific experiments, it is usually found 
that a large number of the higher-order inter- 
actions are negligible. The crucial part of the 
specification of fractional factorial designs is 
the suitable choice of the defining or identity 
relationship. Equating the nonestimable fac- 
torial effects for the selected fraction of treat- 
ment combinations with 7 gives the identity 
relation. Each factorial effect is not estimable 
after selecting a fraction of treatment com- 
binations, and any contrast of the selected 
treatment combinations represents more than 
one factorial effect. All factorial effects repre- 
sented by the same treatment combinations 
are called aliases. In aliases, by assuming that 
other interactions are negligible when com- 
pared to the one of interest, estimation can 
be made by means of the corresponding con- 
trast of the selected treatment combinations. 
When all factorial effects of order higher than 
l are assumed to be zero, a fractional factorial 
design is said to be of resolution 2/+ 1 if it 
satisfies the condition that under the usual 
model all factorial effects up to order / are 
estimable, whereas a fractional factorial design 
is said to be of resolution 2/ if it satisfies the 
condition that under the usual model all fac- 
torial effects up to order /!—1 are estimable. 

In the beginning, the theory was developed 
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for an orthogonal fractional factorial design in 
which the estimates of various effects of inter- 
est are all uncorrelated. However, these are 
available only for special values of the number 
n of treatment combinations and are in general 
uneconomical in the sense that they require a 
large value of n in comparison with the num- 
ber of unknown effects. As a generalization of 
orthogonal fractional factorial designs, I. M. 
Chakravarti (Sankhya, 17 (1956)) introduced 
the concept of balanced fractional factorial 
designs. In these designs, the estimates of 
effects are not always uncorrelated. However, 
the tcovariance matrix of the estimates ts 
invariant under any permutation of factors. 
Balanced fractional factorial designs are flex- 
ible in the number of treatment combinations, 
with the result that more experimental situ- 
ations can be handled. These two kinds of 
fractional factorial designs can be constructed 
by using orthogonal arrays and balanced 
arrays, defined in Section L. All alias relations 
can be derived from the identity relation in 
orthogonal designs. 

In the model (1) for a design, the variance of 
the estimates of estimable linear functions of é 
depends on the matrix X’X which is called the 
information matrix. For eigenvalues of X’X, 
the D-, E-, and A-optimatlities of fractional 
factorial designs can be defined similarly to (I), 
(II), and (IIT) given in Section E. An ortho- 
gonal fractional 2" factorial design is D-, E- 
and A-optimal. A fractional factorial design is 
said to be saturated if the number of treatment 
combinations is equal to that of parameters in 
the model to be estimated. However, since 
saturated designs do not provide an estimate 
of experimental error, their use should, in 
general, be confined to those situations in 
which a prior estimate of experimental error 
is available. The existence of a symmetric 
BIBD(4t— 1, 4t— 1, 2¢~1, 2t—1, t— 1) implies 
that of an orthogonal saturated fractional 
2“! factorial design of resolution III. 


J. Application of Algebras 


In the theory of design of experiments, the 
ideas of association algebra and relationship 
algebra play an important role. Let A; be a 

v xv symmetric matrix with entries 0 or 1 (i= 
0,1, ...,m). Ifa set {A,;|i=0,1,...,m} satisfies 
the conditions 


o=, 3 A;= Ep 
=0 


and there is a nonnegative integer pi for every 
i, j, k such that 


m 
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then the A; are called association matrices. In 
this case, there exists a natural number n; for 
every i such that A;E„,=n;E,„, holds. If an («, f) 
component a, of A; is unity, then the treat- 
ments « and f are said to be the ith associates. 
The talgebra . over the real number field 
generated by the matrices Ay, A,,..., Am iS 
called the association algebra. .o/ is commuta- 
tive and tcompletely reducible. A,> A, =(p},) 
is the tregular representation of .. There is a 
nonsingular matrix U = (u;;) that transforms 
all Z into diagonal matrices simultaneously: 


UPUT! =diag(Zo;, «.., Zm i=0,1,...,m. 


m 


m —1 
A? = ( 2 msz) 3 ujAj, 
f=0 i 


=0 


i=0,1,...,m, 


are mutually orthogonal idempotent elements 
of £. For example, consider the case where 
Ag =]; ® oe A, = (Ess, = L) & Les A, = 
is © (Ess czy L) A, = (Ess, —=1,) & (Ess, — 1) 
with m=3, where ® means the Kronecker 
product. The algebra ./ generated by Ao, A,, 
A, A, is called an F, type association algebra. 
The association matrices Ap, A,, 42, and A, 
correspond to the relationships between the 
treatments in a two-way layout. The mutu- 
ally orthogonal idempotent elements in this 
case are AF =s] "Es s, O s3" Es s AP =U, — 
si Ess) & s2 Ess,» Ay 7 si Bye @ (s, na 
s3 Ess.) and AŤ = (L, —s,'E, ..) & (s, = 
s3 "E, s) For the factorial experiment with h 
factors (h > 3), association matrices can be 
constructed in a similar way. If h=3, the num- 
ber m of association matrices is 7. Many types 
of association schemes are known, and some 
of them for m=2 are classified as group divis- 
ible, triangular, Latin square, cyclic type, and 
so on. For a group divisible type of v=s; s3, 
Ag=1,, Ay =I, ® Ess Ag, Az = Ey, — Ag— 
Ai; A =v Ew, AŽ =v"! {(s,—1) (Ag + At)— 
A}, AŽ =s3' {(s2—1)Ao— Aj}. 

An experimental design consists of a set of 
n experimental units called plots. Define a 
relationship R between the plots as a set of 
ordered pairs (i,j) of plots. A relationship R 
among a set of n plots can be expressed as a 
symmetric n x n matrix (r;) of 0’s and 1’s: 


1 ifthe ith plot is related to the jth 
r= plot by the relationship R, 
0 otherwise, 


and this matrix is also denoted by R. If there 
are k types of relationships R,,..., R, among n 
plots, the algebra # over the real number field 
generated by the matrices R,,..., R,, is called 
the relationship algebra of the design. 2 is a 
tsemisimple algebra. 

Example (1). The relationship algebra & of 
the factorial experiment with h=2 and t repli- 
cations is generated by the following matrices 
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over the real number field: B, =A, © I, B= 
Ao Q (Ey —1,), Bs = Ay Q Ey, By= Az ® En, 
B; = A, Q E,,, where A;, i=0, 1, 2, 3, are the 
association matrices of F, type. The mutually 
orthogonal idempotents that correspond to 
the two-sided ideal decomposition of 2 are 
BÝ = AŽ Ot “Ey, BŽ = Ař Ot 'E„n, BÝ = 
A¥ @t'E,,, BŽ = AŽ Qt E„, and BÝ = 
Ao ®(I,—t7'E,,). These are the same as the 
projection P, in the two-way layout design 
given in Section H. 

Example (2). Consider a block design with v 
treatments, each having the same number r of 
replications, and with b blocks, each having 
the same size k (<v). Suppose that association 
matrices A;,i=0, 1,...,m, are given, by which 
the associations among the treatments are 
defined. Let Ai, be the number of blocks to 
which the ith associate treatments a and f are 
applied. The design is called the partially bal- 
anced incomplete block design (PBIBD) if 4:,, 
=4,>0 independently of « and p. When m= 1, 
the design is a BIBD. Let the observation 
vector be represented by (4). The relationship 
algebra 2 of a PBIBD is generated by n x n 
matrices I, Enn, B= YY’, T =@A;,®’, i= 1, 
2,...,m, where NN' =E oA A= Diop; A}, 
Pi= Xio å;Zy (O< p; <rk; i=0,1,...,m), and po 
=rk= E" on; Write T,* = ®A# ®’. Accord- 
ing as p;=rk, 0< p;<rk, or p;=0, L(T,*) is said 
to be confounded with the blocks, partially 
confounded with the blocks, or orthogonal to 
the blocks. 2 is noncommutative, completely 
reducible, and isomorphic to the algebra of 
matrices of the type shown in Fig. 1. 


h 0 
* 
** 
** 
x 
** 
E 
9 * 
Fig. 1 


The analysis of variance of a PBIBD is 
given by a decomposition of I, into mutually 
orthogonal idempotent elements of 2. For a 
PBIBD, the matrix C in (6) is of the form C= 
Eg AŽ, where t,=r—k +p; For a connected 
PBIBD, all A¥ č, i=1, 2, ...,m, are estimable. 
If there exists a group divisible PBIBD(m = 2) 
with 2, =A, +1, then it is E- and A-optimal. 


K. Design for Two-Way Elimination of 
Heterogeneity 


Consider a design with v treatments in a u x w 
rectangular block. The row effect and the 
column effect of this block are denoted by y 
and v, respectively. Thus the observation vec- 
tor Y is of the form 


Y= +é + Py+lly+wW, (10) 
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where the definitions of F, ®, ¥ and IT are 
similar to those for block designs. Set L = 0'TT, 
M=90'Y, D,=0'O, F=D.—w 'LL'’—u-'MM’ 
+(uw) 'LE,,,L’. The matrix F plays a role 
similar to that of the matrix C in (6). When the 
rank of F equals v— 1, the design is said to be 
connected. If 


F=t(I,—v 'E,,), (11) 


then the design satisfies the optimum criteria 
(I), (II), and (ITI) given in Section E. When u = 
w=v and (11) holds, the design is called the 
Latin square. When u =v and (11) holds, the 
design is called the Youden square. When u>v 
and (11) holds, the design is called the Shrik- 
hande square. In a Youden square design treat- 
ments are grouped into replications in two 
different ways, i.e., rows and columns, where 
rows constitute a BIBD, whereas columns are 
complete blocks. The existence of a Youden 
square design is equivalent to that of a sym- 
metric BIBD. 

If the associations among v treatments are 
defined in terms of association matrices, the 
partially balanced design for two-way elimi- 
nation of heterogeneity can be defined in a way 
similar to the PBIBD. In this case, the equa- 
tion F=),1,A? holds, and if «={I,,E,,—I1,}, 
then (11) holds, hence the optimum criteria 
are fulfilled. The definition of the relationship 
algebra 2 of a partially balanced design for 
two-way elimination of heterogeneity is similar 
to that used for the PBIBD. & is isomorphic 
to the algebra of matrices shown in Fig. 2. 


Fig. 2 


The analysis of variance of this design is 
given by a decomposition of I, into mutually 
orthogonal idempotent elements of 2. 


L. Balanced Array and Orthogonal Array 


Suppose that T is an n x h matrix with entries 
from a set A of s (> 2) distinct elements. Con- 
sider the s‘ 1 x t matrices X =(X,,X,...,X,) 
that can be formed by giving different values 
to the x;(€ A), i=1, 2, ..., t. Suppose that asso- 
ciated with each 1 x t matrix X there is a non- 
negative integer A(x,,X ,.-.,X,) which is invar- 
iant under any permutation of a given set 
{x1,X2,-.-,X;}. If, for every t-columned sub- 
matrix of T, the s' 1 x t matrices X occur as 
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rows A(x,,X>,.--,X,) times, then the matrix T 
is called the balanced array (BA) of size n, h 
constraints, s levels, and strength t having the 
index set {A(x,,...,X,)}. Such an array is de- 
noted by BA(n, h, s, t); n is also called the num- 
ber of treatment combinations. In particular, 
when A(x,,...,X,)=A for every x,,...,X,, such 
an array is called an orthogonal array (OA) of 
size n, h constraints, s levels, strength t, and 
index 4, which is denoted by OA (n, h, s, £). 
We have n= ås*. Balanced arrays have the 
advantage that they can be constructed 
with fewer treatment combinations than 
the orthogonal arrays for given h, s, t para- 
meters. The transpose of the incidence ma- 
trix of a BIBD(v, b, r, k, À) is, for A={0,1}, a 
BA (b, v, 2,2) with 4(0,0)=b— 2r + J, A(1, 0)= 
A(0, t)=r—A, and A(t, 1)=2. There is a close 
relation between the existence of an OA and 
that of a BIBD. For example, the existence 
of an OA(s”, h, s, 2) is equivalent to that of 
h—2 mutually forthogonal Latin squares of 
order s that is for h=s+ 1 equivalent to the 
existence of a BIBD(s’, s(s+ 1),s+ 1,s, 1). The 
existence of an OA(4t, 4t — 1, 2, 2) is equivalent 
to that of a symmetric BIBD(4t— 1, 4r—1, 
2t—1,2t—1,t—1) which is also equivalent to 
the existence of an OA (8t, 4t, 2, 3). 

Balanced arrays and orthogonal arrays play 
a vital role in the construction of symmetric 
and asymmetric confounded factorial experi- 
ments and fractional factorial designs. In an 
OA(n, h, s, t), regarding columns and entries as 
h factors and levels of a factor of the column, 
respectively, each row corresponds to a treat- 
ment combination. In this case, this ortho- 
gonal array gives an orthogonal fractional s‘ 
factorial design of resolution t+ 1 with n treat- 
ment combinations. A necessary and sufficient 
condition for a fractional s* factorial design 
of resolution 2/+ 1 to be orthogonal (resp. 
balanced) is that the design be an orthogonal 
(resp. balanced) array of strength 2/, provided 
that the information matrix of the design is 
nonsingular. 


M. Response Surface 


If all the factors represent quantitative vari- 
ables, such as time, temperature, amount of 
ingredients, etc., it is natural to think of the 
yield or response Y of the experimental results 
as a continuous function of the levels of these 
factors. We can write a functional relationship 


Y, = f (Xia X20 kee Wo  &«=1,2,...,n, 


where Y, represents the «th observation in the 
designed experiment and x;, represents the 
level of the ith factor in the ath observation. 
The function f or the surface defined in (h + 1)- 
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dimensional *Euclidean space by f is called the 
response surface. The residual W, measures the 
experimental error of the ath observation. A 
knowledge of the function f gives a complete 
summary of experimental results and also 
enables us to predict the response for values of 
the x,, that were not tested in the experiment, 
or to determine such combinations of the 
values of variables x;, that give the max- 

imum (or minimum) value of f. When the 
mathematical form of f is not known, the 
function can sometimes be approximated 
satisfactorily within the experimental region, 
which is relatively small, by flexible graduating 
functions, such as polynomials of degrees 1 
and 2 in the variables x;: 


h 
Y= Bot F BiXia +t Wo 
i=1 


h h h 
Y,=Bot >, BiXiat ¥. Bit, t X BijXiaXja t Wa 
izl i=1 i<j 


The coefficients fo, B,,... are parameters to be 
estimated from the data. Thus, (1) when the 
form of the true f is assumed known, the ob- 
ject is to estimate the parameters; (ii) when the 
form of the true f is unknown, the object is to 
approximate the f by some graduating func- 
tion. Designs appropriate for (i) and (it) are 
called designs for estimating parameters and 
designs for exploring a response surface, respec- 
tively. Some experimental designs that have 
been developed for fitting polynomials of the 
first and second degrees are called first-order 
designs and second-order designs, respectively. 
The problem here is to increase the precision 
of fitting response surfaces by appropriately 
choosing n points in a given experimental 
region S of variable x. That is, how do we find 
the experimental region that interests us in the 
sense of finding optimum conditions and, 
having found it, how do we design experiments 
to map f over the region? As optimum criteria 
of allocations, corresponding to (I) and (II) 
given in Section E, consider the following: 
(IV) the *generalized variance of the estimates 
of coefficients is minimal; (V) the supremum of 
the variance of estimates of the expectation 
E(Y) of response Y is minimal for all variables 
xes. For example, when an observation Y is 
given by a polynomial regression of a variable 
x as Y,= Po + By Xa t baxa +... + Bixg t+ W,, if 
for the Legendre polynomial P,(x) we allocate 
an experiment replicated at x= +1 and h— 1 
roots of P;(x)=0 the same number of times, 
then it is an optimum design in the sense of 
(IV) and (V). 

As a design for fitting first-order response 
planes, we can use a 2" factorial design (or its 
fraction) of resolution HI, since it is sufficient 
to consider only linear effects for each of h 
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factors. In comparing first-order designs, we 
take as an optimum criterion the minimum of 
the average variance of the estimates of the 
coefficients, corresponding to (III) of Section 
E. For a design for fitting second-order re- 
sponse surfaces, we need at least three levels 
of each factor for estimating all effects up to 
quadratic ones; then we can use a 3" factorial 
design (or its fraction) of resolution V for esti- 
mating the linear x linear component of inter- 
actions of two factors. In this case, treatment 
combinations are too large in number, so we 
can adopt central composite designs, which are 
constructed by adding further treatment com- 
binations to those obtained from a 2" factorial 
design (or its fraction). If the coded levels of 
each factor are —1 and +1 in the 2" factorial 
design, the (2h + 1) additional combinations 
are (0,0, ...,0), (£4,0,...,0), (0, +4,0,...,0), 
...,(0,...,0, +d). The total number of treat- 
ment combinations to be tested is 2"+ 2h+ 1 
(< 3" for h>3). The value of d can be chosen 
to make the coefficients in the quadratic poly- 
nomials as orthogonal as possible to each 
other or to minimize the bias that is created 

if the true form of the response surface is not 
quadratic or to give the design with the prop- 
erty of being rotatable. If Y, is the estimated 
response at (X9,X209,---,Xp0)ES, the response 
surface design is said to be rotatable if and 
only if the variance of Y, is a function only 

of the distance p=(x79 +x39 +... +p)!” of 
(Xios X20; ---, Xpo) from the origin, so that the 
variance contours in the experimental region 
of the variables are circles, spheres, or hyper- 
spheres centered at the origin. Now, when Y= 
Bo + Pixi + BaX2+ Br 1X7 + B22x3 + 2Bi 2X1 X2 + 
W with |x,|<1 for i=1, 2, an optimum experi- 
ment in the sense of (V) is given by allocating 
0.0960 at the origin, 0.0802 at experimental 
points (1,0), (—1, 0), (0, 1), (0, —1), and 0.1458 
at experimental points (1, 1), (1, —1), (—1, 1), 
(—1, —1), respectively. However, this design is 
not rotatable. Instead of an optimum criterion 
(V), we can consider a region § which is not 
always equal to the original experimental 
region S and a criterion: (V’) The supremum 
of the variance of estimatesof the expectation 
E(Y) of response Y is mintmal for all variables 
xeS. Within x? +x3 <c?, an optimum alloca- 
tion depends on the value of c, and it is shown 
that a rotatable design is obtained only if c 

is in a certain range. 

There are many topics in the theory of ex- 
perimental design besides the ones mentioned 
in this article (— [4] for multiple comparison, 
[9] for confounding designs that are factorial 
experiments in which the block size is reduced 
and in which for each block a fractional of all 
the treatment combinations are tested, [10] for 
split-plot designs in which certain main effects 
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are confounded with blocks, [15] for weighing 
designs). 
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A. Definition 


The determinant of an n x n tmatrix A = (aip) 
in a tcommutative ring R is defined to be the 
following element of R: 


$ (sgn P)ai p, Arp, -+ np,» 


where 


( Os ah a 
P= 
Pı P2 > Dn 


is a permutation of the numbers 1, 2,...,n, 
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sgn P denotes the sign of the permutation P 
(that is, sgn P=1 if P is an even permutation 
and sgn P= —1 if P is an odd permutation), 
and the summation extends over all n! per- 
mutations of 1,2,...,n. The determinant of A 
is denoted by 


dii 442 din 
Gz, a22 An 
dni an2 Ann 


It is written |a] or |A| and is also denoted by 
det A. Usually we suppose that R is the field R 
of real numbers or the field C of complex 
numbers, but the following theorems are also 
valid for cases in which R is any commutative 
ring, unless otherwise stated. 


B. Relation to Exterior Algebras 


Consider an texterior algebra (‘Grassmann 
algebra) of a linear space (*free module) of 
dimension n over R with a basis (e4, €23, ..., €p} 
Set 


+ n 
e= di1 +a 2+... + Ay, ey, 


where a,,¢ R. Then we have e1 Aez A... Ae) 
=|a,,\e;Ae A... ^e, Conversely, we can 
define the determinant |a,,| by this relation. 
The properties of determinants can be easily 
deduced from those of exterior algebras. 


C. Fundamental Properties of Determinants 


(1) The determinant of the *transposed matrix 
‘4 of a matrix A is equal to the determinant of 
A. Hence the theorems stated for rows are also 
valid for columns. 

(2) If the elements of one row (column) ofa 
matrix are multiplied by a factor c, the deter- 
minant of the matrix is also multiplied by c. If 
the elements of one row (column) of a matrix 
are zero, its determinant is equal to zero. 

(3) If from a matrix A =(a;,), we obtain two 
matrices A’ and A” by replacing one row, for 
instance the ith row, by aj,,-..., aj, and by 
Gj, +}, -.-,Gin + ain, respectively, then |A”|= 
|A|+]A’|. This relation is equally valid for a 
column. 

(4) If we obtain Ag by a permutation Q on 
the rows of a matrix A, then |Ag|=(sgnQ)|Al. 
In particular, tf two rows (columns) of a ma- 
trix are interchanged, then the determinant 
changes sign. 

(5) The determinant of a matrix is zero if 
two rows (or columns) are identical. 

(6) The determinant of a matrix is not 
changed if the elements of any row (column), 
each multiplied by the same factor, are added 
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to the corresponding elements of another row 
(column). 

(7) Suppose that R has unity element. Let x;, 
(i,k=1,...,n) be n? variables in R, and denote 
a function (having its values in R) of these 
variables by p(X), X =(x,,). Assume that (X) 
has the following properties: (i) if the elements 
of one row of X are multiplied by a factor 4, 
the value of ¢ is also multiplied by 4; (ii) if we 
obtain two matrices X’ and X” by replacing 
one row of X, for instance the ith row, by 
Xita esos Xin AMG by Xj +X; -+43 Xin + Xin, TESPEC- 
tively, then ọ(X")= @(X)4+ o(X’); and (iii) if 
two rows of X are equal, p(X)=0. Then (X) 
=c|X| for some constant c (in R). 

(8) Suppose now that R is a field K. Assume 
that a function (X) (in K) has the following 
properties: (i) if the elements of one row in 
X are multiplied by 4, the value of ¢ is also 
multiplied by /; and (ii) the value of ọ is not 
changed if the elements of any row are added 
to the corresponding elements of another row. 
Then ~(X)=c|X| for some constant c (in K). 


D. The Laplace Expansion Theorem 


Let A =(a,,) be an n x n matrix. Take r-tuples 
(ii-i) and (k,,...,k,), where i, and k, be- 
long to {1,...,n} andi, <...<i,,k,<...<k,. 
Let (i,,,,..-54,) and (k,,,,...,,) be (n—r)- 
tuples such that i,,,<...<i,,k,4;<...<k, and 
Abpea dey pe aera bay = Wis Basse Mya E A 
={1,...,m}. Let ag. ij...) be the deter- 
minant of an r xr matrix whose (p, q)- 
component is the (i,,, k,)-component of A 

for each p and q. We call this determinant 

a minor of degree r of the matrix A. (The 
corresponding submatrix of A is sometimes 
also called a minor of A.) In particular, if 
(i,,.--,1,)=(k,,...,,), then it is called a princi- 
pal minor. Furthermore, we define the cofactor 
of the minor 44. i k) OF A to be 


pA —{—_ tu 
iip penne (T D a ikp ok)? 


where A=i,+...4+i, and w=k,+...4+k,. In 
the particular case r= 1, the cofactor of a,, is 
Gi, =(—1)'**A,,, where A; is the determinant 
of the (n— 1) x (n— 1) matrix obtained from A 
by eliminating its ith row and kth column. For 
simplicity, we abbreviate (i,,...,i,), (ky, -<3 Kp) 
and (j,,..-,j,) as (i), (k), and (j), respectively. 
Then we have 


Taoa -fA HO=O 
a HG) 0) (i) 0 if (i) €(K), 
Saud ofA O= 
T OTC 9 bed O01 08) 0 if (i) A(k), 


where Ł, means that the sum is taken over all 
combinations (j). This is called the Laplace 
expansion theorem. If a matrix A has the form 
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B 0 Bx 
A= or A= i 


and B and C are square matrices, then by this 
theorem we have |A| =]|B] |C]. If we number 
the combinations (i) =(i,, ...,i,) and (k) = 
(k,,-...,,) appropriately (for instance, in lexi- 
cographical order) and regard the numbers 
assigned to them as row numbers and column 
numbers, respectively, to form a matrix (dig) 
then the Laplace theorem can be expressed as 


(Aya (@agiy) T (Gain Miya) 


|A| 0 


0 IAL 
In the particular case r=1, we have 


IA} ifi=k, 
O ifi<k, 


n 
È ajăyj= 
jl 


3 fs |A| ifi=k, 
d d: = 
E FF LO ifižk. 


E. Product of Determinants 


Let A =(a;,) and B=(b,,) be two n x n matrices. 
For the product AB=C =(c,,), where c = 
DF =1 jb, (i,k =1,...,), we have |AB|= 

|A| |B|. The tinverse matrix A~! exists for an 
nxn matrix A = (a) if and only if |A| 40, and 
then A~!=(b,,) with elements b; = 4;,;/| Al. 
Moreover, we have |A '|=|A|~'. (In the case 
where the elements a; are in the commutative 
ring R with unity element, A~! exists if and 
only if |A| is a tregular element of R.) 


F. Theorems on Determinants 


(1) Let @,, be the cofactor of a; in the deter- 
minant of ann x n matrix A=(a;,,). Then the 
determinant |d;,| is equal to |A|"_'. In general, 
GD 
CREET seek l= IAL A 
7 n 
CORRET E T i EA . 
(2) The determinant of a submatrix of the 
matrix (ã), composed of the i, th, ...,i,th rows 
and k,th,...,k,th columns of (d;,) is equal to 


|A Pay, pees dp) (Key seers kp) 


EN: 


Let A 
ODE e. 


Jt the determinant 


of the (n—r) x (n—r) matrix obtained from an 
nxn matrix A by eliminating its i,th,...,i,th 
rows and k,th,...,k,th columns. Then 


watea JA 


i<j, k<l. 
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(4) Sylvester’s theorem. Let b, (i,k =1,..., 
n—r) denote the Minor 4). r rtd.. r th) 
ofan n x n matrix A=(a,). Then 


G1, © Gy, 
|bi,|=|A| 


a ux ad 


(5) Let A be an n x m matrix and Banmxn 
matrix. Then AB is an n x n matrix. If n>m, 
then |AB| =0. If n <m, let (i1)=(i,,...,i,) (i <.-.. 
<i,) be a combination of 1, 2,..., m, taken n at 
a time. Let A, be the n x n matrix composed 
of the i, th, ..., i,th columns of A, and B,, the 
nxn matrix composed of the i,th,..., i th rows 
of B. Then |AB|=2 {Aw | Bal, where the 
summation extends over all possible combi- 
nations (i). 

(6) Determinant of a t*Kronecker product. If 
Ais an m xm matrix and B is ann x n matrix, 
then |A © B|=|A|"|B|”. 

(7) Let H be ann x n *tHermitian matrix, and 
let H, denote the matrix composed of its first k 
rows and columns. Then H is positive definite 
if and only if |H,|>0 for all k=1,...,n. 


G. Special Determinants 


(1) A determinant of the form 


| 1 
1 X2 Xn 
2 2 2 
n=l n-1 n-t 
x} X2 Xn 


is called a Vandermonde determinant. It is 
equal to the *simplest alternating function 
IT. (x; =X): 


(2) A cyclic determinant is one of the form 


Xo Xi X2 tt Xai 
Xn-1 Xo Xi cu Xa-2 


n-1 
=|] Worl eee ag oh ei) 
i=0 


where ¢ is a *primitive nth root of unity. 

(3) Consider the vectors %;=(4;,,4;2,---, Gin) 
(i=1,2,...,n), and let («;,«;) denote the tinner 
product of a; and a,. Then the following deter- 
minant is called the Gramian of these vectors: 


(%1.%1) (01, %) (1, Xn) 
(22,01) (%2,%) (22, %,) 
(Xn X) (Ans %2) (Ens Xn) 
Qi, y2 Uo yy |? 
hezi G27 a 


ni An2 nog 


nn 
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(4) For an talternating matrix (namely, a 
square matrix X such that 'X = — X), we have 
the identity 


0 X12 Xiz °° Xin 
— X2 0 X23 °° Xan 
—=Xi3 — X23 0 vt Xan 
— Xin Xon —X3n 0 
_ SPC Xp)? ifn is even, 
0 if n is odd, 


where P,(...,X;j,...) is a polynomial of vari- 
ables x,,, which (equipped with appropriate 
sign) is called the Pfaffian of these variables. 
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A. General Remarks 


Let y be a function of a real variable x defined 
on an interval J and let Ax be a fixed quantity. 
When two points x and x + Ax are in I, we 
define the difference Ay(x) of y at x by Ay(x)= 
y(x + Ax) — y(x) and the difference quotient 

by Ay(x)/Ax; Ax is called the difference of x. 
Without loss of generality, we can take Ax = 
1, for otherwise there is a constant b such 

that Ax’=1 for the new independent vari- 
able bx =x’. If Ax=1, the second difference 
A’y(x)= A(Ay(x)) is given by 


A? y(x)= Ay(x + 1)—Ay(x) 
= y(x + 2)—2y(x + t)+ y(x). 
Similarly, the difference of the nth order is 


defined by A"y(x)=A(A" "y(x)), and 


Ary) = Y _1yr-k (i) VED. 
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Conversely, y(x +n) is expressed by differences 
as 


yortn)= Y (o) Aty(x) 


k=0 


(— 223 Interpolation). 


B. Summation 


Given a function g(x) and Ax, a function y(x) 
that satisfies Ay(x)/Ax = g(x) is called a sum 
of g(x). Summation of g(x) is to find a sum of 
g(x). Given a sum y(x) of g(x), an indefinite 
sum of g(x), written as Sg(x)Ax, is given by 
Sq(x)Ax = y(x)+ c(x), where c(x) is an arbitrary 
periodic function of period Ax. In many cases, 
c(x), which corresponds to an arbitrary con- 
stant in an indefinite integral, is omitted. For 
example, a sum of g(x)=nx" t! for Ax=1 is the 
nth-order tBernoull: polynomial B,,(x) for 
n#0; a sum of x`! is W(x), given by w(x)= 
dlogI(x)/dx (— 174 Gamma Function). 

When the series —Ax £go g(x + kAx) or 
Ax È, g(x — kAx) converges, both can be 
sums of g(x). Since the requirement of conver- 
gence for these series was found to be too 
strict, the following requirement was given by 
N. E. Nörlund instead: Let x be a real variable 
and g(x) be continuous for x >b. Define A(x) 
by A(x) = x?(log x)? (p> 1,q 20). Then if for a 
positive y 


oe 


F(x, Ax, n=| g(zje dz 


a 


~Ax 2 g(x + kAx)e etka) 
k=0 


is convergent for a >b, F satisfies AF (x, Ax, 
n)/Ax =g(x)exp(—/(x)). Accordingly, if 
F(x, Ax, n) approaches a limit F(x) as y-0, 
F(x) is a solution of AF(x)/Ax = g(x). F(x) 

is called the principal solution of AF(x)/Ax = 
g(x). 


C. Difference Equations 


Let Ax = 1. An equation F(x, y(x), Ay(x), 
...,A”y(x)) =0 in x and differences of an un- 
known function y(x) is called a difference 
equation. If the substitution y= ¢(x) satisfies 
the equation for x in some interval, (x) is a 
solution of the equation. Because of the rela- 
tion between y(x), y(x+ 1), ..., y(x +n) and the 
differences of y at x, we can transform the 
given difference equation in the form G(x, y(x), 
y(x + 1),...,¥(x+n))=9. This form appears 
more often in applications and is called the 
standard form of a difference equation. 
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If the equation is linear in y(x), y(x + 1), 
.., ¥(x+n), namely, if it is given by 


P(x) y(x +1) = q(x), 


ims 


i 


the difference equation is said to be linear. 
When q(x)=0, it is homogeneous; otherwise, it 
is inhomogeneous (or nonhomogeneous). 


D. Linear Difference Equations 


Assume that po(x), ..., p,{x) are single-valued 
analytic functions without poles and common 
zeros in some domain. Consider the linear 
difference equation 


2 pAdy(x-+i)=0. (1) 


If p(X), P(x), ..., n(x) are solutions of (1), 
then a linear combination a,(x)@,(x)+ 
az(x)p(x)+...+4,,(x)@,,(x) with arbitrary 
periodic functions a, (x), a,(X), -..,@,{x) of 
period 1 is also a solution of (1). 

Let £i, B,,... be singular points of p,(x), 
P2(x), ..-,D,(X), 41, %2,... be the zeros of po(x), 
and y,, y2,.-. be the zeros of p,(x-+n). Then the 
set of singular points of the linear difference 
equation (1) is the set {a;, B;, y;}. 

A function 9g,,(x) is said to be linearly de- 
pendent on the functions (x), @2(x),..., 
Qm- (x) with respect to the difference equa- 
tion (1) if (x) =a, (x) — 1(x) +a (x)@2(x) + 
-+ am-1(X)Pm-1 (X), Where a, (x), a2(X),..., 
Qm— (x) are functions of period 1, every one 
of which takes a nonzero finite value at least 
at one point not congruent (mod Z) to any of 
the singular points, where Z is the additive 
group of integers. 

A set of m functions is called linearly inde- 
pendent if none of the functions is dependent 
on the other m— 1 functions. When a set of n 
solutions of equation (1) is linearly indepen- 
dent, it is a fundamental system for (1). Any 
solution of (1) can be expressed as a linear 
combination of n solutions of a fundamental 
system. 

The determinant 


- PAX) 


p(x) . 
. OAX+ 1) 


p2(x+ 1) 


p(x) 
pı(x+1) 


go (x+n—1) pa(x+n-1) ... o,(xt+n—-1) 


formed from n functions @,(x), @2(x), -.., @,(x) 
is called Casorati’s determinant and is denoted 
by D(~, (x), p2(x), ..., @,(x)). A necessary and 
sufficient condition for a given set of n func- 
tions to be independent is that Casorati’s 
determinant be nonzero at every point except 
those which are congruent to singular points 
of (1). Casorati’s determinant is used to deter- 
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mine whether a given set of solutions is 
fundamental. 

Let y(x) be a solution of a nonhomogeneous 
linear difference equation 


Pe(y)= X pyi =a. 2) 


If @,(x), @2(x), ..., @,(x) are n linearly inde- 
pendent solutions of (1), then an arbitrary 
solution of (2) is given by 


Y= 4, (x) Q1 (x) + 42(x)Po(x) +... 
+ y(X)Pu(x) + W(X), 


where a(x), ...,a,(x) are arbitrary periodic 
functions of period 1. Then the expression for 
y is called a general solution of (2). If we ab- 
breviate Casorati’s determinant of a funda- 
mental system of solutions @,(x), @2(x),..., 
E(x) of (1) by D(x) and write p(x) as the quo- 
tient of the cofactor of g(x +n) of D(x + 1) by 
D(x + 1), we have 


Ws) = È SONA 


assuming that the summation S on the right- 
hand side 1s known. This is the analog of 
Lagrange’s tmethod of variation of constants 
in the theory of linear ordinary differential 
equations. 


E. Linear Difference Equations with Constant 
Coefficients 


If all the coefficients in 


piy(x+i)=0, Po #0, Pa #9, (3) 


im» 


i 
are constants, n linearly independent solutions 
are obtained easily. Indeed, if A is a root of the 
algebraic equation Ečo p;4'=0, 4* is a solu- 
tion of (3). This algebraic equation is called 
the characteristic equation of (3). If it has n 
distinct roots A4, A, -..; An, then AÑ, A3, ..., 4% 
are n linearly independent solutions. In gen- 
eral, if å is an m-tuple root of the characteristic 
equation, then 4*, xå*, ...,x™714* are solu- 
tions of (3). Accordingly, if A; is a root of multi- 
plicity m; (Xj-;mj=n,j=1,2,...,s), then 47, 
xA¥,..., x" A (j= 1, ...,s) constitute a set of 
n linearly independent solutions. 

Even if all the p; are real, the characteristic 
equation may have complex roots. In such a 
case real solutions are obtained as follows: 
When å = u+ iv is a root of multiplicity m, 7 = 
u—iv is also a root of the same multiplicity. 

If we write p=,/p?+v’, tan o = v/m, then 

p* COS OX, p* SIN MX, XP* COS PX, XP* SiN MX, ..., 
x™ 1o* cos px, x" 'p* sin px are 2m indepen- 
dent real solutions. 

Nonhomogeneous equations with con- 
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stant coefficients can be generally solved by 
Lagrange’s method with these solutions. How- 
ever, when the nonhomogeneous term has a 
special form such as 


È pint +i) =p 


where p(x) is a polynomial in x and å is a root 
of multiplicity m of the characteristic equation, 
we can use the method of undetermined coeffi- 
cients. In this particular case, the substitution 
(Aot A,xt+... + A,x")x™1* with undetermined 
coefficients Ag, A,,..., A; gives solutions, k 
being the degree of p(x). 


F. Difference and Differential Equations 


The differential operator d/dx acts on the 
family of functions {x”|m=0, +1,...} accord- 
ing to dx"/dx =mx""", just as the difference 
operator A acts on the family {x =I (x+ 
1)/T'(x—m-+ 1)|m=0, +1,...} according to 
Ax =mx'"~)). Hence by using the factorial 
series > a,,x'” and its similarity to the power 
series £ a„x”, we may obtain some analogies 
with the theory of differential equations. For 
example, the Frobenius method in the theory 
of tregular singular points can be applied to 
the system of difference equations 


(-1)A m= 3 ay (2)w((2), 


k=1,2,...,n. 


However, there are certain essential differences 
between functions defined as solutions of 
differential and difference equations. For ex- 
ample, Hölder’s theorem states that no solu- 
tion of the simple difference equation y(x+ 1) 
—y(x)=x ! satisfies any talgebraic differential 
equation. Consequently, the gamma function, 
which is related to a solution of the equation 
W(x) =dlogI(x)/dx, cannot be a solution of 
any algebraic differential equation. For the 
numerical solution of ordinary differential 
equations by difference equation approxima- 
tion — 303 Numerical Solution of Ordinary 
Differential Equations. 


G. Geometric Difference Equations 


For an arbitrary complex number q, an equa- 
tion of the form y(qx)= f(x, y(x)) is called a 
geometric difference equation. For example, the 
ordinary difference equation (1) can be trans- 
formed into 


p,(z)U (2q") = B(z) (1’) 


= 
iMs 


by the change of variable z=q*. Although it is 
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possible to transform an equation of the type 
(1°) into that of the type (1), there are theories 
developed specifically for the type (1’), since 
the coefficients of the equation may become 
more complicated by such a transformation. 
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A. General Remarks 


The rudimentary concept of n-dimensional 
manifolds can already be seen in J. Lagrange’s 
dynamics. In the middle of the 19th century n- 
dimensional tEuclidean space was known as a 
continuum of n real parameters (A. Cayley; H. 
Grassmann, 1844, 1861; L. Schlafli, 1852). The 
notion of general n-dimensional manifolds was 
introduced by B. Riemann as a result of his 
differential geometric observations (1854). He 
considered an n-dimensional manifold to be a 
set formed by a 1-parameter family of (n—1)- 
dimensional manifolds, just as a surface is 
formed by the motion of a curve. Analytical 
studies of topological structures of manifolds 
and their local properties were initiated and 
developed by Riemann, E. Betti, H. Poincaré, 
and others. To avoid the difficulties and dis- 
advantages of analytical methods, Poincaré 
restricted his consideration to those topo- 
logical spaces X that are tconnected, ttri- 
angulable, and such that each point of X 


383 


has a neighborhood homeomorphic to an n- 
dimensional Euclidean space. We often refer to 
such spaces as Poincaré manifolds; Poincaré 
called them n-dimensional manifolds. In 1936, 
H. Whitney published a monumental paper 
[14] on differentiable manifolds in which the 
various fundamental concepts on differentiable 
manifolds were established. This and sub- 
sequent papers written by Whitney during 
nearly twenty years greatly influenced the 
rapid advance of the theory of differentiable 
manifolds since 1950. 


B. Topological Manifolds 


An n-dimensional topological manifold M is by 
definition a tHausdorff space in which each 
point p has a neighborhood U(p) homeo- 
morphic to an open set of R”. 

Let M’ be a Hausdorff space in which 
each point p of M’ has a neighborhood U(p) 
homeomorphic to an open set of H”, where H” 
is the half-space {(x,,x ,...,xX,)E€R"|x, >0}. 
Let 6M’ denote the set consisting of points p 
of M’ such that p corresponds to a point of 
Hg = {(X,,---,X,)€ H" |x =0} c H” under the 
homeomorphism from U(p) to an open set of 
H”. M' is called an n-dimensional topological 
manifold with boundary if 0M’ 4 Ø, and 0M’ is 
called the boundary of M’. On the other hand, 
M defined as above or M’ with ôM’ = @ is 
called an n-dimensional topological manifold 
without boundary. The interior of M’ is the 
complement Mo = M'— 0M’ of the boundary. 
The boundary of an n-dimensional topological 
manifold is an (n— 1)-dimensional topological 
manifold. A topological manifold without 
boundary is called closed or open according as 
it is compact or has no connected component 
which is compact. There exist connected topo- 
logical manifolds that are not tparacompact; 
among them, the 1-dimensional ones are called 
long lines. A connected paracompact topolog- 
ical manifold M has a tcountable open base 
and is tmetrizable. 


C. Local Coordinates 


Let M be an n-dimensional! topological mani- 
fold. A pair (U, y) consisting of an open set U 
of M and a homeomorphism w of U onto an 
open set of R” is called a coordinate neighbor- 
hood of M. If we denote by (x! (p), ...,x"(p)) 
(pe U) the coordinates of the point w(p) of R”, 
then x!, x’,...,x" are real-valued continuous 
functions defined on U. We call these n func- 
tions the local coordinate system in the coordi- 
nate neighborhood (U, w) and the n real num- 
bers x'(p),...,x"(p) the local coordinates of the 
point pe U (with respect to (U, w)). 
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A set S= { (Up, Wa) }ac4 of coordinate neigh- 
borhoods is called an atlas of M if {U,} 
forms an topen covering of M. 


acA 


D. Differentiable Manifolds 


Let S= { (Uz, Ya) }aca be an atlas of an n- 
dimensional topological manifold M. For each 
pair of coordinate neighborhoods (U,, w,) and 
(Us. We) in S such that U,NU,# Ø, Ypo Ya’ is 
a homeomorphism of the open set y,(U, N Up) 
of R” onto the open set Y(U, N Up) of R”. Let 
x=(x',...,x”)ey,(U,M Up). Then we can 

write (pow, =A), --.ff4(%)). If the 

n real-valued functions fj, ...,.ff, defined in 
w,(U, Up) are of tclass C” (1 <r < œ) (resp. 
treal analytic) for any a, $ in A such that 

U, N Up # Ø, then we call S an atlas of class C” 
(resp. C°) of M. When an n-dimensional topo- 
logical manifold M has an atlas S of class 

C (1<r<q), we call the pair (M, S) an n- 
dimensional differentiable manifold of class C’ 
(or C’-manifold). A C”-manifold is also called 
a smooth manifold, while a C®-manifold is 
called a real analytic manifold. We call M the 
underlying topological space of (M, S), and 

we say that S defines a differentiable structure 
of class C’ (or C’-structure) in M. 

In particular, a C°-structure is called a real 
analytic structure. A C’-manifold whose under- 
lying topological space is compact (tparacom- 
pact) is called a compact (paracompact) C’- 
manifold. A coordinate neighborhood (U, y) of 
M is called a coordinate neighborhood of class 
C” of (M, S) if the union SU {(U, p)} is also an 
atlas of class C’ of M. In particular, each co- 
ordinate neighborhood of M belonging to S is 
of class C”. The set Ẹ of all coordinate neigh- 
borhoods of class C” of (M, S) is an atlas of M 
containing S, and we call § the maximal atlas 
containing S. Let S and S’ be two atlases of 
class C” of M. If S=5’, then we say that S and 
S’ define the same differentiable structure of 
class C” on M and that the differentiable mani- 
folds (M, S) and (M, S’) of class C” are equiva- 
lent. In particular, (M, S) and (M, §) are equiv- 
alent C’-manifolds. Let S and S’ be atlases 
of class C” and class C5, respectively, where 
l<r<s<wo. Since s>r, we can consider S’ 
an atlas of class C”. If S and S’ define the same 
C’-structure in M, then we say that the C*- 
structure defined by S’ is subordinate to the 
C’-structure defined by S. If M is paracom- 
pact, then there exists a C°-structure subordi- 
nate to a C’-structure of M (Whitney [14]). 


E. Differentiable Manifolds with Boundaries 


Let U and U’ be open sets in the half-space H”, 
and let gp: U->U’ be a continuous mapping. If 
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there exist open sets W and W’ in R” contain- 
ing U and U’, respectively, and a mapping 
y:W—W' of class C” that extends ọ, we call o 
a mapping of U into U’ of class C”. Let M be 
a Hausdorff topological space. A structure of 
a C’-manifold on M is defined by a set S = 
{(U,, Wa) neas Where {U,},-4 is an open covering 
of M, and, for each a, y, is a homeomorphism 
of U, onto an open set of H” such that for 

any a, BEA with U,NU;# Ø, Wow, ' isa 
mapping of class C” from W,(U, Us) onto 
W,(U,1 Up). Let 6M denote the set consisting 
of points p of M such that pe U, and y,(p)e Hë = 
{(x1,...,X,)€H"|x,=0} for some «e A. If 

OM # Ø, the pair (M, S) is called an n-dimen- 
sional differentiable manifold with boundary of 
class C’ (or C’-manifold with boundary), and 
ôM is called the boundary of M. ðM forms an 
(n— 1)-dimensional C’-manifold. If we put 
U;=U,éM and denote the restriction of y, 
to U; by Ya, then S’ = {(Uj, Ya) }aca is an atlas 
of class C” of ôM. If ôM is empty, then (M, S) 
is a C’-manifold. In this sense a C’-manifold 

is sometimes called a C’-manifold without 
boundary. 


F. Orientation of a Manifold 


Let S={(U,,W,)},.4 be an atlas of class C” in 
M, and for each « let {x},...,x%} be the local 
coordinate system in a coordinate neighbor- 
hood (U,, W,). If U, and U, intersect, then there 
exist n real-valued functions F! (i=1,...,n) 
defined on w,(U, Up) such that xj(p) = 
F'(xi(p), --.,x2(p)) for pe U,N U, and i= 
1,...,n. The Jacobian D,,=D(F",..., F")/ 
D(x}, ..., x”) is different from zero at each 
point (xz, ...,x%) of Ya(U, N Up). If we can 
choose an atlas S of M so that, for any a, B 
such that U,U, is nonempty, the Jacobian 
D,, is always positive, then we say that the 
C’-manifold M is orientable, and we call S an 
oriented atlas. 

Let S={(U,, Ya) aca and S' = { (Vz, P1) bien 
be two oriented atlases of a connected C’- 
manifold M. If M is connected, then the sign of 
the Jacobian D,,(p) of the transformation of 
local coordinates is independent of the choice 

_of ae A, AEA, and peU,NV,. We say that S 
and S’ define the same (opposite) orientation if 
D,, is always positive (negative). Hence if M is 
connected, the set of all oriented atlases of 
class C’ is composed of two subsets such that 
atlases belonging to one of them have the 
same orientation, while two atlases belonging 
to different ones have the opposite orientation. 
Each of these subsets is called an orientation 
of the connected C’-manifold M. When we 
assign to M one of two possible orientations, 
M is called an oriented manifold; the assigned 
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orientation is called its positive orientation 
and the other its negative orientation. If S = 
{(U,,W,) taca belongs to the positive orien- 
tation, S and (U,, w,) are called an atlas and 
local coordinate system, respectively, compa- 
tible with the positive orientation. 


G. Differentiable Functions 


Let f be a real-valued function defined in a 
neighborhood of a point p of a C®-manifold 
M. Let (U, y) be a coordinate neighborhood of 
class C® such that pe U. If the function foy ~+ 
is of class C’ (1 <r < œ) in a neighborhood of 
the point w(p) in R”, then the function f is 
called a function of class C’ at p. This defi- 
nition is independent of the choice of a coor- 
dinate neighborhood of class C”. If we de- 
note the local coordinate system in (U, y) by 
(x',...,x”), there exists a function f(x’, ..., x") 
of n variables defined in a neighborhood of 
W(p) in R” such that f(q) = f(x! (q), ...,x"(q)) for 
each point q in the neighborhood of p. Here 
we use the same symbol f for the function f 
defined in a neighborhood of p in M and for 
the function fow~! defined in the image of the 
neighborhood by y in R”. The function f is 

of class C” at p if and only if f(x',..., x”) is 

of class C” in a neighborhood of the point 
(x'(p), ...,x"(p)) of R”. A function of class C” 
(or C’-function) in M is a real-valued function 
in M that is of class C” at every point of M. 


H. Tangent Vectors 


Let M be a C”-manifold, and let §%(M) be the 
real vector space consisting of all C*-functions 
in M. (For the sake of simplicity, we denote a 
manifold (M, S) by M.) A tangent vector L at a 
point p of M is a linear mapping L: §ğ(M)>R 
such that L(fg)= L(f)g(p)+f(p)L(g) for any f 
and g in %&(M). For any two tangent vectors 
L,, L, and any pair of real numbers /,, 4. we 
define 4,L,+4,L, by (A L; +4, L (f )= 
ALi (P)+A2La(f), f EFM). 

Thus tangent vectors at p form a real vector 
space T,, which we call the tangent vector 
space (or simply the tangent space) of M at 
the point p. The dimension of the tangent 
vector space T, equals the dimension of M. 
The set of all tangent vectors of M forms a 
tvector bundle over the base space M, called 
the tangent vector bundle (or tangent bundle) 
of M. 

By a tangent r-frame (r <n) at p we mean an 
ordered set ofr linearly independent tangent 
vectors at p. The set of all tangent r-frames 
also forms a fiber bundle over M called the 
tangent r-frame bundle (or bundle of tangent 
r-frames) (— 147 Fiber Bundles F). 
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I. Differentials of Functions 


For a C®-function f in M and a point p of M 
we can define a linear mapping df,:7,-R by 
df,(L)= L(f) for all Le T,, and we call df, the 
differential of f at p. The totality of differen- 
tials at p of C®-functions in M forms the tdual 
vector space of the tangent vector space T,. 


J. Differentiable Mappings 


Let o be a continuous mapping of a C®- 
manifold M into a C®-manifold M’. We call o 
a differentiable mapping of class C’ (or simply 
a C’-mapping) ({ <r < œ) if the function fo 
is of class C” for any C’-function f on M’. If 
ọ is a homeomorphism of M onto M’ and @ 
and ~' are both of class C”, then we call g a 
diffeomorphism of class C’. If there exists a 
diffeomorphism of class C” of a C®-manifold 
M onto a C”-manifold M’, then M and M’ are 
said to be diffeomorphic. 

Let M and M’ be C”-manifolds and ọ be 
a C“-mapping of M into M’. For a tangent 
vector L of M at p, a tangent vector L’ of M’ 
at o(p) is defined by L'(g)= L(g o p), ge G(M’). 
The mapping L>L' defines a linear mapping 
(de), of the tangent vector space T, of M at p 
into the tangent vector space Typ of M'at 
o(p). The linear mapping (dọ), is called the 
differential of the differentiable mapping ¢ at p. 
If (de), is surjective, p is called a regular point 
of g. A point on M which is not a regular 
point is called a critical point of p. A point q 
on M’ which is an image of a critical point is 
called a critical value of ọ, and a point on M’ 
which ts not a critical value is called a regular 
value. In R”, the diffeomorphic image of a set 
of Lebesgue measure zero has Lebesgue mea- 
sure zero. So the set of Lebesgue measure 
zero is well defined on a (paracompact) C®- 
manifold. Then Sard’s theorem states: Let 
p:M—M' be a C”-mapping; then the set of 
critical values of ọ has Lebesgue measure zero 
in M’. 


K. Immersions and Embeddings 


Let M and M’ be C®-manifolds and ọ be a 
C”-mapping of M into M’. If (dq), is injective 
at every point p of M, then ¢ is called an im- 
mersion of M into M'. If ọ is an immersion, 
then for some neighborhood U, of any point 
p of M the restriction ọ |U, gives rise to a 
homeomorphism from U, into M’. If an im- 
mersion @ is injective, then ¢ is called an 
embedding (or an imbedding) of M into M’. An 
alternative definition is often used, which says 
that p is an embedding if, in addition to the 
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above conditions, ọ gives a homeomorphism 
from M onto @(M), where @(M) has the rela- 
tive topology of M’. If the former definition is 
adopted as embedding, then a mapping ¢ 
satisfying the conditions of the alternative 
definition is sometimes referred to as a regular 
embedding. If M is compact, the two defini- 
tions coincide. In Sections L and M, embed- 
ding always means regular embeddings. 

The theory of embeddings and immersions 
is mainly concerned with ways to embed and 
immerse a given manifold M into a manifold 
M’ of a particular type with lowest possible 
dimension. M’ ts usually the Euclidean space 
R”, the projective space PR or a certain stan- 
dard manifold. The theory was initiated by 
Whitney (1936). He proved by “general posi- 
tion” argument that an n-dimensional C®- 
manifold M with countable basis can always 
be immersed in the 2n-dimensional Euclidean 
space and can always be embedded in the 
(2n+ 1)-dimensional Euclidean space as a 
closed set (Whitney’s theorem). 


L. Submanifolds 


A C*-manifold M is said to be a submanifold 
of a C*-manifold M’ if M is a subset of M’ 
and the identity mapping of M into M’ is an 
immersion. If the identity mapping of M into 
M' is an embedding, then M is called a regular 
submanifold of M’. A regular submanifold M 
of M’ is called a closed submanifold if M is a 
closed subset of M’. 

Let ọ be a C”-mapping from M into M’ 
and M” be a submanifold of M’. Then for each 
qe M" the tangent space T; of M” at qisa 
linear subspace of the tangent space T} of M’ 
at q. Denote by z, the projection of quotient 
vector space onto T/T. A C®-mapping ¢ is 
called transverse to M” if for each peg '(M") 
the composite Top) 0d4@,: Ty > Tyipy/Toipy 18 
surjective. If @ is transverse to M” then 
y~'(M”") is a submanifold of M. For any C®- 
mapping ọ: M—M’ and any submanifold M” 
c M’, we can find a C®-mapping @’: M >M’ 
which is transverse to M” and arbitrarily close 
to ọ (transversality theorem). Let M, and M, 
be submanifolds of M’. Then we say M, in- 
tersects transversely to M, if the inclusion 
M, <M’ is transverse to M3. 

A C®-mapping is called a submersion if it 
has no critical point. Let g be a submersion 
from M into M’. Then for each point qe M’, 
o` (q) is a regular submanifold of M, and M is 
covered by a mutually disjoint family of sub- 
manifolds: M =( J emp *(q). 

Let M be a submanifold of an n-dimensional 
Euclidean space R". We can identify the tan- 
gent vector space T, of M at p with the geom- 
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etric tangent space of M at p in the Eucli- 
dean space R”. A vector in R” that is ortho- 
gonal to the tangent vector space T, of M at p 
is called a normal vector to M at p. The set of 
all vectors normal to M forms a vector bundle 
over M, which we call the normal vector bundle 
(or normal bundle) of M. If M is compact, then 
the totality of vectors normal to M whose 
length is <e (where g is a sufficiently small 
positive real number) forms a neighborhood 
N(M) of M in R” which we call a tubular 
neighborhood of M. Int N(M) is called an open 
tubular neighborhood. 


M. Vector Fields 


Let N be a subset of a C°-manifold M. By a 
vector field on N we mean a mapping X that 
assigns to each point p of N a tangent vector 
X, of M at p. We can consider X as a tcross 
section over N of the tangent vector bundle of 
M. Let X be a vector field in M, and let f be a 
C®-function in M. Then we can define a func- 
tion Xfin M by (Xf)(p)=X,f. We call X a 
vector field of class C” if the function Xf is 

of class C” for any C®-function f in M. Let 
(x',...,x") be the local coordinate system in 

a coordinate neighborhood (U, y), and let 
(0/dx'),f =(6f/0x')(p) for pe U and fe (M). 
Then 6/0x' (i=1,...,n) are vector fields in U, 
and the (0/0x'), form a basis of T, at every 
point pe U. A vector field X in U is written 
uniquely as X, = &,¢'(p)(@/6x'), at each point 
peU. Then ët, ..., č” are real-valued func- 
tions defined in U, called the components of X 
with respect to the local coordinate system 
(x',...,x"). A vector field X in M is of class C” 
if and only if its components &' with respect to 
each coordinate system are functions of class 
C’ (0<r < œ). Let (x+, ..., X") be another local 
coordinate system in a neighborhood U of 

p, and let (t, ... , €") be the components of 

X with respect to (x',..., X”). Then we have 
E'(q) = X,(0x'/dx-)(q)E4(q) at each point qe U. 

For the rest of this article we mean by a 
vector field in M a vector field of class C”, and 
we denote by X(M) the set of all vector fields 
in M. Then X(M) is an %(M)-tmodule, where 
&(M) denotes the algebra of all C*®-functions 
in M. In fact, for f, ge (M) and X, Ye X(M), 
we can define a vector field fX +gY by (fX 
+gY),=f(p)X,+9(p) Y, and this defines an 
(M)-module structure in ¥(M). 

In a coordinate neighborhood (U, y), we can 
write X =; ¢'(6/0x'). The right-hand side of 
this equation is sometimes called the symbol of 
the vector field X. A vector field X can also be 
interpreted as a linear differential operator 
that acts on (M). 

Let X and Y be vector fields in M. Then 
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there exists a unique vector field Z in M such 
that Zf= X(Yf)— Y(Xf) for any C®-function f 
in M. We denote Z by [.X, Y] and call it the 
Poisson bracket (or simply bracket) of X and 
Y. If €' and y‘ denote the components of X 
and Y, respectively, in a coordinate neighbor- 
hood (U, y), then the components ¢' of [X, Y] 
are given by ¢ = X {2*(dy'/dx*)—n*(0E'/0x")}. 
The bracket of vector fields has the following 
properties: (i) [X, Y] f= X(¥f)— Y(Xf), i) 
L£X, g¥]=folX, Y1+f(X9) ¥—9(Y)X, (ii) 
[X + Y,Z]=[X,Z]+LY, Z], (iv) [X, Y]= 
—{Y, X], and (v) [[X, Y], Z]+[[Y, Z], X] + 
[EZ, X], Y]=0 (Jacobi identity). These iden- 
tities show that ¥(M) is a Lie algebra (— 248 
Lie Algebras) over R. 

If ọ is a diffeomorphism of M onto M’, 
then for any vector field X in M we can define 
a vector field p, X in M’ by the condition 
(PX) =4P;(X4)}, p= (q). Then p, is an iso- 
morphism of the Lie algebra ¥(M) onto the 
Lie algebra ¥(M’). 


N. Vector Fields and One-Parameter Groups 
of Transformations 


A one-parameter group of transformations of 
M is a family 9, (teR) of diffeomorphisms 
satisfying the following two conditions: (i) the 
mapping of R x M into M defined by (t, p)—> 
(p) is of class C”; and (ii) 9,0 p, = 9,4, for 

s, teR. 

Let p, be a one-parameter group of trans- 
formations of M. Then we can define a vector 
field X by X,f=lim,o( fep) — f/i, 
where pe M and fe (M). The vector field X 
thus defined is called the infinitesimal trans- 
formation of @,. We also say that ¢, is gen- 
erated by X, and sometimes we denote p, by 
the symbol exptX. In this case, if (x',..., x”) is 
a local coordinate system, then at each point p 
of the coordinate neighborhood, we have X p= 
Xi(dx'(@,(p))/dt),=0(0/0x'),. 

If M is compact, then every vector field in M 
is the infinitesimal transformation of a one- 
parameter group of transformations; that is, 
every vector field generates a one-parameter 
group of transformations. For M noncompact, 
this is not always true. Nevertheless, for each 
vector field X we have the following result 
concerning local properties of X: For each 
point p of M, there exist a neighborhood U of 
p, a positive real number g, and a family (|t| 
<£) of mappings of U into M satisfying the 
following three conditions. (1) The mapping 
of (—«,e) x U into M defined by (t, g)>@,(q) 
is of class C”, and for each fixed t, ¢, is a dif- 
feomorphism of U onto an open set ¢,(U) 
of M. (2) If isl, [¢|, and |s+¢| are all smaller 
than c and q and @,(q) both belong to U, then 
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Ps(P:(9)) = Ps+4). (3) X (Q) f = lim, o( f(9,(9)) 

— f(q))/t for qeU and fe (M). We call o, the 
local one-parameter group of local transfor- 
mations around p generated by X. 

Let X and Y be vector fields in M, and let 
y, be the local one-parameter group of local 
transformations around p generated by X. 
Then [X, Y],=lim,..o(¥, —((@), Y)p)/t, where 
(~,), Y is a vector field defined as follows: Let 
U be a neighborhood of the point p where o, 
({t|<«) is defined. Then (g,), Y is the vector 
field on g,{U) that is the image of Y under the 
diffeomorphism ¢,. In particular, if X gener- 
ates a one-parameter group of transforma- 
tions of M, then we have [X, Y]=lim,..(Y— 
(~,)4 Y)/t for any vector field Y in M. 


O. Tensor Fields 


Let 7,’(p) be the vector space consisting of all 
r-times contravariant and s-times covariant 
tensors over the tangent vector space T, of a 
C”-manifold M, that is, 


rn-(1,)0( ®t), 


where 7,* denotes the dual linear space of T, 
(— 256 Linear Spaces). A tensor field (more 
precisely, contravariant of order r and covariant 
of order s, or simply a tensor field of type (r, s)) 
on a subset N of M is a mapping K that as- 
signs to each point p of N an element K, of the 
vector space 77’(p). In particular, if r=s=0, K 
is a real-valued function on N, and we call K a 
scalar field. If r= 1 and s=0, K is a vector 
field, called a contravariant vector field. When 
r=0 and s=1, we call K a covariant vector 
field (or differential form of degree 1). If r #0, 
s=0 or r=0, s40, we call K a contravariant 
tensor field of order r or a covariant tensor 

field of order s, respectively. A contravar- 

iant or covariant tensor field K is said to be 
symmetric (alternating) if K, is a symmetric 
(alternating) tensor at every point p of M. 

Let (x!,...,x") be the local coordinate system 
in a coordinate neighborhood (U, y). Then at 
each point p of U, the (0/éx'), (i=1,...,n) form 
a basis of the tangent vector space T,, the 
differentials (dx'), (i=1,...,n) form a basis of 
the dual space T*, and these bases are dual to 
each other. A tensor field K of type (r, s) de- 
fined on M is written at any point p of U in 
the following form: 


K=} Kit ii(p)(0/6x"), ® ... @ (6/6x"), 
Q (dxi), @ ... O (dx*),. 


The functions Kj" defined in U are called 
the components of the tensor field K of type 
(r,s) with respect to the local coordinate sys- 
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tem (x', ..., x"). If K; are the components 
of K with respect to the local coordinate sys- 
tem (xt, ..., X”) in another coordinate neigh- 
borhood (U’,’) such that U N U’ # @, then for 
each qe U N U’, the following relations hold: 


Kir (q) =F p, (0X4 /Ox"),... (X"/dx"), 
x (Ox"1/dx4),.. (Ox*/0x"), Rees 


A tensor field K in M is called a tensor field of 
class Œ (0<t< œ) if the components are func- 
tions of class C! for any coordinate neighbor- 
hood of M. 

The sum K + L and the tensor product 
K ® L of two tensor fields K and L are defined 
by the rules (K+L),=K,+L, and (K @ L), 
=K, ® L, respectively. The contraction of 
two tensor fields is also defined by taking the 
contraction pointwise. 

Let ọ be a diffeomorphism of M into M’. 
Then the differential (dọ), is an isomorphism 
of T, onto T, (p= (q)) for each qe M and 
hence induces an isomorphism @, of the vector 
space 7;'(q) onto the vector space T;(p) (— 256 
Linear Spaces). For any tensor field K in M we 
can define a tensor field K in M’ by (K), = 
(Kı), P= 9(q), GEM. Then (K +L)= GK + 
PL, PIK © L)=@K ®@ @L, and the mapping 
@ commutes with contraction. 

Let K be a tensor field and X be a vector 
field (both of class C”) in M. We define a 
tensor field Ly K by (Ly K),=lim,..o(K, — 
(@,K),)/t, where ~, denotes the local one- 
parameter group of local transformations 
around p generated by X. We call L,K the 
Lie derivative of K with respect to the vector 
field X. The operator Ly: K-L,K has the 
following six properties: (i) Ly(K + K) = 
LyK+L,K’; (ii) Ly(K © K’)=(LyK)® K'+ 
K @L,K’; (iit) the operator Ly commutes 
with contraction; (iv) Ly f= Xf for a scalar 
field f and Ly Y=[X, Y] for a vector field Y; 
(v) Lyx, y= LyLy~LyLy, that is, Ly nK = 
L,(LyK)— Ly(LyK); and (vi) K is invariant 
under p, i.e., @ K =K for all t, if and only if 
LyK=0. 

Let K be a covariant tensor field of order r 
in M. We always assume that K is of class C®. 
The value K, of K at pe M ts an element of the 
vector space T,* ® ... © T,* (r times tensor 
product of 7;*); hence we may consider K, an 
r-linear mapping of T, into R (— 256 Linear 
Spaces). If X,,..., X, are vector fields in M, 
we define a C®-function K(X,,..., X,) by 
K(X,, ...,X,)(p)=K p((X1)p, -..,(X,),). Then 
the mapping that assigns to each r-tuple 
(X,,...,X,) of vector fields the C*-function 
K(X,,...,X,) is an r-linear mapping on the 
&(M)-module X(M) consisting of all vector 
fields of class C” in M into %(M); that is, 
K(X,,...,fX;+ 9%, ....X)=fK(X,,..., X35 
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XJ) +gK(X,,..., ¥,---,X,) (i=1,..., 7) for 
J, g€ X(M). Conversely any r-linear map- 
ping of the §(M)-module X(M) into (M) 
can be interpreted as a covariant tensor field 
of order r in M. If the tensor field K is sym- 
metric (alternating), the corresponding r- 
linear mapping K(X,,..., X,) is symmetric 
(alternating) with respect to X,,...,X,. For 
the Lie derivative Ly K of a covariant tensor 
field of order r of K, we have the following 
formula: (Ly K)(X,,...,X,)=X(K(X,,...,X,)) 
—Di-, K(X1,..., LX, Xz, -.-, X). 


P. Riemannian Metrics 


A symmetric covariant tensor field g of order 
2 and of class C” in M is called a pseudo- 
Riemannian metric if the symmetric bilinear 
form g, on the tangent vector space T, is non- 
degenerate at each point pe M; and g is called 
a Riemannian metric if g, is positive definite 
for all p. If g is a Riemannian metric, the 
length || L|| of a tangent vector Le T, is defined 
by ||L\j?=g,(L, L). On a paracompact C®- 
manifold there always exists a Riemannian 
metric. A pair consisting of a differentiable 
manifold and a Riemannian metric on it 

is called a Riemannian manifold (— 364 
Riemannian Manifolds). 


Q. Differential Forms 


An alternating covariant tensor field in M of 
order r and of class C! (0 <t < œ) is also called 
a differential form (or exterior differential 
form) of degree r. A differential form of degree 
1 is sometimes called a Pfaffian form. Let œ be 
a differential form of degree r. Since each alter- 
nating covariant tensor of order r at a point p 
is an element of A'T% , the r-fold texterior 
product of T*, the form œ is a mapping that 
sends each point p of M to an element œ, of 
/\'T,*. We can also regard @ as an alternating 
r-linear mapping of X(M) into ¥(M). Let 
(x!,..., X") be the local coordinate system in a 
local coordinate neighborhood (U, w). Since 
(dx'), (i=1,...,n) is a basis of T*¥ at each point 
p of U, we can express œ, (pe U) uniquely in 
the form 
o= Yai, ()(dx')y 0... A(dx*)p 
ip<...<i, 
where the sum extends over all ordered r- 
tuples (i,,...,i,) of indices such that 1 <i, <i, 
<...<i,<n. For an ordered r-tuple (i,, ...,i,) 
of indices with repeated indices we put a; _; =0, 
and for (i,,...,i,) with r distinct indices we put 
ai i =(sgno)a; j, where (jise Gi <e 
<j,) is a permutation of (i,,...,i,) and sgno 
denotes the sign of the permutation 0: i,j, 
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(k=1,...,r). Then we can write 


1 n 
== 3 


Fiiil 


a; i,(p)(dx"), Aw A (dx*)p. 


The functions a;,..;, are the components of the 
tensor field œ, and w is of class C! if these 
components are of class C' for any coordinate 
neighborhood. By the support (or carrier) of a 
differential form œw we mean the closure of the 
subset of M consisting of all p such that w, 40. 

In the rest of this article, differential forms 
are always of class C”, and we denote by 
D’(M) the real vector space consisting of all 
differential forms of degree r and of class C®. 
In particular, D°(M)=%}(M) and D'(M)= {0} 
forr>n,n=dimM. 

For differential forms we have the following 
five important operations. 

(1) Exterior product. Let w and 0 be dif- 
ferential forms of degree r and s, respectively. 
The exterior product w ^ 0 of œw and 0 is the 
differential form of degree r + s defined by 
(wa 0),=0,/9,, PEM. Let Xj,...,X,+5 be 
r+s vector fields in M. Then we have 
(A8)(X1, ++ Xp45) 


=) sgn(ij)o(X;,, ata , X OX; mince ,X;), 


Js 


where the summation runs over all possible 
partitions of (1,2, ...,r +s) such that i, <i, < 
... <i, andj, <j: <... <j, and sgn(i; j) means 
the sign of the permutation (1,2,...,r+s)> 
(iieii, Ja). In particular, if w,,...,@, 
are differential forms of degree 1, then we have 
(w A...A@,)(X,,.-.,X,) = det(w,(X;)). 

(2) Exterior differentiation. Let be a dif- 
ferential form of degree r, and let @ =(1/r!) 
Èa; dx" A... adx" in a coordinate neigh- 
borhood (U, yw). Then we can define a differen- 
tial form dw of degree r+ 1 by the condition 
dw=(1/r!)È da; Adx't a... Adx" in U. The 
differential form dw is called the exterior de- 
rivative (or exterior differential) of a. A dif- 
ferential form œ satisfying the condition dw =0 
is called a closed differential form, and a dif- 
ferential form y that can be expressed as y= 
dw for some w is called an exact differential 
form. If a, be R and w, œw e D(M), then we 
have d(aw + bw’)=adw + bdw’. Therefore the 
set €’(M) of all closed differential forms of 
degree r and the set ©’(M) of all exact differ- 
ential forms of degree r are linear subspaces 
of D(M). 

For the exterior derivative dw, we have the 
following formula: 


(da) (X,,..., X,) 


r+] 
== 2 (—1¥+ X ({o(X,, oes Xi, wit X41) 


+5 (D oX;X], X 


W AEE CRONE AE 
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where the variables under the sign ^ are to be 
omitted. 

(3) Interior product with vector field. Let 
w be a differential form of degree r and X a 
vector field. When r> 1, we can define a dif- 
ferential form :(X)@ of degree r—1 by the 
formula (1(X)@)(X,,...,X,-,)=@(X, X,--., 
X,-1) for any r—1 vector fields X,,..., X,_,; if 
r=0, we put 1(X)w=0. The differential form 
1(X)q is called the interior product of œ with 
X. 

(4) Lie derivative. The Lie derivative Lyœ of 
a differential form of degree r with respect to 
a vector field X is a differential form of the 
same degree. For any r vector fields X,,...,X, 
we have, by definition, (Lyq@) (X,,...,X,= 
X(@(X,,..., X,))— Liz, @(X,,..., LX, XG], 

.., X,). 

(5) Let o be a C*-mapping of M into M’, 
and let ož: Täpp > Ty be the transpose (or 
‘dual) of the linear mapping (dọ )p: T, > Typ) 
for pe M, i.e., the mapping defined by the 
condition ((dọ),L, «)=(L, ofa) for each Le T,, 
xE Top We denote the linear mapping of 
AT gp) into AT% induced by ož by —F also. Let 
w be a differential form of degree r in M. Then 
a differential form @*qw in M, the pullback by 
o of œ, is defined by (p*w), = GF gp), PEM. 

The operations defined previously satisfy 
the following six important relations: (i) d? = 
0, that is, d(dw) =0; (ii) d(w A 0)=da ^ 0+ 
(—1)’w A d0, where œ is of degree r; (iti) 

p*(w A 9)=eE*w ^ p*0, p*(dw) = d(p*w); (iv) 
Llw A 0)=(Ly@) A04+@ A (Ly9); (v) Ly = 
(X)-d+d-i(X), Ly(dw)=d(Lyq); and (vi) 
Lix,y= Lx: Ly —Ly: Ly, (LX, Y]J=Ly(Y)— 
(Y) Ly. 


R. De Rham Cohomology 


Let D(M)=X?_, D’(M), where n=dim M. 
Then D(M) is a tcochain complex with 
tcoboundary operator d. We denote by H’(D) 
the r-dimensional cohomology group of 

this cochain complex, and we call it the r- 
dimensional de Rham cohomology group of the 
differentiable manifold M. If we denote by 
C(M) and &’(M) the subspaces of D"(M) con- 
sisting of closed differential forms and exact 
differential forms, respectively, then H’(D) 

= €"(M)/€"(M) (0<r<n) by definition. If 
we€'(M) and 6€€"(M), then w a 0eC'*"(M), 
and if we €'(M) and 0€€'(M) (or we E'(M) 
and #e€’(M)), then w a Ge E'*"(M). So if we 
put H(D)=>?_, H’(D) (direct sum), we can 
define a product in H(D) by [œ]: [0] =[ma 8] 
for each [w]e H'(D), [6]¢ H"(D). With respect 
to this product, H(D) forms an algebra over R 
called the de Rham cohomology ring of M. 
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S. Partitions of Unity 


Let M be a paracompact C®-manifold, and let 
{V }ier be a tlocally finite open covering of M 
such that the closure of V, is compact for each 
index i. Then there exists a C®-function f; in 

M for each i satisfying the following three 
conditions: (1) 0< f;< 1, (ii) the support of f; is 
contained in V, and (iti) Xer f;(x) =1 for every 
xéM. The family of C”-functions f;, ie J, is 
called a partition of unity of class C” subordi- 
nate to the open covering {V;} <7. 


T. Integrals of Differential Forms 


Integrals over an Oriented Manifold. Let M be 
an n-dimensional paracompact oriented C®- 
manifold and œ a differential form of degree n 
in M with compact support. We can choose a 
positively oriented atlas S={(U,, Wa) łasa such 
that {U,},.4 is a locally finite covering; U, is 
compact for each a. Suppose first that the 
support of w is contained in U, for some index 
a. Then (y; ')*w, is a differential form of de- 
gree n in w,(U,), and we can express (w, ')*w, 
in the form adx' a ... a dx", where (x', ..., x") 
are the coordinates in R” such that xi=x'ow, 
(i=1,...,n) give a local coordinate system 
compatible with the orientation of M and a is 
a C®-function with compact support. Then we 
define the integral of w over M by 


| o=[ adx! ...dx". 
M WalUa) 


For the general case let { f,\,., be a partition 
of unity of class C” subordinate to {U,},.4. 
Then the support of f œ is contained in U,, 
and except for a finite number of the indices «, 
jf, vanishes identically. Therefore we may 
define the integral of œ over M by 


jo" 


and we can show that this definition of the 
integral is independent of the choice of or- 
iented atlas S and of a partition of unity sub- 
ordinate to S. 


Integrals over a Singular Chain. We fix rectan- 
gular coordinates in R”. Let dọ be the origin 
and d; be the unit point on the ith coordinate 
axis. Let S’ denote the oriented r-simplex 

(do, d,,...,d,) with vertices dy, d,,...,d,. When 
we regard S” as a point set, we denote it by 
|S”|. An oriented singular r-simplex of class C” 
in M is, by definition, a pair (S’, ø) consisting 
of S” and a C*-mapping ¢ of an open neigh- 
borhood of |S’| into M. An element of the free 
Z-module generated by singular r-simplexes of 
class C” is called an integral singular r-chain 
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of class C” in M. We define a real singular r- 
chain of class C” analogously. Let œw be a 
differential form of degree r and (S’, o) be an 
oriented singular r-simplex of class C” in M. 
Then ~*w is a differential form of degree r in a 
neighborhood of [S’|, and we can express p*œ 
in the form p*w=adx! a dx? a... a dx". We 
define the integral of œw over (S’, g) by 


| o=| adx! ...dx", 
(S', p) IS” 


and the integral of œ over a singular r-chain C 
of class C” by 


f o=> m; | w, 
C i (S".p;) 


where C =X; m;(S", 9i), mE Z (or m;e R). 
When r=0, then w is a function in M, and 
S° is a point o. In this case we put fcœ = 
Zim olg;lo)). 

Let C,(S, Z) (C,(S, R)) be the Z-module (vec- 
tor space over R) of integral (real) singular 
r-chains of class C” in M, and let w(C) be 
the value of the integral of œ over a chain C. 
Then w is a linear function in the vector space 
C.(S, R), and hence we can consider œ a sin- 
gular r-cochain of class C”. 


U. Stokes’s Formulas 


(1) Let D be a fdomain in an n-dimensional 
C®’-manifold M, and let ôD and D be the 
boundary and the closure of D, respectively. 
Let S={(U,, Ya) }zea be an atlas of class C” of 
M, U! = U,N D, wW be the restriction of y, to 
Uj, and T={(U;,W))}aea- If the pair (D, T) is a 
C*”-manifold with boundary under a suitable 
choice of S, then the domain D is called a 
domain with regular (or smooth) boundary. 
The boundary ôD of (D, T) is then an (n—1)- 
dimensional closed submanifold of M, and if 
M is orientable, 0D is also orientable. Now let 
M be a paracompact and oriented manifold 
and D be a domain with regular boundary. 
Let C be a characteristic function of D in M, 
i.e., a function defined by the condition C(p)= 
1 for peD and C(p)=0 for p€D. Let 6 be a 
differential form of degree n in M with com- 
pact support. We define the integral of 0 over 
D by 


[pe 


Let w be a differential form of degree n— 1 in 
M with compact support. We then have 
Stokes’s formula: 


| da -Í i*w, 
D 6D 


where i denotes the identity mapping of the 
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submanifold ðD into M with ôD having the 
orientation induced naturally from that of M. 
(2) Let C be a singular r-chain of class C” in 
M, and let ðC be the boundary of C. Then for 
any differential form œw of degree r—1, we have 


| t= f Oo. 
c ac 


This formula is also called Stokes’s formula. 


V. De Rham’s Theorem 


Let M be a connected paracompact C®- 
manifold. If we consider w as a singular co- 
chain, we have (dw)(C)=w{0C); by Stokes’s 
formula, this means that the exterior differen- 
tial dw of w is equal to the coboundary of the 
singular cochain œ. Let œw and C be a closed 
differential form of degree r and a singular r- 
cycle of class C”, respectively, and let [œ] and 
[C] be the de Rham cohomology class and the 
singular homology class represented by w and 
C. Using Stokes’s formula, we can define the 
inner product ([@],[C]) by 


testen- w. 


Through this inner product, it follows that 
the de Rham cohomology group H’(D) is 
isomorphic to the rth singular cohomology 
group H"(M,R), the dual space of the rth 
homology group of the complex of real sin- 
gular chains of class C”. Moreover, the de 
Rham cohomology ring H(D) is isomorphic to 
the singular cohomology ring H*(M, R) (de 
Rham’s theorem). 


W. Divergence of a Vector Field 


Let M be an n-dimensional oriented C®™- 
manifold, and let S be an oriented atlas. Let 

œw be a differential form of degree n, and let 
(x',...,x") be the local coordinate system in a 
coordinate neighborhood in S. Then we can 
express œ in the coordinate neighborhood 
uniquely in the form @=adx' a... a dx". If the 
function a is positive for any coordinate neigh- 
borhood in S, we call w a volume element of 
M. In a paracompact oriented manifold, there 
always exists a volume element. (We remark 
that an n-dimensional differentiable manifold 
M is orientable if and only if there exists an 
everywhere nonvanishing differential form of 
degree n.) Let f be a C®-function in M with 
compact support. Then f-« is a differential 
form of degree n with compact support, and so 
the integral 


| re 
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is defined. We call this integral the integral of 
the function f with respect to the volume ele- 
ment œ. 

Let g be a Riemannian metric in M and g; 
the components of g with respect to the local 
coordinate system (xt, ..., x") as before. Then 
we can define a volume element w in M by 
putting œ= JGax! A... A dx", G=det(g,;) in 
each coordinate neighborhood. The volume 
element thus defined is called the volume ele- 
ment associated with the Riemannian metric g. 

Let w be a volume element and X a vector 
field in M. Then the Lie derivative Lyw is also 
a differential form of degree n, and we can 
express Lyw in the form Lyw = fy’ œ, where fy 
is a scalar field, i.e., a function in M. We call fy 
the divergence of the vector field X with re- 
spect to the volume element w and denote it 
by div X. If w is associated with a Riemannian 
metric, then div X is called the divergence of X 
with respect to the Riemannian metric. 

If M is compact, we have 


| div X-w=0 
M 


for any vector field X. This result is known as 
Green’s theorem. 


X. Jets 


Let M and N be C®-manifolds. We define 
an equivalence relation in the set of all C%- 
mappings of M into N. Let f and g be such 
mappings and p be a point of M. Choosing 
local coordinate systems, we write f(p)= 
O, s A, 9(P) = (91 00) -+ gal), = 
(X1,--+5Xm). We say f and g are equivalent at 
p if f(p), g(p), and the values at p of all the 
partial derivatives of f; and g; up to the order r 
(r an integer, r>0) are equal (i=1,...,n). An 
equivalence class with respect to this equiva- 
lence relation is called a jet of order r at p. A jet 
of order r at p represented by a function f is 
denoted by j; f, and the points p and f(p) are 
called the source and the target of the jet jp f, 
respectively. We denote by J5(M, N) the set of 
all jets of order r with source at p and target in 
N and let J'(M, N)=|) pea Jp(M, N). For any 
jet j, let z (j) and z,(j) denote the source and 
the target of j, respectively. We can introduce 
the structure of a C®-manifold in J’(M, N) 
in a natural way such that the projections 
m,.J"(M, N)>M and 7,:J"(M, N)—>N are both 
of class C” and J’(M, N) is a fiber bundle over 
M(N) with projection z,(z,). As examples, we 
have: 

(1) JE (R, N) is identified with the tangent 
vector bundle of N. 

(2) The set J’(€) of all jets of order r deter- 
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mined by the sections of a vector bundle € of 
class C® is also a vector bundle of class C°”. 

Let f: MN and g: N >L be C*-mappings. 
We define a composition of jets by jig ` 
j=j (go f). A jet jp fes’(M, N) is inver- 
tible if there exists a mapping g: N>M such 
that ji()9 Jp f =Jp(1a), where 1, denotes the 
identity mapping of M onto itself. We de- 
note by I’(M, N) the set of all invertible jets 
in J"(M, N), and put I5(M, N)=1"(M, N)N 
J (M, N). 

(3) Let G" (n) be the set of all invertible jets 
in F(R”, R”) whose source and target are the 
origin of R”. Then with respect to the compo- 
sition of jets, G’(n) forms a Lie group that is an 
textension of G!(n)=GL (n, R) by a simply 
connected nilpotent Lie group. The projec- 
tion G’(n)>G!(n) is a special example of the 
natural projection J’(M, N)>J°(M, N) (r2s), 
which is defined in general. 

(4) We can identify 14 (R”, M) (m=dim M) 
with the tangent m-frame bundle over M. 
More generally, 1§(R”, M) is a G'(m)-bundle 
over M. 


Y. Pseudogroup Structure 


Let X be a topological space, and let I be a 
set consisting of homeomorphisms f: U;>V,, 
where Up, V; are open subsets of X. We call a 
pseudogroup of topological transformations if 
I satisfies the following four conditions: (i) F 
contains the identity mapping of X onto X; (ii) 
if fel, then the restriction of f onto any open 
subset U of U,; is also contained in T; (iii) if f 
and g are in I and V,cU,, then go f is con- 
tained in T; and (iv) if fel, then f7! : V,— U; is 
also in T. 

Following the definition of differentiable 
manifolds we define a pseudogroup structure of 
M (or, more precisely, a T -structure of M) as a 
set A of bijections, with each member « defined 
on a subset U, of M onto an open set V, of X, 
satisfying the following three conditions: (i) 
.U,=M; (ii) if x, Be A, then «o p ~ter, 
where the domain of definition of xo B™ is 
B(U,0 Up); and (iii) A is the maximal set of 
bijections that satisfies conditions (i) and (ii). 
We introduce in M the weakest (coarsest) 
topology such that every bijection « is a 
homeomorphism. If I” is a pseudogroup of X 
containing F and A’ is aI’-structure of M 
such that Ac A’, then we say that A’ is sub- 
ordinate to A. If X =R" (or H”, a half-space of 
R”), F is the totality of diffeomorphisms of 
class C of open sets of X onto open sets of 
X, and M is a space with Hausdorff topol- 
ogy, then the I’-structure is the C’-structure 
with or without boundary which we have 
already defined. We give three examples of T- 


105 Z 
Differentiable Manifolds 


structures subordinate to I’, where I’ is the 
totality of local transformations of class C” 
(r>1) in R”. 

(1) When n is even, we identify R” with C”? 
and denote the totality of holomorphic trans- 
formations of connected open domains by I. 
The I’-structure in this case is called a complex 
structure. 

(2) When n is odd, we define F as the totality 
of transformations of connected open domains 
in R” that leave invariant a Pfaffian form 
Dr, xidx™*!+dx?"*! (n=2m-+ 1) up to scalar 
factors. The I’-structure in this case is called a 
contact structure. 

(3) We consider R” =R? x R"? and define F 
as the set of all diffeomorphisms U —> V (where 
U, V are open in R") such that each set of the 
form U N(R? x {y}) is mapped onto a set of the 
form VM(R? x {y’}). The T-structure in this 
case is called a foliated structure. 

The problem of determining whether there 
exists a T -structure for given I and M involves 
widely ranging problems of topology and 
analysis. The classification of with reason- 
able conditions is another important open 
problem. 

Haefliger has constructed the classifying 
space BT for T’-structures. 


Z. Infinite-Dimensional Manifolds 


Let B and B’ be Banach spaces and @ be a 
mapping from an open set of B to B’. Then, 
using the notion of Fréchet derivatives, we can 
define ọ to be smooth. For a smooth mapping 
Q, the Jacobian J(@) at each point x is a linear 
mapping from B to B’, for which the inverse 
function theorem holds true as a straightfor- 
ward extension of the corresponding one in 
the finite-dimensional case. Similar extension 
also holds for the existence and the unique- 
ness theorems of solution of ordinary differen- 
tial equations with value in B. These facts 
permit us to generalize the notion of differ- 
entiable manifolds to infinite-dimensional 
ones. Actually, infinite-dimensional manifolds 
can be defined in a way similar to the finite- 
dimensional case, taking open sets of a certain 
Banach space as local coordinate neighbor- 
hoods. Such a manifold is called a Banach 
manifold. Various formal definitions of dif- 
ferentiable manifolds can also be stated for 
Banach manifolds in extended form. However, 
while differentiable manifolds are locally com- 
pact and admit a partition of unity by smooth 
functions subordinate to a locally finite open 
covering, Banach manifolds generally lack 
these properties. Actually, local compactness 
gives a criterion for whether a manifold is 
finite-dimensional or not. Banach manifolds 
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often provide a basic functional-analytic view 
to nonlinear analysis and global analysis. 

The most important category of Banach 
manifolds is provided by the Hilbert manifold, 
that is, a Banach manifold whose local coordi- 
nates are modeled on a Hilbert space. For a 
Hilbert manifold, a partition of unity subordi- 
nate to a locally finite open covering can be 
taken from smooth functions. In the follow- 
ing, we refer to a separable Hilbert manifold 
simply as a Hilbert manifold. An infinite- 
dimensional Hilbert manifold M can be 
smoothly embedded as an open set of a Hil- 
bert space and thus covered by a single coordi- 
nate neighborhood. Hence, in particular, the 
tangent bundle of M is trivial. Historically, 
this fact was the first instance showing that 
the distinguishing properties are shared by 
infinite-dimensional manifolds; this was first 
recognized as a consequence of the theorem 
stating that the unitary group of an infinite- 
dimensional Hilbert space is contractible. Two 
infinite-dimensional Hilbert manifolds are 
diffeomorphic if and only if they are homo- 
topically equivalent. A typical example of a 
Hilbert manifold is provided by the space of 
L?-loops on a compact Riemannian manifold. 
Morse theory can be extended in a suitable 
way to a Riemannian Hilbert manifold under 
the Palais-Smale condition, which makes it 
possible for the integral curve of grad f to tend 
to a critical point, where f is a Morse function. 


AA. GePfand-Fuks Cohomology 


The space X(M) consisting of all the smooth 
vector fields on a smooth manifold M has the 
structure of a Lie algebra under the bracket 
operation [X, Y]=X Y— YX, where the vector 
fields X and Y are regarded as derivations on 
the algebra C”(M) of smooth functions on M. 
X(M) is a topological Lie algebra when en- 
dowed with the topology defined by uniform 
convergence of the components of vector fields 
and all their partial derivatives on each com- 
pact set of M. When X(M) acts continuously 
on a topological vector space V, the continu- 
ous cohomology H*(X(M), V) of X(M) with 
coefficients in V is the cohomology of the 
cochain complex @ {C?=C?(X(M), V), d}. 
Here C? = V, C?(p>1) is the space of all the 
alternating p-linear continuous mapping ¢ of 
X(M) x ... x ¥(M) (p-times) into V, and d: C? 
—+C?*! is defined for pe C? and X;e ¥(M) by 
dp(X,)=X, 9 (p=0) and dg(X,, -X p11)= 
Li<(—Lo(LX;, Xs X, o Xis i pales 
Xp HEDY X;(X%,..., Xi, -s X p41) 
(p21). When V is a topological algebra and 
the elements of ¥(M) act on V as derivations, 
the exterior multiplication of cochains induces 
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a graded algebra structure in H*(X(M), V). 
When V=R is the trivial ¥(M)-module, 

then H*(X(M))= H*(X(M), R) is called the 
Gel’fand-Fuks cohomology of M. Gel’fand and 
Fuks proved that, for any compact oriented 
manifold M, we have dim H?(X¥(M))< +œ 
for all p and H?(X(M))=0 for 1<p<n(n= 
dim M). For example, if M is the circle S', 
then the algebra H*(X(S')) is generated by 
two generators «e H?, Be H?, which are expli- 
citly described as cochains in the following 


way: 
d d 
a(s di” |, 


d d d ff Ff 
Ary IF ra= g g g'\at. 
Si kK k h” 


f’ f” 
t n 


dt, 
g 


Localization of the concept of Gel’fand- 
Fuks cohomology naturally yields the co- 
homology of formal vector fields. Here a 
formal vector field means the expression 
E fulX1, -+ X,)0/0x,, Ja being formal power 
series in X,,...,X,- The set of all the formal 
vector fields forms a Lie algebra a, and the 
continuous cohomology of a, with respect 
to the Krull topology is denoted by H*(a,). 
Let By be the universal tclassifying space of 
the unitary group U(n), let (By)5, be its 2n- 
skeleton, and let P,,, be the canonical principal 
U(n)-bundle restricted to (By),,. Then there is 
an algebra isomorphism H*(a,)= H*(P>,,; R). 
This cohomology and its variants play im- 
portant roles in the theory of foliations. 

An important subcomplex @ {C}, d} of 
@ {C?’,d}, the diagonal complex, is defined as 
CkK={peC?|o(X,,...,X,)=0 if supp X, N 
... supp X,= Ø}. Here, supp X; denotes 
the support of X;,, that is, {x | X,(x) #0}. Let 
Py be the principal U(n)-bundle associated 
with the complexified tangent bundle of M. 
U(n) acts freely on the product P,, x P,,, and 
the quotient space Ey = Py x Pa„/U (n) is a 
fiber bundle over M with fiber P,,,. Then, if 
M is a compact oriented manifold, the co- 
homology H¥*(X(M)) of the diagonal com- 
plex is completely determined by the isomor- 
phisms H?(¥(M))=H?'"(E,,;R) for all p. In 
particular, if all the Pontryagin classes of M 
vanish, then Hf(¥(M))=Dj4;=p+nH'(M; R) @ 
Hi(a,). 

The Gel’fand-Fuks cohomology has a topo- 
logical interpretation: H*(X(M))= H* (T(E m), 
R) as graded algebras, where T(E) denotes 
the space of all the continuous cross-sections 
of E,,>M with the compact open topology. 
Moreover the differential graded algebra 
C*(X(M)) has a model in the sense of Sullivan 
constructed purely algebraically from a model 
of the de Rham algebra of M and the Pon- 
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tryagin classes of M. This model shows in 
particular that H*(X(M)) is not necessarily 
finitely generated as a graded algebra. 

The cohomology theory of X(M) in the case 
where the representation is nontrivial has also 
been investigated. The natural representa- 
tion on C“(M) is a typical example. There is 
also a topological interpretation: H*(X(M), 
C%”(M))= H*(Yy,R), where Yy is the fiber 
product of the evaluation mapping M x T(E) 
— Ey, and the inclusion Py GE,, correspond- 
ing to a fiber inclusion U (n) S Pzp. 
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A. First-Order Derivatives 


Let y= f(x) be a real-valued function of x 
defined on an interval F of the real line R. If for 
a fixed xjeI, the limit 

i I (Xo +h)—f(Xo) 

im ————__ 
ENET h 
exists and is finite, then f is called differenti- 
able at the point x), and the limit is the deriva- 
tive (differential coefficient or differential 
quotient) of f at the point xọ. If f is differenti- 
able at every point of a set Ac 1, then f is said 
to be differentiable on A. The function that 
assigns the derivative of f at x to xe A is called 
the derivative (or derived function) of f(x), 
which is denoted by dy/dx, y’, ý, df(x)/dx, 
(d/dx) f(x), f(x), or D, f(x). The process of 
determining f’ is known as the differentia- 
tion of f. The derivative of f at the point x, is 
written f (xo), (df/dx)(x9), D. (Xo), [dy/dx]x=x, 
etc. We say that f is right (left) differentiable or 
differentiable on the right (left) if the limit on 
the right, lim,- 40(f(Xo + 4) — f(x0))/h (the limit 
on the left, lim,- .9( (xo — A) — f(x9))/h), exists 
and is finite. This limit is called the right (left) 
derivative or derivative on the right (left) and is 
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denoted by Dy’ f(x9) or f4 (Xo)(Dy f(Xo) or 
f- (xo)). For instance, if f is defined on I= 
[a, b), then D, f(a) is identical to D,' f(a). 


B. Differentials 


In the definitions given above, neither dx nor 
dy in dy/dx has a meaning by itself. In the 
following, however, we give a definition of dx 
and dy, using the concept of increment, so that 
dy = f'(x)dx. Let Ay denote the increment 
J(x+Ax)— f(x) of f corresponding to the in- 
crement Ax of x. Suppose that f(x) is differen- 
tiable at x. We set Ay/Ax = f'(x)+ £. Then we 
have lim,, 9 ¢=0. This can be written utiliz- 
ing fLandau’s notation as Ay= f'(x)Ax + 
o(|Ax|) (Ax-+0); in other words, Ay is the 

sum of two terms, of which the first, f’(x)Ax, is 
proportional to Ax and the second is an tin- 
finitesimal of an order higher than Ax. Here 
the principal part f‘(x)Ax of Ay is called the 
differential of y= f(x) and is denoted by dy. 
The differential dy thus defined is a function of 
two independent variables x and Ax. In partic- 
ular, if f(x) =x, from the definition we get 

dx =1-Ax=Ax. Hence, in general, we have 
dy=f'(x)dx and f’(x)=dy/dx. 

With respect to the rectangular coordi- 
nates (x, y), the straight line with slope f’(xo) 
through a point (xo, f(xo)) on the graph of y= 
J (x) is the ttangent line of the graph at the 
point (xo, f(xo)). A function is continuous at a 
point where the function is differentiable, but 
the converse of this proposition does not hold. 
In fact, Weierstrass showed that the function 
defined by the infinite series 1°) a" cos b" xx, 
where 0<a< 1 and b is an odd integer with 
ab > (3/2)n+ 1, is continuous everywhere and 
nowhere differentiable on (—0o, 00) [3]. 


C. Differentiation 


For two differentiable functions f and g de- 
fined on the interval J, the following formulas 
hold: (af + Bg)’ = af’ + Bg’, where œ and B are 
constants; (fg) = f'g + fg’; and (f/g) =( f'g — 
fg')/g? (at every point where g #0). Let y= 

f(x) be a function of x defined on the interval 
(a,b) and x= ¢(t) a function of t defined on 

(a, P). If o(t)e(a, b) whenever te(a, £), then the 
composite function y= F(t)= f(g(t))=(f o @)(t) 
is well defined. Assume further that f and ọ 
are differentiable on (a, b) and (a, f), respectively. 
Then the composite function F(t)=(f o @)(t) is 
differentiable on (a, £), and we have the chain 
rule, F'(t)=f'(x)e(t) (x= ¢(0), or dy/dt = 
(dy/dx)(dx/dt). Assume that a function y= 

f(x) is tstrictly increasing or decreasing and 
differentiable at xo. If furthermore we have 
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f'(x) 40, then the inverse function x = f !(y) is 
also differentiable at yọ (= f(x,)) and satisfies 
(dx/dy),-, (dy/dx),-., = 1. However, if f'(xo)= 
0, then even though f(y) is not differenti- 


able at yo, limy,.o(f (yo + 4y) — f" (Yo))/Ay 
exists and is +00 or —o0. 


D. Higher-Order Derivatives 


If the derivative f'(x) of a function y= f(x) is 
again differentiable on Z, then (f'(x)Y = f(x) is 
well defined as a function of x on J. In general, 
if f(x) is differentiable on I, then f(x) is 
called n-times differentiable on J, and the nth 
derivative (or nth derived function) f(x) of 
f(x) is defined by f(x) =(f)(x))’ and is 
also denoted by d"y/dx" or D™ y. The nth 
derivative for n>2 is called a higher-order 
derivative. 

Concerning the nth derivative of the product 
of two functions, Leibniz’s formula holds: 


d= sogea ay 


("a a 


Analogous to dy = y'Ax, which is a function of 
x and Ax, we can define d? y in the notation 

d?’ y/dx? by d? y=d(dy)=d(y'Ax) =(y'Ax) Ax 

= y"Ax?. Since Ax = dx, it follows from the 
above that d? y= y” dx?. Similarly, d"y = y dx" 
and is called the nth differential (or differential 
of nth order) of f(x). 


E. The Mean Value Theorem 


Let f(x) be a continuous function defined on 
[a,b], and suppose that for every point x, on 
(a,b) there exists a limit lim, .9(f(x9 +h) — 
J (Xo))/h, which may be infinite. (These condi- 
tions are satisfied if f(x) is differentiable on 
[a, b].) Then there exists a point č such that 
LOL) = 51, a<é<hb. 

—a 
This proposition is called the mean value 
theorem. A special case of the theorem under 
the further condition that f(a)= f(b) is called 
Rolle’s theorem. If we put b—a=h, €=a+ 06h, 
then the conclusion of the theorem may be 
written as f(a+h)=f(a)+h-f'(a+@h) (0< 
0<1). 

This theorem implies the following: Let 
f(x) be a function as in the hypothesis of the 
mean value theorem, and assume further that 
A< f'(x)<B holds for all x witha<x<b. 
Then A <(f(b)—f(a))(b — a) < B. (French 
mathematicians sometimes call this the “théo- 
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rème des accroissements finis.”) Using the 
mean value theorem, the following theorem 
can be proved: If f(x) is continuous on [a,b] 
and f'(x) exists and is positive on (a, b), then 
f(b)> f(a). Accordingly, if f'(x)>0 at every 
point x of an interval J, then f(x) is tstrictly 
increasing on that interval. (If f‘(x)<0 on J, 
then f is strictly decreasing.) The converse of 
the previous statement does not always hold 
(f'(x)=x? is a counterexample, since f'(0)=0). 
Furthermore, from the mean value theorem 
it follows that if f’(x)=0 everywhere in an 
interval, then f(x) is constant on that inter- 
val. Consequently, two functions with the 
same derivative on an interval differ only by 
a constant. 

Suppose that f(x) is n-times differentiable on 
an open interval J. For a fixed ae I and an 
arbitrary xe I, we put 


fep= sas? (x—a)+... 
f"V(a) ot 
Em A +R,- 


Then R, =f (2)(x—a)"/n! for some é be- 
tween a and x. This is called Taylor’s formula, 
where R, is the remainder of the nth order 
given by Lagrange. We also have several other 
forms for R, (= Appendix A, Table 9). If 
f'”(x) is continuous at x =a, then č>a as 
x—a, and accordingly, f ®(€)—> f(a). Hence 
f(x) = Ek- f M(@)/k!)(x — a) + 0((x — a)"). If 
f(x) is continuous at x =a, then, by Tay- 
lor’s formula, the value of the polynomial 

E; -o( f(@/k!)(x — a) can be considered an 
approximate value of f(x) for x near a. This 
approximation is called the nth approximation 
of f(x), and its error is given by |R,,,,|. By 
applying this formula, it is sometimes possible 
to calculate a limit such as A=lim,.., f(x)/g(x), 
where f(x)>0 and g(x)-+0 as x->a. For in- 
stance, if f'(x) and g'(x) are both continuous at 
x =a and g(a) #0, then by taking the first 
approximations of f(x) and g(x) it is easily 
seen that A = f’(a)/g’(a). A limit of this type is 
often called a limit of an indeterminate form 
0/0. Similarly, we can calculate limits of such 
indeterminate forms as 0- œ or 0” (for limits 
of indeterminate forms — [6]). 


F. Partial Derivatives 


Let w= f(x, y,...,Z) be a real-valued function of 
n independent real variables x, y,...,z defined 
on a domain G contained in n-dimensional 
Euclidean space R”. We obtain a function of a 
single variable from f by keeping n—1 vari- 
ables (say, (x, p,...,Z), Le., all except y) fixed. If 


106 G 
Differential Calculus 


such a function g(y)= f(X9, y, ..., Zo) is dif- 
ferentiable, that is, if 


goj- lim 220+ 4¥)— V0) 
Oo” by70 Ay 
= Yim S02 PoF AY, ---» 20) S a Yor +70) 
Ay70 Ay 


exists and is finite, then f is called partially 
differentiable with respect to y at (Xo, Vo, 
.+,Zo), and the derivative is called the partial 
derivative (or partial differential coefficient) 
of f(x, y,...,2) with respect to y at (Xo, Vo, 
---» Zo). It is denoted by [éw/0y],-.,, 
(C/0Y) f (X0; Yo» «++» Zo), Sy(Xo2 Yo» ---> Zo), OF 
Dy f (Xo, Yo» +- Zo), etc. We usually assume that 
the point (x, y, ..., z) where partial derivatives 
are considered is an tinterior point of the 
*domain of the function. Since in a space of 
dimension higher than 1, the tboundary of a 
domain may be complicated, partial deriva- 
tives at boundary points are usually not con- 
sidered. If a function f possesses a partial de- 
rivative with respect to x at every point of an 
open set G, then f, is a function on G and is 
called a partial derivative of f with respect to 
x. The process of determining partial deriva- 
tives of f is called the partial differentiation of 


Í. 


very Z=ZQ? 


G. Total Differential 


Let w= f(x, y, ..., z) be a function defined 

on a domain G, and let P=(x, y,...,z) be an 
interior point of the domain G of a function 
w= f(x, y,...,Z). Put Aw= f(x9 + Ax, Vo + 

Ay, ...,Z9 +Az)— f(X9, Yos ---, Zo). If there 

exist constants «, ß, ...,y such that Aw = 
aAx + BAy+...+yAz+o(p)(p—0), where 

p= /Ax*+Ay?+...4+Az?, then f is called 
totally differentiable (or differentiable in the 
sense of Stolz) at P. In this case, f is partially 
differentiable at P with respect to each of the 
variables x, y,...,2z, and «= f,(X9, Yo» ---» Zo), B 
=fy(X03 Yos -+s Zoh =- Y= J2(X0; Yo» «++» Zo). The 
principal part of Aw as p>0 is aAx+ BAy+... 
+ Az, which is called the total differential of w 
at P. If f is totally differentiable at every point 
of G, then f is said to be totally differentiable 
on G. The total differential of w is denoted by 
dw. Since the total differentials of x, y,...,z are 
dx = Ax, dy=Ay,...,dz= Az, respectively, we 
can write dw= f.dx+ f,dy+...+f,dz; and 

dw is a function of independent variables x, 

Y, ...,2, dx, dy, ...,dz. The total differentiability 
of f implies the continuity of f, whereas the 
partial differentiability of f with respect to 
each variable does not imply that f is continu- 
ous. (Example: Define f(x, y)=xy/(x? + y?) 

for (x, y)#(0, 0), and (0, 0)=0; then the func- 
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tion f is not continuous at (0,0), even though 
both f, and f, exist at (0,0).) The function f is 
totally differentiable on G if all f,, fp, -3 fz 
exist and are continuous on G, or, more weak- 
ly, if all fe, fys --- > fz exist and, with possibly 
one exception, are continuous. Suppose that 
w= f(x, y) is totally differentiable at (x, y), and 
let Ax = pcos, Ay=psin0. As p—>0, for a 
fixed 0 there exists the limit lim, ..(Aw/p) = 
SAX, y)cos 0+ f,(x, y)sin 0. This limit is called 
the directional derivative in the direction @ at 
(x, y). The partial derivatives f, and f, are 
special cases of the directional derivative for 
0=0 and z/2, respectively. Suppose that we 
are given a curve lying in the tinterior of the 
domain of f and that the curve passes through 
the point (x, y), where the curve is differenti- 
able. Then the partial derivative of w= f(x, y) 
in the direction of the normal to the curve at 
(x, y) is called the normal derivative of w at the 
point (x, y) on the curve and is denoted by 
éw/én. Analogous definitions and notations 
have been introduced for functions of more 
than two variables. 

To see the geometric significance of the total 
differentiability of w= f(x, y), we consider the 
graph of the function w= f(x, y) and a point 
(a, b, f(a, b)) on the graph. Then the plane 
represented by w— f(a, b)=a(x—a)+ B(y—b) is 
the ‘tangent plane to the surface at (a, b, f(a, b)) 
if and only if «= f,(a,b) and B= f,(a, b). The 
existence of f, and f, depends on the choice of 
coordinate axes, while the total differentia- 
bility of f does not. 


H. Higher-Order Partial Derivatives 


Suppose that a partial derivative of a function 
w= f(x, y,...,z) defined on an open set G again 
admits partial differentiation. The latter partial 
derivative is called a second-order partial 
derivative of f. We can similarly define the nth 
order partial derivatives. Higher-order partial 
derivatives are denoted as follows: 


6 (éw\ ĉ&w 
( ) =f Acca Z), 


ex \ ax Ox? 


o {ow 8? w f ) 
—|— |=— = s Pieas Z) 
ðy\ðx) axay 2Y 


ô (&w ow ft ) 
= Sf uN Vane cece 
Ox\dxdy] dxdydx “PO” 


In general, fx, and f,, are not equal. (Peano’s 
example: Let f(x, y) = xy(x* — y*)/(x? + y?) for 
(x, y) #(0,0) and f(0,0)=0. Then f= —1, 
fix = 1 at (0,0).) However, if both fe, and fx 
are continuous on an open set G’, then they 
coincide in G’. Furthermore, if f., fp, and f,, 
exist in a neighborhood U of a point P belong- 
ing to the domain of f and f,, is continuous 
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at P, then f,, exists at P and f,,=/,, (H. A. 
Schwarz). If f, and f, exist in U and are totally 
differentiable at P, then f,,=/f,. at P (W. H. 
ees Similarly, if the partial derivatives of 

E n = Í yx... Hence we can change thie’ 
order of differentiation if all the derivatives 
concerned are continuous. 


I. Composite Functions of Several Variables 


Let w be a function of x, y,...,z, and let each 
xX, y,...,Z be a function of t. Suppose that the 
range of (x(t), y(t), ..., z(t)) is contained in the 
domain of w. Then w is a function of t. If fur- 
ther w is totally differentiable and x, y,...,z 
are all differentiable, then w as a function of t 
is differentiable, and we have 


dw éwdx 5 OY 
dt dx dt 


TA ae dz 
widi ôzdt ` 


If partial derivatives of order >2 are totally 
differentiable, then d? w/dt?, d>w/dt?, ... are 
obtained by repeating the above procedure. A 
similar consideration is valid when x, y, ...,2Z 
are functions of several variables. 


J. Taylor’s Formula for Functions of Several 
Variables 


Suppose that f(x, y) is defined on an open 
domain G, f(x, y) has continuous partial de- 
rivatives of orders up to n, and the line seg- 
ment (a+(x—a)t,b+(y—b)t) (O<t<l)is 
contained in the domain G. Then there exists a 
number @ (0<@< 1) such that 


f(x,y) 
é fa) 

= f(a, b+ (0-92 + fla b) 
x oy 


1 ô a\? 
tzr aaa, f(a, b)+... 


1 
Faiy 


1 0 8 \" 
+(e antl 7 
x f(a+(x—a)0, b+(y—b)0), 


ô ĝ n~1 
(1-0 2+0-2 ) flab) 


where, for instance, the third term ((x — a): 
(6/6x) + (y — b)(8/ðy)} f(a, b) means 

(x —a)?(6*f/0x*) (a, b) + 2(x —a){y— b): 

(6? f/Gx0y)(a, b)+ (y —b)?(0*f/0y’) (a, b), 

with (67f/0x”) (a, b), (67f/0xdy)(a, b), and 

(67 f/dy7)(a, b) denoting the values of (3?f/ðx?), 
(d7f/dxdy), and (6*f/éy7) at (a, b), respectively. 
The displayed formula is called Taylor’s for- 
mula for a function of two variables. A similar 
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formula ts valid for a function of n variables 
(n> 3). As in the case of functions of one vari- 
able, we can derive approximation formulas 
for f from Taylor’s formula. 


K. Classes of Functions 


If all the partial derivatives of order n of {(P) 
are continuous on an open set G, then f is said 
to be a function of class C” (or n-times contin- 
uously differentiable) on G. The set of all n- 
times continuously differentiable functions is 
denoted by C” (n=1,2,...). A continuous 
function is of class C°. A function of class C! is 
also called a smooth function. It is obvious 
that C° >Ct>C?°>.... A partial derivative of 
order r <s of a function belonging to class C* 
does not depend on the order of the differenti- 
ation. A function belonging to C® = £, C” is 
said to be of class C® or infinitely differenti- 
able. We sometimes say that a function has a 
certain “nice” property or is “well behaved” if 
it belongs to some C’ (r>1). 


Let w= f(x, y,...,z) be a function defined on 
an open set G in R” and P=(a,b,...,c)eG. If 
f(x,y, ...,z)= f(a, b, ...,6) 


(z= c)" 


ime 


holds in some open neighborhood U of P, 
where the right-hand side of the equality is an 
absolutely convergent series, then f is said to 
be real analytic at P. In this case, f is r-times 
differentiable at P for any r, and we have 


Sera =ayi(y= by” tee 


rir! ra! 
(ritr +... +r)! 
rtrt- +r 
7 A 
Ox" y"... 0z" 


le A —4 
Tiarn 


(a, b, ...,). 


If fis real analytic at every point P of the 
domain G, then f is called a real analytic func- 
tion on G. Sometimes, a real analytic function 
is called a function of class C®. A real analytic 
function belongs to C”, but the converse is not 
true (— 58 C”-Functions and Quasi-Analytic 
Functions E). 


L. Extrema 


Let f be a real-valued function defined on a 
domain G in an n-dimensional Euclidean 
space R” that has the point P, in its interor. 
If there exists a neighborhood U of P, such 
that for every point P (#P,) of U we have 
S(P)>f(Po), then we say that f has a relative 
minimum at P), and f(P,) is a relative mini- 
mum of f. Replacing > by <, we obtain the 
definition of a relative maximum. f{(P,) is 
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called a relative extremum if it is either a rela- 
tive maximum or a relative minimum. 
To find relative extrema, the following facts 


concerning the sign of the derivative are useful. 


Suppose that a function of a single variable is 
differentiable on an interval J. Then we have 
the following: (1) If f has a relative extremum 
at an interior point xo of J, then f'(xọ)=0. 

(2) If f’(xo)=0 and f(x) changes its sign at xo 
from positive (negative) to negative (positive), 
then f has a relative maximum (minimum) at 
Xo. (3) If f '(xo)=0 and f is twice differentiable 
on some neighborhood of xo, then f has a 
relative maximum or minimum according as 
S’'(Xo) <9 or >0. If f"(x9)=0, then nothing 
definite can be concluded about a relative 
extremum of f at xo. In general, if there exists 
a neighborhood of x, in which f is r-times 
differentiable (r is even) and f” is continu- 
ous, and if f'(x) =f" (Xo) =... =f (x9) =0, 
f(x) >0 (or <0), then f has a relative mini- 
mum (maximum) at x). On the other hand, if 
this condition holds with odd r, then f(xọ) is 
not a relative extremum. If f’(x))=0, then 

J (Xo) is called a stationary value of f. 

If a function f on n variables x, y,...,z has a 
relative extremum at (Xo, Yo, ---, Zo), then we 
have f.(Xo, Yo» +++» Zo) =0, (Xo, Yo» +++» 20) = 
0, ...,f,(X93 Yos «--»Z9) =0, provided that the 
partial derivatives of f exist. Assume that for a 
function f of class C? of two variables x and y, 
we have f,(X9, Yo) =0 and f,(Xo, Yo) =9, and let 
ô= frx(Xo, Vo) fyy(Xo, Yo) — fi(Xo. Yo). Then we 
have the following: (1) If 6>0, then according 
as fex(Xos Yo) <Q or >0, f has a relative maxi- 
mum or minimum at (xo, Yo). (2) If ô <0, then 


f does not have a relative extremum at (xo, yo). 


(3) If 6=0, then without further information 
nothing definite can be said about a relative 
extremum of f at the point. 

Let x,,...,x, be independent variables. If a 
function f of variables x,,...,x,, has a relative 
extremum at a point P)=(x?,...,x°), then 
Ji= fa(Po)=0 (i= 1,...,n), provided that all 
the partial derivatives of f exist. In general, a 
point P, where f is totally differentiable and 
these conditions are satisfied is called a critical 
point of f. The value f(P,) at a critical point is 
called a stationary value. If further f is of class 
C?, then consider a tquadratic form of n 
variables Q=Q(X,,...,X.)= Din SueXiXns 
where fi, = fsix (Po). Suppose that | fil 40. 
Then according to whether Q is tpositive 
definite, tnegative definite, or findefinite, f 
has a relative minimum, relative maximum, or 
no relative extremum at P}. If | f;,|=0, then 
nothing can be said in general. A critical point 
P of f is said to be nondegenerate if | f| 40 
and degenerate if | f;,|=0. 

We can also apply the method of differenti- 
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ation of timplicit functions to find relative 
extrema of functions defined implicitly. Given 
functions ~,,...,9,,(m<n), the problem of 
finding a relative extremum of f(x,,...,x,,) 
under the condition that g,(x,,...,x,)=0, 

<-s Pm(X1, +--+, Xp) =0 is called the problem 
of finding a conditional relative extremum. 
This problem can be reduced to the problem 
of finding a relative extremum of an implicit 
function. Actually, if the functions f, ¢,, 

--, Øm are of class C! and the tJacobian 
O(@ 15 +++ Pm)/O(Xp—m41> +++» Xn) does not vanish 
in the domain considered, then y; =X,_m41; 

-s Ym =X, can be regarded as implicit func- 
tions of x,,...,x, (=n —m). Hence we can set 


PIX FEV 5 oes Ym =J * (X4, ---, Xi). Then f 
has a relative extremum at (x°, ..., x?) under 
the condition g, =...=@,,=0 if and only if 


f* has a relative extremum at P) =(x9, ..., x9). 
The latter condition implies that all df */ôx; 
(j=1,...,/) vanish at P), which holds if and 
only if for arbitrary constants /,,...,4,, the 
function F(x,,...,x,)=f+4,@, +... +4mOm 
satisfies 0F/Ox;=0 (i=1,...,n), and further p; 
=0,...,Pm=0 at (x?,...,x®). From this system 
of equations we can often find the values of 
x?,...,x°. This method of finding conditional 
relative extrema is called Lagrange’s method of 
indeterminate coefficients or the method of 
Lagrange multipliers (— 208 Implicit Func- 
tions; 216 Integral Calculus H; 379 Series H). 
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A. Ordinary Differential Equations 


It was Galileo who found that the accelera- 
tion of a falling body is a constant and thence 
derived his law of a falling body x(t)=gt?/2 as 
what we would now view as a solution of the 
differential equation x” (t)=g, where x(t) de- 
notes the distance the body has fallen during 
the time interval t and g is the constant gravi- 
tational acceleration. This pioneering work 
may be regarded as the first example of solu- 
tion of a differential equation. Also, the tequa- 
tions of motion, proposed by I. tNewton as 
the mathematical formulation of the law of 
motion, including Galileo’s law as a special 
case, are differential equations of the second 
order. Thus differential equations appeared, 
simultaneously with differential and integral 
calculus, as an indispensable tool for the uni- 
fied and concise expression of the laws of 
nature. Such laws are generally called dif- 
ferential laws. 

Newton completely solved the equations of 
the ttwo-body problem proposed by himself; 
G. W. tLeibniz also succeeded in solving many 
simple differential equations. 

In the 18th century, many mathematicians, 
such as the tBernoullis, A. C. Clairaut, J. F. 
Riccati, L. Euler, and J. L. tLagrange, at- 
tacked and solved differential equations of 
various types independently. In that period, 
the emphasis was on solution by quadrature, 
that is, applying to telementary functions a 
finite number of algebraic operations, trans- 
formations of variables, and indefinite inte- 
grations. It was toward the end of the 18th 
century that new methods, such as integration 
by infinite series, came to be discussed. A 
method of variation of constants for the solu- 
tion of linear ordinary differential equations 
was invented by Lagrange in 1775. At the 
beginning of the 19th century, C. F. tGauss 
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initiated the study of differential equations 
satisfied by thypergeometric series. 

The problem of existence of solutions, which 
supplies a foundation of modern differential 
equation theory, was first treated by A. L. 
tCauchy. His proof of the existence theorem 
was later improved by R. L. Lipschitz (1869). 

Pioneers in the function-theoretic treatment 
of differential equations were C. A. A. Briot and 
J. C. Bouquet, who investigated the singular 
points of a function defined by an analytic 
differential equation. Also, B. Riemann pro- 
posed a new viewpoint which influenced L. 
Fuchs in his development of the theory of 
linear ordinary differential equations in the 
complex domain (1865). Works of A. M. Le- 
gendre on telliptic functions and of H. *Poin- 
caré on tautomorphic functions should also be 
mentioned in this connection. 

After the Cauchy-Lipschitz existence the- 
orem for the equation y'= f(x, y) was known, 
efforts were directed toward weakening the 
conditions imposed on f(x, y). G. Peano first 
succeeded in giving a proof under the continu- 
ity assumption only (1890), and his results 
were sharpened by O. Perron (1915). 

Regarding the uniqueness of solutions of 


tinitial value problems, there are various re- 


sults by W. F. Osgood (1898), Perron (1925), 
and many Japanese mathematicians. In the 
course of this work the necessary and sufficient 
condition for uniqueness was successfully 
formulated in a concise form (— 316 Ordi- 
nary Differential Equations (Initial Value 
Problems)). 

For linear differential equations with peri- 
odic coefficients, investigations were carried 
out by C. Hermite (1877), E. Picard (1881), 

G. Floquet (1883), G. W. Hill (1886), and 
others. For instance, solutions satisfying y(x + 
w) = Ay(x) were found to exist, where w is 

the period of the coefficients. Analogous re- 
sults followed in the case of doubly periodic 
coefficients. 

Techniques of factorization of linear dif- 
ferential equations developed by G. Frobenius 
(1873) and E. Landau (1920) should also be 
noted. Picard (1883), J. Drach (1898), and 
E. Vessiot (1903, 1904) established a remark- 
able result on the solvability (in the sense of 
solution by quadrature) of linear differential 
equations, successfully extending the tGalois 
theory in this new direction. 

The concept of fasymptotic series, which in 
a sense approximate the solution of differential 
equations, was introduced by Poincaré (1886) 
and extended by M. A. Lyapunov (1892), J. C. 
C. Kneser (1896), J. Horn (1897), C. E. Love 
(1914), and others. Poincaré was also the 
founder of topological methods in differential 
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equation theory, and his ideas were developed 
extensively by I. Bendixson (1900), Perron 
(1922, 1923), G. D. Birkhoff, and others (— 126 
Dynamical Systems). 

In 1890, Picard invented an ingenious tech- 
nique of tsuccessive approximation for the 
proof of existence theorems, and his technique 
is now widely used in every application of 
functional equations. The technique of reduc- 
ing linear differential equations to linear 
tintegral equations of Volterra type was also 
developed. 

On the tboundary value problems and 
teigenvalue problems that appear in many 
areas of physics, there was extensive research 
by mathematicians such as J. C. F. Sturm 
(1836), J. Liouville, L. Tonelli, Picard, M. 
Bôcher (1898, 1921), Birkhoff (1901, 1911), 
and others. In this connection the problem 
arises of expanding a given function by an 
torthogonal system of functions obtained as 
teigenfunctions of a given boundary value 
problem. Those problems were brought into 
unified form by D. tHilbert (1904) in his theory 
of tintegral equations. Subsequently boundary 
value problems of ordinary and partial dif- 
ferential equations came to be discussed in this 
framework. 

Finally, it should be mentioned that the 
tcalculus of variations created by Euler and 
Lagrange gave rise to the study of a certain 
class of differential equations bearing the name 
of Euler (— 46 Calculus of Variations). 


B. Partial Differential Equations 


The origin of partial differential equations can 
be traced back to the study of hydrodynamic 
problems by J. d’Alembert (1744) and Euler. 
However, perhaps Lagrange and P. S. tLa- 
place were the first to investigate the general 
theory. Subsequently, during the 18th and 
19th centuries, it was developed by G. Monge, 
A.-M. Ampère, J. F. Pfaff, C. G. tJacobi, 
Cauchy, S. Lie, and many other mathema- 
ticians. The fundamental existence theorem 
for the initial value problem, now called the 
Cauchy-Kovalevskaya theorem, was proved 
by S. Kovalevskaya in 1875 (— 321 Partial 
Differential Equations (Initial Value Problems) 
B). 

Because of their close connection with prob- 
lems of physics, linear equations of the second 
order have been a chief object of research. Up 
to the 19th century, classification into ‘elliptic, 
thyperbolic, and tparabolic types and the 
study of boundary and initial value problems 
for each of these types constituted the main 
part of the theory. 

In the 20th century, more complicated 
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problems—?nonlinear problems appearing in 
the study of viscous or compressible fluids, or 
the study of equations of tmixed type in con- 
nection with supersonic flow—have emerged 
as important topics; and the newly developed 
techniques of functional analysis have brought 
about remarkable changes. Especially in the 
study of the tSchrédinger equations of quan- 
tum mechanics and of more general tevolution 
equations , this method has proved to be a 
powerful tool. 

Finally, we should not fail to mention that 
the development of electronic computers has 
made it possible to obtain numerical solu- 
tions and to discover many important facts. 
*Numerical analysis is now becoming an indis- 
pensable part of the theory (— 304 Numerical 
Solution of Partial Differential Equations). 
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A. Introduction 


The study of differential games arose from the 
study of pursuit and evasion problems and 
various tactical problems. The first work was 
done by R. P. Isaacs [1] in a series of RAND 
Corporation memoranda that appeared in 
1954. He applied to many illustrative examples 
the method of Hamilton-Jacobi differential 
equations (— eq. (4) below). The heuristic 
results of Isaacs were made rigorous by W. H. 
Fleming [2], L. D. Berkovitz [3], A. Friedman 
[4,5], and others. 


B. Zero-Sum Two-Person Games 


Suppose that there are two antagonists, each 
exerting partial control over the state of a 
system. One wishes to maximize a given payoff 
that is a functional of the state and the control 
exerted, while the other wishes to minimize 
this payoff. Let the state of a differential game 
at time t be represented by an n-dimensional 
vector x(t)e R”. In a zero-sum differential game 
between two players I and II, we are given a 
system of n differential equations 


dx/dt = f(t, x, u, v) (1) 


with an initial condition x(t)=ée¢R", where 
ue R? is chosen at each instant of time by 
player I and ve R? is chosen by player II. We 
assume that the function f(t, x,u, v) is con- 
tinuous in t and continuously differentiable 
on the entire (x, u, v)-space. 

It is usually assumed that both players 
know the present state of the game and that 
they know how the game proceeds; that is, 
they know the system (1). Each player can take 
the state of the game into account in making 
his choice. Thus player I can let his choice of u 
be governed by a vector function u(t, x) defined 
on D, where D c {[0, œ) x R” is a fixed region of 
the (t, x)-space. Similarly, player II can let his 
choice of v be governed by a vector function 
v(t, x) defined on D. 

A finite collection of subregions D,,..., D, of 
a region D is said to constitute a decomposi- 
tion of D whenever the following conditions 
hold: (i) Each D; (i=1,...,7) is connected and 
has a piecewise smooth boundary; (i1) D;N 
D;=@ if i#j. A function defined on D is said to 
be piecewise C! in x on D if there is a decom- 
position of D such that on each D; the function 
and all its derivatives with respect to x are 
continuous in (t, x) on D;. Let U and V be 
fixed closed subsets of R?” and R4, respectively. 
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Let S, denote the class of functions u(t, x) that 
are piecewise C! in x on D and have their 
range in U. Similarly, let S, denote the class of 
functions v(t, x) that are piecewise C! in x on D 
and have their range in V. 

Let weS, and veS,, and consider the dif- 
ferential equation 


dx/dt = f(t, x, u(t, x), v(t, x)), (2) 
subject to the initial condition 
x(t)=6¢. (3) 


We say that a pair (u, v}e S, x S, is playable 
if for every (t, č) in D every solution of (2) 
satisfying (3) stays in D and reaches a terminal 
manifold F in finite time, where F is a smooth 
manifold contained in D. Let O,cS,, Quc S, 
be the maximal pair of subclasses such that 
each pair (u, v)e Q, x Q, is playable. We call the 
functions in Q, and Q, the strategies for the 
players. 

For each strategy pair (u, v) we can define a 
functional 


I(t, ¢, u,v) 


= g(ty,x(t,)) + | ” h(t, x(0), u(t, x(0),o(t, x(0))) dt 


to 


where x(t) is a solution of (2) and (3) and ¢, is 
the first time that (t, x(t)) reaches the terminal 
manifold F. The functional J is called the 
payoff. 

Let (u*, v*)EQ, x Q, be a strategy pair. 
Suppose that for any ue Q, and veQ,, the 
inequalities 


I(t, 6, u, v*) < J (t, Č, u*, v*) < J (t, č, u*, v) 


hold for all (t, č)e D. We say that (u*,v*} is a 
saddle point relative to the classes Q, and Q,. 
The function 


Wt, x)=J(t, x, u*, v*) 


defined on D is called the value function of 
the game. Berkovitz [3] proved that the 
value function W(t, x) is continuous on D and 
continuously differentiable on each D; and 
satisfies 


max [A(t, x, u,v") + Welt, x) x, u, 07) J 
=min [h(t, x, u*, v) + Wt, x) f(t, x, u*, v)] 


=h(t, x, u*,v*)+ W(t, x) f(t, x, u*, v*) 
= — W(t, x). (4) 


Equation (4) is called the Hamilton-Jacobi 
equation. 

Let x*(t; t, č) be the optimal trajectory corre- 
sponding to the saddle-point strategies (u*, v*) 
and resulting from an initial point (t, č)e D. 
Then there exists an n-dimensional continuous 
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vector function A(t; t, č) such that the following 
hold [3]: 

(i) The functions x* and å satisfy the system 
of differential equations 


dx/dt = f(t, x, u*(t, x), v*(t, x)), 
dd/dt = — H(t, x, A, u*(t, x), v*(t, x)), 
where 
H(t, x, A,u, v)=h(t, x, u,v) + Af (t, x, u, v). 
(ii) If x = x*(t; t, č), t2 t, then 
W,(t,x)= A(t, 8). 
(iii) At t=t,, the transversality condition 


ôT dg0T dgGX ðX 


tc =0 
ðc tös Gx do Co 


holds, where the terminal manifold F is given 
parametrically by the relations 


t=T(o), x=X(o), 


o ranging over a cube in some finite- 
dimensional Euclidean space. 
(iv) For all t<t<t,, 


max min H(t, x*(t), A(t), u, v) 
ueU veV 


=minmax H(t, x*(t), A(t), u, v) 
veV uc 


= H(t, x*(t), A(t), u*(t, x*(t)), v*(t, x*(t))). 


P. Varaiya and J. Lin [6] and Friedman 
[4,5] have defined certain special classes of 
differential games, and have shown that under 
their definitions the games have nonzero value 
functions. 


C. N-Person Differential Games 


In a differential game between many players, 
the state vector x(t)e R” is governed by 


dxjdt=f(t,x,Uy,...,Uy)  x(t)=č, 


where u;e R”! is chosen at each instant of time 
by player i. Each u; is constrained to lie ina 
fixed closed subset U; of R”:. Let S; be the class 
of functions u;(t, x) that are piecewise C! in x 
on D and have their range in U;. 

We say that an element u=(u,,..., uy) of 
S, x... x Sy is playable if, for every (t, č)e D, 
every solution of the differential equation 


dx/dt = f(t,x,u,(t,x),...,Uy(t,x)), x(t) =€, 
(5) 


stays in D and reaches a terminal manifold 

F in finite time. Let Q;<S; (i=1,...,.N) be 

the maximal subclasses such that each ele- 
ment u=(u,,...,Uy)EQ, x ... xX Qy is playable. 
We call the functions u;e Q; (i=1,..., N) the 
strategies. For each strategy N-tuple we define 
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a payoff 


Ji(t, Č, Ui, ves Uy) =Gi(t,, X(t,)) 


ti 
-| ht, x(t), u; (t, x(t)), ..., uy(t, x(t)))dt, 
to 

where x(t) is a solution of (5) and t, is the first 
time that (t, x(t)) reaches the terminal manifold 
F. Each player i is to choose his strategy u;eEQ; 
so as to maximize his own payoff J;. 

There are many definitions of “solution” for 
games involving more than two players. A 
strategy N-tuple u* =(u,*, ...,uy*) is called an 
equilibrium point for the game if the 
inequalities 
Ji(u,*,.. 


* * 
5 Uj-1*, Uj, Ua.) 


<Ji(u,*, eng ap ay se) (i= I, v N) 


hold for any u,€Q,,...,uyeQy. J. H. Case [9] 
has shown that the conclusions drawn for 
zero-sum two-person games also hold for N- 
person differential games. 
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109 (VII.1) 
Differential Geometry 


In differential geometry in the classical sense, 
we use differential calculus to study the prop- 
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erties of figures such as curves and surfaces in 
Euclidean planes or spaces. Owing to his 
studies of how to draw tangents to smooth 
plane curves, P. *Fermat is regarded as a 
pioneer in this field. Since his time, differential 
geometry of plane curves, dealing with cur- 
vature, ‘circles of curvature, tevolutes, ten- 
velopes, etc., has been developed as a part of 
calculus. Also, the field has been expanded to 
analogous studies of space curves and surfaces, 
especially of tasymptotic curves, ‘lines of cur- 
vature, tcurvatures and ‘geodesics on surfaces, 
and truled surfaces. C. F. tGauss founded the 
theory of surfaces by introducing concepts of 
the tgeometry on surfaces (Disquisitiones circa 
superficie curvas, 1827). Gauss recognized 
the importance of the intrinsic geometry of 
surfaces, and it is generally agreed that differ- 
ential geometry as it is known today was ini- 
tiated by him. Thus differential geometry came 
to occupy a firm position as a branch of 
mathematics. The influence that differential- 
geometric investigations of curves and surfaces 
have exerted upon branches of mathematics, 
physics, and engineering has been profound. 
For example, E. Beltrami discovered an in- 
timate relation between the geometry on a 
tpseudo-sphere and tnon-Euclidean geometry. 
The study of tgeodesics is a fertile topic deeply 
related to dynamics, the calculus of variations, 
and topology, on which there is excellent 
work by J. Hadamard, H. tPoincareé, P. Funk, 
G. D. Birkhoff, M. Morse, R. Bott, W. Klin- 
genberg, and M. Berger, among others. The 
theory of minimal surfaces initiated by J. L. 
Lagrange was an application of the calculus 
of variations. At the early stages of develop- 
ment, G. Monge, J. B. M. C. Meusnier, A. M. 
Legendre, O. Bonnet, B. Riemann, K. Weier- 
strass, H. A. Schwarz, Beltrami, and S. Lie 
contributed to the theory. Weierstrass and 
Schwarz established its relationship with the 
theory of functions. J. A. Plateau showed 
experimentally that tminimal surfaces can be 
realized as soap films by dipping wire in the 
form of a closed space curve into a soap solu- 
tion (1873). The Plateau problem, i.e., the prob- 
lem of proving mathematically the existence 
of a minimal surface with prescribed bound- 
ary curve, was solved by Tibor Rado in 1930 
and independently by J. Douglas in 1931. 
Although the relationship to function theory 
is lost for higher-dimensional minimal sub- 
manifolds, their study is intimately related to 
the calculus of variations and topology. 
Euclidean geometry is a geometry belonging 
to F. Klein’s Erlangen program (— 137 Erlan- 
gen Program). For other geometries in the 
sense of F. Klein we may also consider the cor- 
responding differential geometries. For instance, 
in tprojective differential geometry we study 
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by means of differential calculus the properties 
of curves and surfaces that are invariant under 
projective transformations. This subject was 
studied by E. J. Wilczynski, G. Fubini, and 
others; taffine differential geometry and tcon- 
formal differential geometry were studied by 
W. Blaschke and others (— 110 Differential 
Geometry in Specific Spaces). 

Influenced by Gauss’s geometry on surfaces, 
in his inaugural address at Gottingen in 1854 
Riemann advocated an intrinsic differential 
geometry completely independent of embed- 
dings (Uber die Hypothesen, welche der Geo- 
metrie zugrund liegen; Werke, 2nd ed. 1892, 
272-287) (— 364 Riemannian Manifolds). 
Removing the restriction to two dimensions 
and considering abstract manifolds of dimen- 
sion n, he introduced what is now known as 
the Riemannian metric; actually he considered 
the more general metrics that had formed the 
subject matter of the dissertation of P. Finsler 
in 1918 (— 152 Finsler Spaces). Riemann- 
ian geometry includes Euclidean and non- 
Euclidean geometry as special cases, and is 
important for the great influence it exerted 
on geometric ideas of the 20th century. Under 
the influence of the algebraic theory of invar- 
iants, Riemannian geometry was then studied 
as a theory of invariants of quadratic tcovar- 
iant tensors by E. B. Christoffel, C. G. Ricci, 
and others. Riemannian geometry attracted 
wide attention after A. Einstein applied it to 
the tgeneral theory of relativity in 1916. 

In the same year, T. Levi-Civita introduced 
the notion of tLevi-Civita parallelism, which 
contributed greatly to the clarification of geo- 
metric properties of Riemannian spaces. Ob- 
serving parallelism to be an affine-geometric 
concept, H. Weyl and A. S. Eddington devel- 
oped a theory of Riemannian spaces “affinely” 
based on the notion of parallelism without 
using metrical methods. Such a geometry is 
called a geometry of an affine connection (— 
80 Connections). 

Every straight line in a Euclidean space has 
the property that all tangents to the line are 
parallel. In a space with an affine connection, 
we may define a family of curves called tpaths 
as an analog of straight lines. Such curves are 
solutions of a system of ordinary differential 
equations of the second order of a certain type. 
Coefficients of such differential equations 
determine a parallelism and hence an affine 
connection. H. Weyl discovered transforma- 
tions of coefficients that leave the family of 
paths invariant as a whole, namely, projective 
transformations of an affine connection. A 
geometry that aims to study properties of 
paths or affine connections that are invariant 
under these transformations is called a projec- 
tive geometry of paths. Such geometry was 
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studied by L. P. Eisenhart, O. Veblen, and 
others. The concept of projective connections 
was an outcome of such studies. Similarly, the 
concept of conformal connections was devel- 
oped from the consideration of tconformal 
transformations of Riemannian spaces. 

These geometries cannot in gencral be re- 
garded as geometries in the sense of Klein. 
Actually, any one of these geometries generally 
has no transformations that correspond to 
‘congruent transformations of geometries in 
the sense of Klein; even if it has such trans- 
formations, they do not act transitively on the 
space. Thus geometries are naturally divided 
into two categories, one consisting of geome- 
tries in the sense of Klein (based on the group 
concept) and the other of geometries based on 
Riemann’s idea. Under such circumstances, 

E. Cartan unified the thoughts of Klein and 
Riemann from a higher standpoint and con- 
structed his theory of connections in a series of 
papers published between 1923 and 1925. He 
developed the theory of affine, projective, and 
conformal connections from a viewpoint con- 
sistent with that of Klein. Just as each tangent 
space of a Riemannian manifold is viewed as a 
Euclidean space, an affine connection regards 
the tangent space at each point as an affine 
space and develops it onto the tangent space 
at an infinitesimally nearby point. In discuss- 
ing projective connections Cartan attached a 
projective space to each point of a manifold as 
an infinitesimal approximation, and similarly 
for conformal connections. More generally, he 
attached to each point of a manifold a fixed 
Klein space, i.e., a homogeneous space of a Lie 
group, called the structure group. Thus Cartan 
introduced the concept of fiber bundle (— 147 
Fiber Bundles). Then he defined a connection 
as a development of the fiber, i.e., the gen- 
eralized tangent space, at each point onto the 
fiber at an infinitesimally nearby point (— 80 
Connections B). If G is the group of congruent 
transformations in Euclidean space, a mani- 
fold with connection having G as its structural 
group is called a manifold with Euclidean con- 
nection. Among manifolds with Euclidean 
connection, Riemannian manifolds are charac- 
terized as those without ttorsion. If we take 
the group of congruent transformations of 
projective (conformal) geometry as G, we have 
manifolds with projective (conformal) connec- 
tion in the sense of Cartan. Among these, there 
are remarkable ones called manifolds with 
normal projective (or conformal) connections, 
which are essentially the same as the ones 
studied by Veblen and others. Cartan’s idea 
had a profound influence on modern differen- 
tial geometry. The method of moving frames, 
created by G. Darboux and extensively used 
by Cartan in his theory of connections, was a 
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forerunner of the theory of fiber bundles. Com- 
bining *Grassmann algebra with differential 
calculus, Cartan developed a powerful com- 
putational tool known as calculus of differen- 
tial forms (— 105 Differentiable Manifolds Q). 
Differential forms have become indispensable 
in topology, algebraic geometry, and in studies 
of functions of several complex variables, as 
well as in differential geometry. 

The work of Lie on transformation groups 
also had a profound influence on Cartan. The 
latter’s work on ‘Lie groups, particularly on 
simple Lie groups, and on differential geome- 
try culminated in 1926 in his discovery of 
*Riemannian symmetric spaces. These spaces 
are natural generalizations of the spherical 
surface and the unit disk in the complex plane 
with Poincaré metric, and play essential roles 
in unitary representation theory and in other 
areas of mathematics. 

A tangent line to a curve C at a point P of C 
is the limit line of the line PQ, where Q is a 
point on C aporoaching P; hence we can de- 
fine it locally. A concept (or property), such as 
this, that can be defined in an arbitrary small 
neighborhood of a point of a given figure or a 
space is called a local concept (or local prop- 
erty) or a concept (or property) in the small. 
In the early stages of the development of dif- 
ferential geometry, differential calculus was 
the main tool of study, so most of the results 
were local. On the other hand, a concept (or 
property) that is defined in connection with a 
whole figure or a whole space is called global 
or in the large. In modern differential geome- 
try, the study of relations between local and 
global properties has attracted the interest of 
mathematicians. This view was emphasized 
by Blaschke, who worked on the differential 
geometry of tovals and tovaloids. The study of 
trigidity of ovaloids by S. Cohn-Vossen be- 
longs in this category, and many works on 
geodesics and minimal surfaces were done 
from this standpoint. 

From the viewpoint of modern mathemat- 
ics, the basic concepts on which we construct 
Riemannian geometry and geometries of con- 
nections are global concepts of differentiable 
manifolds. However, in Riemann’s time the 
theory of tLie groups and topology were not 
yet developed; consequently, Riemannian 
geometry remained a local theory. In 1925, H. 
Hopf began to study the relations between 
local differential-geometric structures and the 
topological structures of Riemannian spaces. 
However, except for the work of Cartan, Hopf, 
and a few others, differential geometry in the 
1920s was still largely concerned with surfaces 
in the 3-dimensional Euclidean space or local 
properties of Riemannian manifolds, and with 
affine, projective, and conformal connections. 
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Gradually the concept of differentiable mani- 
folds was clarified, the global theory of Lie 
groups made progress, and topology devel- 
oped; and the trend toward global differential 
geometry began slowly in the early 1930s. The 
dissertation of G.-W. de Rham published in 
1931 showed that the cohomology of a mani- 
fold can be computed in terms of differential 
forms (— 105 Differentiable Manifolds R). His 
theorem provides the theoretical foundation 
for expressing cohomological invariants of a 
manifold in terms of differential geometric 
invariants. In a series of papers immediately 
following de Rham’s, W. V. D. Hodge es- 
tablished that, on a compact Riemannian 
manifold, every r-dimensional cohomology 
class can be uniquely represented by a har- 
monic form of degree r (— 194 Harmonic 
Integrals). 

An important class of complex manifolds 
with compatible Riemannian metric was dis- 
covered by J. A. Schouten, D. van Dantzig, 
and E. Kahler around 1929—1932. This class of 
manifolds, called Kahler manifolds today, 
comprises the projective algebraic manifolds. 
Hodge’s theory of harmonic integrals is most 
effective when applied to compact Kahler 
manifolds, (— 232 Kahler Manifolds). 

The most celebrated global theorem in 
classical differential geometry of surfaces is the 
Gauss-Bonnet formula (1848) (— 364 Rie- 
mannian Manifolds D). The formula was gen- 
eralized to closed hypersurfaces of Euclidean 
space by Hopf in 1925, to closed submanifolds 
of Euclidean space by C. B. Allendoerfer and 
W. Fenchel in 1940, and finally to arbitrary 
closed Riemannian manifolds by Allendoerfer 
and A. Weil in 1943. But the simple proof 
given by S. S. Chern in 1944 contained the 
notion of transgression, which has become 
essential in the theory of characteristic classes 
(— 56 Characteristic Classes). The discovery of 
Pontryagin classes for Riemannian manifolds 
(1944) and Chern classes for Hermitian mani- 
folds (1946) culminated in the tindex theorem 
and the tRiemann-Roch theorem of F. E. P. 
Hirzebruch, and finally in the tAtiyah-Singer 
index theorem. 

A simple but fruitful idea of S. Bochner, 
relating harmonic forms to curvature, es- 
tablished tvanishing theorems for harmonic 
forms of Riemannian manifolds and for holo- 
morphic forms of Kahlér manifolds under 
suitable positivity conditions for curvature. 
His idea has led to the vanishing theorems of 
K. Kodaira and others (— 232 Kahler Mani- 
folds D). 

The work of C. Ehresmann in 1950 on con- 
nections in principal fiber bundles established 
a solid foundation to Cartan’s theory of con- 
nections. Gauge theory in physics ts largely 
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based on the theory of connections in prin- 
cipal fiber bundle. 

R. H. Nevanlinna’s value distribution theory 
and its subsequent generalization by Chern 
and others can be best described in differential- 
geometric terms. 

Differentiable manifolds are currently ob- 
jects of research in both differential geometry 
and differential topology. While topology 
studies manifolds per se, differential geometry 
may be considered as the study of differenti- 
able manifolds equipped with geometric struc- 
tures, such as metric tensors, connections, 
(almost) complex structures, and various other 
tensors. Through these geometric structures, 
differential geometry enjoys close contact 
with many branches of mathematics. From 
its early days, differential geometry has had 
close ties to topology (as exemplified by the 
Gauss-Bonnet formula) and to partial dif- 
ferential equations and analytic functions 
(through, e.g., the study of minimal surfaces). 
The bonds with topology were strengthened 
by Morse theory and, more recently, by the 
theory of characteristic classes. In the most 
recent proof of the Atiyah-Singer index theo- 
rem, differential geometry is an important 
intermediary between topology and analysis. 
Differential geometry and algebraic geometry 
have enriched each other through Kahler 
manifolds. The theory of functions of several 
complex variables also has points of contact 
with differential geometry, such as value distri- 
bution theory and Cauchy-Riemann struc- 
tures. Contact and symplectic structures are 
basic to mechanics. Lorentz manifolds and 
connections in principal bundles are essential 
mathematical tools in the general theory of 
relativity and in gauge theory. Topics such as 
minimal submanifolds, manifolds of positive 
curvature, and closed geodesics are active and 
important areas of research belonging to Rie- 
mannian geometry proper; at the same time, 
differential geometry provides a language and 
methods that are important in wider areas of 
mathematics. 
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A. The Method of Moving Frames 


The main theme of this article is the theory of 
surfaces (i.e., submanifolds) in a differentiable 
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manifold V on which a tLie transformation 
group G acts. 

If a *Lie group G of dimension r acts ftransi- 
tively on a space G, and the stability group 
for any point of G, consists of the identity 
element only, then G, is called the group mani- 
fold of G, and an element of G, is called a 
frame. If f, is a fixed frame, then the mapping 
a>afy€ G, (ae G) gives a tdiffeomorphism of 
G to G,,. Let 1(G) be the set of all tdifferen- 
tial forms @ of degree 1 on G, that are invar- 
iant under transformations of G. Then (G) 
is a linear space of dimension r and is the 
tdual space of the Lie algebra g of G. A basis 
{@,|1<A<r} of I(G) is called a set of rela- 
tive components of G. The structure equations 
hold: 


r 


D Ci Oe ^ Oy, 
u, y=l 


dw, == 


where czu, are tstructure constants of the Lie 
algebra g. 

Let G be a Lie transformation group of a 
space E. Then a G-invariant submanifold of E 
on which G acts transitively is called an orbit. 
Each point ye E determines an orbit contain- 
ing y. When there exist parameters k; (1 <j<t) 
such that any G-invariant on E is a function of 
k,,...,k,, then these parameters are called the 
fundamental invariants of E. Let H (c G) be the 
stability group at a point yọ on an orbit M; 
then M is identified with the thomogeneous 
space G/H by the diffeomorphism @:M—>G/H, 
(ayo) =aH (ae G). Furthermore, a tprincipal 
fiber bundle (G,,, M, H, t) is determined by the 
projection t:G,— M, t(afo)= ayo (ae G). The 
‘fiber H, on a point ye M is a group manifold 
of H. H, is called the family of frames on y, 
and an element of H, 1s a frame on y. Local 
coordinates 6, (1 < u <s) of the group H are 
called the secondary parameters and are used 
to indicate frames in H,. When H is not con- 
nected, let H? be the connected component 
of the identity of H and M be the covering 
manifold G/H? of M. An element je M over 
yéM is called an oriented element. Now as- 
sume that the group H is connected. Then the 
family of frames H, on each ye M is given as 
an tintegral manifold of a tcompletely inte- 
grable system of total differential equations 7; 
=0(1<i<r—s, 7,€1(G)) on the group mani- 
fold Gy. Here the x, are linearly independent 
and are called the horizontal components of M. 
The 7; are linear combinations of the relative 
components w, of G, and their coefficients are 
generally functions of the fundamental invar- 
iants k;. For simplicity, we assume that the 
relative components {œ} are chosen such that 
the horizontal components z; and components 
@, (r—8<a<r) are linearly independent. Then 
the a, (1 <} <s) are called the secondary com- 
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ponents. The differentials d0, of the secondary 
parameters are linear combinations of the œg. 
Furthermore, let {x,} (1 <ø <n) be local co- 
ordinates of E; then the differentials dx, are 
linear combinations of the differentials dk; and 
the horizontal components 7;. 

Let G be a Lie transformation group of a 
space V. We regard two m-dimensional sur- 
faces W, and W, passing through xe V as 
equivalent if they have a tcontact of order p at 
x. Then an equivalence class of submanifolds 
is called a contact element of order p at the 
point x. Let E, be the set of all contact ele- 
ments of order p at x, where x runs over all 
the points in V. A contact element of order p 
naturally determines a contact element of 
order p— 1, and we denote this correspon- 
dence by y:E,—>E,_,. Thus we obtain the 
series of correspondences 


V=Ep Ee... Ep- OE ©... > 


where a contact element of order 0 is identified 
with a point of V. Since a transformation on 
the space V induces a transformation on E,, 

G is also a Lie transformation group of E,, 
and this transformation commutes with the 
mapping Ņ. The fundamental invariants k; of 
E, are said to be of order p. We use similar 
terminology (such as frames of order p, etc.) 
throughout this article. The fundamental in- 
variants k; of order p (1 <j<t,) can be chosen 
such that they contain the fundamental invar- 
iants k; (1 <i<t,_,) of order p—1. The ad- 
ditional t, —t,-, invariants k, (tp-1 <& < t,) are 
called the invariants of order p. The family H? 
(yeE,) of frames of order p can be chosen 
such that H? is contained in the family H?~' 
(z= wyeE,_,) of frames of order p—1. If neces- 
sary, the family H? of frames of order p can be 
made connected by defining an orientation of 
contact elements of order p. Furthermore, the 
horizontal components 7; (1 <j<r—s,) of 
order p can be chosen such that they contain 
the horizontal components z; (1 <i<r—s,_4). 
The additional s,_, —s, components 7, (r 
—Sp-1 SASF — Sp) are called the principal 
components of order p. 

Let W be an m-dimensional surface of a 
space V. The contact element of order p (>0) 
is determined at every point of W and ex- 
pressed by the family of frames of order p and 
the values of invariants of orders less than or 
equal to p. Let {u;} (1 <i<m) be local coordi- 
nates on W. Then the differentials du; are given 
as linear combinations of linearly independent 
differential forms 2; (1 <i<m), where the 7;, 
called the basic components of W, are certain 
linear combinations of the differentials of the 
fundamental invariants of V and of the horizon- 
tal components of orbits of V. Let F?(W) be 
the set of all families of frames of order p; then 
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F?(W) depends on m parameters u; and s, 
secondary parameters 0, of order p. On the 
space F?(W), the differentials of invariants of 
orders less than or equal to p— 1 and the prin- 
cipal components of orders less than or equal 
to p—1 are linear combinations of the basic 
components, whose coefficients are functions 
of the invariants of orders less than or equal 
to p. The differentials of invariants of order 

p and the principal components of order p are 
linear combinations of the basic components: 


ONT + -H gag lige. tpg SOS Es 


RNa = brini hic + Dpto F—Sp-1 SASF Ss 


where the coefficients h,,, b,; are functions of 
the invariants of orders less than or equal to p 
and, in general, the secondary parameters 0, of 
order p. These coefficients are called the coeffi- 
cients of order p. Let I, be a subgroup of G 
preserving a family of frames of order p and D, 
be a space whose coordinates are coefficients 
(h,;, bai) of order p. Then T, acts on D, as a 
transformation group. Knowledge of the prop- 
erties of contact elements of order less than 

or equal to p can be utilized to obtain infor- 
mation about the invariants of order p+ 1, etc. 
In fact, if we can choose in the I,-space D, a 
subspace C, that intersects each orbit in D, at 
one and only one point, then in general the 
secondary parameters of order p associated 
with the points in C, correspond to the frames 
of order p, and the parameters associated with 
the points in C, are the invariants of order p+ 
1. The restrictions of the coefficients of order 

p to C, are functions of the invariants of orders 
less than or equal to p+ 1; they are indepen- 
dent of the secondary parameters of order p. 
Thus the frames of order p+ 1 and the in- 
variants of orders less than or equal to p+ 1 
determine the contact elements of order p of W 
and their differentials; generally, the latter can 
be utilized to determine the contact elements 
of order p+1. 

This process of obtaining information of 
“order p+ 1” utilizing a suitable subspace C, 
of D, is the so-called general method of moving 
frames. However, the surface W may contain 
points for which the general method does not 
apply. Actually, there are surfaces W for which 
the method does not apply for any point in W. 
Thus various methods of moving frames are 
necessary to cope with different kinds of sur- 
faces. In the actual application of the method 
of moving frames, we use certain devices that 
help to simplify the calculations. In fact, an 
infinitesimal transformation (6h,,, 6b,;) of the 
group I, acting on the space D, is expressed as 
a linear combination of the secondary compo- 


ai’ 


nents of order p; this expression is easily ob- 
tained by means of the structure equations of 
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G. The group I,,, is a subgroup of T, fixing 
every point of the subspace C,, and its in- 
finitesimal transformation is such that 6h,;=0, 
6b,;=0. The secondary components of order 
p+1 are immediately obtained from the 
equations for dk, and z,. Furthermore, when 
m> 2, the condition for the principal compo- 
nents of every order to satisfy the structure 
equations of G is essential to the problem of 
the existence of (m-dimensional) surfaces. 

As we apply the method of moving frames 
consecutively to a surface W, we eventually 
arrive at the order q having the following 
properties: The families of frames of order 
q+ 1 coincide with those of order q, and the 
invariants k, of order q+ 1 are expressed as 
functions ¢,(k,) of invariants k, of order less 
than or equal to q. In this case, the families of 
frames of orders q +j (j2 1) are all equal, and 
the invariants of orders q +j are partial (j~1)- 
derivatives of functions g,{k,). The family of 
frames of order q is called the Frenet frame. 
The differential invariants on a surface are 
defined to be differential forms generated by 
the basic components and the invariants of 
each order. 

Specifically, assume that the group G is an 
analytic transformation group of V, and the 
m-dimensional surfaces W,, W, are analytic. 
Then there exists an element g of G such that 
gW, =W, if and only if W, and W, are of the 
same kind and have the same relations among 
the invariants of orders less than or equal to 
q+ 1. These relations are called the natural 
equations of the surface. The theory of sur- 
faces based on the analysis of the natural 
equations of surfaces is called natural geome- 
try. The reduction formula can be obtained by 
utilizing the Frenet frame; it gives the equation 
of the surface in the form of power series con- 
taining the invariants of each order. 

Various results are known concerning the 
theory of surfaces of the spaces V,, V, sharing 
the same transformation group G. We also 
have a theory of special surfaces whose in- 
variants satisfy specific functional relations. 
Furthermore, we have problems concerning 
the deformation of a surface (preserving some 
differential invariants). Actually, the theory of 
surfaces of dimension m other than curves and 
hypersurfaces is in general quite difficult. The 
methods of tensor calculus can be applied to 
the study of surfaces. The theory of fconnec- 
tions can be considered to be an outgrowth of 
the study of surfaces by means of the method 
of moving frames and tensor calculus. 


B. Projective Differential Geometry 


The rudiments of differential geometry sub- 
ordinated to the tprojective transformation 
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group, or projective differential geometry, can 
be found in the Theory of surfaces by J. G. 
Darboux. The subject has been systematically 
studied by H. G. H. Halphen, E. J. Wilczynski, 
and G. Fubini. The Fubini theory was en- 
riched substantially by E. Cartan, E. Cech, E. 
Bompiani, and J. Kanitani. 

In this section we consider a surface S in a 
3-dimensional projective space. Let A(u', u?) 
(u',u? are parameters on S) be a point of S, 
and associate with A all the frames [.A, A,, 
Az, A3] (|A, A1, Az, A3|= 1), where A,, Az, A3 
are points of the tangent plane to S at A. A 
family of such frames is called the family of 
frames of order 1, and we express its differen- 
tial by 


3 
dA, = 5 wf Ag, a=0, 12,3: Ag=A. 
g=0 


The œf are tPfaffian forms that depend on two 
principal parameters determining the origin A 
and ten secondary parameters determining the 
frame. We have œ$ +w! +3 +w3=0, w =0. 
Furthermore, œw! =w}, œ? =? are indepen- 
dent of each other and depend on the principal 
parameters only. Let z!, z?, z? be tnonhomo- 
geneous coordinates with respect to a frame of 
order 1. Then in a neighborhood of the origin, 
S is expressed by z? =£ 2, f,, where the f, are 
homogeneous functions of degree r with re- 
spect to z}, z?. 

If we write f) =(a9(z')* + 2a, 2z!2z? + a,(z”)*)/ 
2, then it follows from the structure equations 
of the projective transformation group that 
(} =dayw' +4,07, 03 =a,w!+a,07. If we 
put p, =ao(w')? + 2a, w! w? + a,(w7)?, then 
a curve on S defined by p, =0 is called the 
asymptotic curve and its tangent the asymp- 
totic tangent. At any point of this curve, the 
plane tangent to S is in contact of order 2 
with this curve, and there are in general two 
asymptotic curves through any point of S. 
Equations of the asymptotic tangent at A 
are given by z*>=0, f, =0. A point of S at 
which the asymptotic tangents coincide is 
called a parabolic point. If every point of S is 
parabolic, then S is a tdevelopable surface, and 
the general theory is not applicable to such a 
surface. 

Among the family of frames of order 1, a 
frame satisfying ag =a, =0, a, =1 is called the 
frame of order 2. For this frame, the straight 
lines AA,, AA, are asymptotic tangents. With 
respect to this frame, if f3 = —(bp(z')? + 
3b, (z')?2? + 3b,21(2*)? +b; (z7)°)/3, then œ? = 
baw! +b, @*, —0§ +w! +03 —w3=2(b,o!+ 
bw), œ =b,w'!+b,@7, and the quadric 
surface z? =z!z? —z3(b,z'+b,z7 + pz?) (with p 
arbitrary) is called Darboux’s quadric at A, an 
especially interesting one among contact quad- 
rics of S. Darboux’s curve is a curve on S such 
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that Darboux’s quadric is in contact of order 3 
at any point of it. Its tangent is called Dar- 
boux’s tangent and is given by z? =0, by(z')> + 
b,(z?)° =0. 

We have b,b; £0, except in the case of 
truled surfaces. We take special frames of 
order 2 determined by b, =b, =0, bb) =b3=1 
and call them the frames of order 3. If a frame 
of order 3 satisfies f4 = —(Co(z')* + 4c, (z1)? z7 
+ 6(c, —1)(z!z)? +4c3z! (z? +¢,(z7)4)/12, 
then w§—2a}+03=cyw'+c,m7, 03 — o, 
=c,0'+c,07, w§-wl=c,w!+c,0, wo! 
—2m%=c,@1+c,@. With respect to this 
family of frames of order 3, ((@')? +(w7)°)/ 
2w' w is an invariant associated with two 
neighboring points of S, called the projective 
line element. Also with respect to this frame, 
two straight lines 4A,, A, A, are polar with 
respect to Darboux’s quadric. 

Among the families of frames of order 3, a 
frame satisfying c, =c, =c¢3 =0 is called a 
frame of order 4. With respect to this frame, 
there exist A, u, v, p such that w? =A! + pw?, 
w? = vwt + pw, w = pw' + Aw. Hence if we 
put cy = — 3a, c4= — 3b, it follows that 


te o! œ 0 
o th o! œ 
(o)=| 4 3 
a 0 2 21 1 ]> 
o w tt w 
Of œ Of —% 


t? = —(3/2)(aw' +b’), t? =(1/2)(aw' + be’). 


Thus the frame of order 4 is the Frenet frame 
and is attached to every point of S. This frame 
is called the normal frame, and the invariants 
a, b, A, u, v, p are called the fundamental differ- 
ential invariants. The straight lines AA, and 
A, A, associated with the normal frame are 
called directrices of Wilczynski of the first and 
second kind, respectively. With respect to the 
normal frame, S is expressed by 


2 =ziz2 —((z')? +(z7)9)/3 
+ (a(z')* + b(2?)*)/4 + (24 2?)2/2 +... 


A necessary and sufficient condition for two 
surfaces S, S to be projectively equivalent is 
that there be normal frames having the same 
«1, w? and the same six fundamental differen- 
tial invariants. For six quantities a, b, A, 4, V, p 
to be fundamental differential invariants of a 
surface, they must satisfy a certain condition of 
existence [6]. 

A frame of order 1 such that AA, and A, A, 
are polar with respect to Darboux’s quadric 
is called Darboux’s frame. With respect to 
this frame also, a theory of surfaces has been 
established. 

Consider a pointwise correspondence be- 
tween two surfaces S, S, and denote by AeS 
the point corresponding to A€ S. If there exists 
a projective transformation ¢ that transforms 
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A into A and the image ¢(S) is in contact of 
order 2 with S at A, then the pointwise corre- 
spondence is called a projective deformation. 
A necessary and sufficient condition for the 
existence of a projective deformation between 
two surfaces is that these surfaces have the 
same projective line element [6]. A ruled sur- 
face is projectively deformable only to a ruled 
surface. Given an arbitrary surface S, it is 
generally impossible to find a surface that is 
different from S and projectively deformable to 
S; some conditions must be satisfied [6, 8]. 

Let p°, p°?, p®3, p12, p13, p?3 be *Pliicker 
coordinate of a straight line in a 3-dimensional 
projective space P?. Then we have p°! p?3 — 
p°? p+? +p% pt? =0, and there is a one-to- 
one correspondence between the ratios of { p°} 
and straight lines (— 90 Coordinates B). If the 
př are regarded as homogeneous coordinates 
of a 5-dimensional projective space P, then 
the previous equation defines a hyperquadric 
Q in P°. Thus there is a one-to-one correspon- 
dence between points of Q and straight lines in 
P?. A curve on Q corresponds to a set of one- 
parameter families of straight lines, or a ruled 
surface. Sets of 2-parameter or 3-parameter 
families of straight lines corresponding to 
surfaces of 2 or 3 dimensions on Q in P* are 
called congruences of lines or complexes of 
lines, respectively. Thus by using a theory of 
surfaces in P5, it is possible to establish the 
theory of congruences and complexes [2, 6, 8], 
which is an important part of projective dif- 
ferential geometry. 

Specifically, if the surface is either a curve or 
a hypersurface, there are numerous interesting 
results [2, 4, 6]. 


C. Affine Differential Geometry 


The theme of general affine differential geome- 
try is the study of differential-geometric prop- 
erties of a point or set of points in a space 

that are invariant under the action of the 
taffine transformation group. Affine differen- 
tial geometry is the study of the properties 
invariant under the action of the tequivalent 
affine transformation group, Le., a subgroup of 
the affine transformation group formed by 
elements sending (x;) to (x;) such that 


X=a;t i ajx i=1,...,n,  det(a,)=1. 
j=1 


The latter transformation leaves invariant the 
volume surrounded by an oriented closed 
hypersurface. The method of moving frames is 
effective in affine differential geometry. 

Let C be a plane curve, and associate with 
any point A = A(t) of C a family of frames 
[A,e,,e,], where the area of the parallelogram 
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determined by the two vectors e,, e, is equal 
to 1. This frame is called the frame of order 0. 
Its differential is expressed by 


2 2 

dA= 5 œe, de=)> ose, r=1,2, 
s=1 s=1 

wi +o3=0. 


The frames of order 1, 2, and 3 are character- 
ized by w* =0; w? =0, w? =w! ; and w?=0, 

w? =w!, œw! =0, respectively. Then a frame of 
order 3 can be associated with each point of C 
and coincides with the Frenet frame. We call 
w'= doc the affine arc element; the affine curva- 
ture x is defined by œt = — x do. Then the 
Frenet formula is given by 


dA=doe,, de,=doe,, de,=—xkdoe. 


With respect to this frame, C is expressed as 
y=x?7/2+Kx*/8 + (dk/do)x>/40+.... 
Further, do and x are given analytically by 
do=|dA,d*A|'?, x =|d?A/do*, d°A/do?|, 


where |M, N| =det(M, N). M, N are column 
vectors with two entries. We call o the affine 
arc length. The straight line on which e, is 
situated is called the affine normal, the dia- 
meter of the parabola osculating C at A. If x is 
constant, then C is a conic section. Further- 
more, C is an ellipse, hyperbola, or parabola 
according as the constant x is positive, nega- 
tive, or zero. In affine geometry, parabolas play 
a role similar to that played by straight lines in 
Euclidean geometry. 

There are numerous results concerning the 
theory of skew curves and surfaces [1]. Con- 
cerning the theory of skew curves, results on 
affine length, affine curvature, affine torsion, 
affine principal normals, and affine binormals 
are similar to those in Euclidean geometry. 
The affine transformation group is situated 
between the projective transformation group 
and the congruent transformation group and 
hence has properties analogous to theirs. The 
theory of surfaces has a character similar to 
that of projective differential geometry [1]. 
We may also consider the variation of the 
affine area of a surface surrounded by a closed 
skew curve C. We call the extremal surface the 
affine minimal surface. W. Blaschke and others 
obtained many results on the global properties 
of such surfaces. 


D. Conformal Differential Geometry 


Let S” be a tconformal space of dimension 

n, and associate with each point AES” a 
frame R[ Ap, Ay, ..-, Ans A] Of the t(n + 2)- 
hyperspherical coordinates with origin Ao 
(— 76 Conformal Geometry). Then denoting 
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by A- B the tinner product of hyperspheres 
A, B, we obtain 


A, Ap = Gap; a, B=0,1,...,n, 00, 
where 
0 0 -1 
(Gag) = 0 Jij 0}, 
—-1 0 0 


ij=1,...,⁄, Jj = jie 


Let z" be homogeneous coordinates with 
respect to R. Then the tMöbius transforma- 
tion z-+Z of S" is characterized by Z*=c¢%z*, 
where g,gc¢c? = gan |cf| #0. The differential of 
the family of the frames is defined by 


dA,= wf Ag, (1) 
p=0,1 
where 
we wù 0 
(wf) = |E ga ok, o È Giz >, 
0 Oo, —we 


X (Jof T Jro) = dgj;. 


There are (n+ 1)(n+ 2)/2 linearly independent 
forms among œ, and this is the number of 
parameters of the Mobius transformation 
group. The structure equations of this group 
are 


dof =) win wf. (2) 


The theme of conformal differential geometry 
is the properties of Pfaffian forms w? satisfying 
(1) and (2). 

Consider a transformation c: È z*4,—> 
> 27(A, +dA,). (1) If all œ vanish except w9, 
then all the circles through Ap, A% are in- 
variant, and any point P is transformed to a 
neighboring point P on the circle, such that 
the cross ratio (P, P; Ap, A») is constant. This 
transformation is called the homothety with 
centers Ap, A». (2) If all œ vanish except wi, 
wo” => gaw}, then all the circles tangent to a 
fixed direction at A,, are transformed into 
themselves, and any hypersphere through A,, 
and orthogonal to those circles is transformed 
into a hypersphere having the same property. 
This transformation is called the elation with 
center A,,. (3) If all œ vanish except w? = 
Egok, œ, then the transformation is an 
elation with center Ap. (4) If all œ vanish 
except wf, then the transformation is an in- 
finitesimal rotation with center Ay, with A,, 
regarded as a point at infinity. Thus any in- 
finitesimal Möbius transformation is de- 
composed into the previous four types of 
transformation. 

To study the theory of curves and hypersur- 
faces in S$", we again utilize the Frenet frame 
chosen from a family of frames associated with 
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Ao. For example, the Frenet formula of a curve 
in S? is given by 


(wf)=| -do 0 0 rdo Q 
—tda 0 0 
0 kdo —do 0 0 


We call do, x, and t the conformal arc element, 
conformal curvature, and conformal torsion, 
respectively. There are many results on con- 
formal deformation [5]. 

Concerning Laguerre differential geometry, 
we have results dual to those in conformal 
differential geometry (a point is replaced by a 
straight line and an angle by a distance be- 
tween the points of contact of the common 
tangents of two oriented circles). 


E. Contact Manifolds 


Consider a (2n + 1)-dimensional differentiable 
manifold M?"*! with a 1-form y such that 

n A(dn)" #0, where dy is the exterior derivative 
of 4 and ^ denotes exterior multiplication. 
(Note that this is true for the 1-form in the left- 
hand side of eq. (2) in 82 Contact Transforma- 
tions A.) Such a manifold is called a contact 
manifold with contact form y. The structure 
group of the tangent bundle of a contact mani- 
fold M?"*! reduces to U (n) x 1, where U (n) is 
the unitary group; hence every contact mani- 
fold is orientable. Simple but typical examples 
are given by the unit sphere $?"*' in Euclidean 
space E?"*? and the tangent sphere bundle of 
an (n+ 1)-dimensional Riemannian manifold 
M"*!, both with natural contact forms (S. S. 
Chern [9]). Every 3-dimensional compact 
orientable differentiable manifold is a contact 
manifold (J. Martinet [12]). 

Now a differentiable manifold M?"*? is said 
to be an almost contact manifold if it admits a 
tensor field @ of type (1, 1), a vector field €, and 
a 1-form y such that 


oP X=—X+n(X)b, n=, (3) 


where X is an arbitrary vector field on M2"*?; 
and the triple (ọ, č, n) is then called an almost 
contact structure. (3) implies that gé =0 and 
n(oX}=0 (S. Sasaki [14, I]). The structure 
group of the tangent bundle of an almost 
contact manifold M?"*! reduces to U(n) x 1. 
Indeed, J. W. Gray [10] took this property as 
his definition of almost contact structure. For 
any pair of vector fields X and Y on M?"*?, let 


N(X, Y)=LX, Y]+ọ[ọX, Y] 
+ LX, eY]—-[eX, oY] 
—{X-n(Y)—Y-n(X)} 6, 
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where [ , ] is the Poisson bracket; then N is a 
tensor field of type (1,2) over M?"*!, which we 
call the torsion tensor of the almost contact 
structure (o, č). When N vanishes identically 
on M?"** we say that the almost contact 
structure is normal. 

An almost contact structure (Q, č, n) on 
M?*** induces naturally an almost complex 
structure J on M?"*! x R (resp. M?"*! x S?), 
which reduces to a complex structure if and 
only if (g, €,4) is normal. A similar statement is 
also valid for the product space of two almost 
contact manifolds (A. Morimoto [13]). 

If M?"*! is an almost contact manifold 
with structure tensor (ọ, €,7), we can find a 
positive definite Riemannian metric g so that 
g(pX, pY)=g(X, Y)—n(X)n(Y) for any pair 
of vector fields X and Y, and the set (9, €, y, g) 
is then said to be an almost contact metric 
structure. 

When M2"*! is a contact manifold with 
contact form y, there exists a unique vector 
field é which satisfies dyn(X, €)=0, n(€)=1 for 
any vector field X. We can then find a tensor 
field ọ of type (1, 1) and a positive definite 
metric tensor g so that (i) dy(X, Y)=g(X, oY) 
is satisfied for any pair of vector fields X and 
Y, and (ii) (ọ, €,4, g) is an almost contact metric 
structure. The almost contact metric structure 
determined in this way by a contact form 7 is 
called a contact metric structure. A differenti- 
able manifold with normal contact metric 
structure is called a normal contact Rieman- 
nian manifold or a Sasakian manifold. Bries- 
korn manifolds are examples of such mani- 
folds. They include, besides the standard 
sphere S*"*!, all exotic (2n + 1)-spheres that 
bound compact oriented parallelizable mani- 
folds. An almost contact manifold is said to be 
regular or nonregular according as the integral 
curve of € is regular or not as a submani- 
fold. A compact regular contact manifold is a 
principal circle bundle over a symplectic mani- 
fold, and it admits a normal contact metric 
structure if and only if the base manifold is 
a Hodge manifold (Boothby and Wang [8], 
Hatakeyama [11]). 

Many research papers on the topology 
and differential geometry of manifolds with 
the structures defined above have been pub- 
lished by S. Tanno, S. Tachibana, D. E. Blair, 
M. Okumura, K. Ogiue, S. I. Goldberg, and 
others. 
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A. General Remarks 


Let f be an timmersion of an m-dimensional 
tdifferentiable manifold M of class C” into an 
n-dimensional Euclidean space E”. More pre- 
cisely, f is a differentiable mapping of class C” 
such that the ‘differential df, is injective at 
every point p of M. The pair (M, f) is called 


an immersed submanifold (or a surface) of E”. 
When m= 1, we call it a curve of E”, and when 
m=n-—1,a hypersurface in £”. The cases of 
n=2 and n=3 have been the main objects of 
study in differential geometry of curves and 
surfaces. The differential-geometric properties 
for the general case of immersion are discussed 
in 365 Riemannian Submanifolds. 


B. Frames in £” 


Every tEuclidean motion in E” can be ex- 
pressed as the product of a parallel translation 
and an forthogonal transformation that keeps 
the origin of E” fixed. The set of all parallel 
translations is a commutative group that can 
be identified with R”. It is a normal subgroup 
of the group of motions I(E") of E”. So we see 
that I(E”) is a tsemidirect product of R” and 
the torthogonal group O(n). The Lie algebra 
of I(E”) is the direct sum of R” and the Lie 
algebra o(n) of the orthogonal group, where 
both are regarded as additive groups. Corre- 
sponding to this decomposition, we can write 
the fMaurer-Cartan differential form over 
T(E") as ow + Q, where w belongs to R” and Q 
to o(n). The tstructural equation d(a+Q)= 

— (1/2 +9) A (œ +9) can be divided into 
the following two parts: dw =Q ^ œ; dQ = 
—(1/2)Q ^ Q. These are known as the structure 
equations of E". By an orthogonal frame in 

E" we mean an ordered set (x,e,,...,€,) con- 
sisting of a point x and a set of torthonormal 
vectors €;, €2, -..,€„. We denote by O(n) the set 
of all orthogonal frames in E". If we denote 
the translation identified with xe R" by T,,, 
then there is a one-to-one correspondence g: 
I(E")> @(n) given by @(T;, A) =(x, Ae,, ..., Ae,) 
(AeO(n)). We can make O(n) into a differ- 
entiable manifold so that is a tdiffeomor- 
phism. We denote the differential forms over 
O(n), which are images of w and Q under the 
‘dual mapping of p `!, by the same letters œ 
and Q, respectively. For O(n) as a ‘principal 
fiber bundle over R” with the projection z: 
m(x,€,,-.-,€,) =x and n vector-valued func- 
tions ~;:@;(x,e,,...,€,) =e; over O(n), we have 


w=) wie, Q=} QE, 
i i<j 
w’ =(dn, p), QË =(do;, p;), (1) 


do'=F Qno, dQËÏS=Y Q" Aa Q¥, 

j k 
where {E;;} is a basis of o(n) defined by E,e;=e;, 
E,e;=—e,, Eje,=0 (k4#i,j) and (, ) is the 
scalar product of vector-valued forms induced 
from the scalar product of E”. Any diffeomor- 
phism of @(n) onto itself preserving œw and Q 
must be a Euclidean motion. 
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C. Theory of Curves 


Let (M, f) be an immersion of a 1-dimensional 
differentiable manifold M into E”. We identify 
the tangent space of E” at each point with E” 
itself. Then df, maps the origin of the tangent 
space M, to f(x), and the image df (M) of M, 
by df, is a straight line passing through f(x) 

in E”, called the tangent line of f(M) at f(x). 
By ©,(M) we mean the set of all ordered sets 
(x,e€,,...,€,), where xe M and {e;} is an ortho- 
normal basis of E” such that e; edf (M). Then 
0,(M) can be naturally immersed in O(n) by 
the mapping f: f(x,e,,...,e,)=(/(x),e,,...,€,)- 
We can pull back the differential forms w, Q, 
w', QË over O(n) to O,(M) f* and denote them 
by 6, ©, 64, and OË, respectively; then we have 
6'=0(i>1). Let f, and f, be two immersions 
of M into E". Then in order for there to exist a 
Euclidean motion « of E” such that fi =a0 fz, 
it is necessary and sufficient that there exist a 
diffeomorphism ¢ of ©, (M) onto 0, (M) such 
that 6, = p*(0;,), Or, =—*(O,,). Let z; be the 
projection of the fiber bundle (,(M), and let o; 
be naturally defined vector-valued functions 
over 0,(M). Then we have d(fo2,)=6' @,, 
dg,=DX}_, O”. If we put ds? (X) = |ldf,(X)|i? 
(X €M,), then we have (0'} = x#(ds’). For each 
point xe M there are two possibilities for the 
choice of e, corresponding to two orientations 
of the curve. But since (dy, ,d,) = nf (p° ds’), 
p° depends only on the point x of M. We call 
p (20) the absolute curvature. We now choose 
an orientation of the curve and then e, in 
accordance with the orientation. Thus we get a 
submanifold of ©,(M), which we again express 
by the notation (,(M). If we define the form ds 
by ds(X)=(df(X),e,), we have 0* = x}*(ds), and 
ds is called the line element. Any tlocal cross 
section R:z,0 R=1 of the bundle 0,(M) is 
called a moving frame. Putting 


R*(91)=ds, R*(O")=pds, 


we see that the following equation holds over 
M: 


J 


For two immersions (M, f,) and (M, f2), we 
have fi =«0 f (a is a Euclidean motion) if and 
only if they have the same ds and pË for some 
moving frames. 


D. Frenet’s Formulas 


In order to study local properties of curves it is 
sufficient to consider them on ‘Jordan arcs of 
class C”. With respect to orthogonal coordi- 
nates (x',...,x”) in E”, such a curve is repre- 
sented parametrically by x‘= f(t) (te [a,b], 

> (dx'/dt)? > 0) or by a vector representation 
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x=x(t). If ọ is a diffeomorphism of a closed 
interval [a’,b’] onto [a,b], then fog and f 
are representations of the same arc in £”, and 
ọ is called a transformation of the parameter. 
Any curve of class C! is trectifiable, and its 
arc length is given by s = [2(27.,(dx‘/dt)”)"? dt. 
We may choose the arc length s measured 
from a point on the arc as a parameter, called 
the canonical parameter of the arc. Consider 
an arc C of class C” given by the vector repre- 
sentation x =x(s), sé[a,b]. We assume that its 
*Wronskian |x’(s), ...,x(s)| is not identically 
zero (we denote by ’ the derivative with respect 
to the canonical parameter), which means 
that the arc C is not contained in a hyper- 
plane in E”. A point at which the Wronskian 
vanishes is called a stationary point, and we 
assume that there exists no stationary point 
on C. By the tGram-Schmidt orthonormaliz- 
ing process we obtain an orthonormal basis 
€,,---,€, (€;,...,€,| >0) from n vectors x’(s), 
...,x"(s) at each point of C. We call the frame 
thus determined the Frenet frame. With re- 
spect to the Frenet frame, (2) is rewritten as 


e;(s) = — K;_1(s)e;_,(s) + K;(s)e;+1 ($), 


Ko(s)=x,(s)=0; (3) 
K;(s) > 0, 


These are called Frenet’s formulas (or the 
Frenet-Serret formulas). We call «,, «2, ..., 
K„-2 the first, second, ..., (n—2)nd curvature, 
respectively, while we call x,_, the torsion for 
n> 3. For a curve in a lower-dimensional sub- 
space E” c E”, we set k;=0 (i>m). The curva- 
tures and the torsion of a straight line are 
zero. To get Frenet’s formulas in these special 
cases, we fix e; (i>m) in the subspace ortho- 
gonally complementary to E” in E” and pro- 
ceed as in the general case. Suppose that C,, 
C, are arcs such that both of their Frenet 
frames are of class C’. If there exists a diffeo- 
morphism of C, to C, that preserves arc length 
and the x; (i=1,...,n—1) are equal at corre- 
sponding points, C, and C, are mapped onto 
each other by a motion of E". This is the fun- 
damental theorem of the theory of curves. 
Given n—1 functions of class C' «,(s)20,..., 
K,-2(S) 20 (we assume that the equality signs 
occur at most at a finite number of points) 
and x,,_,(s) for O< s< L, there exists an arc 
that has k,,...,K,-2, Ka-1 as its first, ..., 
(n—2)nd curvatures and its torsion, respec- 
tively. The equations x; =x,(s) are called the 
natural equations of the curve. 


E. Plane Curves 


Let x = x(s) be a curve of class C? in E?, and 
(x(s),e,,€,) its Frenet frame. The tangent and 
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the normal of this curve at x(s) have para- 
metric representations x(s)+te,, x(s)+te,, 
respectively (with parameter t). Frenet’s for- 
mulas are written as x'=e,, x” =e, =Ke,, and 
p is called the curvature of the curve C. The 
natural equation is given by k=x(s). If k(s)= 
constant 40 along C, C must be a portion of 
a circle. Another way of defining the curvature 
is as follows: We take a fixed direction (for 
example, the positive direction of the x-axis on 
E?) and denote by 0(s) the angle made by the 
tangent T, of the curve C at x(s) with the direc- 
tion. Then we have x(s)=d6@/ds. If n=2, the 
curvature can take both positive and nega- 
tive values. Figs. 1 and 2 suggest a geometric 
meaning of k >0 and x <0, respectively. The 
circle with center x =x(s)+(1/x)e, and radius 
1/« has a contact of higher order than any 
other circle in E?. We call this circle the oscu- 
lating circle (or circle of curvature) at the point 
x = x(s), its center the center of curvature, and 
1/« the radius of curvature. The locus C’ of the 
center of curvature of a curve C is called an 
evolute of C. Conversely, C is called an involute 
of C’, the fenvelope of the family of normal 
lines of C. When a curve is given in terms of its 
canonical parameter s, the curvature is given 
by [x(s), x"(s)|; when the curve is given by 
another parameter t as x = x(t), the curvature 
is given by x(t)=|x’(t),x"()|/[x’(O[°, where ’ 
means d/dt. 


Fig. 1 


Fig. 2 
K>0. k<0. 


The facts we have just stated concern local 
properties of plane curves. We shall now dis- 
cuss the global theory of curves, which deals 
with properties of each curve as a whole. Let 
C:x=x(s), a<s<b, be a closed curve. Let 
0(s), O< 0(s)< 2r, be the angle that e(s) makes 
with the x axis. Put © = {? 0'(s)ds. Intuitively, 
© measures the total rotation of e; (s) as we 
run along the curve C from a to b. Since C is 
closed, © is an integer multiple J of 2x. The 
integer I is called the rotation number of C, 
and is equal to (1/2m){?x(s)ds. Let D be a 
closed domain consisting of points in the inte- 
rior and on the boundary of a simple closed 
curve C. C is called a closed convex curve or 
an oval if D is tconvex in E?. Among all ovals 
of given length, the circle has the maximum 
area. Various generalizations of this theorem 
have been obtained, and the collection of 
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problems of this kind ts called the isoperimet- 
ric problem. This problem has intimate con- 
nections with fields such as integral geometry. 
The oval has a convenient parameter other 
than the arc length parameter s. Given a num- 
ber t, 0<t <2z, there exists a unique point 
x(t) in the oval such that e, =(cost, sint) at 
x(t). When we describe the oval in terms of 
the parameter t, the tangent vector at x(t) is 
parallel to that at x(t + 7), and we can define 
the width W(t) at x(t). W(t) is called the width 
of the oval. A curve is called a curve of con- 
stant width if the curve is an oval whose width 
W(t) does not depend on t. The circle is a 
typical example of a curve of constant width. 
Reuleaux’s triangle is another well-known 
example of a curve of constant width (Fig. 3). 
For a curve of constant width of width W 
and length L, we have L=2w. 


aN 


Fig. 3 


There are also some results concerning the 
relations between local properties (for exam- 
ple, curvature) and properties of the whole 
figure. An example is given by the four-vertex 
theorem. A vertex on a curve C is by defini- 
tion a point where dx/ds=0. Then there are at 
least four vertices on an oval of class C*. A 
simple closed curve with x >0 (<0) must be 
convex (— 89 Convex Sets). 


F. Space Curves 


Let x =x(s) (se [a, b]) be a curve C of class C? 
in E? defined in terms of the canonical para- 
meter s. Let (x(s), e; , €32, €3) be Frenet frames 
along C. Then we have the Frenet formulas 


e1 =K,e2, e3 = — K€; +K363, e3 = — K3€3. 


We call 1/«,, 1/x, the radius of curvature and 
the radius of torsion, respectively. The line 
X=X(sọ)+ te, is the tangent of C at x(sp). 
The two straight lines through the point x(sọ) 
defined by X=x(sy)+te, and X=x(sy)+ te, 
are called the principal normal and the binormal 
of C at x(so), respectively. The three planes 
through x(sp) defined by X=x(sy)+1te, + fe3, 
X=x(So) + tez +te;, and Xx=x(so)+te, + fe, 
are called the normal plane, the rectifying 
plane, and the osculating plane, respectively. 
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At a point x(sọ) of a curve x =x(s) of class C®, 
we take e; (so), €,(So), and e,(Sọ) as unit vectors 
of the coordinate axes. Substituting the Frenet 
formulas into the Taylor expansion of x(s), we 
see that the new coordinate x; (s$), x,(s), X3(s) of 
C are given by 


X1 =(S—Sp)—(K (59)/6)(s—So)? + ---, 
X2 =(K;(S0)/2)(S— So)? + (x4 (S)/6)(s— s0)? +... 
X3 =(K1 (S0)K2(S0)/6)(S— So)? +... 


These are called Bouquet’s formulas. Utilizing 
these formulas we can see the nature of the 
curve with given x, and x. A curve and its 
osculating plane at a point on it have contact 
of order higher than any other plane through 
that point. The family of osculating planes 

of C tenvelops a tdevelopable surface S and 
coincides with the locus of tangent lines to C. 
We call S the tangent surface of C, and C the 
line of regression of S. The family of rectifying 
planes of C also envelops a developable sur- 
face called the rectifying surface, and C is a 
tgeodesic on this surface. The family of normal 
planes of C envelops either a cone or a tangent 
surface of another curve C. When the natural 
equation of a space curve has a special form, 
the shape of the curve is simple. For example, 
K,(s)=constant, «,{s)=constant represent a 
curve, called an ordinary helix, on a cylinder 
which cuts all the generators of the cylinder at 
a constant angle. More generally, it is known 
that if k/k, constant, the tangent at each 
point of the curve makes a constant angle with 
a fixed direction. Such a curve is called a gen- 
eralized helix or a curve of constant inclination. 
Each curve satisfying ax, + bk, =c (ab 40) is 
called a Bertrand curve. For a Bertrand curve 
there exists another curve C and a corre- 
spondence of C onto C such that they have a 
common principal normal at corresponding 
points. Conversely, this property is also a 
sufficient condition for C to be a Bertrand 
curve. A Mannheim curve is defined analo- 
gously as a curve having a correspondence 
with another curve C such that the principal 
normal of C and the binormal of C coincide at 
corresponding points. When a correspondence 
of C and C has the property that tangents at 
corresponding points are parallel, then the 
correspondence is called a correspondence of 
Combescure. 

We have stated mainly local properties of 
space curves. There are also several results 
about global properties of curves in E? analo- 
gous to the case of plane curves. For a sim- 
ple closed curve C of length L, we call K = 
fó «,(s)ds the total curvature of C. Generally 
we have K <2z, while K =2z7 if and only if C 
is a closed convex curve lying in a plane (W. 
Fenchel) [5, 6]. The total curvature is deeply 
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related to the properties of tknots. If a simple 
closed curve in E? is knotted, then the total 
curvature is at least 47 [7,8]. We fix an origin 
O in E? and draw a unit tangent vector with 
initial point O parallel to the unit tangent 
vector at each point of a space curve C; then 
the endpoint of this vector traces a curve C on 
the unit sphere with the center O. We call C 
the spherical indicatrix of C and the corre- 
spondence of C to C a spherical representation. 
The total curvature K of a curve C is equal to 
the length of C. Consequently, we have K = 
fcd0, where @ is the angular deflection of the 
tangent line along the closed curve C. 


G. Theory of Hypersurfaces 


Let (M, f) be an immersion of an (n — 1)- 
dimensional differentiable manifold M of class 
C into E”. Then we can define on the hyper- 
surface M a positive definite differential form 
g of degree 2 induced from the inner product 
of E":9,(X, X)=(df,(X), df,(X)), X € M,,. Then 
M becomes a tRiemannian manifold with 
*Riemannian metric g. We call g the first fun- 
damental form of (M, f). The Riemannian 
geometry on a surface with its first funda- 
mental form as Riemannian metric is called 
geometry on a surface (— 364 Riemannian 
Manifolds). 

By ©,(M) we mean the set of all the ordered 
sets (x,e€,,...,€,)€O(n), where xe M and {e;} 
(i=1,2,...,n) is an orthonormal system of E" 
such that eẹ;edf (M) (i=1,...,n—1). Then 
©,(M) with natural projection z, and natural 
differentiable structure is a tprincipal fiber 
bundle over M and has a natural immersion 
f:f(x e, ee en) =(f(x), e,,---,€,) in the prin- 
cipal fiber bundle O(n). We can tpull back the 
forms on O(n) to ©,(M) by f* and put 0 = 
f *(w), O= f *(Q); then the structural equa- 
tions of E” are transformed to dł =0 ^9, 
d®=(—1/2)0 ^ ©. Furthermore, if we put 
6'=f*(w'), Of = f*(Q#), then 0" =0 and 6, 
©'(i,j <n) depend only on the first funda- 
mental form of (M, f). Let fı and f, be two 
immersions of M into E". Then in order that 
there exist a Euclidean motion « of E” such 
that fi =a0o fz, it is necessary and sufficient 
that there exist a diffeomorphism ¢ of ©, (M) 
onto ©, (M) such that 6, =9*(6,,) and ©, = 
¢*(,,). Suppose that M is forientable and 
oriented. Then the unit vector field normal to 
df (M) at every point xe M in E” defines a 
mapping of M into the unit sphere in £” called 
the spherical representation of M or the Gauss 
mapping (Gauss map). Regarding the unit 
normal vector field č of M as a vector-valued 
function over M, we can define a symmetric 
product of df and dé by —(df,dé)(X, Y)= 
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(1/2) Lf (X), dč(Y))+ (df (Y), dč(X))], called 
the second fundamental form of (M, f). 

Two immersions f, and f, of M that induce 
the same first and second fundamental forms 
have a Euclidean motion « such that fı = 
ao fa; and the converse is also true. This fact 
is called the fundamental theorem of the theory 
of surfaces. 


H. Theory of Surfaces in E? (— 365 Riemann- 
ian Submanifolds; Appendix A, Table 4.1) 


A surface in E” is locally expressed by para- 
metric equations x;=x,{u,) (i= 1,..., n; a= 

1, ... m) or by a single vector equation x = 
x(u,). We are mainly concerned with the case 
n=3,m=2, and we express the surface by a 
vector representation x =x(u, v). The first and 
second fundamental forms are written as 


Edu? +2F dudv+Gdv’, 
Ldu? +2M dudv+N dv. 


If we use the usual notation of tensor analy- 
sis, then the first and second fundamental 
forms are also denoted by g,,du*du* and 
H,,du* du”, respectively, where u” (x= 1,2) are 
parameters (with } omitted by tEinstein’s 
convention). We call {g.g}, {H,,} the first and 
second fundamental quantities, respectively (— 
365 Riemannian Submanifolds). At the point 
Po =X(Up, Vo) on a Surface S that corresponds 
to parameter values (uo, Vo), the curves ex- 
pressed by v= vo and u =u are called a u- 
curve and a v-curve through po, respectively. 
Let x,, x, denote the tangent vectors éx/éu, 
Ox/6v at po to the u-curve and v-curve, respec- 
tively, through the given point pọ and é de- 
note the unit vector orthogonal to x, and x,. 
Then € is called the normal vector of S at po 
and (x,,x,,¢) the Gaussian frame of S at po. 
Although a Gaussian frame is not in general 
an orthogonal frame, it is intimately related 
to local parameters. The plane that passes 
through the point pọ and is spanned by x,, x, 
is called the tangent plane to S at pọ. The 
coefficients of the second fundamental form 
L(u, v), M(u, v), N (u, v) are expressed by the 
inner products L=(—x,, Ča), M =(— Xn €,), 
N=(—X,,¢,) (Ču =06/0u, Č, =0€/0v). 

Let (X, Y) be the coordinates of a point on 
the tangent space at pọ with respect to the 
Gaussian frame. We call the curve of the sec- 
ond order defined by LX*+2MXY+NY?=e 
(e is a suitable constant) the Dupin indicatrix. 
The point pọ is called an elliptic point or a 
hyperbolic point on S according as the Dupin 
indicatrix at the point is an ellipse or a hyper- 
bola. If pọ is an elliptic point, then points near 
Po on the surface lie on one side of the tangent 
plane at pọ, whereas if py is a hyperbolic point, 
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points near pọ on the surface lie on both sides 
of the tangent plane at pọ (Figs. 4, 5). A hyper- 
bolic point is also called a saddle point, since in 
a neighborhood of the point the surface looks 
like a saddle. A point that is neither elliptic 
nor hyperbolic is called a parabolic point; at a 
parabolic point we have LN — M? =Q. If at 
least one of L, M, N does not vanish at po, 
then there is a neighborhood of po of the sur- 
face that lies on one side of the tangent plane 
at Po (Fig. 6). Ifa vector (X, Y) on the tangent 
plane of the surface at pọ satisfies the equation 
LX? +2MXY+ NY? =0, then the direction of 
the vector is called an asymptotic direction. If 
the point pọ is elliptic, such a direction does 
not exist; if pọ is hyperbolic, the direction is an 
tasymptotic direction of the Dupin indicatrix 
on the tangent plane at pọ. A curve C ona 
surface such that the tangent line at each point 
of the curve coincides with an asymptotic 
direction of the surface at the point is called an 
asymptotic curve. 


Fig. 4 
Elliptic point. 


Hyperbolic point. 


Fig. 6 
Parabolic point. 


Let C:x=x(u(t), v(t)) be a curve through po 
on the surface x =x(u, v). Then the curvature xK 
of C as a space curve is given by 


Ldu?+2M dudv + N dv? 


par ae 
weoi e Pu FaF dudo Gan 
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where dudv is the direction of C on the surface 
at po and @ is the angle between the normal of 
the surface at pọ and the principal normal of C 
at po. The center of curvature at a point po of 
a curve C of class C? on a surface of class C? is 
the projection on its osculating plane of the 
center of curvature of the section C* (of the 
surface) cut by the plane determined by the 
tangent to the curve at pọ and the normal of 
the surface at the point (Meusnier’s theorem). 
The curvature of the curve C* at pọ is called 
the normal curvature of the surface at the point 
for the tangent direction. Since the normal 
curvature for a direction at a point is a con- 
tinuous function of this direction that can be 
represented as a point on a unit circle, there 
exist two directions that realize the maximum 
and minimum of the normal curvature. These 
directions are given by the equation 


Edu+Fdv Fdu+Gdbv = 
Ldut+tMdv Mdu+Ndv| ` 


When this quadratic equation in du/dv has 
nonzero discriminant, it determines two direc- 
tions defined by its two roots. These directions 
are called principal directions at the point. A 
curve C on a surface such that the tangent line 
at each point of the curve coincides with a 
principal direction at the point is called a line 
of curvature. When all lines of curvature of 

a surface are circles, the surface is called a 
cyclide of Dupin. The two normal curvatures 
corresponding to two principal directions are 
given by 1/R, satisfying the following second- 
order equation: 


1\2 EN +GL—2FM 1, LR-M? _ 
R EG-F? R EG—F? — 


They are called principal curvatures, and each 
of their inverses is called a radius of principal 
curvature. The mean value H =(x,+«,)/2 of 
two principal curvatures x;=1/R; (i= 1, 2) is 
called the mean curvature (or Germain’s curva- 
ture), and the product K =x,k, is called the 
total curvature (or Gaussian curvature). These 
are given by 


_1EN+GL—2FM 
2 #EG-F2 ’ 


_LN-M? 
~ EG—F?2° 


A point on a surface is elliptic, hyperbolic, or 
parabolic according as K >0, K <0, or K=0 
at the point. A point where the second fun- 
damental form is proportional to the first 
fundamental form is called an umbilical point, 
and a point where the second fundamental 
form vanishes is called a flat point or a geo- 
desic point. If a surface consists of umbilical 
points only, the ratio (L(du)* +2M dudv+ 
N(dv)*)((E(du)? + 2F dudv + G(du)’) is a con- 
stant, and the surface is either a sphere or a 
portion of it. If a surface consists of flat points 
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only, the surface must be either a plane ora 
portion of it. The mean curvature and the 
Gaussian curvature of a sphere are constant, 
and those of a plane are both equal to zero. If 
we use the spherical representation of a surface 
stated in Section G, we can give to the Gauss- 
ian curvature the following geometric mean- 
ing: Let A be the area of the domain enclosed 
by a closed curve C around a point pọ ona 
surface, and let A* be the area of the domain 
on the unit sphere enclosed by the curve that 
is the image of C under the spherical represen- 
tation of the surface. Then the limit of A*/A 
as the closed curve C tends to the point pọ is 
equal to K at po. 

Let us denote by (g*) the inverse matrix of 
the matrix (g,,) whose elements are coefficients 
of the first fundamental form g,ş du” du’. We 
easily see that g'' =G/(EG — F?), g+? =g?! = 
— F/(EG— F?), g*? = E/(EG — F?). We intro- 
duce the symbols 


1/05 00 89 
tbr) =( at +g) 


ĝu? ĝu? ðu" 


x = ad ô 
E g” Epy, 6], 


which are called the Christoffel symbols of the 
first and second kinds, respectively. Suppose 
that a surface is given by the vector represen- 
tation X =x(u;, u3), and put x,=0x/du", Xag = 
ôx,/ĉðu". Then for the derivatives of the Gaus- 
sian frame, we obtain 


Y 
Xag = H x +Hag%, a= — g” HpaX,. 


We call the former Gauss’s formula and the 
latter Weingarten’s formula. The integrability 
conditions of these partial differential equa- 
tions are 


R” by = Hap H; — H, H, H =g" Hs, 


CA, OH ay oly Tl 20 
ôu? uf lap)  layf P? 


where 


Remas lear a bar “ant od 
a 


are components of the curvature tensor. The 
former are called the Gauss equations, and the 
latter the Codazzi-Mainardi equations. In 
connection with these equations, Bonnet’s 
fundamental theorem states the following: 
Suppose that a positive definite symmetric 
matrix (g,g) and a symmetric matrix (H,,) are 
given that are functions of class C? and C}, 
respectively, defined over a tsimply connected 
domain D in R?. If they satisfy the Gauss 
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equations and the Codazzi-Mainardi equa- 
tions, then there exists a surface x = x(u,,u>) 
with the given (g,,) and (H,,) as coefficients of 
its first and second fundamental forms, respec- 
tively. Such a surface is determined uniquely 
if, for an arbitrary fixed point (u?, u$) of D, 

we assign an arbitrary point pọ and a frame 
(x?, x9, €°) at po so that x?, x? are orthogonal 
to the unit vector €° and (x2, x3) =g,,(u?, u9) as 
the Gaussian frame at pọ. On the other hand, 
we can investigate surfaces as Riemannian 
manifolds defined by the first fundamental 
form. 

A tdiffeomorphism between two surfaces 
preserving arc length is called an isometric 
mapping. The condition of preserving arc 
length is equivalent to the condition that the 
first fundamental quantities of the surfaces 
coincide at each pair of corresponding points, 
provided that we have introduced parameters 
on the two surfaces so that corresponding 
points have the same parameter values. In 
such a case two surfaces are said to be iso- 
metric. From the Gauss equation we can see 
that the total curvature depends only on the 
first fundamental quantities. So K is a quantity 
that is preserved under isometric mappings 
(Gauss’s theorema egregium). 

A vector field 4%(t)x, defined along a curve 
u* =u*(t) on a surface is said to be parallel in 
the sense of Levi-Civita along the curve if its 
tcovariant derivative along the curve vanishes, 
i.e., if 


5i%/dt =dat/dt + Z% du’/dt=0. 

By 
The length of a vector belonging to a vector 
field that is parallel along a curve C is constant 
along C. The angle of two vectors both be- 
longing to vector fields that are parallel along 
C is also constant along C. Choose two vector 
fields A(,, parallel along a curve C on a surface 
that satisfy gag 4n Af) = ôa. Then the tangent 
vector to C is expressed by du*/dt = Xav" (t). 
Take a 2-plane and fix an orthogonal coordi- 
nate system on it; then the integral curve C of 
a set of ordinary differential equations dx*/dt 
= Cia t (t) (Ci = Ala)(Po)) is called the develop- 
ment of C (— 80 Connections). We denote by 
k the curvature of a curve C of class C? ona 
surface S of class C? and by ø the angle be- 
tween the binormal of C and the normal of 
S at the same point. Then the quantity x, = 
«cosa, belonging to the geometry on the sur- 
face, is called the geodesic curvature of the 
curve at the point. A curve with vanishing 
geodesic curvature is called a geodesic. It satis- 
fies the differential equations 


aut (a) dub du’ 
adi gg siecle) 
ds? {By} ds ds 
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The development of a geodesic is a straight 
line (— 364 Riemannian Manifolds). 

Let us consider a simply connected, orient- 
able bounded domain D on a surface such 
that the boundary of D is a simple closed curve 
C that consists of a finite number of arcs of 
class C*. If we denote by a; (i=1,2,...,m) the 
external angles at vertices of the curvilinear 
polygon C (Fig. 7), we have 


| K,ds+ } a;+ {| Kdo=2n. 
C i=1 D 


This is called the Gauss-Bonnet formula. In 
particular, if all the arcs of C are geodesics, we 
have 


> at || Kdo=2n. 
i=1 D 


This formula implies as special cases the 
following well-known theorems in Euclidean 
geometry and spherical trigonometry: (1) The 
sum of interior angles of a triangle is equal to 
z. (it) The area of a spherical triangle is pro- 
portional to its spherical excess. The formula 
also implies the following theorem: On any 
closed orientable surface we have f f K do = 
2nz, where y is the tEuler characteristic of the 
surface. We call f f K do the integral curvature 
(or total Gaussian curvature). 


Fig. 7 


I. Special Surfaces in E* 


A surface is called a surface of revolution if it is 
generated by a curve C on a plane z when z 
is rotated around a straight line / in z. Then 
land C are called an axis of rotation and 

a generating curve, respectively. A surface 

of revolution having the x3-axis as the axis 

of rotation is given by the equations x, = 
rcos@, x, =rsin 0, x,= (vr); its first funda- 
mental form is (1 + @’?)dr? +r? dO. The sec- 
tion of a surface of revolution by a half-plane 
through its axis of rotation is called a meri- 
dian. According as the meridian is a straight 
line parallel to the axis of rotation or a straight 
line intersecting the axis nonorthogonally, 

the surface of revolution is called a circular 
cylinder or a circular cone, respectively. If the 
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meridian is a circle that does not intersect the 
axis of rotation, it is called a torus. 

A surface of class C? whose mean curvature 
H vanishes everywhere is called a minimal 
surface (— 275 Minimal Submanifolds). A 
surface of class C! realizing a relative mini- 
mum of areas among all surfaces of class C! 
with a given closed curve as their boundaries 
is an tanalytic surface such that H =0. Con- 
versely, a surface of class C? with vanishing 
mean curvature is an analytic surface. The 
equation of a surface of revolution with a cate- 
nary as its generating line is given by x? + x3 = 
a(e** + e-*3/9)/2. This surface is called a cate- 
noid and is a minimal surface. Conversely, a 
minimal surface of revolution is necessarily a 
catenoid. For a surface obtained by rotating a 
tDelaunay curve around its base line, the mean 
curvature H is equal to a constant (#0). Con- 
versely, a surface of revolution with nonzero 
constant mean curvature must be such a sur- 
face. A surface with constant Gaussian curva- 
ture is called a surface of constant curvature, 
and is a 2-dimensional Riemannian space of 
constant curvature (— 364 Riemannian Mani- 
folds). A non-Euclidean plane can be repre- 
sented locally as a surface of constant curva- 
ture (— 285 Non-Euclidean Geometry). Two 
surfaces of the same constant curvature are 
locally isometric to each other. 

Surfaces of revolution of constant curvature 
are Classified. The simplest surface of constant 
negative curvature is a pseudosphere, which is 
a surface of revolution obtained by rotating a 
ttractrix x; =acos@, x,=alogtan((@/2)+ 
(z/4))—asing (—2/2<g<zx/2) around the 
x-axis. A surface generated by a 1-parameter 
family of straight lines is called a ruled surface; 
a hyperboloid of one sheet, a hyperbolic para- 
boloid, a circular cylinder, and a circular cone 
are examples. The first two can be regarded 
as ruled surfaces in two ways. Each of the 
straight lines that generate a ruled surface is 
called a generating line. A surface consisting of 
straight lines parallel to a fixed line and pass- 
ing through each point of a space curve C is 
called a cylindrical surface with the director 
curve C. A surface generated by a straight line 
that connects a certain point o with each point 
of a curve C is called a conical surface. Both a 
cylindrical surface and a conical surface are 
ruled surfaces such that K =0 everywhere. For 
ruled surfaces we have K <0. In particular, a 
surface such that H #0 and K =0 everywhere 
is called a developable surface. A developable 
surface must be either a cylindrical surface, a 
conical surface, or a tangent surface of a space 
curve. There exist ruled surfaces that are not 
developable, for example, hyperboloids of one 
sheet and hyperbolic paraboloids. A nondevel- 
opable ruled surface is called a skew surface. A 
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ruled surface generated by a straight line that 
moves under a certain rule intersecting a 

fixed straight line / orthogonally is called a 
right conoid. If we take l as the x3-axis, the 
surface is given by the equations x, =ucos», 
x, =usinv, x, = f(v). A surface generated by a 
curve C (C may be chosen as a plane curve) 
that moves in the direction of a fixed line / 
with constant velocity and turns around / with 
certain constant angular velocity is called a 
helicoidal surface. If we take | as the x-axis, 
the surface is given by the equations x, = 
UCOSU, X, =usinv, X3 = f(u)+ kv, where k is a 
constant and x, = f(x,) is the equation of C. In 
particular, if C is a straight line that intersects | 
orthogonally, then f(u)=0, and the surface is 
called a right helicoid (or ordinary helicoid). 

A right conoid is both a ruled surface and a 
minimal surface. Conversely, a ruled surface 
that is also a minimal surface is necessarily a 
right conoid. A helicoidal surface with a trac- 
trix as the curve C is called a Dini surface and 
is a surface of constant negative curvature. On 
the normal of a surface S two points q; (i= 1,2) 
are centers of principal curvature at p. The 
locus of each of these points is a surface called 
a center surface of S. When S is a sphere, two 
center surfaces degenerate to a point; if Sis a 
surface of revolution, one of the center surfaces 
degenerates to the axis of revolution and the 
other is a certain surface of revolution. If S is 
general, each of the center surfaces is the locus 
of an edge of regression of the developable 
surface generated by normals of S along a line 
of curvature. 

When a 1-parameter family of surfaces S, is 
given by the equation F(x,,x,,x3,t)=0,a 
surface E that does not belong to this family is 
called an enveloping surface of the family of 
surfaces {S,} if E is tangent to some S, at each 
point of E, that is, if E and S, have the same 
tangent plane. The equation of E is obtained 
by eliminating t from F(x,,x,,x3,t)=0 and 
(OF /6t)(x1,X2,X3,t)=O. In general, if we de- 
note by (x1, X2, X3)=0 the equation ob- 
tained by eliminating t from F =0 and 6F/ét= 
0, then the surface defined by @ =0 is either 
the enveloping surface of {S,} or the locus 
of singular points of S,. The intersection Car 
of the enveloping surface E of {S,} and S, 
is a curve defined by F(x1, X2, X3,to)=0, 

(OF /0t)(x,,X2,X3,l9)=0. We call C, a charac- 
teristic curve of {S,}. Since {C,} is a family of 
curves on the enveloping surface E, there may 
exist an envelope F on E. In such a case, F is 
called the line of regression of {S,}. The equa- 
tion of F is obtained by eliminating t from F = 
0, 6F/ét=0, and 6?F/ét? =0. In particular, 

the enveloping surface of a family of planes is a 
developable surface, and their characteristic 
curves are straight lines. Moreover, the line of 
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regression coincides with the line of regression 
of the tangent surface. 

If there exists a diffeomorphism between 
two surfaces such that first fundamental forms 
at each pair of corresponding points are pro- 
portional, then the surfaces are said to be ina 
conformal correspondence. In particular, when 
the proportionality factor is a constant, they 
are said to be in a similar (or homothetic) 
correspondence. There exists a local conformal 
correspondence between any analytic surface 
and a plane. Namely, if we choose suitable 
parameters, we can reduce the first funda- 
mental form of any analytic surface to the 
form A(é,y)(dé? + dn’). Such parameters are 
called isothermal parameters. From the exis- 
tence of isothermal parameters we can see 
that there exists a local conformal correspon- 
dence between any two analytic surfaces. The 
assumption of analyticity in these theorems is 
not necessary [10]. If there exists a diffeomor- 
phism between two surfaces under which geo- 
desics are mapped to geodesics, then the sur- 
faces are said to be in geodesic correspondence. 
A surface has a locally geodesic correspon- 
dence with a plane if and only if it is a surface 
of constant curvature. If two surfaces are in 
geodesic correspondence, then with respect 
to parameters with the same values at corre- 
sponding points, we have the relation 


x x a a 
Bif By es 


for coefficients of connections of the two 
surfaces. 

By the tAlexander-Pontryagin duality theo- 
rem, a submanifold M in E? that is homeo- 
morphic to $° divides E? into two domains, 
and two points belonging to different domains 
cannot be connected by a broken segment 
unless the segment meets the surface. Such a 
manifold M is called a closed surface. One of 
the two domains consists of those points with 
bounded distance from a point belonging to 
the domain. Such a domain is called the inte- 
rior of the closed surface M. If the set M* con- 
sisting of M and its interior is convex in E°, 
the surface M is called a closed convex surface 
(or ovaloid). 

The Gaussian curvature of an ovaloid can- 
not be negative at any point. A closed surface 
with K >0 must be an ovaloid. Moreover, it is 
known that on any closed surface there exists 
at least one point where K > 0 (J. Hadamard). 
If there exists no umbilical point and K is 
strictly positive in a domain on a surface, then 
the two principal curvatures regarded as con- 
tinuous functions on the domain cannot take 
their local maximum and local minimum 
values at the same point (D. Hilbert). A com- 
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pact, connected surface of class C* with con- 
stant Gaussian curvature is a sphere. A closed 
surface with K >0 and H=constant is a 
sphere (H. Liebmann). 

A problem proposed by H. Hopf asks 
whether an orientable compact surface with 
constant mean curvature is a sphere. In con- 
nection with this problem, Hopf showed that a 
closed orientable surface of class C? of genus 
zero with constant mean curvature is a sphere. 
If there is a certain relation W(k,,k,)=0 be- 
tween the two principal curvatures k,, k, 

(k, =k.) of a surface, the surface is called a 
Weingarten surface (or W-surface). There are 
many interesting results for W-surfaces. As an 
extension of the convex surface, tight immer- 
sions have been studied (— 365 Riemannian 
Submanifolds). 

The Gaussian curvature K is invariant 
under isometries. Hence a sphere is trans- 
formed to a sphere by each isometry. This 
fact is sometimes described as the rigidity of a 
sphere. More generally, if two ovaloids are iso- 
metric, then they are congruent (Cohn-Vossen’s 
theorem). It is known that if we remove a small 
circular disk from a sphere, then the remaining 
portion of the sphere is isometrically deform- 
able. On the existence of closed geodesics on 
ovaloids, G. D. Birkhoff proved the follow- 
ing theorem: There exist at least three closed 
geodesics on any ovaloid of class C?. It is also 
known that there exist surfaces of revolution 
that are not spheres but whose geodesics are 
all closed (— 178 Geodesics). On a hyperbolic 
non-Euclidean compact tspace form of genus 
p (p> 2) there exists a geodesic whose points 
are everywhere dense in it (E. Hopf) (for the 
ergodicity of flows along geodesics on this 
surface — 136 Ergodic Theory; also 126 
Dynamical Systems). 


J. Singular Points of a Surface 


Suppose that a neighborhood of a point py of 
a surface S in E? is given by a certain vector- 
valued function f of class C” as r= f(u, v). Then 
a point pọ where two vectors (Of/du),,, (Cf/Cv),, 
are linearly independent is called a regular 
point. A point on S that is not regular is called 
a singular point. If for suitable parameters we 
have (6f/du),, =O but (2f/év),,, (07 f/éu’),,. 

(67 f/du ðv), are linearly independent, then 
such a singular point is called a semiregular 
point. In general, shapes of neighborhoods of 
singular points are extremely complicated. 
However, we note the following: (i) by a small 
deformation of the function f (and its deriva- 
tives of orders at most r) we can reduce py toa 
regular or semiregular point of the deformed 
surface; (ii) if pọ is semiregular, we can choose 
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suitable parameters and curvilinear coordi- 
nates of class C” in E? near pọ so that the 
surface S in the neighborhood of the origin po 
is expressed by the equations x, =u?, x, = 

v, x3 =uv (H. Whitney’s theorem [15]). (The 
higher-dimensional case has also been consid- 
ered (Whitney [16]).) 
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A. Definition 


A mapping (or an operator) A of a function 
space F, to a function space F, is said to be 
a differential operator if the value f(x) of the 
image f = Au (ue F, feF,) at each point x is 
determined by the values at x of u and a finite 
number of its derivatives. If u and f are tdistri- 
butions, the definition applies with the deriva- 
tive interpreted in the sense of distributions 
(— 125 Distributions and Hyperfunctions). In 
this article we restrict ourselves to the case of 
linear differential operators and consider only 
those of the form 
P(x, D)= Ý a,(x)D*, (1) 
la| <m 
where « denotes n-tuples (&;, €23, ..., p) of 
nonnegative integers, called multi-indices; | «| 
the length of «:|a|=«, +2 +... +æ, and D* 
the differential operator D” = Df}: D}... D%", 
with D;=(—i)ð/ôx;. The coefficient (—i) is 
sometimes omitted. The coefficients a,(x) 
are functions defined on an open set Q in n- 
dimensional space. We call P(x, D) an ordinary 
differential operator if the dimension n of Q is 
1 and a partial differential operator if n> 2. 
Ordinary differential operators and partial 
differential operators behave quite differently 
in many respects. 
We set 
P(x, 9= 2 a(x), =% Ca, 
where €=(€,,&,,...,¢,)€R” or C". The order 
of P(x, D) is the greatest integer |«| for which 
a,{x)#0. In expression (1) m is assumed to be 
equal to the order, and in that case 
P(x, D)= > a,(x)D* 
jal=m 
is called the principal part of P(x, D), and the 
corresponding polynomial P,,(x, č) the charac- 
teristic polynomial. 
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Differential operators have been investt- 
gated for a long time in connection with the 
linear differential equations 


P(x, D)u(x)= f(x),  xeQ. (2) 


Except for ordinary differential operators, 
however, it is rather recently that the prop- 
erties of such operators have been studied 
from the general viewpoint. 

We denote a differential operator with 
constant coefficients by P(D). In general, 

P(x, D) is assumed to be a linear differen- 

tial operator with coefficients that are C®- 
functions. However, many of the results for 
C*-coefficients also hold when the coefficients 
are sufficiently differentiable. (For tfunction 
spaces BQ), B(Q), &(Q), &(Q), CQ), C°(Q), 
Co (Q), L,(Q), (9), BQ), ete., — 125 Distri- 
butions and Hyperfunctions, 168 Function 
Spaces). 

Differential operators are classified accord- 
ing to their properties. The most important are 
the elliptic, hyperbolic, and parabolic types. 

A differential operator P(x, D) is said to be an 
elliptic operator if the characteristic poly- 
nomial P„(x, č) has no real zero except for 
€=0 for each xeQ. Typical examples are 

the Laplacian A= —(D? + ... + DŻ) and the 
Cauchy-Riemann operator 0/0z =(1/2)(6/éx + 
id/dy). 

A differential operator is said to be hyper- 
bolic if the associated Cauchy problem is well 
posed (— 325 Partial Differential Equations of 
Hyperbolic Type). The d’Alembertian D? — 
(D3 +... + D?)is an example. A differential 
operator of the form 6/ét+ P(t, x, D,) is called 
parabolic if P(t, x, D,) is strongly elliptic (— 
Section G) in x. The heat operator iD,,, ~A is 
typical. 

These three types of operators appear most 
often in applications, and if n=2, then any 
operator of order 2 with real coefficients in 
(6/6x;)* belongs to one of them at a generic 
point. In other cases, however, there are dif- 
ferential operators that do not belong to any 
of them. 


B. Fundamental Solutions 


If a differential operator P(x, D) with AQ) as 
its domain has a left inverse F that is expressed 
as an tintegral operator with tkernel distri- 
bution (in 2’, ,; = 125 Distributions and 
Hyperfunctions F), then the kernel is said to be 
a fundamental solution (or elementary solution). 
F is usually a right inverse of the weak exten- 
sion (— Section F) of P(x, D) and maps Z(Q) 
into 6(Q). The image is mapped to the original 
function by P(x, D). Nevertheless, F is not a 
genuine right inverse, and hence the funda- 
mental solution is not unique if it exists. 
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When P(D) is a differential operator with 
constant coefficients, we call a distribution 
E(x) a fundamental solution if it satisfies 


P(D)E(x) = 6(x), (3) 


where 0(x) is tDirac’s distribution (ô function). 
If E(x) is a fundamental solution in this sense, 
then F(x, y)= E(x —y) is the kernel of a left 
inverse of P(D) and is a fundamental solution 
in the sense of the preceding paragraph. 
Every differential operator P(D) with con- 
stant coefficients has a fundamental solu- 
tion in the sense of (3) (Ehrenpreis-Malgrange 
theorem; see L. Hörmander [4] for a proof). 
General operators P(x, D) with variable 
coefficients do not necessarily have funda- 
mental solutions. However, if P(x, D) belongs 
to one of the classical types of operators (ellip- 
tic, hyperbolic, or parabolic), then it has a 
fundamental solution at least locally. (See 
F. John [8] for elliptic operators, J. Leray 
[11] for strongly hyperbolic operators, and S. 
Mizohata [12] and S. D. Eidel’man [9] for 
parabolic operators.) Leray has generalized 
John’s method to strongly hyperbolic opera- 
tors in an enormous work [13]. 


C. Ranges of Differential Operators 


Let P(D) be a differential operator with 
constant coefficients. Then it follows from 

the Ehrenpreis-Malgrange theorem that 
P(D)2'(Q)> BQ) holds for any open set Q. 
However, there are differential operators 

P(x, D) with variable coefficients such that for 
any Q, P(x, D)Z(Q)f BQ). H. Lewy first 
devised such an example: 


P(x, D) = —iD, + D, —2(x, +ix2)D3. 


Let C,,,-1(x, D) be the homogeneous part of 
order 2m—1 of the commutator 


P(x, D) P(x, D) — P(x, D) P(x, D). 


Then in order that P(x, D)G(Q)> A), it is 
necessary that 


Pail ¢)=0 imply Com-1 (X, ¢)=0 


for all xeQ, ¢e R” (H6rmander’s theorem [4]). 
When P(x, D) is a differential operator that 
does not satisfy this condition (e.g., Lewy’s 
operator), choose an f(x)e AQ) that is not in 
P(x, D)G'(Q). Then the differential equation (2) 
has no distribution solutions at all. P. Scha- 
pira extended this result to the case of thyper- 
functions (also — 274 Microlocal Analysis). 
Concerning the ranges of differential oper- 
ators P(D) with constant coefficients, we have 
the following detailed results due to L. Ehren- 
preis [28], B. Malgrange [14], and Hormander 


[4]. 
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An open set Q is said to be P-convex for a 
differential operator P(D) if for each compact 
set K <Q there exists a compact set K'cQ 
such that ọ e C5 (Q) and supp P(—D)gc K 
imply supp ọ c K’. Convex sets are P-convex 
for any P(D). All open sets are P-convex if and 
only if P(D) is an elliptic operator. 

Theorem: The following conditions are 
equivalent: (i) Q is P-convex; (ii) P(D)F(Q)> 
&(Q); (iii) P(D)&(Q) = &(Q). Property (tii), the 
Mittag-Leffler theorem, and the solvability of 
*Cousin’s first problem for the solutions of 
P(D)u=0 are equivalent. 

An open set Q is said to be strongly P- 
convex if for each compact set K <Q there 
exists a compact set K’ such that we &'(Q) and 
supp P(—D)u< K imply supp uc K’; and 
ueé& (Q) and sing supp P(—D)uc K imply sing 
suppu € K’; where the singular support of u is 
the closure of the set of all points at which p is 
not a C”-function. Convex sets are strongly P- 
convex, and strongly P-convex sets are P- 
convex. 

Theorem: Q is strongly P-convex if and only 
if P(D)Z'(Q)= BQ). 

R. Harvey has shown that every domain Q 
is P-convex in the sense of hyperfunctions, i.e., 
the equation P(D)u= f always has a hyper- 
function solution u on Q for any hyperfunction 
f on Q. For the treal analytic functions .e(Q), 
however, P(D).o(Q) = (Q) does not hold for 
convex open set Q in general. H6rmander [15] 
gave a necessary and sufficient condition for 
P(D) and Q in order that this hold. 


D. Hypoellipticity 


A differential operator P(x, D) is called hypo- 
elliptic in Q if for any distribution u(x)e 2'(Q), 
Pue C”(Q) implies ue C”(Q). Further, a differ- 
ential operator with real analytic coefficients 
P(x, D) is called analytically hypoelliptic in Q 
if P is hypoelliptic and if for any distribution 
u(x)e B(Q), Pue £(Q) implies u{x)e (9). 

There are two fundamental facts about 
such operators. Let P have constant coeffi- 
cients; then P(D) is hypoelliptic if and only if 
P(č +iņ)=0 and |ë +iņn|>œ imply that |y|— œ 
(Hörmander’s theorem [4]). Furthermore, P(D) 
is analytic hypoelliptic if and only if P is ellip- 
tic (Petrovskis theorem [16]). The heat opera- 
tor is not elliptic, but hypoelliptic. If P(D) is 
elliptic, then actually any hyperfunction u(x)e 
ABQ) such that Pue (Q) is real analytic (R. 
Harvey, G. Bengel). On the other hand, if P(D) 
is not elliptic, there is a hyperfunction solution 
u of Pu=0 that is not a distribution. 

Strictly speaking, the notion of the hypo- 
ellipticity for general differential operators 
was first formulated explicitly by L. Schwartz 
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[27]. Before that time, D. Hilbert, E. E. Levi 
and K. O. Friedrichs, and others investigated 
this problem for some elliptic operators, and 
the hypoellipticity was called Weyl’s lemma 
(— 323 Partial Differential Equations of Ellip- 
tic Type). 

Similarly to the constant coefficient case 
the following two theorems are fundamental: 
Elliptic and parabolic operators P(x, D) with 
C® coefficients are hypoelliptic (Schwartz [27], 
Mizohata [12]). Elliptic operators P(x, D) 
with real analytic coefficients are analytic 
hypoelliptic (I. G. Petrovskii [16], C. B. Mor- 
rey and L. Nirenberg). The latter holds also 
for hyperfunction solutions (M. Sato and 
Schapira). Moreover, the following result is 
known: If for each compact set K in an open 
set Q, there exists a constant C such that 
| P¥ ull e < C**!(mk)!, then ue.(Q), where ||-|| x 
denotes the L,-norm on K (H. Komatsu, T. 
Kotake, and M. S. Narasimhan). 

However, as seen by the example D, +ix{ D, 
in R? (k is even), the analytic hypoellipticity 
also holds for nonelliptic operators. Such an 
operator is called a subelliptic operator; subel- 
liptic operators have been investigated by 
Hormander, Yu. V. Egorov, F. Treves, and 
others [19]. 

Hormander has obtained a fairly complete 
result on the hypoellipticity of the operators of 
the form 


L=} X?+Xo+c(x), 

i=1 
where X,,..., X, are homogeneous first-order 
differential operators with real coefficients 
({17]; O. A. Oleinik and E. V. Radkevich 
[18]). Hypoellipticity was investigated exten- 
sively after the introduction of pseudodifferen- 
tial operators and Fourier integral operators 
(— 345 Pseudodifferential Operators). 


E. Differential Operators in Banach Spaces 


We consider differential operators P(x, D) de- 
fined on a domain Q as operators in the func- 
tion spaces C(Q) or L,(Q). Differential oper- 
ators of order m>1 are always unbounded 
operators in the Banach space X = C(Q) or 
L,(Q). Moreover, their domains of definition 
as operators in X are not generally determined 
uniquely by the expressions P(x, D) as differen- 
tial operators. 

P(x, D) is a linear operator that maps 

o (Q) into X. This operator has a closed 
extension. The minimal closed extension P, is 
called the minimal operator of P(x, D) in X. 
We have ue 2(P,) and Pou=/f if and only if 
there exists a sequence pE Cy (Q) such that 
P, >u, P(x, D)o, > f. On the other hand, the 
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closed linear operator P, whose domain is 

the set of all ue X such that P(x, D)ue X in 
the sense of distribution is called the maxi- 
mal operator (or weak extension) of P(x, D). 
We have ue 2(P,) and P,u= f if and only if 
<u, ‘P(x, Dp) =< f, p> for any pe C(O), 
where ‘P(x, D) is the transposed operator 

‘P(x, D) g(x) z > (—D)*(a,(x) p(x). 
Integration by parts shows that P, is an exten- 
sion of P, and that when X is the tdual space 
of a space Y, the weak extension P, in X is the 
tdual of the minimal operator of ‘P(x, D) in 

Y. Let X =L,(Q) (1 <p < œ), Q be a bounded 
open set with smooth boundary, and P(D) 
have constant coefficients. Then P, coincides 
with the smallest closed extension of the oper- 
ator P(D) having as its domain the set of all 
ueC”(Q)NX such that P(D)ue X. The latter 
closed extension is called the strong extension. 
The difference between the weak and the 
strong extension is not obvious in the vari- 
able coefficient case. 

P, coincides with P, when Q is the entire 
space and P(x, D) is an elliptic operator whose 
coefficients are constants or close to constants 
(J. Peetre, Medd. Lunds Univ. Mat. Sem., 16 
(1959); T. Ikebe and T. Kato, Arch. Rational 
Mech. Anal., 9 (1962)). In general, we have 
P, #P,. Let P(x, D) be an ordinary differential 
operator with bounded coefficients such that 
(a,,{(x)| 26 >0 and Q be the bounded interval 
(a,b). Then the domain of P, coincides with the 
set of all (m—1)-times continuously differenti- 
able functions u such that the (m— 1)st deriva- 
tive is absolutely continuous and P(x, Djue X, 
while the domain of P, is the set of all func- 
tions u which satisfy in addition the boundary 
conditions 


u(a)=u'(a)=...=u'™ (a) 
=u(b)=...=u'"")(b)=0. 


(Moreover, u’”)(a)=u(b)=0 when X = 
C({a, b).) 

Let G(P)), G(P,)(< X x X) be the tgraphs of 
Po, P,. Then the quotient space # = G(P,)/ 
G(P,) is called the boundary space, and an 
element of the dual #’ of &, i.e., a continuous 
linear functional on G(P,) which vanishes on 
G(P,), is called a boundary value relative to 
P(x, D). For the ordinary differential opera- 
tors discussed above, the boundary space is 
the set of all linear combinations of u(a) and 
u)(b). When P(x, D) is an ordinary differen- 
tial operator, we can explicitly determine the 
boundary values also in the case where the 
interval (a, b) is infinite, the coefficients a,(x) 
are not bounded, or a„(x)>0 (xa, b), and we 
can show that 2 is finite-dimensional. When 
P(x, D) is a partial differential operator, 2 is 
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generally of infinite dimension, and the con- 
crete forms of the elements of # and #' are 
not known. However, we have some informa- 
tion by M. I. Vishik (Amer. Math. Soc. Transl., 
(2) 24 (1963)) about the boundary values of 
elliptic operators of the second order. Combin- 
ing this with the results by J.-L. Lions and E. 
Magenes (J. Anal. Math., 11 (1963)), we can 
obtain information for elliptic operators of 
higher order. 


F. Differential Operators with Boundary 
Conditions 


A closed operator between the minimal opera- 
tor P) and the maximal operator P, is deter- 
mined by designating a closed subspace B 

of the boundary space 2. This operator is 
called the operator with the boundary condition 
B. Particularly important are boundary condi- 
tions expressed in the form 


Ox, Dyu(x)=0, xeôQ, i=1,...,k, (4) 


with differential operators Q;(x, D) (i=1, ...,k) 
defined on the boundary ôN of Q. 

When P(x, D) is an ordinary differential 
operator defined on a finite interval and the 
orders of Q; are at most m — 1 (or m), (4) always 
has a definite meaning. However, for partial 
differential operators, we need an interpreta- 
tion of (4), i.e., (4) does not necessarily deter- 
mine the subspace B of # uniquely. 

Let P, be the smallest closed extension of 
P(x, D) with {ueC?(Q)N.X | O,(x, D)u(x)=0, 
xeE6Q; P(x, D)ue X} as its domain. P, is called 
the strong extension of the differential operator 
P(x, D) with boundary condition (4). 

On the other hand, when Q, P, and Q; 
satisfy suitable conditions, we can define the 
transposed differential operator 'P(x, D) with 
the transposed boundary operators R,(x, D) 
(j=1,...,1). Namely, there are differential 
operators R,(x, D) on the boundary such that 
a necessary and sufficient condition for ue 
C%(Q) to satisfy (4) and Pu(x)= f(x) is 


| flsts)ax= | u(x)'Pv(x)dx (5) 
Q Q 

for all v(x)e C” (Q) with the boundary con- 
ditions Rjv=0, xeôQ. Then the operator P, 
defined by P,,u(x)= f(x) for the pairs u(x), 
f(x)eX satisfying (5) is called the weak exten- 
sion of the differential operator P(x, D) with 
boundary condition (4). As in the case of 
operators without boundary conditions, the 
weak extension is an extension of the strong 
extension, and generally is the dual of the 
strong extension in the dual space X’ of the 
transposed differential operator with the trans- 
posed boundary condition. 
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Regularity up to the boundary. The funda- 
mental problems for the differential operator 
P(x, D) with the homogeneous boundary 
condition (4) are to determine, for both the 
strong and the weak extensions, the spaces 
of solutions of the homogeneous equations 
Pu=0 and their ranges. The problems mostly 
reduce to determining when the strong and 
the weak extensions coincide and, including 
this, also to the problem of regularity on the 
closed domain © containing the boundary 
of the solutions u of the equation Pu = f. 

This problem was solved by Nirenberg for 
strongly elliptic operators in L?(Q) with the 
Dirichlet boundary condition 


Ft u(x)/on?'=0, j=1,2,...,m/2, 


and generalized later by F. Browder, M. 
Schechter [20], S. Agmon, Lions, and others 
for elliptic operators in L,(Q) with a kind of 
coercive boundary condition (— Section H). 
Consequently, when Q is bounded and 
smooth, P,, is equal to P, for those operators. 
Write P for P,,. Then the space N(P) of the 
solutions of Pu=0 is a subspace of finite di- 
mension, and the range R(P) is a closed sub- 
space of finite codimension. In particular, 
it follows that the tindex of P, dim N(P)— 
codim R(P), is finite. 


G. Strongly Elliptic Operators 


A differential operator P(x, D) is said to be 
strongly elliptic if its characteristic polynomial 
satisfies 


ReP,,(x, c)2C|c(">0, č#0. 


Many of the elliptic operators, such as the 
Laplacian, that appear in applications are 
strongly elliptic. L. Garding treated the 
boundary value problem with the Dirichlet 
condition (in the generalized sense) for strong- 
ly elliptic operators. His work initiated the 
general study of differential operators (— 323 
Partial Differential Equations of Elliptic 
Type). His theory is based on the following 
inequality, called Garding’s inequality: 


ivmPulf, <C( Re | Puri jul, ), 


uEeCe(Q). 


H. Coercive Boundary Conditions 


The boundary condition (4) is said to be coer- 
cive if 

WW"ull <C(| Pull + ull) (6) 
holds for any ue C” (0) that satisfies (4). In 
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order that a differential operator P have a 
coercive boundary condition, it is necessary 
that it be a special type of elliptic operator. In 
this case, Agmon, N. Aronszajn, Schechter, 
and others found conditions under which (4) is 
coercive. Agmon, A. Douglis, and Nirenberg 
show that the inequality (6) holds in L,(Q) and 
in the normed spaces of Hdlder continuous 
functions under a suitable condition. The 
classical boundary conditions au + béu/én=0 
(a>0,b>0, a+ b=1) are coercive for elliptic 
operators of the second order (— 323 Partial 
Differential Equations of Elliptic Type). How- 
ever, problems remain when the coefficients 

of Q,(x, D) are discontinuous. In order to 

have coincidence of the strong and weak exten- 
sions or regularity up to the boundary, it is 
necessary neither that P be elliptic nor that the 
boundary condition be coercive. But it is not 
known to what extent these conditions can be 
weakened. At present, major contributions are 
H6érmander’s work (Acta Math., 99 (1958)) 
dealing with operators with constant coeffi- 
cients and flat boundaries, and works by J. J. 
Kohn [21], Nirenberg, and Hormander con- 
cerning noncoercive boundary conditions. 
The latter works are connected with the theory 
of several complex variables, and have at- 
tracted much attention. 


I. Self-Adjoint Extension 


One of the fundamental problems in the case 
X =L,(Q) is whether the minimal operator Po 
has a self-adjoint extension. P, is symmetric 
if and only if P(x, D) is formally self-adjoint: 
P(x, D)='‘P(x, D). Under this condition the 
boundary space # turns out to be the direct 
sum of two subspaces &, = {(x, P,x)|xEF(P,), 
P,x= +ix}+ G(P)). The numbers n, = dim 2, 
are called the deficiency indices of P,, and P, 
has a self-adjoint extension if and only if n4 
SMa 

H. Weyl gave a method for computing n4 
for the Sturm-Liouville operators: 


d d 
P(x, D)= ~ (Ewi rat x €(a, b). 


We say that a (resp. b) is of limit circle type 

if the solutions u(x) of P(x, D)u(x)+lu(x)=0 
(leC) always belong to L, in a neighborhood 
of a (resp. b) and of limit point type if a solu- 
tion does not belong to L,. This classification 
does not depend on the choice of leC. (1) If 
both a and b are of limit point type, then 

n, =n_=0 and hence Py is self-adjoint. (2) If 
ais of limit circle type and b is of limit point 
type, then n, =n_ =1, and the self-adjoint 
extensions of P, are the operators P, that are 
obtained from P, by assigning the boundary 
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condition 
p(a)u'(a)cos «+ u(a)sin«=0. 


The same is true when a and b are inter- 
changed. (3) If both a and b are of limit circle 
type, then n, =n_ =2, and we can impose two 
boundary conditions to obtain the self-adjoint 
extensions. 

These results have been extended by K. 
Kodaira [23] and N. Dunford and J. Schwartz 
[3] to the case of ordinary differential opera- 
tors of order m. There are formally self-adjoint 
operators that have no self-adjoint extensions. 
For example, the operator —id/dx in L,(0, œ) 
has deficiency indices n, =14n_=0. 

For partial differential operators, it is dif- 
ficult to determine explicitly all self-adjoint 
extensions of a given formally self-adjoint 
operator because the boundary space is com- 
plicated. If the boundary condition (4) is for- 
mally self-adjoint and coercive, then it follows 
from the results of Schechter and others that 
the differential operator with boundary condi- 
tion (4) is self-adjoint. Furthermore, condi- 
tions under which P, is self-adjoint or has a 
self-adjoint extension are known. The follow- 
ing theorem is often used as a condition of the 
latter type. If a tsymmetric operator defined on 
a dense subspace of a Hilbert space X is posi- 
tive definite: 


(Tx,x)20, xe2(T), 


then there is a positive definite self-adjoint 
extension T (Friedrichs’s theorem). The self- 
adjoint extension obtained by this theorem is 
called the Friedrichs extension. 


J. Generators of Semigroups 


From the point of view of probability theory, 
W. Feller investigated the extensions of the 
Laplacian d?/dx* and similar operators that 
are the generators of order-preserving semi- 
groups. Recently various attempts have been 
made to generalize his results to the multi- 
dimensional case (— 115 Diffusion Processes; 
378 Semigroups of Operators and Evolution 
Equations). 

P. D. Lax and A. N. Milgram proved that 
if P(x, D) is a strongly elliptic operator, then 
— P(x, D) with the Dirichlet condition in L,(Q) 
is the generator of a semigroup [1]. 


K. Boundary Value Problems 


There are two methods of solving the inhomo- 
geneous boundary value problem 


P(x, D)u(x)= f(x), xeEQ, 
O,(x, D)u(x)=9,(x), xEéQ, i=1,...,k. 
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In the first method, we take a function v(x) 
that satisfies Q;(x, D)v(x)=g,(x) and reduce 
the problem to the homogeneous one for ug = 
u—v. In the second method, we consider the 
pair P(x, D) and Q,(x, D) as an operator that 
maps a function u to the pair of functions 
(Pu, Q;u) and investigate it directly. The latter 
method was adopted by Peetre and Hör- 
mander [4]. 


L. Estimates in Weighted Spaces 


J. F. Treves, H6rmander [4], and H. Kumano- 
go obtained estimates similar to (6) in L,- 
spaces relative to the weighted measure 

w,(x) dx instead of the usual L,-spaces, and 
applied them to the proof of the uniqueness 
of Cauchy problems for differential equations 
with variable coefficients (— 321 Partial Dif- 
ferential Equations (Initial Value Problems)). 
Hormander [22] applied similar estimates to 
the proof of the fundamental theorems of 
Stein manifolds. 


M. Eigenfunction Expansions 


When a self-adjoint operator P in the Hilbert 
space L,(Q) is a self-adjoint extension of a 
differential operator P(x, D), the tspectral 
decomposition of P is concretely expressed by 
the expansion of functions ue L,(Q) into eigen- 
functions of P(x, D). 

If Q is bounded, P(x, D) is an elliptic oper- 
ator defined on a neighborhood of Q, and 
the boundary condition is coercive, then the 
tspectrum of P is composed solely of eigen- 
values, and the eigenvectors of P are eigen- 
functions of P(x, D) in the classical sense and 
are of class C” up to the boundary. 


N. Asymptotic Distribution of Eigenvalues 


If P= —A, the number v(A) of eigenvalues less 
than å satisfies the asymptotic relation 


An? A 


f ae 3 A 
winn n S TS 


v(A) 
regardless of the shape of the domain and the 
boundary condition, where n is the dimension 
and A is the volume of Q [2]. This was first 
proved by Weyl and extended by T. Carleman, 
Garding, and others to the case of operators 
of higher order with variable coefficients (— 
323 Partial Differential Equations of Elliptic 


Type). 
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O. Weyl-Stone-Titchmarsh-K odaira Theory 


When Q is unbounded or the coefficients of 
P(x, D) have singularities near the boundary of 
Q, the spectrum of P may have a continuous 
part. 

Let P(x, D) be an ordinary differential oper- 
ator on an interval (a, b). Then for each Ae 
C the equation (P(x, D)—A)p(x)=0 has m 
linearly independent solutions ¢,(x, A) (k= 
1,...,m), and any solution is represented as 
a linear combination of them. Weyl and M. H. 
Stone obtained the spectral decomposition of 
P (of the second order) in the form 


ux)= F | p(x, A) dp, (A) 


pk=l 


b 
«| p,(y, Alu(y) dy, 


a 


Pu(x)= H f Leplx, 4) 4 Pilà) 


b 
«| ay, Au(y) dy, 
where the p,,{4) are functions of bounded 
variation and their variations represent the 
spectral measure. This formula shows that 
linear combinations of the ¢;(x, 4) form gen- 
eralized eigenfunctions even when å belongs to 
the continuous spectrum. Later, E. C. Titch- 
marsh and Kodaira gave a formula to obtain 
the density matrix p;(4) and completed the 
theory (Titchmarsh [6], Kodaira [23], Dun- 
ford and Schwartz [3]). This expansion theo- 
rem makes it possible to deduce in a unified 
manner expansion theorems for classical 
special functions, such as the tFourier series 
expansion theorem, the expansions by tHer- 
mite polynomials and tLaguerre polynomials, 
the tFourier integral theorem, and various 
expansions in terms of tBessel functions [6, 7]. 

The relation between the coefficients of the 
differential operator and the spectral distri- 
bution of P is important in applications and is 
the subject of many papers [3, 5, 6]. 

For example, let P(x, D)= —d?/dx? + q(x) 
and 2=(—o, œ). If g(x) 0 as |x| 00, then 
P(x, D) is essentially self-adjoint, the spectrum 
is entirely composed of the point spectrum, 
and a detailed estimate of the jth eigenvalue /, 
is also known [6]. If q(x) converges rapidly to 
0 as |x| 00, then P(x, D} is essentially self- 
adjoint, and there is only a continuous spec- 
trum for 4>0 and eigenvalues for 1 <0 with 
at most 0 as accumulation point. If q(x) is a 
periodic function of x, then P(x, D) is again 
essentially self-adjoint, and the spectrum con- 
sists of a continuous spectrum decomposed in 
a sequence of nonoverlapping intervals. The 
converse problem of determining q(x) when 
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the spectral measure is given has been studied 
by I. M. Gelfand and B. M. Levitan (Amer. 
Math. Soc. Transl., (2) 1 (1955); original in 
Russian, 1951). 


P. Eigenfunction Expansion for Partial 
Differential Operators 


The theory of eigenfunction expansion for 
partial differential operators with continuous 
spectra is not complete, as it is for ordinary 
differential operators. What causes difficulties 
is that the solutions of (P(x, D)—A)u=0, which 
should be the generalized eigenfunctions, form 
an infinite-dimensional space, and except for 
special cases it is impossible to introduce 
convenient parameters in it. Many proofs are 
known for a general result, saying that any 
self-adjoint elliptic operator has an eigenfunc- 
tion expansion into generalized eigenfunctions 
of the form 


© k(A) 


X g(x, A)dp,(A) j o;(y, A)u(y) dy, 


-œ j=l 


u(x) = 


[10]. There are few operators, however, for 
which we know how to construct (x, 4) and 
the measure dp,(A). The Fourier transform 
gives the expansion for operators with con- 
stant coefficients defined on the whole space. 
By means of a generalized form of the Fourier 
transform, Ikebe gave an eigenfunction expan- 
sion for the Schrödinger operator —A+q(x) 
in R? under the condition that g(x)¢L, and 
O(|x|~*~*) as |x| 00 [24]. Y. Shizuta, Mizo- 
hata, Lax and R. S. Phillips, N. A. Shenk, 
Ikebe, D. K. Fadeev, and others proved the 
same results for similar operators defined 
on an exterior domain with a bounded set 
deleted from a higher-dimensional Euclidean 
space. These theories are closely related to the 
tscattering theory of the Schrödinger equa- 
tion (—id/dt — P)u=0 or the wave equation 
(d*/dt? + P)u=0 associated with those opera- 
tors. Lax and Phillips have developed the 
latter scattering theory [26] (— 375 Scattering 
Theory). 

Many of the problems in fquantum mechan- 
ics reduce to finding the spectral distribution 
of self-adjoint partial differential operators. 


Q. Expansion Theorems for Non-Self-Adjoint 
Operators 


A kind of eigenfunction expansion theorem 
may hold for non-self-adjoint differential 
operators or for non-Hilbert spaces. (See 
papers by the Russian school for ordinary 
differential operators and those by Browder, 
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Agmon, and others for partial differential 
operators.) 


R. Systems of Differential Operators 


We have so far dealt with single differential 
operators that map functions u to functions 
J. A linear differential operator that maps a 
p-tuple (u,,...,u,) of functions to a q-tuple 
(fi> ---» Ja) of functions can be written as 


fix) = 2 P(x, D)u;(x), i= l, see Gy 


where P(x, D)=(P,,(x, D)) is a matrix of single 
differential operators. Such a matrix is called 
a system of differential operators. A system 
P(x, D) is said to be underdetermined if p >q, 
determined if p =q, and overdetermined if p < 
q. Many propositions that hold for single 
operators hold for determined systems under 
appropriate conditions. 

However, there is a fundamental difference 
between overdetermined (underdetermined) 
systems and determined systems, as is seen 
from the theory of several complex variables, 
which is the theory of a typical overdeter- 
mined system 0/0z =(6/0z;). The theory is 
much more difficult for overdetermined (under- 
determined) systems. The general theory of 
overdetermined and underdetermined systems 
with constant coefficients has been constructed 
by Ehrenpreis [28], Malgrange, V. Palamodov, 
and Hormander [22] for C%-functions and 
distributions. It has been extended by H. Ko- 
matsu to the case of hyperfunctions. T. Miwa 
has discussed the same problem for real ana- 
lytic functions. 


S. Symmetric Systems of the First Order 


Determined systems of the first order are 
important in applications. Many problems in 
mathematical physics are formulated in terms 
of them. Also, single equations of higher order 
can be reduced to determined systems of the 
first order by regarding the derivatives as 
unknown functions. In some cases determined 
systems of the first order are easier to handle 
than single operators of higher order. In partic- 
ular, a system of differential operators 


P(x, D)=S° A(x)0/0x;+ B(x) 


is said to be a symmetric positive system if the 
matrices A,{x) and B(x) satisfy the following 
conditions: A¥ = A;; B+ B* + > 64A,/6x; is 
positive semidefinite. Symmetric positive sys- 
tems have been studied in detail by Friedrichs 
[25], Phillips, C. S. Morawetz, Lax, and 
others. 
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Let R be a tcommutative ring with a unity 
element 1. Ifa map 6 of R into R is such that 
for any pair x, y of elements of R, (i) 6(x + y) 
=6dx + dy, and (ii) 6(xy)=6x-y+x- dy, then 6 
is called a derivation (or differentiation) in R. A 
ring R provided with a finite number of mutu- 
ally commutative differentiations in R is called 
a differential ring. In this article we consider 
only the case where R contains a subfield that 
has the unity element in common with R. In 
particular, if R is a field, we call it a differential 
field. 

In the above definition of differential ring, it 
is not necessary to mention the tcharacteristic 
of the subfield contained in R. However, to 
make it more effectively applicable in the case 
of nonzero characteristics, we may define 
differential rings using higher differentiation in 
place of the differentiation defined above. If a 
sequence d= {06,} of maps dp, 6,, 62,... of R 
into R satisfies the following conditions (i)~—(iv) 
for any pair x, y of elements of R and any pair 
A, H of nonnegative integers, then ô is called a 
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higher differentiation in R: (i) 6,(x + y)=6,x 
+ ôy; (i) ô (xy)= È 6,x + òy (the addition is 
performed over all pairs «, 8 of nonnegative 
integers that satisfy «+ B= A); (iii) 6,(6,x) = 


A 

ferentiations ô= {6,} and 6’ = {6,} are said to 
be commutative if and only if 6, and 6), com- 
mute for all pairs A, u of nonnegative integers. 
Higher differentiations were introduced by H. 
Hasse (1935) for the study of the field of talge- 
braic functions of one variable in the case of 
nonzero characteristics. 

These two definitions of differential rings 
coincide if the characteristic of R is zero. For 
simplicity, we shall restrict ourselves to that 
case. 

Let 6,,...,6,, be the differentiations of the 
differential ring R. If x is an element of R, 

OP OF ... On" X(S1, S2, -++ s Sm are nonnegative 
integers) is called a derivative of x. We call x 
constant if and only ifd,x= ... = „X =0. An 
tideal a of R with 6,aca (i=1,2,...,m) is 
called a differential ideal of R. If it is a tprime 
ideal (semiprime ideal (i.c., an ideal containing 
all those elements x that satisfy x’ ea for some 
natural number g)), then a is called a prime 
differential ideal (semiprime differential ideal). 
A subring S of R with ôS cS can be regarded 
as a differential ring with respect to the differ- 
entiations 6,,...,6,,. We call S a differential 
subring of R and R a differential extension ring 
of S. 

Let X,,...,X, be elements of a differential 
extension field of a differential field K with the 
differentiations 6,,...,6,, and let ôẸ 62... 
ôm X, (s1 20, ...,5_, 229, 1<i<n) be talgebra- 
ically independent over K. The totality of their 
polynomials over K, which forms a differential 
ring, is called the ring of differential polyno- 
mials of the differential variables X,,...,X, 
over K, and is denoted by K{X,,...,X,}. Its 
elements are called differential polynomials. 
For this ring of differential polynomials we 
have an analog of *Hilbert’s basis theorem in 
the ring of ordinary polynomials, Ritt’s basis 
theorem: If we are given any set Nt of differen- 
tial polynomials of X,,..., X, over K, we can 
choose a finite number of differential polyno- 
mials P,,..., P, from IR such that each element 
Q of Mi has an integral power Q? equal to a 
linear combination of P,,..., P, and their de- 
rivatives, where the coefficients of the linear 
combination are elements of K{X,,...,X,}- 
This theorem implies that in the ring of differ- 
ential polynomials, every semiprime differential 
ideal can be expressed as the intersection of a 
finite number of prime differential ideals; if this 
expression is tirredundant, it is unique (— 67 
Commutative Rings). 

The equation obtained by equating a dif- 


A 
( 3 i 544,X; (iv) box =x. Two higher dif- 
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ferential polynomial to zero ts called an alge- 
braic differential equation. Concerning these 
equations, we are able to use methods similar 
to those used in talgebraic geometry in study- 
ing the usual algebraic equations. J. Ritt made 
interesting studies on solutions of algebraic 
differential equations by such methods, prin- 
cipally in the case when the ground field K 
consists of fmeromorphic functions. 

Since that time, basic study of differential 
rings and fields has been fairly well organized 
and has developed into theories such as the 
following two: 

(1) Picard-Vessiot theory. This is a classical 
theory of flinear homogeneous differential 
equations originated by E. Picard and E. 
Vessiot; it resembles the tGalois theory con- 
cerning algebraic equations. The Galois group 
in this case is a linear group, and its structure 
characterizes the solution of the differential 
equation. E. Kolchin introduced the general 
concept of the Picard-Vessiot extension field of 
a differential field and studied in detail the 
group of differential automorphisms (i.e., the 
group of all those automorphisms that com- 
mute with the differentiations and fix elements 
of the ground field), thus making the classical 
theory more precise and more general. 

(2) Galois theory of differential fields. Gen- 
eralizing the concept of the Picard-Vesstot 
extension, Kolchin introduced the notion of 
the strongly normal extension field and es- 
tablished the Galois theory for such exten- 
sions. In this theory, we see that the Galois 
group is an ‘algebraic group relative to a 
tuniversal domain over the field of constants 
of the ground field. Conversely, every algebraic 
group is the Galois group of a strongly normal 
extension field. We also see that a strongly 
normal extension can be decomposed, in a 
certain sense, into a Picard-Vessiot extension 
and an Abelian extension (i.e., an extension 
whose Galois group is an tAbelian variety) 

(— Kolchin [2-5], Okugawa [6]}). 
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A. General Remarks 


Differential topology can be defined as the 
study of those properties of tdifferentiable 
manifolds that are invariant under tdiffeo- 
morphisms. The basic objects studied in this 
field are the topological, combinatorial, and 
differentiable structures of manifolds and the 
relationships among them. Some of the re- 
markable contributions, such as H. Whitney’s 
embedding theorem [1], the triangulation theo- 
rems of J. H. C. Whitehead and S. S. Cairnes 
[2,3], and Morse theory [4], were made in 

the 1930s. In the late 1950s, outstanding re- 
sults were obtained by R. Thom, J. W. Milnor, 
S. Smale, M. Kervaire, and F. Hirzebruch, 
among others. Differential topology thus be- 
came a new, fascinating branch of mathematics. 


B. Differentiable Structures 


We assume that all manifolds are tparacom- 
pact. Let M be a ttopological manifold. A 
C’-equivalence class of tatlases of class C” 
(1<r<oo) on M is called a C’-structure on M. 
Any C’-structure on M contains an atlas of 
class C” on M, and its C®-equivalence class is 
uniquely determined (Whitney [1]). This C”- 
structure is called a differentiable structure on 
M compatible with the given C’-structure. 
Moreover, any C”-structure admits a treal 
analytic structure compatible with it in this 
sense (Whitney [1]). A C*-manifold is also 
called a smooth manifold, and a differentiable 
structure a smooth structure. Let Do, D, be 
differentiable structures on M. If two C®- 
manifolds (M, Do) and (M, D,) are not tdif- 
feomorphic, we say that the differentiable 
structures Do, D, are distinct (— 105 Differen- 
tiable Manifolds). Milnor defined a certain 
invariant of differentiable structure using the 
Hirzebruch index theorem (— 56 Character- 
istic Classes) and proved that there are several 
distinct differentiable structures on the 7- 
dimensional sphere S’ (Milnor [5]). Milnor’s 
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example of such structures was: Let f,:S°> 
SO(4) be the mapping defined by f,(o)t = 
o"to/, where k is an odd integer, h, j are inte- 
gers determined by h+j=1, h—j=k, and 
o, t are tquaternions of norm 1. Let M, be 
the total space of the $3-bundle over S* cor- 
responding to the mapping f,. This is an 
oriented closed manifold with the naturally 
defined differentiable structure. Moreover, 
for each k, M, is homeomorphic to the 7- 
dimensional sphere S7. But if k, l are odd and 
k? +I? (mod 7), M, is not diffeomorphic to 
M? . Differentiable manifolds (such as Mẹ) that 
are homeomorphic, but not diffeomorphic, to 
the natural sphere are called exotic spheres. 
After the discovery made by Milnor, many 
topological manifolds other than the 7- 
dimensional sphere and possessing several 
distinct differentiable structures have been 
found (N. Shimada, I. Tamura). Moreover, 
topological manifolds admitting no differen- 
tiable structures have been constructed (M. 
Kervaire [6], Smale, Tamura [7], J. Eells and 
N. Kuiper, C. T. C. Wall [8]). On the other 
hand, manifolds of dimension <3 admit 
unique differentiable structures. 

We can introduce a fRiemannian metric 
on a C’-manifold. Let M{ and M3 be C®- 
manifolds (k-dimensional and n-dimensional, 
respectively) and f: M{— M3 be an timmer- 
sion. We fix a Riemannian metric on M3. For 
each point p of M$, let N,(f) be the linear 
space of all tangent vectors of M3 at f(p) that 
are orthogonal to the ttangent space of f(U(p)), 
where U(p) is a small open neighborhood of p 
in Mj. Then {N,(f)|peM{} forms an (n—k)- 
dimensional tvector bundle v; over M{, called 
the normal bundle of the immersion f. The 
equivalence class of v, is independent of the 
choice of Riemannian metric on M3. When f 
is an fembedding, we denote the submanifold 
f(M*) by L*, Let N,(L*) be the set of all points 
whose distances (with respect to the Riemann- 
ian metric) from L* are <e. If L* is compact 
and e is sufficiently small, N,(L*) is an n- 
dimensional submanifold of M3 with bound- 
ary and is uniquely determined independently 
of the choice of up to diffeomorphism. This 
submanifold is called a tubular neighborhood of 
L in M3 and is denoted by N(L*). The interior 
of N(L*) is called an open tubular neighbor- 
hood. The total space E(v,) of the normal 
bundle v; of the embedding f is diffeomorphic 
to the open tubular neighborhood of L* in MZ. 


C. C’-Triangulation and the Smoothing 
Problem 


Let (K, f) be a ttriangulation of the C’- 
manifold M” (1<r<oo)(f:|K|->M") satisfying 
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the following conditions: (i) For any closed n- 
simplex o of K, f|¢:0-M” is a C’-mapping; 
(ii) for any point p of c, the tJacobian matrix 
of f |c has rank n at p. Then we say that (K, f) 
is a C’-triangulation of the C’-manifold M”, 
and the C’-structure of M” is compatible with 
the triangulation (K, f). Concerning C’- 
triangulations of C’-manifolds, we have the 
following results (S. Cairns [2], J. H. C. White- 
head [3]): (i) any C’-manifold (1 <r < oo) has a 
C’-triangulation, and any C’-triangulation of 
the boundary can be extended to the whole 
manifold; (ii) for a C’-triangulation (K, f) of 

a C’-manifold M”, the triangulated space 

(K, f; M*) is a combinatorial manifold; (iii) for 
two C’-triangulations (K,, f1) and (K3, f2) of 

a C’-manifold M”, the triangulated spaces 
(Ki, fı; M") and (K,, f2; M") are combinatori- 
ally equivalent. 

Conversely, for a combinatorial manifold 
(K, f; M", a differentiable structure on M that 
is compatible with the triangulation (K, f) is 
called a smoothing of M. The question of 
whether a smoothing of a combinatorial mani- 
fold exists is called the smoothing problem. We 
have some criteria for its solvability. One is 
expressed in terms of transverse fields, and 
another in terms of tmicrobundles (— 147 
Fiber Bundles P). We also have the following 
method using the theory of tobstruction (J. 
Munkres [9]; M. W. Hirsch and B. Mazur 
[10]). Let I, be the group of oriented differen- 
tiable structures on the tcombinatorial k- 
sphere (an exact definition of this group is 
given in Section I). Let KŻ be the i-fskeleton 
of a combinatorial n-manifold (K, f; M”) and 
x a smoothing of a neighborhood U of f(K‘) 
in M”. Then there exists an obstruction cocycle 
C€ Z'™ (K,T, satisfying the following con- 
ditions: (i) if C,(a'*')=0 for an (i+ 1)-simplex 
o'*? of K, the smoothing « can be extended 
over a neighborhood of f(K'Ua‘*}), and vice 
versa; (ii) if there exists another smoothing «’ 
of a neighborhood of f(K') that coincides with 
the smoothing « of a neighborhood of f(K'~'), 
then C, ~C, is a tcoboundary; conversely, for 
any (i+ 1)-coboundary b with coefficients in 
I, there exists a smoothing «' of a neighbor- 
hood of {(K') such that b=C,,—C,. Using 
this method we can prove that for n<7 all 
combinatorial manifolds of dimension n are 
smoothable. 


D. Embedding and Immersion Theorems 


In the following, M” and X” are C”-manifolds 
of dimensions n and m, respectively. Two 
timmersions fo, fı: M">X™ are said to be 
regularly homotopic if there exists a thomotopy 
fi, M">X™, 0<t <1, such that f, is an immer- 
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sion for each t and the induced mapping (tdif- 
ferential of f,) df: T(M")— T(X™”) on the tan- 
gent spaces naturally gives rise to a continuous 
mapping over T(M") x I. Two tembeddings are 
isotopic if they are regularly homotopic and 
the homotopy f, is an embedding for each t. 
Given M” and X”, a fundamental problem of 
embedding (immersion) theory is to classify the 
embeddings (immersions) of M” in X” accord- 
ing to their isotopy (regular homotopy) classes. 
Whitney proved that a continuous mapping 
f:M"—X”™ can be approximated by an immer- 
sion if m> 2n and by an embedding if m 22n + 
1 [1]. The following results are also due to 
Whitney [1]: Any two immersions of M” in X” 
that are homotopic are regularly homotopic if 
m>2n+2, and any two embeddings of M” in 
X” that are homotopic are isotopic if m> 2n 
+3. The range m> 2n +3 is called the stable 
range of embeddings. 

In [11, 12] (1944) Whitney improved his 
classical theorems and showed that M” can 
always be immersed in R?""! for n> 1 and M” 
can always be embedded in R?”. The methods 
used in his proof have played an important 
role in the subsequent development. Classifi- 
cation of immersions of the n-sphere S” in R” 
was determined by Smale [13]. Let p: T(M”)—> 
M” and p': T(X™)—> X" be the projections of 
the tangent bundles of M” and X”, respec- 
tively. A mapping g: T(M")—>T(X”) is a linear 
fiber mapping (linear fiber map) if, for each 
xe M”, ø(p~!(x)) is contained in a fiber of 
T(X”) and ọ|p~'(x)is a linear mapping of 
rank n. A linear homotopy ¢,: 7(M")> T(X”) 
(0<t<1) is a homotopy such that each ọ, is a 
linear fiber mapping. The following theorem 
of Hirsch [14] is fundamental to immersion 
theory: Assume n <m. If 9: T(M")> T(X”) is 
a linear fiber mapping, the mapping @: M"—> 
X™ induced by ¢ can be approximated by an 
immersion f:M"—>X™ such that df and ¢ are 
linearly homotopic. Two immersions f, g: 
M"~>xX™ are regularly homotopic if and only if 
df and dg are linearly homotopic. If M” is im- 
mersible in R"*’, where m>n with r linearly 
independent fields of tnormal vectors, then M” 
is immersible in R”. In particular, if M” is a z- 
manifold (— Section I), M” is immersible in 
R"! 

Let &(M") be the set of isomorphism classes 
of real tvector bundles over M”, and consider 
0: 6(M")—> KO(M") (— 237 K-Theory). An 
element če KO(M") is said to be positive if ¢ 
is in the image of 0. If Eye KO(M ”), the geo- 
metric dimension of €), written g(čo), is the 
least integer k such that čo +k is positive. Then 
Hirsch’s theorem [14] can be expressed as 
follows: M” is immersible in R"** (k >0) if and 
only if g(n—t(M"))<k, where t(M") is the 
tangent bundle of M”. 
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A. Haefliger [15] obtained the following 
important result: Let M” be compact and 
*(k — 1)-connected, and let X™ be k-connected. 
Then any continuous mapping of M” in X" is 
homotopic to an embedding if 2k <n, m>2n— 
k+1, and any two homotopic embeddings 
of M” in X” are isotopic if 2k<n+1,m>2n— 
k+2. Thus if m>3(n+ 1)/2, any two em- 
beddings of S” in R” are isotopic. The range 
m> 3(n+ 1)/2 is called the metastable range. 
Haefliger further classified the embeddings of 
S*"~! in R" and showed the existence of em- 
beddings of S-t in R™ that are not isotopic 
to the natural one. More complete results for 
the classification of embeddings of thomotopy 
n-spheres in S” were obtained by J. Levine 
[16]. Furthermore, Levine proved the follow- 
ing unknotting theorem in higher-dimensional 
knot theory [16]: Let f:S""?>S" be a C”- 
embedding for n> 6; then f(S"~) is unknotted 
if and only if S"— f(S"7?) is homotopy equiva- 
lent to St (— 235 Knot Theory G; also 65 
Combinatorial Manifolds D). 

We list some results about embeddings and 
immersions. If M” is noncompact, M” can 
always be embedded in R2” 1; if M” is a non- 
compact z-manifold, M” can always be im- 
mersed in R”; if M” is compact and orientable 
and n>4, M” can always be embedded in 
R2"-1. 


E. Nonembedding and Nonimmersion 
Theorems 


We denote the ‘total Stiefel-Whitney class 

of M” by w(M") and the ttotal Pontryagin 
class of M” by p(M") (— 56 Characteristic 
Classes). Then (w(M”)) ~! (e H*(M"; Z,)) and 
(p(M")) | (e H*(M"; Z)) can be written as 
W(M”)= E w,(M") (W;€ H'(M"; Z,)) and p(M") = 
> pM") (p:e H*(M"; Z)). Then the property 
of characteristic classes for the Whitney sum 
implies the following theorem: If M” can be 
immersed in R"**, then w,(M")=0 for i>k and 
p(M")=0 for i> [k/2]. Furthermore, if M" can 
be embedded in R”**, then w,(M")=0. As an 
application, these results yield the nonembed- 
ding (nonimmersion) theorem for projective 
spaces (— Appendix A, Table 6.VII). Sharper 
theorems were obtained subsequently. In 
particular, Atiyah proved the following: Let 

Ai (i=0, 1,...) be texterior power operations 
(— 237 K-Theory), and let y' be the operations 
defined by the formal power series 0/2, ytti = 
(D9 4't')(1—t)?. Then »'(n—1(M"))=0 for 
i>k (i>k) if M" can be immersed (embedded) 
in R"**, Furthermore, we have an interesting 
result for the differentiable case. For any posi- 
tive integer q, there exists a differentiable 
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manifold M” such that M” is immersible in R* 
but not embeddable in R**4. 


F. Handlebodies 


Let W be a C*®-manifold with boundary ôW. 
Then the boundary dW has a neighborhood U 
that is homeomorphic to OW x R,, where R , 
=[0, œ). Let W,, W, be C®-manifolds with 
boundaries and f:0W, +¢éW, be a diffeomor- 
phism. Then the quotient space M of W, UW, 
obtained by identifying points corresponding 
under f has a natural differentiable structure. 
This construction of the C®-manifold M from 
the C”-manifolds W,, W, is called pasting 
together the boundaries. 

Now we consider the ttopological product 
W, x W, of manifolds W,, Wz with boundaries 
ôW, OW,. W, x W,—0W, x OW, has a natural 
atlas of class C™. By introducing a suitable 
atlas of class C” in a neighborhood of dW, x 
OW, in W, x W, we obtain a C”-manifold 
with boundary homeomorphic to W, x W3. 
More generally, let M be an n-manifold with 
boundary, let N be a finite union of submani- 
folds of dimension <n—2 in 0M, and suppose 
that M has a corner along N. Extension of the 
C”-structure of M —N over M as in this para- 
graph is called straightening the angle. 

Let D” be the oriented unit tn-disk in the 
n-dimensional Euclidean space R", My and 
M3 be oriented compact C”-manifolds, and 
f,:D"~ M? (i= 1, 2) be tC®-embeddings, fi 
torientation-preserving and f, forientation- 
reversing. Then, pasting together the bound- 
aries of Mj — Int f,(D") and M3 — Int f,(D”) by 
the mapping f, 0 fy +, we obtain an oriented 
C”-manifold, called the connected sum of M? 
and M3 and denoted by M? # M$. The con- 
nected sum Mf # MÌ has the orientation in- 
duced from those of Mf and M3, and its dif- 
ferentiable structure is uniquely determined 
independently of the mappings f;. Let S" be 
the natural n-dimensional sphere. Then we 
have M” # S" œ M” (here = means diffeomor- 
phic), (M, # M,)# M, ~ M, #(M,#M,), M, # 
M,=M,#M,. 

Let M" be a manifold with boundary and 
f:{0D°) x D" *-+0M" be a C®-embedding. 
Then we call the quotient space X(M”; f;s) of 
M"U(D* x D"”‘) obtained by the identification 
of corresponding points under f the manifold 
with a handle attached by /. Also, we call the 
construction of X(M"; f; s) from M” attaching 
a handle and call D° x D”~* an s-handle. After 
straightening the angle, X(M"; f; s) is consid- 
ered naturally a C*-manifold with boundary. 
Let f;:0D} x DP? *—>6M" (i=1,...,k) be em- 
beddings whose images are mutually disjoint. 
Then similarly, using embeddings f,,..., fp, we 
can define a C”-manifold with handles X(M"; 
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Jis- Juis). In particular, X(X(...(X(X(D"; 
A135 1)3 £25 82) ---)s fj s;)) is called a handlebody. 

Let M"! be an oriented (n — 1)-mani- 
fold and f:6D* x D”*3(M""'—6M""") an 
orientation-preserving C”®-embedding. Then, 
by straightening the angle, the quotient space 
of (M""! ~Int f(dD* x D"~*))U(D* x 6D" S) 
obtained by the identification of points cor- 
responding under f|éD* x dD"* is a C”- 
manifold y(M""'; f). We say this manifold is 
obtained by a spherical modification (or sur- 
gery) of type (s,n—s) from M”~!. The manifold 
y(M"~’; f) has a natural orientation, and 
dy(M"~!; f)=0M" }. The process of spherical 
modification has the following relation to 
that of attaching a handle: Let W be an n- 
dimensional manifold and f:éD* x D" *+0dW 
an embedding. Then 0X(W; f; s)=x(0W; f). 
When W=M""'! x [0,1] and f:dDS x D" 53 
M x {1}, 6X(W; f;s)=x(M x {1}; f)UM x {0}, 
and therefore M is cobordant (— Section H) 
to y(M ; f). Conversely, let M, be cobordant 
to M,. Then we can obtain M, from M, by . 
a finite sequence of spherical modifications 
(A. Wallace [17], Milnor). Let M” be an n- 
dimensional C®-manifold and f:M"~—R! bea 
C*-function. If f satisfies the following con- 
ditions, then it is called a nice function: (i) All 
‘critical points of f are tnondegenerate; (ii) for 
any critical point p of f, the findex (— 279 
Morse Theory) of f at p is equal to f(p). We 
have the following theorems. 

1. Let M be a compact C”-manifoid. Then 
there exists a nice function on M (M. Morse, 
Smale [18], Wallace [17]). 

2. Let M be a compact C®-manifold and 
f:M->R’ a C”-function all of whose critical 
points are nondegenerate. Suppose that the 
number of critical points on f ~'[—¢,¢] is k 
and that they are all contained in f~'(0). Sup- 
pose further that the indices of these critical 
points are all equal to s. Then f~1(—00, e] is 
diffeomorphic to the manifold with handles 
X(f (—œ, ~e]; fis- J s) (Morse, Thom, 
Smale [18]). 

3. tGeneralized Poincaré conjecture. Let M" 
be an n-dimensional thomotopy sphere of class 
C” (n> 5). Then M" can be obtained by past- 
ing together the boundaries of two n-disks. 
Consequently, M” is homeomorphic to the n- 
dimensional sphere S" (Smale [18], H. Yama- 
suge [19]). 

4. Let M” be a tcontractible compact n- 
dimensional manifold (n> 5), with 6M con- 
nected and simply connected. Then M” is dif- 
feomorphic to the n-disk D" (Smale [18]). 

S. Let MY, M3 be oriented, compact, simply 
connected n-dimensional manifolds (n > 4). If 
M, is h-cobordant (— Section I) to M,, then 
M, is diffeomorphic to M, (h-cobordism theo- 
rem; Smale [20]). 
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6. If Mi x R* is diffeomorphic to M3 x R*, we 
say that Mf is k-equivalent to M}. Let M,, M, 
be compact n-dimensional manifolds of the 
same thomotopy type. Then M, is k-equivalent 
to M, if and only if for a thomotopy equiva- 
lence f: M, > M,, t(M,) @ g is equivalent to 
f*t(M2) ® ex, where t(M;) is the ttangent 
bundle of M; and g, the ‘trivial vector bundle 
of dimension k (Mazur). 

7. In some special cases the classification of 
manifolds by diffeomorphism is completely 
determined. (i) The classification of simply 
connected 5-dimensional manifolds M with 
vanishing second tStiefel-Whitney class w,(M) 
is determined by H,(M). (ii) A 2-connected 
compact 6-dimensional manifold is dif- 
feomorphic to either $ô or the connected sum 
of a finite number of copies of S* x S°? (Smale 
[20]). Besides these, the classifications of 
(n—1)-connected 2n-dimensional manifolds 
(Wall [8]) and (n — 1)-connected (2n + 1)- 
dimensional manifolds (Tamura [21], Wall 
[22]) have been obtained. 


G. Thom Complexes 


Let € be a real n-dimensional vector bundle 
over a paracompact space X, A; be the total 
space of the associated bundle of € with fiber 
the closed n-disk D", and A, be the total space 
of the associated bundle of č with fiber the 
(n—1)-sphere cD". Then the quotient space 
X= AzlĀs, obtained from A; by contracting 
A, to a point, is called the Thom space of the 
vector bundle č. If X is a tCW-complex, then 
the Thom space X; of ¢ has the homotopy 
type of a CW-complex and is called a Thom 
complex. The Thom space X, has the canon- 
ical base point p corresponding to A & 

For a coefficient group G, denote by G; the 
tlocal system of coefficient groups with stalk G 
associated with the torientation sheaf of an R”- 
bundle č. Then we have the Thom-Gysin 
isomorphism: 


H(X; G) S H"*1(X;, Pe; G), 
H(X; G)= Hn+a(X e Pes G). 


Let G be a closed subgroup of the ortho- 
gonal group O(n) and BG be the base space of 
the universal n-dimensional vector bundle yg 
with structure group G. Then we can take a 
connected CW-complex as BG. We denote the 
Thom complex of the vector bundle yg by MG 
and call it the Thom complex associated with 
(G,n). The Thom complex associated with 
(G, n) is (n—1)-tconnected. If G is connected, 
then we have z,(MG)=Z, z,(MO(n)) = Z,, 
H"(MG; Z)=Z, and H"(MO(n); Z,)=Z, (Z, 
stands for the quotient group Z/2Z). The 
generator U of H"(MG; Z) is called the funda- 
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mental (cohomology) class of the Thom com- 
plex MG. For a general Thom space X,, we 
can also define the fundamental class of X; 
using the Thom-Gysin isomorphism. 

A C®-submanifold W”? of a compact mani- 
fold V” is a tsupport of a singular cycle and 
represents a homology class ze H,(V"; G) 
(G=Z, or Z). In this case we say that the 
homology class z of V” is realizable by a sub- 
manifold. A homology class ze H,,_,(V"; G) is 
realizable by a submanifold if and only if there 
exists a mapping f: V"— MO(k) (or MSO(k)) 
such that f*(U) is the dual cohomology class 
of z (Thom [23]). 

The homotopy group z,,,,(MO(n)) (resp. 
Tn+4(MSO(n)) (n> k) is determined only by 
k for any n up to isomorphism. It ts called 
the k-dimensional stable homotopy group of 
the Thom spectrum MO = {MO(n)} (MSO 
= {MSO(n)}) and is denoted by z,(MO) 
(z,(MSO)). For the tunitary group U (n) and 
the symplectic group Sp(n), we define Thom 
complexes MU (n) and MSp(n) as the Thom 
complexes associated with (U(n), 2n) and 
(Sp(n), 4n) by the canonical inclusions U(n)< 
O(2n) and Sp(n) c O(4n), respectively. The 
k-dimensional stable homotopy groups 
7,(M U)=lim 19,4,(MU(n)), 2,(MSp)= 
lim 714,44(M Sp(n)) of Thom spectra MU ={..., 
MU(n),SMU(n), MU(n+ 1),...}, MSp= 
{..., MSp(n), SMSp(n), S?MSp(n), S>MSp(n), 
MSp(n+ 1),...} can be defined similarly to the 
case of 2,(MO), where SMU (n) denotes the 
treduced suspension of MU (n). The stable 
homotopy groups of Thom spectra are cal- 
culated in connection with the cohomology 
groups utilizing the Thom-Gysin isomorphism 
(— 202 Homotopy Theory T) (Thom [23], 
Milnor [24]). 


H. Cobordism 


Cobordism theory is a theory of classification 
of differentiable manifolds initiated by L. S. 
Pontryagin and V. A. Rokhlin, who called it 
intrinsic homology. The theory was brought to 
maturity by Thom [23]. Its fundamental prob- 
lem is to determine whether a given compact 
manifold is the boundary of another manifold. 
Corresponding cobordism theories for com- 
binatorial and topological manifolds are being 
developed (Wall, R. Williamson). 

We consider only compact C”-manifolds 
that are not necessarily connected. Let D 
be the set of all diffeomorphism classes of 
C”-manifolds, and let Dy be the set of all 
orientation-preserving diffeomorphism classes 
of oriented C”-manifolds. For an oriented 
manifold V, we write — V for the manifold 
with reversed orientation. 
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For two compact k-manifolds V, WeDo, we 
say that V is cobordant to W if there exists a 
compact (k + 1)-manifold X e Do such that 0X 
= VU(— W). In this definition, considering D 
instead of Do, we say that V is cobordant to W 
mod 2. The equivalence class of V* with respect 
to the cobordism relation (the cobordism 
relation mod 2) ts called an oriented (un- 
oriented) cobordism class and is denoted by 
[V*] ({V*],). The set of all oriented (un- 
oriented) cobordism classes [V*] ({V*],) of k- 
manifolds forms an Abelian group Q, (,) by 
the natural addition [V*] +[W*] =[V*U W*] 
([V*].+[W*],=[V*U W*],). This is called 
the k-dimensional oriented (unoriented) cobor- 
dism group. Define the product [V*] x [W°] = 
[V* x W°] (LV*], x [W"], =[V* x W"]),). 

Then the direct sum Q=2Q, (N= N,) forms 
an anticommutative (commutative) tgraded 
algebra, which is called the cobordism ring 

or Thom algebra. We have the following 
theorems. 

1. Q,, R, are isomorphic to the stable homo- 
topy groups 2,(MSO), 2,(MO) of the Thom 
spectra MSO, MO, respectively (Thom’s funda- 
mental theorem) [23]. 

2. For a natural number k not of the form 
2'— 1, there exists a compact manifold P(k), 
and ÒK is a polynomial ring over Z, with gen- 
erators {[P(k)],|k#2'—1}.Q2@ Q (Qis the 
field of rational numbers) is a polynomial ring 
over Q with generators {[PC?”]|m=> 1}, where 
PC?" is the complex 2m-dimensional projec- 
tive space (Thom [23]). Moreover, Milnor 
[24] proved that the p-component of Q, is 
zero for an odd prime p. Wall proved that 
the 2-component of Q, contains no element 
of order 4, using a certain exact sequence 
which contains the natural homomorphism 
Q, 3%, [25]. 

3. Let T be the ideal of Q consisting of all 
torsion elements in Q. Then Q/T is a poly- 
nomial ring over Z with generators {[Y,,]| 
k21}, where we can take for Y}, a complex 
2k-dimensional nonsingular algebraic variety 
(Milnor). 

tCharacteristic numbers are invariants of 
cobordism classes. Combining the results 
stated in 2, we have the following theorem: 

4. Let V, W be manifolds. V is cobordant to 
W mod 2 if and only if they have the same 
corresponding *Stiefel-Whitney numbers. Let 
V, W be oriented manifolds. V is cobordant to 
W if and only if they have the same corre- 
sponding Stiefel- Whitney numbers and tPon- 
tryagin numbers. The findex of an oriented 4k- 
dimensional compact manifold is a cobordism 
class invariant and can be expressed as a linear 
combination with rational coefficients of fPon- 
tryagin numbers (— 56 Characteristic Classes). 

A manifold whose tstable tangent bundle 
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has a complex structure is called a stably (or 
weakly) almost complex manifold. Let V, W be 
2n-dimensional compact stably almost com- 
plex manifolds (n> 1). We say that V is C- 
equivalent to W if they have the same tChern 
numbers. The set of all C-equivalence classes 
[V]- forms the complex cobordism group X, 
as in the (real) case of cobordism groups (the 
existence of the inverse of an element is not 
trivial). Introducing multiplication into the 
direct sum U, =È H, by the product of mani- 
folds, we obtain the complex cobordism ring. 
We have U,,=72,(MU) and U, is a poly- 
nomial ring over Z with generators {[Y3,]¢| 
k>1}, where we can take for Y;, a complex 
k-dimensional nonsingular algebraic variety 
(Milnor [24]). 


I. h-Cobordism Groups of Homotopy Spheres 


A manifold M is said to be parallelizable if its 
tangent bundle t(M) is trivial, and almost 
parallelizable if there exists a finite number of 
points x; in M such that M — | ) {x;} is paral- 
lelizable. A manifold M is called stably paral- 
lelizable (or s-parallelizable) if the Whitney 
sum t ®é, of its tangent bundle t(M) and the 
trivial line bundle ¢, is trivial. A manifold M” 
is called a z-manifold if M” has a trivial nor- 
mal bundle when it 1s embedded into a Eucli- 
dean space R“ (N >2n). A manifold M” is a z- 
manifold if and only if M” is s-parallelizable. 
The concepts defined in this paragraph are 
related by inclusions as follows: parallelizable 
& s-parallelizable & almost parallelizable. 
For a connected manifold with boundary, 
these three concepts are equivalent. tGroup 
manifolds are parallelizable. An n-dimensional 
manifold homeomorphic to the n-dimensional 
sphere is parallelizable if and only if n= 1, 3, 7 
(Milnor [26]). Suppose that we are given an 
(n—1)-dimensional sphere S"~! (n even). We 
can consider the problem of determining the 
maximal number r such that there exists a 
tangent r-frame field over S”'. J. F. Adams 
solved this problem using tK-theory (— 237 
K-Theory) as follows: Let n=(2a+1)2°, b= 
c+4d, where a, b, c, d are integers and 0<c< 
3. Put p(n)=2°+ 8d. Then r= p(n)—1. On the 
other hand, homotopy spheres are z-manifolds 
[27]. 

Let V,, V, be oriented compact manifolds. 
Then we say that V, is h-cobordant to V, if 
there exists an oriented manifold W with 
boundary 0W= V, U(— V,) such that V; (i= 1, 2) 
is a deformation retract of W. The set of all h- 
cobordism classes of oriented homotopy n- 
spheres forms an Abelian group 0, with con- 
nected sum as addition. This is called the (h- 
cobordism) group of homotopy n-spheres. We 
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denote by 6,(éz) the subgroup of homotopy n- 
spheres that are boundaries of z-manifolds. 
Kervaire and Milnor gave certain exact se- 
quences that contain the groups 6,, the stable 
homotopy groups G, of spheres, and the stable 
homotopy groups z,(SO). They clarified rela- 
tions among these groups and proved that 
0,(6x) =0 for n even, 0,(0z)=finite group for n 
odd #3, 0,/0,(éx) < Coker J,, etc. [27], where 
J,:%,(SO)—G, is the *J-homomorphism. From 
these results it follows that the 0, (n 43) are 
finite Abelian groups, 0,=0 for n<7, 43, 

-= Zg, etc. (— Appendix A, Table 6.1). 

By pasting together the boundaries of two 
n-disks Df and D3, we obtain an oriented 
manifold which is considered as a smoothing 
of the combinatorial n-sphere. The set of all 
orientation-preserving diffeomorphism classes 
of smoothings of the combinatorial n-sphere 
forms an Abelian group T, with connected 
sum as addition. This is called the group of 
oriented differentiable structures on the com- 
binatorial sphere. By the generalized Poincaré 
conjecture and the h-cobordism theorem, we 
obtain I, = 6, (n 43, 4). Furthermore, I’, =0 
and I, =0 (J. Cerf [28]). The group I, can also 
be defined as follows: Let Diff * D”, Diff * S”~1 
be the groups of orientation-preserving diffeo- 
morphisms of D”, S"”', respectively, where 
multiplication is defined by composition. Let 
r: Diff * D* Diff * S”! be the homomor- 
phism induced by the restriction D"—>S""!. 
Then the image of r is a normal subgroup, and 
I, = Diff * $"-!/r (Diff * D”). 


J. Surgery Theory 


A process of modifying a manifold into an- 
other by a sequence of spherical modifications 
is called surgery on the manifold (— Section 
F). The technique of surgery proved to be a 
powerful tool for the development of differen- 
tial topology in the 1960s. Kervaire and Mil- 
nor exploited this technique in their study of 
homotopy spheres [27]. W. Browder [29] and 
S. P. Novikov [30] used surgery to construct 
differentiable manifolds with the same homo- 
topy type as that of a given Poincaré complex 
in dimension greater than 4. 

To explain the main points of surgery the- 
ory, we introduce some terminology. A pair 
of finite *CW complexes (X, Y) is called a 
Poincaré pair of formal dimension n if there 
exists a class we H,(X, Y; Z) called the funda- 
mental class such that the fcap product u^: 
A(X; Z)>H,_,(X, Y; Z) is an isomorphism 
for each q. When Y= Ø, X is called a Poin- 
caré complex. Let M be an n-dimensional 
smooth manifold. Consider an embedding g 
of M into a Euclidean space R"**, where k is 
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large enough (i.e., k >n+ 2). Then the isomor- 
phism class of the tnormal bundle of g is inde- 
pendent of the choice of embedding, and it 
depends only on M. Any representative of the 
isomorphism class is called the normal k-vector 
bundle of M and is denoted by v. Let (X, Y) 
be a Poincaré pair of formal dimension n, 

and let č be a real k-vector bundle over X. A 
normal mapping (normal map) ( f, b) consists of 
a degree 1 mapping f:(M,dM)>(X, Y) and a 
*bundle mapping b: v¥, >& which covers f. 

A normal mapping (f, b):(M,0M)->(X, Y) is 
normally cobordant to a normal mapping 
(f',b):(M’,0M)>(X, Y) if ôM =eM' and 
there exist a smooth (n+ 1)-manifold W and 
a mapping F: WX such that ĝW =M U 
(— M’)U0M x I, F is covered by a bundle 
mapping B: vy —€, and (F, B)|(M,éM)=(f,b), 
(F, B)|(M’,0M’)=(f',5’). 

The fundamental theorem of surgery theory 
is formulated as follows: Let (X, Y) be a Poin- 
caré pair of formal dimension n. Suppose that 
X is simply connected and n>5. Let (fb): 
(M,0M)>(X, Y) be a normal mapping that 
restricts to a homotopy equivalence on the 
boundary f|0M:0M >Y. Then (f,b) is nor- 
mally cobordant to a normal mapping ( f’, b’): 
(M',0M’)>(X, Y) with f': M’>X a homotopy 
equivalence if and only if a well-defined ob- 
struction a( f, b) vanishes. o( f, b) is called the 
surgery obstruction. When n is odd, a( f, b) 
always vanishes. When n=0 mod 4, a( f, b) is 
an integer. If Y= @, it is given by (1(M)— 
I(X))/8, where I( ) denotes the tindex of the 
manifold or of the Poincaré complex. If n=2 
mod 4, o( f, b) is an integer mod 2, called the 
Arf-Kervaire invariant. For a thorough devel- 
opment of simply connected surgery — [31]. 

In the PL or even in the topological cate- 
gories, surgery theory can be developed simi- 
larly (Browder and Hirsch, R. C. Kirby and 
L. C. Siebenmann [32]). 

In his study of Hauptvermutung for simply 
connected manifolds, D. Sullivan reformu- 
lated surgery in terms of the “surgery exact se- 
quences” involving the classifying spaces G/PL 
or G/O. (These spaces are “homotopy theoretic 
fibers” of BPL—BG or BO— BG respectively.) 
In the special case where X is a closed simply 
connected PL n-manifold, the surgery exact 
sequence can be formulated as follows (n> 5): 


Z (n=0mod4), 
0>AT(X)S[X,G/PL] >< Z, (n=2 mod4), 
0 (nodd). 


The set hT(X) is the totality of equivalence 
classes of pairs (M, f), where M is a closed PL 
n-manifold and f is a homotopy equivalence 
M-X. Two such pairs (M’, f^) and (M”, f^) 
are defined to be equivalent if there exists a PL 
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homeomorphism h: M'—>M” such that f” oh is 
homotopic to f’. The set of homotopy classes 
[X,G/PL] of mappings X >G/PL is appro- 
priately identified with the set of all normal 
cobordism classes of normal mappings with 
target X or with the set of all PL reductions 
of the Spivak normal fiber space of X as a 
Poincaré complex [31]. The mapping y can 
be defined naturally, and the mapping 0 cor- 
responds to the assignment of the surgery 
obstruction. The image y(M, f)e[X, G/PL] 
is sometimes called the normal invariant of 
(M, f). Sullivan reduced the classification 
problem of manifolds within a given homo- 
topy type to a homotopy theoretic problem 
of the classifying spaces G/PL, G/O [33] (also 
— [31, 34]). 

In the latter half of the 1960s, surgery theory 
was extended by Wall to cover all compact 
non—simply connected manifolds which are 
not necessarily orientable. He introduced a 
certain Abelian group L,,(z; w), now called the 
Wall group, which functorially depends on the 
fundamental group x=72,(X), with the orienta- 
tion character w:2,{(X)—Z,, and which is of 
period 4 with respect to the formal dimension 
n of X. In Wall’s theory, the surgery obstruc- 
tion o(f, b) takes its value in this group. The 
group structure of Wall groups has been cal- 
culated in many cases. Sullivan’s exact se- 
quences are extended to the non—simply con- 
nected cases [34]. Making use of the extended 
exact sequences, Wall and W. C. Hsiang and 
J. L. Shaneson classifield homotopy tori. The 
result played an important role in the work of 
Kirby and Siebenmann on the tannulus con- 
jecture and stable homeomorphisms which led 
to the solution of tHauptvermutung and trian- 
gulation problems on topological manifolds in 
1969 (— 65 Combinatorial Manifolds). Sur- 
gery theory has many applications to other 
geometric problems. Among them are find- 
ing missing boundaries for open manifolds, 
equivariant surgery, homology surgery [35], 
and surgery on codimension two submanifolds 
[36, 37]. 


K. 4-Dimensional Manifolds 


The results in differential topology discussed 
above are mainly concerned with manifolds of 
dimension > 5. For 4-manifolds, however, 
because of their peculiar nature which is not 
observed in other dimensions, many funda- 
mental problems had remained unsolved until 
M. H. Freedman’s epoch-making paper [54] 
appeared in 1982. His paper, together with 

S. K. Donaldson’s theorem [55] which was 
published a little later, was a breakthrough in 
the theory of 4-manifolds. 
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It was Rokhlin who first discovered a 
strange property of 4-manifolds [38]. Ro- 
khlin’s theorem states: Let M be a closed 
oriented smooth 4-manifold. If M is almost 
parallelizable (or equivalently, if M is a spin 4- 
manifold), then the index of M is divisible by 
16. Milnor and Kervaire gave an alternative 
proof of this theorem from the differential- 
topological point of view [39]. Freedman and 
Kirby and Y. Matsumoto gave geometric or 
elementary proofs. 

Until 1981, Rokhlin’s theorem had been a 
constant source of many curious phenomena 
in 4-dimensional topology. The list contains, 
for example: (1) the class (4m +2, 4n + 2)e 
H,(S? x S*)=Z@ Z cannot be represented by a 
smoothly embedded 2-sphere in S? x S? (Ker- 
vaire and Milnor). This result was improved 
by W.-C. Hsiang and R. H. Szezarba, A. G. 
Tristram, and Rokhlin. (2) The h-cobordism 
theorem fails to hold in 4 or 3 dimensions 
(T. Matumoto, Siebenmann [40]). (3) There 
exists a closed smooth 4-manifold M that is 
homotopy equivalent to the real projective 
space RP*, but M #RP* (Cappell and Shane- 
son). (4) There exists an open 4-manifold W4 
properly homotopy equivalent to S* x R but 
distinct (Freedman). 

Closed, connected, simply connected 4- 
manifolds M and N are homotopy equivalent 
if and only if the intersection forms defined on 
the 2-dimensional (co)homology groups are 
equivalent as inner product spaces over Z 
[41]. Wall [42] proved that if closed simply 
connected smooth 4-manifolds M and N 
are homotopy equivalent, then they are h- 
cobordant. Moreover, if M and N are h- 
cobordant, then there exists an integer k>0 
such that M #k(S? x $°) is diffeomorphic to 
N #k(S? x S?). 

About 1973, using a certain infinite repeti- 
tion process, A. Casson constructed a family 
of noncompact smooth 4-manifolds which are 
properly homotopy equivalent to the open 2- 
handle D? x R?. A 4-manifold belonging to the 
family is called a Casson handle. He observed 
that if all the Casson handles are diffeomor- 
phic to D? x R?, then theories analogous to 
surgery and the h-cobordism theorem in 
higher dimensions can also be developed in 
dimension 4. 

In his 1982 paper [54], Freedman proved 
that each Casson handle is homeomorphic to 
D? x R?. (It was proved later that Casson 
handles are not, in general, diffeomorphic 
to D? x R?.) This result and the proper h- 
cobordism theorem, also due to Freedman, 
proved many fundamental results on the topo- 
logical structure of 4-manifolds: (1) If closed 
simply connected smooth 4-manifolds M, N 
are h-cobordant, they are homeomorphic. In 
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particular, a 4-dimensional homotopy 4-sphere 
is homeomorphic to the 4-sphere S*. (Proof of 
the 4-dimensional Poincaré conjecture.) (2) A 
topological 4-manifold properly homotopy 
equivalent to Rt is homeomorphic to R*. (3) 
Given a nonsingular symmetric bilinear form 
œw over Z, there exists a closed, connected, 
simply connected topological 4-manifold 
whose intersection form is equivalent to w. 
(Therefore Rokhlin’s theorem cannot be ex- 
tended to topological 4-manifolds. From this, 
it follows that there exist simply connected 
topological 4-manifolds which are nonsmooth- 
able or even nontriangulable.) (4) The homeo- 
morphism class of a closed connected simply 
connected 4-manifold is determined by the 
intersection form and the Kirby-Siebenmann 
class. (This statement is an improved one by 
F. Quinn [56].) 

Donaldson [55] revealed a sharp contrast 
existing between smooth and topological 4- 
manifolds. Donaldson’s theorem states: If the 
intersection form of a closed, connected, sim- 
ply connected smooth 4-manifold is positive 
definite, then the form is equivalent to the 
standard one <1) ®<1> @... ® <1). This 
theorem, together with Casson’s theory and 
Freedman’s result (2) stated above, implies 
that there exists an exotic differential struc- 
ture on a 4-dimensional Euclidean space R4. 
Moreover, as an application of Donaldson’s 
theorem, the problem of representing a 2- 
dimensional homology class of S* x S? by a 
smoothly embedded 2-sphere was completely 
solved (K. Kuga [57]): the class (p, g)¢ H(S? x 
S?) is represented by a smoothly embedded 
S? if and only if |p| <1 or |g|<1. 

Many problems concerning differential 
structures on 4-manifolds remain unsolved. It 
is not known whether an exotic smooth 4- 
sphere exists. 

Among other results, the proof of the 4- 
dimensional annulus conjecture by Quinn [56] 
is remarkable. 


L. Miscellaneous Results 


The Browder-Livesay invariant. Let 2” be a 
homotopy n-sphere, T an involution on 2", 
that is, a smooth mapping T: 2" ” with 
To T=idsn. Assume that T is free from fixed 
points. The involution T is said to desuspend 
if there exists a homotopy (n — 1)-sphere 2” 
smoothly embedded in £” which is invariant 
under the action of T. Browder and G. R. 
Livesay defined an obstruction a(T) in the 
group O(n: even), Z (n=3 mod 4), Z, (n= 

1 mod 4) such that o(7)=0 if and only if T 
desuspends (provided that n> 6) [43]. The 
invariant o(T) is now called the Browder- 
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Livesay invariant. The technique used there 
was equivariant surgery under the action of T. 

De Rham homotopy theory. A new approach 
to topology of manifolds was invented by 
Sullivan [44], who considered the exterior 
algebra of differential forms (with polynomial 
coefficients) on a simplicial complex. Through 
the construction of the minimal model for the 
algebra of differential forms, he recovers the 
rational homotopy type of the complex under 
some reasonable condition on the fundamental 
group. Following upon the classical de Rham 
theorem which recovers the (real) cohomology 
algebra from the algebra of differential forms, 
this approach is called the de Rham homotopy 
theory. This method has proved to be useful 
for the explicit calculation of the algebraic 
topological invariants of fKahler manifolds, 
tloop spaces, cross-section spaces, talgebraic 
varieties, etc. 

Kirby calculus. Kirby [45] initiated a link- 
theoretic approach to 3- and 4-manifolds. Let 
L be a link in S°. Suppose that each compo- 
nent of L is given a framing, which means a 
trivialization of a tubular neighborhood. Such 
a link is called a framed link. A framing of 
a component is determined by the ‘linking 
number between the component and its re- 
placement along the framing. By attaching 
2-handles to the 4-disk D* along a framed link 
in S*=€D*, we obtain a handlebody; here we 
denote it by W,. It is known that each closed 
connected oriented 3-manifold is obtained as 
the boundary of such a handlebody (W. B. R. 
Lickorish, Wallace). Kirby proved that for 
framed links L and L’, the boundaries dW, 
and ðW, belong to the same orientation- 
preserving diffeomorphism class if and only if 
L is transformed into L’ by a sequence of the 
following two kinds of elementary operations: 
(i) adding or deleting a trivial knot (separated 
from L) with framing +1, (ii) band summation 
of components corresponding to “handle 
sliding.” Kirby’s approach is called Kirby 
calculus on framed links. For applications — 
[45, 53]. 
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Diffusion Processes 


A. General Remarks 


Let (Q, B, P) be a tprobability space. A tMar- 
kov process {X,}o << ON a ftopological space 
S with fcontinuous time parameter t is called a 
diffusion process if the tsample function X,(w) 
is continuous in t with probability 1 until a 
random time {(q), called the tterminal time. 
After the terminal time, X,(q) stays at the 
terminal point ô. Such a process is said to be 
1-dimensional or multidimensional according 
as S is an interval or a manifold (possibly with 
boundary) with dimension >2. Brownian ` 
motion is a typical diffusion process (— 45 
Brownian Motion). 

To give conditions for a Markov process to 
be a diffusion process, let us assume that S is a 
‘complete metric space with metric p. Let {X,} 
be a Markov process on S with time para- 
meter t ranging over a finite interval [t], t3] 


and satisfying 
sup P(s, x,t, S—U,(x))=O(h) (h}0) (1) 


for every ¢>0, where U,(x) is the e-neighbor- 
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hood of x and sup is taken over all xe S, and 
s, te[t,,t,] such that 0<t—s<h. Then {X,} 
is a diffusion process if and only if 


ii P(p(X,, Xr44)>e)dt=o(h) — (h1 0) 


for every ¢>0 [5]. This includes the following 
result due to E. B. Dynkin (1952) and J. R. 
Kinney (1953) as a special case: If in (1) we can 
replace O(h) with o(h), then {X,} will be a 
diffusion process. Specifically, when {X,} is a 
1-dimensional tstrong Markov process, the 
latter condition is also necessary for the pro- 
cess to be a diffusion process. 

Diffusion processes are intimately related to 
a certain class of tpartial differential equations. 
Let S be the real line. Assume that the ttran- 
sition probability P(s, x,t, E) (s <t) of the pro- 
cess {Xj} <,<, satisfies 
1—P(s,x,s+h,(x—e,x+8))=o(h)  (AĻ0) 

(2a) 


for every ¢>0 and that the following limits 
exist: 


1 xt+e 
: a DORES 
jim a (y—x)’P(s, x, +h, dy) 
=2a(s,x)>0, (2b) 


. 1 xte 
Jim Al (y—x)P(s,x,s+h,dy)=b(s,x), (2c) 


1 
lim —(P(s,x,s +h, S)—1)=c(s, x) <0. (2d) 
h>0+ h 


Assume further that the transition probability 
is tabsolutely continuous with respect to Le- 
besgue measure: P(s, x, t, dy)= p(s, x, t, y)dy. 
Then under some suitable additional con- 
ditions, p(s, x, t, y) satisfies 


0 8? ð 

ag als) atls) els), 

p(t —0, x,t, y)=ô(x— y), (3) 
and 

op 8? ô 

VAG a? > ae Ta b ’ ` > 

ar Tai y)p) JA (t, y)p)+ c(t, y)p 

p(s, x,s+0, y)=d(y—x), (4) 


where ô is the tDirac delta function. The co- 
efficient c vanishes if { X,} is tconservative. 
Equations (3) and (4) are called Kolmogorov’s 
backward equation and forward equation, re- 
spectively. They are also called the Fokker- 
Planck partial differential equations. 

A. N. Kolmogorov [1] derived equations (3) 
and (4) in 1931, and W. Feller proved that (3) 
(or (4)) has a unique solution under certain 
regularity conditions on the coefficients a, b, 
and c, and that the solution p(s, x, t, y) is non- 
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negative, has an integral with respect to y that 
does not exceed 1, and satisfies the *Chapman- 
Kolmogorov equation 


| x, t, y)p(t, y, u, z)dy = p(s, x, u, Z) 


for every 0<s<t<u. Hence p(s, x, t, y) deter- 
mines a tMarkov process analytically. How- 
ever, rigorous proof establishing the sufficiency 
of condition (2) for the Markov process {X,} to 
be a diffusion process did not appear until the 
1950s. 

In the temporally homogeneous case, 
p(s, x, s +h, y) does not depend on s and can be 
written as p(h, x, y). Then a(s, x), b(s, x), and 
c(s, x) are also independent of s, and p(t, x, y) 
satisfies 


Op 6? p ap 
as a(x) + bihot c(x)p, 
p(O0+,x, y)=d(x—y). (5) 


Feller made an intensive study of this case and 
completely solved the problem of existence 
and uniqueness of the solution of (5) assuming 
that p(t, x, y) is nonnegative and that its inte- 
gral with respect to y does not exceed 1 [7]. 
In particular, when t varies in [0, +00) and S 
is an interval [r,,r,], Feller used the Hille- 
Yosida theory of tsemigroups of operators to 
determine the conditions that should be satis- 
fied by r, and r, in order that the differential 
equation (5) (with the initial condition and his 
additional assumptions) yield one and only 
one solution. Feller also introduced the notion 
of generalized differential operators, which 
expresses the differential operator in the right- 
hand side of (5) in the most general form [8]. 
The probabilistic meaning of his results was 
clarified by Dynkin, H. P. McKean, K. Its, 

D. B. Ray, and others, and all 1-dimensional 
diffusion processes with the tstrong Mar- 

kov property have now been completely 
determined. 

Since not much research on temporally 
nonhomogeneous diffusion processes has 
been done so far, we restrict our explanation 
to temporally homogeneous ones. Let M = 
(X,, W, P,.|xeS) be a Markov process, where 
S is a tstate space, W is the tpath space con- 
sisting of all paths w:[0, +00 ]>SU{é} which 
are continuous in t for O0<t<€(w) (w(t)=6 for 
t>{(w) while w(t)eS for 0<t<(w)), and P, is 
a probability measure on W under the con- 
dition that the process starts from x at t=0 
(— 261 Markov Processes). We can actually 
identify W with the tbasic space Q and set 
X,(w) = w(t) for weQ. Assume that W has the 
*strong Markov property. It follows from 
*Dynkin’s formula for tinfinitesimal generators 
(— 261 Markov Processes) that the infini- 
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tesimal generator © of a diffusion process has 
the local property stating that if u and v be- 
long to the domain of © and coincide in a 
neighborhood of xo, then Gu(x_) = Gv(x,). 


B. 1-Dimensional Diffusion Processes 


Let S be a straight line. A point xeS is called 
a right singular point if X,(w)>x for all te 
[0, €(w)) with P,-probability 1. A left singu- 
lar point is defined analogously, with > re- 
placed by <. A right and left singular point 
is called a trap, while a right singular point 
which is not left singular is called a right shunt 
(a left shunt is defined analogously). A point is 
called a regular point if it is neither right nor 
left singular. 

The set of all regular points is open. Let 
(ri, r2) be a connected component of this open 
set. One of the most important results con- 
cerning this situation is the proof of the exis- 
tence of a strictly increasing function s(x) de- 
fined on (r,,r,) and two measures m and k 
on (r,,’,) such that the infinitesimal generator 
© of M is represented as 
Gulx) _u! (dx)—u(x)k(dx) l 6) 

m(dx) 

where u*(dx) is the measure du*(x) induced by 
the fright derivative u` (x) of u(x) with respect 
to s(x) (i.e., u* (x) slimy, . +o {u(x + Ax)— 
u(x)}/{s(x + Ax) —s(x)}). Equation (6) gives a 
generalization of second-order ‘differential 
operator au” + bw + cu (a>0,c <0) [12]. Here 
m is positive for nonempty open sets, both m 
and k are finite for compact sets in (r,,r,), and 
s, m, and k are unique in the following sense: If 
there are two sets of values of s;, m;, and k; (i= 
1, 2), then s,(x)=cs,(x)+ constant, m, (dx) = 
c 'm,(dx), and k,(dx)=c~'k, (dx) for some 
positive constant c. We call s, m, and k, respec- 
tively, the canonical scale, canonical measure 
(or speed measure), and killing measure for Wi. 
They determine the behavior of X,(w) belong- 
ing to M inside the interval (r,,7,). Conversely, 
given any such set of s, m, and k, we can find a 
1-dimensional diffusion process Wè such that s, 
m, and k are, respectively, the canonical scale, 
canonical measure, and killing measure of W. 
If X,(w) is nonvanishing in (r,,7r,) with proba- 
bility 1, the killing measure k is identically 
zero, and the canonical scale s satisfies the 
equation 


s(x) —s(x,) 


PlOx, < Ox) ysl) 


for x; <x <x, where g, is the thitting time of 
the point y. 

The motion X,{w) belonging to the process 
M and contained in (r,,r,) can be constructed 
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from the Brownian motion by means of a 
topological transformation of the state space 
(interval) based on s, a ttime change based on 
m, and a tkilling based on k. More precisely, 
we first transform the interval (r,,r,) by x3 
s(x) into the interval (s(r, +0), s(r, —0)) so 
that the diffusion process on this new interval 
has a canonical scale coincident with x. The 
speed and killing measures are transformed 
accordingly. We can, therefore, assume that 
the canonical scale is x. Let us consider the 
case (r,,1,)=(—00, +00) for simplicity. Let 
t(t, x) be the flocal time of Brownian motion at 
x (— 45 Brownian Motion). Next, we apply 
the ftime change to the Brownian motion by 
means of the tadditive functional 


TEN ta, x)m(dx), 


and finally tkill the latter process by means of 
the tmultiplicative functional 


+% 
a(t)= exp( — | t(p~'(t), 79 f 
Thus we obtain the process M in (—oo, 00). In 
particular, if m(dx)=a(x)~'dx and k(dx)= 
|c(x)| dx, we have 


alt= | aX," de 


ie) 


a(t)=exp (| c(X g-i) ic) : 
19) 


and ®©u=au"+cu. 

At a shunt the infinitesimal generator © has 
a form that is a generalization of the first- 
order differential operator bu’ + cu (c <0), 
with b>0 or b<0 according as it is a right 
shunt or a left shunt. At a trap we have ©u(x) 
= —u(x)/E,(). 

When S is an interval with endpoints r, and 
r, and all interior points are regular, the left 
endpoint r, is classified into the following 4 
types, according to the bebavior of W near r,: 
Take an arbitrary fixed point re(r;, r2), and set 
n(dx)=m/(dx)+ k(dx) and 


a= | (s(r) ~ s(x))n(dx), 
(rir) 


p= | n((x, r))s(dx). 
(1r) 


Then r, is a regular boundary if x < œ, p < 

oo; an entrance boundary if «< oo, $ = œ; an 
exit boundary if «= œ, f < œ; and a natural 
boundary if x= 00, $ = œ. This classification 

is independent of the choice of r. A similar 
classification of r, can be established. X,(w) 
approaches r, in finite time with positive or 
null probability according as £ is finite or infi- 
nite. If «= œ, it never happens that X,(w) 
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starts from r, and reaches the interior of the 
interval S even if r; € S, whereas if x < œ we 
can construct (adjoining r, to S if necessary) a 
diffusion process that enters the interior from 
r, and whose motion in the interior coincides 
with that of X,(w). 

If r, is a regular boundary for W and r, €S, 
then there are various possibilities for the 
behavior of X,(w) at r. They are expressed by 
the boundary conditions satisfied by the func- 
tions u belonging to the domain of the in- 
finitesimal generator ©. The condition is in 
general of the form 


yu(r,) + 6Gu(r,)+ put (r,)=0, 


where y, ô, and u are constants, y, d<0, p>0, 
and |6|+>0. If y=d6=0, then r; is said to be 
a reflecting barrier. If r, is regular for W and 
does not belong to S, then X,(w) vanishes 
exactly as X,(w) reaches r,, and r, is called an 
absorbing barrier. This case corresponds to the 
boundary condition u(r,)=0. Whatever the 
boundary condition may be, W is constructed 
from the Brownian motion with reflecting 
barrier by topological transformation of the 
state space, time change, and killing. Here if 
y #0, then killing may occur at r,; if 640, the 
set of visiting times of r} has positive Lebesgue 
measure; and if 40, the trace of the motion 
may go beyond the point r, and reach the 
interior points of S [2, 7-9]. 

If we weaken the assumption of continuity 
of paths and admit jumps from r,, the general 
boundary condition becomes 


yu(r,) + dGu(r,)+ pu’ (ry) 
-| (u(x) —u(r,)) v(dx) =0, 
(riral 


where v is a measure with respect to which 
min(1, s(x)—s(r, +0)) is integrable. 

When S =(r,,7,), the transition probability 
is absolutely continuous with respect to the 
canonical measure, the density p(t, x, y) has an 
teigenfunction expansion 


0 
p(t, x, y)= | e“e(dd; x, y), 
-© 
and p(t, x, y) is positive, jointly continuous in 3 
variables, and symmetric in x and y. A similar 
result is also known when S is half-open or 
closed [9]. 

If M is trecurrent, i.e., P.(o,< +00)=1 for 
every x and y in S, then there exists a unique 
(up to a multiplicative constant) tinvariant 
measure for S that is finite for all closed inter- 
vals in the interior of S. If Mt is conservative 
and all the interior points of S are regular, 
then the canonical measure is an invariant 
measure, provided that the endpoints are 
either entrance, natural, or regular reflecting. 
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C. Multidimensional Diffusion Processes 


Let the state space S be a domain or the 
closure of a domain in the n-dimensional 
Euclidean space R”. Consider a temporally 
homogeneous diffusion process {X,}5<;<., ON 
S. Under suitable regularity conditions the 
infinitesimal generator © coincides, for suffi- 
ciently smooth functions u in its domain, with 
the following telliptic partial differential oper- 
ator A: 


2 


A= Ş aÏ{(x)— 


i, j=1 dx'dx! 


at} b' wA ae, 
c<0; (7) 


where S has a boundary, u satisfies a boundary 
condition of the form 


n oe) 
2 OO) aa 


i,j=1 


ue 


+5 ps 


Ou o 


+ y(x)u(x) + ô(x)Au(x)+ B(x) =0, (8) 
where, for simplicity, we assume that S is the 
closed half-space defined by x" >0, (a4) is a 
symmetric nonnegative definite matrix, y <0, 
6<0, u20, and ĝ/ôn is the inward-directed 
tconormal derivative associated with a”. This 
boundary condition was discovered by A. D. 
Venttsel’ [13]. Conversely, given an operator A 
such as (7) and a boundary condition (8) (if S 
has a boundary), the existence and uniqueness 
of the corresponding diffusion process are 
known for several special cases. If S =R" and 
A has continuous coefficients, there exists at 
least one diffusion process corresponding to A 
[14-16]. 

A probabilistic approach to constructing 
diffusion processes corresponding to the oper- 
ator A with c=0 is given by Itô’s method 
of tstochastic differential equations (— 406 
Stochastic Differential Equations E). A some- 
what different approach was introduced by D. 
W. Stroock and S. R. S. Varadhan [20] under 
the name of martingale problems (— 261 Mar- 
kov Processes C). Let W” = C([0, œ)—> R”) 
be the space of all continuous functions w: 

[0, 00) R” endowed with compact uniform 
topology and 8(W") be the topological o- 
field. Given xe R”, a solution to the martingale 
problem for the operator A with c=0 start- 
ing from x is a probability measure P, on 
(W*, or satisfying P,.(w(0)=x)=1 such 
that f(w(t))— fó Af(w(s))ds is a P,-martingale 
for all fe ae (R"), where CS (R”) denotes the set 
of all C®-functions on R” having compact 
support. If a=(a") is uniformly positive de- 
finite, bounded, and continuous and if b=(b') 
is bounded and Borel measurable, the martin- 
gale problem for the operator A with c=0 is 
well posed, i.e., for each xe R", there is exactly 
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one solution starting from x. (For further 
information related to the theory of martingale 
problems — [15,20].) 

Suppose next that S is the closure of a 
bounded domain with a sufficiently smooth 
boundary and A is given with sufficiently 
smooth coefficients. If the boundary condition 
is yu + pOu/6n=0, 40, then there exists a 
unique diffusion process on S corresponding 
to this situation. Moreover, if y=0, then the 
process is said to have a reflecting barrier. S. 
Watanabe [22] gave a probabilistic condition 
which characterizes the reflecting diffusion 
processes in the normal direction among all 
reflecting diffusion processes in oblique direc- 
tions. Suppose that a general boundary con- 
dition (8) is given. We write the left-hand side 
of (8) as Lu. Write u= Hf for the solution of 
Au=O with boundary value f. Under some 
natural additional conditions, T. Ueno proved 
that if LH tgenerates a Markov process on 
the boundary, then there exists a diffusion 
process for A with boundary condition (8) 
[21]. If {X,} has a reflecting barrier, the Mar- 
kov process on the boundary is, conversely, 
obtained from {X,} through time change by a 
nonnegative continuous tadditive functional 
which increases only when the value of X, is 
on the boundary. Stochastic differential equa- 
tions are also used in constructing diffusion 
processes with boundary condition (8) [19] 
(— 406 Stochastic Differential Equations). An 
elegant method for constructing diffusion 
processes with boundary condition (8) has 
been introduced by Watanabe [22]. It consists 
of piecing together excursions from the bound- 
ary to the boundary. In this construction, 
the stochastic integrals for Poisson point 
processes play an important role. There are 
various other results on multidimensional 
diffusion processes with general boundary 
conditions which cannot be covered by the 
method mentioned above (— [23] and M. 
Motoo, Proc. Intern. Symp. Stochastic Dif- 
ferential Equations, K yoto, 1976). 

When we have a diffusion process on S$ 
with infinitesimal generator of the form A, we 
can obtain a probabilistic expression for the 
solutions of various partial differential equa- 
tions involving A. Let a be the hitting time of 
the boundary of S. Then Hf(x) can be ex- 
pressed as E,( f(X,_))). The solution of Au = 
— f with boundary value 0 is given by u(x) = 
E (f6 f(X,) dt), while the solution u(t, x) of 
ĉu/ðt = Au with boundary value 0 and initial 
value u(0, x)= f is E,.(f(X,); t<o). The first 
case gives the solution of a tDirichlet problem; 
and in this case, the condition for a boundary 
point to be tregular relative to the Dirichlet 
problem can also be expressed probabilisti- 
cally (— 45 Brownian Motion, 261 Markov 
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Processes). eee if f(X,) is replaced by 
f(X)exp(\6 k(X,) ds) in the expressions for u(x) 
and u(t, x), A is teplacéd by A +k (M. Kac 
[24]). When k <0, this replacement gives rise 
to a killing of the process (— 261 Markov 
Processes F). 

By using the theory of tDirichlet forms we 
can investigate a general class of tsymmetric 
multidimensional diffusion processes which are 
not in the framework of the classical diffusion 
processes, i.e., diffusion processes whose in- 
finitesimal generators are not necessarily dif- 
ferential operators (M. Fukushima, Dirichlet 
Forms and Markov Processes, 1980)). 


D. Diffusion Processes on Manifolds 


Let M be a tconnected toriented to-compact 
C” tmanifold of dimension n. Let M be M or 
M U {0} (= the one-point fcompactification 
of M) according as M is tcompact or non- 
compact. Suppose that we are given a system 
of C” —tvector fields Ag, A,,..., A, on M. 

We consider the following stochastic differen- 
tial equation on M: 


dX,=¥. A,(Xodw*(t)+ Ag(X,dt (9) 
k=1 


(— 406 Stochastic Differential Equations), 
where w(t)=(w!(t), w° (t), ..., w"(t)) denotes an 
r-dimensional Brownian motion and the first 
term of the right-hand side is understood in 
the sense of the *Stratonovich stochastic dif- 
ferential. Let X(t, x, w) be the solution of (9) 
with X)=xeM defined on the r-dimensional 
Wiener space (Wj, P”) (— 406 Stochastic 
Differential Equations) and P, be the proba- 
bility law on W(M) of (X(t, x, w))} >o, Where 
W(M) is the space of all continuous map- 
pings w:[0, 00) > M with ô as a trap. Then 
{P,.|xeM} defines a diffusion process W on 
M which is generated by the second-order 
differential operator )7_, A? /2+ Apo, ie. a 
diffusion process W with the infinitesimal 
generator © such that 


(Sf (x)= D Az f(x) + Ao f(x) for fec (M), 
where C5 (M) denotes the space of all C®- 
functions on M with compact support. By 
appealing to the analytical theory of partial 
differential equations we can discuss regularity 
properties of the transition probability of M 
({15]; L. Hormander, Acta Math., 119 (1967)). 
Recently, P. Malliavin [25] also suggested a 
probabilistic method for proving elliptic regu- 
larity results (see also [19]). 

We now assume that all linear sums of 
An, A;,-.-,A, are ftcomplete. Then the terminal 
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time of W is infinite a.s. Let W(M) be the space 
of all continuous mappings w:[0, 00) >M 
endowed with compact uniform topology, and 
set W.(M)={w|we W(M), w(0)= x}. We de- 
scribe the ttopological support “(P,) of the 
probability P, on W,(M), i.e., the smallest 
closed subset of W,(M) that carries the mea- 
sure P,. Let Z be the set of all piecewise con- 
stant mappings u:[0, 0o)->+R”. For a given 
u=(u'(t),u?(t),...,u"(t)) of F, we consider a 
system of ordinary differential equations 


d r 
PA ae Alo) + È A,(p(t))u*(2), (10) 


i.e., for every C”-function f on M with com- 
pact support, 


d r 
a) (Ot) 4S0) P (Afo )u* (0). 


Then for every ue F and xe M, we obtain a 
curve o = (x, u)=(¢,(x, u)) on M by solving 
(10) with p(0)=x. Set Y*={ep(x,ulueF}. 
Then we have 


P(P)=F* for every xeM. 


Let ¥ be the *Lie algebra generated by A,, A3, 
.,A,, and set P(x) ={V,|VeL}. If dim (x)= 
n for every xe M, then Y(P,) = WAM) for every 
xe M. For further information — Stroock and 
Varadhan (Proc. 6th Berkeley Symp. Math. 
Statist. Prob. III, 1972) and H. Kunita (Proc. 
Int. Symp. Stochastic Differential Equations, 
K yoto, 1976). 
Let A be a smooth tnondegenerate second- 
order telliptic differential operator on M which 
is expressed in local coordinates as 


n fag 


1 
Amt a S 


where (a‘(x)) is symmetric and tstrictly posi- 
tive definite. Then there exist a tRiemannian 
metric g and a C”-—tvector field b on M such 
that 


1 
A= Ay +b, (11) 


where Ay is the tLaplace-Beltrami operator 
on the Riemannian manifold (M, g). We now 
construct the diffusion process generated by 
the operator A, introducing a stochastic dif- 
ferential equation on the tbundle O(M) of the 
‘orthonormal frames. There exists an taffine 
connection V fcompatible with the Riemann- 
ian metric g such that for every C” -function f 
on M, 


n 


1 
zat k(fon)(r)= (4 +b) Maton 


reO(M), 
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where (L,,L,...,L,) is the system of tcanon- 
ical horizontal vector fields (tbasic vector 
fields) on O(M) corresponding to the affine 
connection V and z:0(M)—M is the natural 
projection [19]. We now consider the follow- 
ing stochastic differential equation on O(M): 


dr,= 3 L,(r,)odw*(0). (12) 
k=1 


Let r(t,r, w) be the solution of (12) with rp = 
r€O(M) defined on the n-dimensional Wiener 
space (Wg, P”). Now a stochastic curve 

X (t,r, w) on M is defined by X(t,r, w)= 

n(r(t,r, w)). Set X(t,r, w)=0 for t>C, where ¢ 

is the texplosion time of r(t,r, w). Then the 
probability law of (X(t, r, w)),>9 on W(M) 
depends only on x=2(r), and it defines a diffu- 
sion process Jt on M which is generated by 
the operator A of (11). (For details — [19, 26].) 
When A=A,,/2, i.e., b=0 in (11), the diffusion 
process Wis called the Brownian motion on 
the Riemannian manifold M (— 45 Brownian 
Motion). 

Next consider the case when (L4, Ly,..., L,) 
in (12) is a system of canonical horizontal 
vector fields corresponding to the *Riemann- 
ian connection, and let b be a C®—vector 
field on M. Let b be the scalalization of b, 

b=(b', b?,...,b"), b(n) =D, bi(x) fi, and 
(f =(e))7? for r=(x,e1,€2, -€ ERAT 
where b=}; b'0/0x' and e; = E! ef 0/dx/ 
in local coordinates. Suppose nae bi(r) is 
bounded on O(M) for i=1,2,...,n. Then 


mi=exn| È [a r,)dw'(s) 
g p 2d 
ae? K (r,)) s] 


is an texponential martingale. The diffusion 
process generated by the operator A,,/2 +5 is 
obtained from the Brownian motion on M 
constructed above by the ttransformation of 
drift, i.e., the transformation by the tmultiplica- 
tive functional M(t) (— 261 Markov Processes). 
(G. Maruyama (Nat. Sci. Rep. Ochanomizu 
Univ., 15 (1954)) studied this for 1-dimensional 
processes, I. V. Girsanov [27] and M. Motoo 
(Ann. Inst. Statist. Math., 12 (1960—1961)) for 
multidimensional cases.) 

Consider a diffusion process W= {P,|xe M} 
on M generated by the operator A of (11). Let 
œ, be the tdifferential 1-form defined from the 
vector field b by 


O, V) = g.(by, V3) 


for every xe M and every C” -vector field V 

on M. Then Mt is symmetrizable if and only if 
@, is texact, i.e., if there exists a function F on 
M such that w,=dF. The tinvariant measures 
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are then of the form const. x exp[2F(x)]m(dx), 
where m(dx) is the fRiemannian volume (E. 
Nelson, Duke Math. J., 25 (1958); Kolmogorov 
Math. Ann., 113 (1937)). The diffusion process 
Mè is said to be locally symmetrizable if for 
every tsimply connected domain Dc M there 
exists a Borel measure v?(dx) on D such that 


> 


| Tf (x)g(x)v? ay=f F(x) Tg (xv? (ax) 
D D 


for all bounded continuous functions f and g, 
where 


Tf (x)= fiw 


W(M) 
Tp(w) =inf{t| w()¢D}. 


Then Mt is locally symmetrizable if and only if 
œ, is tclosed, 1e., dw,=0. R. Z. Khas’minskii 
[28] proved a pair of useful tests for explo- 
sions of diffusion processes on M =R", similar 
to Feller’s test for n= 1 mentioned in Section B 
(— [18]). In case of a general manifold we can 
investigate the possibility of explosions for W 
by appealing to the comparison theorems for 
tcurvatures in the theory of differential geome- 
try [19,29]; S. T. Yau, J. Math. Pures Appl., 57 
(1978)). The diffusion process Mt on M is said 
to be trecurrent if P,[X,¢U for some t>0]=1 
for any open subset U of M; otherwise it is 
called ‘transient. It is well known that an n- 
dimensional Brownian motion on R” is recur- 
rent if n<2 and transient if n>3 [9]. There 
also are some results for the criterion which 
determines whether W is recurrent or transient 
({19, 28]; A. Friedman, Stochastic Differential 
Equations and Applications I, Il, 1975, and K. 
Ichihara, Publ. Res. Inst. Math. Sci., 14 (1978)). 
Explicit formulas for the transition probability 
of the Brownian motion on a hyperbolic space 
are given in [29]. For information related to 
the asymptotic behavior of diffusion processes 
as t|0 we refer the reader to Varadhan (Comm. 
Pure Appl. Math., 20 (1967)) and S. A. Mol- 
chanov (Russian Math. Surveys, 30 (1975)). 
Diffusion approximations for suitably nor- 
malized random sequences have been studied 
extensively beginning with the work of A. J. 
Khinchin [30]. The theory of tlimiting distri- 
butions of sums of tindependent (or weakly 
dependent) random variables is among the 
best-known examples (— 250 Limit Theorems 
in Probability Theory). For information re- 
lated to the theory of diffusion approxima- 
tions, see Yu. V. Prokhorov (Theory of Prob. 
Appl., 1 (1956)), A. V. Skorokhod [16], and 
Stroock, Varadhan, and G. Papanicolaou 
(Proc. 1976 Duke Turbulence Conference). 
Recently, many interesting examples of multi- 
dimensional diffusion processes have also been 


HH wwr P (dw), 
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introduced to describe probabilistic models in 
physics, biology, etc. (e.g., R. L. Stratonovich, 
Topics in the Theory of Random Noise I, Il, 
1963; J. F. Crow and M. Kimura, An Introduc- 
tion to Population Genetics Theory, 1970; and 
K. Sato, Proc. Int. Symp. Stochastic Differential 
Equations, K yoto, 1976). 

In general, if a diffusion process { X,} is 
given on a noncompact space S and X, has 
no limit points in S as t7¢, then some natural 
compactification should be induced by {X;}. 
The notion of a *Martin boundary for Markov 
processes is introduced in this connection. 
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116 (XX.2) 
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The system of units for physical quantities is 
derived from a certain set of fundamental 

units. If the fundamental units are denoted 

by 0, 9, W, etc., any other unit « (called a de- 
rived unit) can always be expressed in the form 
a=cOlo™y” ...(c,l,m,n, ... are constants), by 
definition or by physical laws. The exponents l, 
m,n,... are called the dimensions of «, and the 
content of the previous statement is expressed 
as [x] =[6'p"w”...], which is called the di- 
mensional formula. The usual practice is to 
take as fundamental units length, time, mass, 
temperature, and energy, which are denoted by 
L, T, M, 0, and H, respectively. Dimensional 
analysis investigates the relation between 
physical quantities by use of the z theorem 
and the law of similitude given below. 


A. The z Theorem 


If a relationship f(a, B,...)=0 holds among n 
physical quantities a, B,... independently of 
the choice of fundamental units, the equation 
f(a, B,...)=0 can always be transformed into 
F(m,,%,...)=0, where the z; are n—m dimen- 
sionless quantities (m is the number of funda- 
mental units) of the form 2,;=af*.... If we 
choose the x; so that n; =f "1y 71...and 73, 
73, etc. do not contain a, then f=0 implies 
a=fy"... O(72,,73,...), which clearly shows 
the manner in which the quantity « is related 
to other quantities f, y,.... 


B. The Law of Similitude 


In general, if two physical systems of the same 
kind have the same values of the z;, then the 
physical states of the systems are similar. If 
we are given a family of mutually similar sys- 
tems, it is sufficient to observe a particular one 
among them (a “model”) in order to estimate 
physical values attached to any one of the 
given systems. 

Consider, for example, the case of the drag 
D acting on geometrically similar bodies 
placed in the flow of a viscous incompressible 
fluid. If v is the velocity, | the representative 
length of the body, p the density of the fluid, 
and y the viscosity (which has the dimensional 
formula ML 'T~‘), then the z theorem gives 
D/pv? I? = f(pvl/u). Hence the drag coefficient 
as given by the left-hand side can be obtained 
by the experiments performed on a geometri- 
cally similar model. The dimensionless quan- 
tity R=vl/v (v= p/p) is called the Reynolds 
number. If the wave resistance due to gravity 
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as well as the effect of compressibility are 
taken into account, gravitational acceleration 
g and sound velocity a must be included, so 
that we have D/pv? I? = f(vl/v, v? /lg, v/a, C,, 
C,,...), where C,, C,,... are other dimension- 
less quantities depending on the physical 
properties of the fluid. Fr=v7/Ig is called the 
Froude number, and M =v/a the Mach number. 
Next, consider the case of heat transfer 
between a solid surface and a flowing fluid. 
Let the area of the solid surface be denoted 
by S, the heat transferred per unit time by 
Q (HT~"), the thermal conductivity of the 
fluid by k (HL~!T~'67'), the specific heat by 
C (HM 16°), the two representative tempera- 
tures by Tọ and T,, and the representative 
length by l, where expressions in parentheses 
represent dimensional formulas. Then we have, 
as dimensionless quantities, the Nusselt num- 
ber Nu=Q/(kS(T, — To)/1), the Prandtl number 
Pr=v/« (k=k/pc), the Grashoff number Gr = 
d39(T, — To)/v? To, and R, so that from the z 
theorem we have the relation Nu = f (R, Pr, Gr, 
C,,C,,...). Furthermore, Pe =vl/k = Pr R is 
called the Péclet number. 
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A. Introduction 


Toward the end of the 19th century, G. Cantor 
discovered that there exists a one-to-one corre- 
spondence between the set of points on a line 
segment and the set of points on a square; and 
also, G. Peano discovered the existence of a 
tcontinuous mapping from the segment onto 
the square. Soon, the progress of the theory of 
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point-set topology led to the consideration of 
sets which are more complicated than familiar 
sets, such as polygons and polyhedra. Thus it 
became necessary to give a precise definition 
to dimension, a concept which had previously 
been used only vaguely. In 1913, L. E. J. 
Brouwer [9] gave a definition of dimension 
based on an idea of H. Poincaré. In 1922, the 
foundations of dimension theory for separable 
metric spaces were established by K. Menger 
[11] and P. Uryson [10]. Subsequently, P. S. 
Aleksandrov and W. Hurewicz contributed 
much to the development of the theory. The 
foundations of dimension theory for general 
metric spaces were established independently 
by M. Katétov amd K. Morita. More general 
theory for normal spaces has also been inves- 
tigated; the same results as in metric spaces, 
however, do not always hold. 


B. Definition of Dimension 


Let X be a normal space. If any finite open 
tcovering of X has an open covering of forder 
<n + 1 as its refinement (— 425 Topological 
Spaces R) (i.e., if for any open sets G, (i= 1, 
...,8) such that X =G, U...UG,, there exist 
open sets H; (i=1,...,s) such that H,<G,, X = 
H,U...UH, and any n+2 of the H; have no 
point in common), then we write dim X <n. If 
dim X <n but dim X <n—1 does not hold, 
then we define X to be n-dimensional and 
write dim X =n. We call dim X the cover- 

ing dimension, (or Lebesgue dimension) of 

X. The idea behind this definition is due to 
H. Lebesgue. 

There are other definitions of dimension 
that are given inductively. Let us define 
Ind X = —1 if X is empty. If for any pair 
consisting of a closed set F and an open set G 
with FG in X there exists an open set V 
such that Fc VcG and Ind(V—V)<n-—1, 
then we define Ind X <n. Next, we define 
ind Ø = —1. For any point p of X and any 
neighborhood G of p, suppose that there exists 
an open neighborhood V of p such that Vc 
G and ind(V—V)=n—1. Then we define 
ind X <n. As before, we set Ind X =n (ind X 
=n) if Ind X <n (ind X <n) but Ind X <n 
— 1 (ind X <n—1) does not hold. (The defi- 
nition of ind X is due to Menger.) We call 
Ind X (ind X) the large inductive dimension of 
X (the small inductive dimension of X). 

If dim X <n does not hold for any n, then 
X is called infinite-dimensional, written 
dim X = œ; we define Ind X = œ and ind X = 
œ similarly. These dimensions are invariant 
under thomeomorphisms. 

The set of irrational points in a Euclidean 
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space, the tCantor discontinuum, and tBaire 
zero-dimensional spaces are all 0-dimensional. 
The set of rational points in a separable 
tHilbert space is 1-dimensional. 


C. Dimension of Metric Spaces 


The following theorems hold for the dimension 
of metric spaces (M. Katétov, Czechoslovak 
Math. J., 2 (1952); K. Morita, Math. Ann., 128 
(1954)). Let X and Y be metric spaces. The 
equality dim X =Ind X holds. If Yc X, then 
dim Y<dim X. If X is a union of a countable 
number of closed sets F, (i=1,2,...), then 
dim X = max(dim F,) (sum theorem for dimen- 
sion). The inequality dim(X U Y)<dim X + 
dim Y+ 1 holds. If dim X =n, then X isa 
union of n+ 1 0-dimensional subsets (decom- 
position theorem for dimension). We have 
dim(X x Y)<dim X + dim Y, where X 4 Ø 
(product theorem for dimension). 

Each of the following is a necessary and 
sufficient condition for dim X <n: (i) There 
exists a subspace A of a Baire zero- 
dimensional space B (t) and a continuous 
closed mapping f of A onto X such that f(x) 
consists of at most n+ 1 points for each point 
x of X (K. Morita, Sci. Rep. Tokyo K yoiku 
Daigaku, 5 (1955)); (ii) there exists a metric of 
X which gives the same topology on X such 
that for any positive number e, any point x of 
X,and any n+2 points x; (i=1,...,n4+2) 
at a distance less than € from the (¢/2)- 
neighborhood of x, there are at least two 
points x; and x; (i4j) with distance <e (J. 
Nagata, Fund. Math., 45 (1958)). 

Hurewicz’s problem asked whether the 
equality dim X =n +m (m>0) implies the 
existence of an m-dimensional space A and a 
mapping f of A onto X having property (i). It 
was solved affirmatively for separable metric 
spaces by J. H. Roberts and for general metric 
spaces by K. Nagami (Japan. J. Math., 30 
(1960)). 

If X is the union of a countable number of 
closed tstrongly paracompact subspaces, in 
particular if X is separable, then Ind X = 
ind X [1,2]. However, it was shown by P. 
Roy (Bull. Amer. Math. Soc., 68 (1962)) that 
this equality does not hold in general. 


D. Euclidean Spaces and Dimension 


The n-dimensional tEuclidean space R” is 
exactly n-dimensional in the sense mentioned 
above; thus this concept of dimension agrees 
with our intuition. The proof of dim R" >n 
comes from Lebesgue’s theorem: If cach mem- 
ber of a finite closed covering of an n-cube has 
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sufficiently small diameter, then the order of 
the covering is not less than n+ 1. (The proof 
of dim R” <n is easy.) Let X be a subset of R” 
and f a homeomorphism from X onto a sub- 
set f(X) of R”. If x is an interior point of X, 
then f(x) is an interior point of f(X). Also, if 
an open set A of R” is homeomorphic to a 
subset B of R”, then B is open in R” (Brouwer’s 
theorem on the invariance of domain [8 }). 
This theorem holds for any manifold but not 
for general separable metric spaces. By the 
theorem of tnvariance of domain it can be 
shown that R” and R”, m#n, are not homeo- 
morphic (theorem on invariance of dimension 
of Euclidean spaces). Any n-dimensional 
separable metric space is embedded in a 
Euclidean space R?"*', or, more precisely, in 
the subset of R*"*! consisting of all points x 
of which at most n coordinates are rational 
(Menger-Noébeling embedding theorem, G. 
Nobeling, Math. Ann., 104 (1930)). Thus, from 
the topological point of view, any finite- 
dimensional separable metric space can be 
identified with a subset of a Euclidean space. 
Moreover, it is known that any n-dimensional 
separable metric space is homeomorphic to a 
subset of some n-dimensional compact metric 
space. 

If F is a bounded closed subset of R”, then 
dim F <n if and only if for any positive num- 
ber e, there exists a continuous mapping f 
from F into an n-dimensional polyhedron in 
R” such that the distance between x and f(x) 
is less than e for each point x of F. 


E. Dimension of Normal Spaces 


Let X be a normal space. Then Ind X > dim X 
and Ind X >ind X, but the equalities do not 
necessarily hold here. The following theorems 
were obtained by E. Cech, Aleksandrov, C. H. 
Dowker, E. Hemmingsen, and Morita [1]. If 
dim X <n, then any locally finite open cover- 
ing of X has an open covering of order <n+ 1 
as its refinement; if A is an tF, subset of X 

or A is strongly paracompact, then dim A < 
dim X; if X has a ta-locally finite closed cover- 
ing {F,\, then dim X = max(dim F,). 

In order that dim X <n, it is necessary and 
sufficient that any continuous mapping from a 
closed subset of X into an n-sphere S” can be 
extended continuously to X. If X and Y are 
tparacompact and X is tlocally compact, 
or if X x Y is strongly paracompact, then 
dim(X x Y)<dim X + dim Y, where X # Ø; if 
X isa tCW complex, then the equality holds 
[14]. M. L. Wage (Proc. Nat. Acad. Sci. US, 
75 (1978)) proved under the continuum hypo- 
thesis (CH) that dim(X x Y)<dim X + dim Y 
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does not hold in general even if X x Y is local- 
ly compact and normal, and dim X = dim Y= 
0; T. Przymusinski (Proc. Amer. Math. Soc., 
76 (1979)) noted that CH can be avoided by a 
modification of Wage’s construction. Katétov 
(Casopis Pést. Mat. Fys., 75 (1950)) proved 
that dim X is determined by the ring C*(X) of 
bounded real-valued continuous functions 

on X. 


F. Homological Dimension 


Aleksandrov contributed much to the develop- 
ment of dimension theory in introducing the 
concept of homological dimension (Math. 
Ann., 106 (1932)). The homological dimension 
of a compact Hausdorff space X with respect 
to an Abelian group G is the largest integer n 
such that the n-dimensional tCech homology 
group H,(X, A; G) is nonzero for some closed 
subset A of X. The cohomological dimension 
D(X; G) is defined similarly by using the tCech 
cohomology group H"(X, A; G). If dim X < æ, 
then dim X = D(X; Z) (Z is the additive group 
of integers). The cohomological dimension of 
X with respect to an arbitrary Abelian group 
is determined by the cohomological dimension 
with respect to some specified groups, and the 
cohomological dimension of the product space 
X x Y is expressed in terms of those of X and 
Y (M. F. Bokshtein, [5]). A compact Hausdorff 
space X has the property that dim(X x Y)= 
dim X + dim Y for any compact Hausdorff 
space Y if and only if dim X = D(X; Q(p)), 
where Q(p) is the additive group of rationals 
mod 1 of the form m/p* for any prime number 
p (V. Boltyanskii, [5]); this result holds also 
when X is paracompact (Y. Kodama, J. Math. 
Soc. Japan, 18 (1966)). 


G. Dimension and Measure 


Let X be a separable metric space. Then 
dim X <n if and only if X is homeomorphic 
to a subset of a Euclidean space R*"*! whose 
(n+ 1)-dimensional tHausdorff measure is 0 
(E. Szpilrajn. Fund. Math., 28 (1937); also — 
[1,2]). The infimum of «>0 such that the 
Hausdorff measure A,(X) of dimension « 
vanishes is called the Hausdorff dimension of 
X. 


H. Dimension Type (Fréchet’s Definition) 
In analogy to the theory of tcardinal numbers 


in set theory, M. Fréchet (1909) defined the 
dimension type of topological spaces as fol- 
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lows: Two spaces X and Y are said to have the 
same dimension type if X is homeomorphic to 
a subset of Y and Y is homeomorphic to a 
subset of X. 


I. Infinite-Dimensional Spaces 


If X is a metric space with 0 <dim X < œ, 
then for each positive integer m with m< 

dim X, X contains a (closed) subset S with 
dim S =m. Tumarkin asked the following 
question: For an infinite-dimensional com- 
pact metric space X and for each positive 
integer m, does X contain a closed subset 

S with dim S =m? D. Henderson (Amer. J. 
Math., 89 (1967)) answered this question in the 
negative. Furthermore, J. Walsh (Bull. Amer. 
Math. Soc., 84 (1978)) constructed an infinite- 
dimensional compact metric space X such 
that if S is an arbitrary subset of X with 

dim $>0 then dim $= œ. 
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A. General Remarks 


A Diophantine equation is an talgebraic equa- 
tion whose coefficients lie in the ring Z of 
rational integers and whose solutions are 
sought in that ring. The name comes from 
Diophantus, an Alexandrian mathematician of 
the third century a.D., who proposed many 
Diophantine problems; but such equations 
have a very long history, extending back to 
ancient Egypt, Babylonia, and Greece. As 
early as the sixth century B.C., Pythagoras is 
said to have partially solved the equation x? + 
y?=27 by x=2n+ 1, y=2n? +2n, z=y+l. 

A general solution is given by the Pythagorean 
numbers x =m? —n?, y=2mn, z=m? +n’. 
*Fermat’s problem also concerns a Diophan- 
tine equation. 

Systematic studies of Diophantine equations 
over Z have been made for the linear equation 
DL, a,x; =a (a; a€ Z) and for the quadratic 
equation ax? +bxy+cy* =k (a,b,c, keZ) in 
two unknowns. The latter forms a principal 
topic of C. F. Gauss’s Disquisitiones arithme- 
ticae and can be regarded as a starting point of 
modern algebraic number theory. The special 
quadratic equation t° — Du? = +4 (De Z) is 
called Pell’s equation. If D <0, then Pell’s 
equation has only a finite number of solutions. 
If D >0, then all solutions t,, u„ of Pell’s equa- 
tion are given by +((t, +u JD)/2y"=(t, + 
u,./D)/2, provided that the pair t,.u, isa 
solution with the smallest t; +u, J/D> 1 [15]. 
Using continued fractions (= 83 Continued 
Fractions), we can determine t,, u, explicitly. 
A general quadratic Diophantine equation 
ax? + bxy+cy?=k with two unknowns can 
be solved completely if we use solutions of 
Pell’s equation; this is an application of the 
arithmetic of quadratic fields (— 347 Qua- 
dratic Fields) [1]. On quadratic Diophantine 
equations of several unknowns, thére are deep 
studies by C. L. Siegel (— 348 Quadratic 
Forms). 

Diophantine problems consist of giving 
criteria for the existence of solutions of alge- 
braic equations in rings and fields and even- 
tually determining the number of such solu- 
tions. The fundamental ring of interest is 
Z and the fundamental field of interest is Q. 
One discovers rapidly, however, that to have 
all the technical maneuverability necessary for 
handling general problems, one must consider 
rings and fields of finite type over Z and Q. 
Furthermore, one is led to consider finite fields 
and local fields when one deals with a locali- 
zation of the problems under consideration. 
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Techniques from various fields of mathematics, 
e.g., algebraic number theory, algebraic geom- 
etry, analysis, Diophantine approximation, 
etc., have been successfully applied to solve 
Diophantine problems. However, much re- 
mains unsolved today. Yu. V. Matiyasevich 
(1970) showed that Hilbert’s tenth problem is 
unsolvable; there is no general method of 
telling whether a Diophantine equation has a 
solution. This theorem in a sense indicates the 
complexity of Diophantine problems. For 
many centuries, no other topic has engaged 
the attentions of so many mathematicians, 
both professional and amateur, or resulted 

in so many published papers. For these mis- 
cellaneous results — Dickson [1] and Mordell 


[2]. 


B. Equations over Finite Fields 


Let k be a finite field of characteristic p con- 
sisting of q (= p”) elements. Chevalley’s theo- 
rem: Let f be a tform of degree d in n vari- 
ables with coefficients in k such that d<n. 
Then f=0 has a nontrivial solution in k. A 
generalization is Warning’s theorem: Let 
Jis- fi be polynomials with coefficients in k in 
n variables of degrees d,, ...,d,, respectively, 
and suppose that d=d,+...+d,<n. Then the 
number N of common zeros of f,,..., f; satis- 
fies N =0 (mod p). Warning’s second theorem 
asserts that if N >0 then N>q"~*. Warning’s 
theorem was also improved by J. Ax (1964) to 
the effect that N =0 (mod q’) for any integer 
b<n/d [3]. For equations over finite fields, 
counting the number of solutions is important. 
Let f(x, y) be an tabsolutely irreducible poly- 
nomial in x and y over k. Let N denote the 
number of zeros in k of f(x, y). A. Weil proved 
|N—~q| <2g./qt c(d), where g is the genus of 
the curve f(x, y)=0 and c(d) is a constant 
depending on d. Weil’s proof requires the 

use of deep results from algebraic geometry. 
This theorem is equivalent to the tRiemann 
hypothesis for algebraic curves over finite fields 
[4]. Later, using Stepanov’s method, W. M. 
Schmidt and E. Bombieri (1973) independently 
gave new proofs which do not depend on 
algebraic geometry [3]. P. Deligne [5] proved 
a far-reaching generalization of Weil’s theorem 
to tnonsingular absolutely irreducible equa- 
tions in n variables. He showed | N —q" "|= 
O(q"~'?). This is a part of the tWeil conjec- 
ture for zeta functions of algebraic varieties 
over finite fields (= Section E; 450 Zeta Func- 
tions Q). Schmidt obtained in an elementary 
manner a weaker estimate |N —q" ‘|= O(q" 37) 
but without the assumption of nonsingularity 
(— Section F). 
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C. Equations over Local Fields 


A method of solving problems in number 
theory by use of embeddings of the ground 
field into its tcompletions is called a local 
method. Such methods have important conse- 
quences when applied to Diophantine equa- 
tions. Let f be a polynomial in n variables 
with rational integer coefficients. The con- 
gruence f =0 (mod p*) is solvable for all k>1 if 
and only if f =0 is solvable in tp-adic integers. 
This is an easy consequence of the com- 
pactness of the ring of p-adic integers. We 

can solve f =0 in p-adic integers provided 
that we can solve an infinite sequence of con- 
gruences. It is generally difficult to tell when 
we may limit our consideration to only a finite 
number of these. In this respect, the following 
lemma is most useful. Hensel’s lemma: Let 

I (X,,.--,X,) be a polynomial whose coeffi- 
cients are p-adic integers. Let y,,...,), be p- 
adic integers such that for some i (1 <i<n) and 
an integer 6>0, we have f(7,,...,7,)=0 

(mod p?**") and df/dx;(),, ..-. Yn) =9 (mod p°), 
+0 (mod p?*!). Then there exist p-adic integers 
0,,...,9,, such that f(0,,...,0,)=0 and 6,=); 
(mod p**") for i=1,...,n. The case 6=0 is 
often useful; it implies that a nonsingular 
solution mod p can be extended to a p-adic 
solution. Generalization to simultaneous equa- 
tions is also known [6]. Skolem’s method is 
sometimes useful when we investigate certain 
types of equations over flocal fields. This 
method is based on some simple properties of 
local analytic manifolds over local fields [7]. If 
a quadratic form has zeros in each local field, 
then it has a rational zero (‘Minkowski-Hasse 
theorem). When a theorem of this type holds, 
we say that the tHasse principle holds (— 348 
Quadratic Forms). For forms of higher degree, 
the Hasse principle no longer holds even if the 
forms are absolutely irreducible and nonsin- 
gular. Counterexamples were first found for 
cubic (E. S. Selmer, 1951) and quintic (M. 
Fujiwara, 1972) forms. Asymptotic formulas in 
Waring’s problem (— 4 Additive Number 
Theory E) can be regarded as an analytic form 
of the Hasse principle. As to the quantitative 
formulation of the Hasse principle, there are 
deep results of Siegel for quadratic forms 

and their generalization by T. Tamagawa. R. 
Brauer (1945) showed that forms in sufficiently 
many variables represent zero in all p-adic 
fields. Forms of odd degree represent zero in 
the field of rational numbers if the number of 
variables is sufficiently large compared with 
the degree (B. J. Birch, 1957). Let f be a poly- 
nomial with p-adic integer coefficients and Cm 
(m20) be the number of solutions to the con- 
gruence f =0 (mod p”). The series y(t) = 

=o Cmt™ is called the Poincaré series of f. 
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J. Igusa (1975) proved, by using his theory of 
asymptotic expansions together with Hiro- 
naka’s tresolution theorem, that (t) is a 
rational function of t [8]. 


D. Integral Solutions of Some Diophantine 
Equations 


In this section we are concerned with those 
equations for which some “theory” exists. For 
isolated results > [1,2]. 


(1) Binary Forms. Thue’s theorem (1908): If 
f(x) = X89 a,x"(a,eZ, n> 2) has distinct roots, 
then the number of rational integral solutions 
of Shp a,x’ y" "=a (Zəa #0) is finite. This 
theorem is a direct consequence of Thue’s 
theorem on Diophantine approximation, 
which says that there are only a finite number 
of rational numbers p/q (p,qeZ,q>0) with 
|a—p/q|<1/q”*’ for a given algebraic num- 
ber « of degree n (n> 2) [9, p. 122]. K. F. Roth 
proved that (n/2)+ 1 in this formula can be 
replaced by 2 +e (e is an arbitrary positive 
number independent of n) (Mathematika, 2 
(1955), 1-20). Roth’s theorem was generalized 
to some cases of number fields and function 
fields (— 182 Geometry of Numbers) and is 
applied to Diophantine equations [9, 10]. 

A. Baker (1968), using a completely different 
method, has given explicit upper bounds for 
the solutions of Thue’s equations, thus enabl- 
ing one to compute effectively all the solu- 
tions. More precisely, if f in Thue’s equation 
is irreducible over the rationals, then every 
integer solution (x, y) of the equation satisfies 
max(|x|,|y|) <exp((nH)""°”” + (log a)*"**), 
where H is the theight of f. The proof of this 
remarkable theorem is based on his deep result 
concerning the lower bound for the linear 
forms in the logarithm of algebraic numbers 
(Mathematica, 13 (1966); 14 (1967)). Baker’s 
method has been applied to elliptic, hyperellip- 
tic, and other curves (Baker, H. Stark, J. 
Coates, V. G. Sprindzhuk, etc.). 


(2) Higher-Degree Forms. A natural generali- 
zation of binary forms is a norm form. Let K 
be an algebraic number field of degree t >3 
and «,,...,a, be elements of K. Then the norm 
N(x +... 4+0,%,) = M- (oP x, +... +ax,), 
where «® denotes a conjugate of «, is a form of 
degree t with rational coefficients. It is easy to 
see that every form which has rational coeffi- 
cients and is irreducible over Q but which is a 
product of linear forms with algebraic coeffi- 
cients is a constant multiple of a norm form 
[7]. A module M in an algebraic number field 
K is called degenerate if M has a submodule N 
such that, for some ae K, aN is a full module 
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in some subfield K’ of K, where K’ is neither Q 
nor an imaginary quadratic field. The most 
basic in the theory of norm forms is (W. M.) 
Schmidt’s theorem: Let «,,...,«%, be linearly 
independent over Q and suppose that the 
module generated by x,,...,%, is nonde- 
generate; then N(a,x,+...+4%,X,)=c (CEQ) 
has only finitely many solutions in integers 

Xis -<s Xn. (Math. Ann., 191 (1971)). The proof 
is based on his remarkable result on *simulta- 
neous approximation which generalizes Roth’s 
theorem (— 182 Geometry of Numbers G). 
There are investigations on special norm forms 
(T. Skolem, N. I. Fel’dman, K. Ramanathan, 
K. Gyory, M. Fujiwara, etc.). For general 
forms of higher degree, not much is yet known 
except for the additive forms (— 4 Additive 
Number Theory E). H. Davenport [11] proved 
that if f(x) is a cubic form with rational in- 
teger coefficients in n variables, then f(x)=0 
has a nontrivial integral solution, provided 
that n> 16. This theorem was proved by 
means of an exquisite application of the tcircle 
method together with some geometry of num- 
bers. A well-known conjecture is that n> 10 
instead of n> 16. It is known that over local 
fields, any cubic form in 10 variables has a non- 
trivial zero. There are various results of this 
type for stmultaneous additive, quadratic, and 
cubic forms (Davenport, D. Lewis, R. Cook, 
Schmidt, etc.) [17]. A satisfactory theory of 
forms of higher degree, like that of quadratic 
forms, is not yet known but is quite desirable. 
In this vein, Igusa has obtained some new 
results of considerable interest, e.g., a tPoisson 
summation formula for higher-degree forms, 
using his theory of asymptotic expansions [8]. 


(3) Algebraic Curves. The fundamental re- 

sult is Siegel’s theorem (1929): Assume that 

the equations /;(X,,...,X,)=0(1<i<m) 
determine an algebraic curve with a positive 
*genus in an taffine space of dimension n. 

Then the number of rational integral solutions 
of f,(X,,...,X,)=0 (1 <i<m) is finite. This 
theorem was generalized by S. Lang in the fol- 
lowing form: Let K be a finitely generated field 
over Q and J a subring of K that ts finitely 
generated over Z. Furthermore, let C be a 
nonsingular projective algebraic curve with a 
positive genus defined over K, and let ọ bea 
rational function on C defined over K. Then 
there are only a finite number of points P on C 
with g(P)e! [10]. The proof of this theorem is 
based on a generalization of Roth’s theorem in 
the above sense and on the weak Mordell- Weil 
theorem (— Section C). A. Robinson and P. 
Roquette gave another approach to Siegel’s 
theorem from the standpoint of nonstandard 
arithmetic (J. Number Theory 7 (1975)). On the 
other hand, a necessary condition for the exis- 
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tence of infinitely many solutions of f (X, Y)= 
0 with rational integral coefficients was given 
by C. Runge (J. Reine Angew. Math., 100 
(1887)). 


(4) Elliptic Curves. An elliptic curve E is an 
tAbelian variety of dimension 1, or what is the 
same, an irreducible nonsingular tprojective 
algebraic curve of tgenus 1 furnished with 

a point O as origin. The *Riemann-Roch 
theorem defines a group law on the set of 
tdivisor classes of E. Actually, if P, P’ are 
points of E, then there exists a unique point 
P” such that (P)+(P’)~(P”)+(O), where ~ 
means linear equivalence, i.e., the left-hand 
side minus the right-hand side is the divisor of 
a rational function on the curve. The group 
law on E is then defined by P+ P' = P”. If the 
characteristic #2 or 3, using the Riemann- 
Roch theorem one finds that the curve E can 
be defined by a Weierstrass equation y? =x? + 
ax +b with a, b in the ground field over which 
the curve is defined. Conversely, any homo- 
geneous nonsingular cubic equation has genus 
1 and defines an elliptic curve in the projective 
plane once the origin has been selected. If both 
the curve and the origin are defined over a 
field k, then the group law is also defined over 
k, and it becomes a 1-dimensional Abelian 
variety defined over k. If the ground field k is 
the field of complex numbers, the group law 

is the same as that given by the taddition 
theorem of the tWeierstrass g-function with 
invariants g, = —4a and g,= — 4b through the 
parametrization x= p (u), y=} (u). Much of 
the Diophantine theorems on elliptic curves 
are generalized to Abelian varieties. Here we 
shall deal mainly with elliptic curves defined 
by Weierstrass equations over Q. Extension to 
algebraic number fields usually causes no trou- 
ble. For more general elliptic curves — [19]. 
The Lutz-Mattuck theorem (— Section E) 
obviously implies that the points of finite order 
in E, (k=Q,) form a finite group. This torsion 
group is computable. In case a and b are in Z 
then any point of finite order in Eg has coordi- 
nates (x, y) in Z and, if y40, y?|4a* +27b? 
(Lutz-Nagell). The WC group (Weil-Chatelet 
group) of E over k is the birational class of 
principal homogeneous spaces over k. The 
extent of validity of the Hasse principle for 
elliptic curves can be measured by the Tate- 
Shafarevich group, which is defined as the set 
of elements of the WC group that are every- 
where locally trivial. This group is conjectured 
to be a finite group. For other results and 
interesting conjectures — [12-14]. The num- 
ber of integral points on elliptic curves is finite 
according to Siegel’s theorem on algebraic 
curves. Explicit bounds for the size of these 
points have been given for several types of 
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elliptic curves by using Baker’s method. For 
example, if f(x, y) is an absolutely irreducible 
polynomial with coefficients in Z such that the 
curve f =0 has genus 1, then max(|x|, |y|)< 
expexpexp((2H)"), where m= 10°, d=deg f, 
and H is the height of f (Baker and Coates, 
1970. The method of proof was to reduce it to 
the Weierstrass equation case, which had been 
treated earlier by Baker, with a better bound.) 
By the tMordell-Weil theorem (— Section E), 
AgS Z x finite torsion group. Here r is called 
the rank of E over Q. There is a rather doubt- 
ful conjecture to the effect that the rank r is 
bounded. The rank r is conjectured to be equal 
to the order of the zero of L(s, E) at s = 1 (Birch- 
Swinnerton-Dyer conjecture). Much numerical 
and theoretical evidence supports this famous 
conjecture [13]. 


E. Rational Points of Algebraic Varieties 


Let V be an tabstract algebraic variety defined 
over a field k, and let P be a point of V. Then 
P is called a rational point over k of V if the 
coordinates of the trepresentative P, contained 
in an taffine open set V, of V are in k (— 16 
Algebraic Varieties D). This definition is inde- 
pendent of the choice of the representative P,. 
In particular, if V is a tprojective variety, the 
point P given by the thomogeneous coordi- 
nates (Xo, X1,-.-,X,) is rational if and only if 
X;/XpEk (O<i<n, x, #0). In the following we 
state main results concerning rational points 
of algebraic varieties, especially results con- 
cerning ‘Abelian varieties, restricting k to be 
either an talgebraic number field of finite 
degree, a tp-adic number field, or a (finite field. 


Mordell-Weil Theorem. Let A be an Abelian 
variety of dimension n defined over an alge- 
braic number field k of finite degree. Then 

the group A, of all k-rational points on A is 
finitely generated. This theorem was proved 
by L. J. Mordell (1922) for the case of n=1 
and by Weil (1928) for the general case [10]. 
The assertion that A,/mA, is a finite group 

for any rational integer m is called the weak 
Mordell-Weil theorem; this theorem is basic in 
the proof of the Mordell-Weil theorem and is 
used in the proof of Siegel’s theorem, too. A 
generalization of the Mordell-Weil theorem is 
obtained when k is a field (of arbitrary charac- 
teristic) finitely generated over the tprime field 
[10]. 

If A is defined over a finite algebraic number 
field k, we have the following conjectures of 
Birch, Swinnerton-Dyer, and Tate on the rank 
of A,. Let p be a prime ideal of k at which A 
has a good reduction, and denote by A, the 
reduced variety. Let 2°)’, ..., 7” be the eigen- 


118 F 
Diophantine Equations 


values of the N(p)th power endomorphism of 
A, with respect to an /-adic representation, 
where N(p) denotes the norm of p (— 3 Abelian 
Varieties E, N), and put L,(s, A)= 172, (1 — 

ut N(p) *)”'. The L-function of A defined by 
L(s, A)= [T L,(s, A), where the product ranges 
over all good primes, is the principal part of 
the zeta function of A (— 450 Zeta Functions 
S). Birch and Swinnerton-Dyer conjectured 
that if k=Q and A is of dimension 1, then 
there exists a constant C #0 such that L(s, A) 
~ C(s—1)% as s>1. Tate generalized this con- 
jecture to any A and k. Moreover, the constant 
C, appropriately modified by factors corre- 
sponding to the bad primes and the infinite 
primes, is thought to be expressible in terms of 
certain arithmetic invariants of A [13]. 


Lutz-Mattuck Theorem. The group of rational 
points of an Abelian variety A of dimension n 
over a tp-adic number field k contains a sub- 
group of finite index isomorphic to the direct 
sum of n copies of the tring o of p-adic integers 
in k (E. Lutz, J. Reine Angew. Math., 177 
(1937); A. Mattuck, Ann. Math, 62 (1955)). 


Mordell’s Conjecture. In his 1922 paper, in 
which the above theorem on the set of rational 
fields on 1-dimensional Abelian varieties (1.e., 
on elliptic curves) was established, Mordell 
stated the conjecture: Any algebraic curve of 
genus g >2 defined over Q has only a finite 
number of rational points. The same can be 
conjectured for such curves defined over any 
algebraic number field k of finite degree. This 
had remained as a conjecture until 1983. In 
1961 I. R. Shafarevich conjectured: Let k be 
any algebraic number field of finite degree, S a 
finite set of finite prime spots of k, and g any 
natural number >2. Then there are, up to 
k-isomorphism, only a finite number of non- 
singular algebraic curves of genus g defined 
over k having good reduction at every finite 
prime spot outside S. 

In 1973, A. N. Parshin showed that Mor- 
dell’s conjecture followed from this conjec- 
ture, which was finally proved in 1983 by 
G. Fallings [7]. Mordell’s conjecture was thus 
settled in the affirmative. Analogs of these 
conjectures on algebraic function fields over 
finite fields are easier than the original ones 
and had been proved in the 1960s for Mordell’s 
conjecture by Yu. I. Manin, H. Grauert, and 
M. Miwa, and for Shafarevich’s conjecture by 
S. Alakelov, A. N. Parshin, and L. Szpiro. 


F. C,-Fields 


Let F be a field, and let i=>0, d>1 be in- 
tegers. Let f be a homogeneous polynomial 
of n variables of degree d with coefficients 
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in F. If the equation f =0 has a solution 
(x,,..-5X,)#(0,...,0)in F for any f with n>d°, 
then F ts called a C,(d)-field. If F is a C,(d)-field 
for any d>1, then F is called a C,-field. In 
order for F to be a C-field, it is necessary and 
sufficient that F be an talgebraically closed 
field. A C,-field is sometimes called a quasi- 
algebraically closed field. There exists no non- 
commutative algebra over a C,-field F. A 
finite field is C, (C. Chevalley (1936)). If Fo is 
algebraically closed, then F = F(X ) (rational 
function field of one variable) is a C,-field 
(Tsen’s theorem). A homogeneous polynomial 


f of n=d' variables of degree d with coeffi- 


cients in F such that f =0 has no solution in F 
except (0, ...,0) is called a normic form of 
order i in F. If a C,-field Fy has at least one 
normic form of order i, then (1) Fo(X,,..., Xx) 
is a C,,,-field; and (ii) an extension of Fo of 
finite degree is a C,-field. A complete field F 
with respect to an fexponential valuation is a 
C,-field whenever its residue field Fo is alge- 
braically closed. The field F of power series of 
one variable over a finite field Fo is a C,-field 
(Lang). E. Artin conjectured that a p-adic field 
Q, is a C,-field. It was proved by H. Hasse 
(1923) that Q, is a C,(2)-field and by D. Lewis 
(1952) that Q, is a C,(3)-field. However, G. 
Terjanian (1966) [17] gave a counterexample 
to Artin’s conjecture; that is, he gave a quartic 
form of 18 variables with coefficients in Q, 
having only trivial zero in Q,. Ax and S. 
Kochen (1965) [18] proved that for any inte- 
ger d>1 there exists an integer po(d) such 
that Q, is a C,(d)-field for p> po(d) (— 276 
Model Theory E). 
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Dirichlet, Peter Gustav 
Lejeune 


Peter Gustav Lejeune Dirichlet (February 13, 
1805—May 5, 1859) was born of a French 
family in Düren, Germany. From 1822 to 1827 
he was in Paris, where he became a friend of 
J.-B. tFourier. In 1827, he was appointed 
lecturer at the University of Breslau; in 1829, 
lecturer at the University of Berlin; and in 
1839, professor at the University of Berlin. In 
1855, he was invited to succeed C. F. *Gauss 
at the University of Göttingen, where he spent 
his last four years as a professor. 

His works cover many aspects of mathemat- 
ics; however, those on number theory, analy- 
sis, and potential theory are the most famous. 
He greatly admired Gauss and is said to have 
kept Gauss’s Disquisitiones arithmeticae at his 
side even when traveling. 

In number theory, he created the tDirichlet 
series and proved that a sequence in arithmetic 
progression contains infinitely many prime 
numbers, provided that the first term and the 
common difference are relatively prime. Also, 
using his “drawer principle,” which states that 
if there are n+ 1 objects in n drawers then at 
least one drawer contains at least 2 objects, he 
clarified the structure of tunit groups of falge- 
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braic number fields. In tpotential theory he 
dealt with the *Dirichlet problem concerning 
the existence of fharmonic functions. He also 
gave tDirichlet’s condition for the convergence 
of trigonometric series. 
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A. The Classical Dirichlet Problem 


Let D be a bounded or unbounded tdomain in 
R” (n22) with compact boundary S. The class- 
ical Dirichlet problem is the problem of finding 
a tharmonic function in D that assumes the 
values of a prescribed continuous function on S. 
This problem is also called the tfirst boundary 
value problem (— 193 Harmonic Functions 
and Subharmonic Functions). In this article f 
always stands for a boundary function given 
on S. The problem is called an interior problem 
if D is bounded and an exterior problem if D 
is unbounded. In an exterior problem, it is 
further required that when an tinversion 
with center at an exterior point Py of D is per- 
formed on D and a tKelvin transformation 
is performed on the solution in D (when the 
solution exists), the function thus obtained on 
the inverted image of D be harmonic at P, 
(n> 3). When n=2, the solution in D, which 
is already regarded as a function on the in- 
verted image of D, is required to be harmonic 
at P,. Thus an interior problem can be trans- 
formed to an exterior problem, and vice versa. 
We now explain the history of the classical 
problem. 

Let D be a bounded domain with boundary 
S in R?. G. Green (1828) asserted that if S is 
sufficiently smooth, 


OG(P, Q) | 
u(P)= -El SQ) dol) (1) 
Ong 
is the solution for the Dirichlet problem, where 
f is prescribed on S, G(P, Q) is tGreen’s func- 
tion with the pole at Q in D, ng is the outward- 
drawn normal to S at Q, and dø is the tsur- 
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face element on S. He took for granted the 
existence of Green’s function from physical 
consideration of the problem. Thus his dis- 
cussion was not quite rigorous. This defect was 
corrected by A. M. Lyapunov (1898) under a 
certain condition on S. Denote by u,, the New- 
tonian potential of a measure with density 
m=>0 on S. Assume that a continuous function 
f on S and a positive constant a are given. In 
1840, C. F. Gauss investigated the existence of 
a density m, >0 on S of total mass a which 
satisfies |;(u,,,—2f )m,do = minn f s(Up — 

2f )mdo, where the total mass of m is equal 

to a. He asserted also that u,,,—f is equal to a 
constant b on S. If f=0, then um, Must be 
equal to a positive constant c on S, and hence 
Um, — DC Um, Must be a solution of the ex- 
terior problem for the boundary function f on 
S. However, his discussion was incomplete 
because we cannot always ensure the existence 
of a density that gives a measure minimizing 
the integral. Moreover, even in the case where 
D is a ball, there exists a continuous function 
{20 on S such that there is no Newtonian 
potential that is equal to f on S up to a con- 
stant (M. Ohtsuka, 1961). Gauss (1840), W. 
Thomson (Lord Kelvin) (1847), and G. L. 
Dirichlet solved the Dirichlet problem by 
making use of the so-called Dirichlet principle, 
which is explained in detail in Section F. After 
K. Weierstrass (1870) pointed out that there is 
a case where no minimizing function exists, D. 
Hilbert (1899) gave a rigorous proof of the 
Dirichlet principle under a certain condition. 
Meanwhile, C. G. Neumann (1870) solved 

the Dirichlet problem rigorously for the first 
time, although he assumed that D is a con- 
vex domain with a smooth boundary. First, 
he considered the potential W, =(1/2z)- 

fs f(ér~1/6n)do of a double layer in D; 

then he formed the potential W, =(1/27)- 
\sf,(Or-1/On)do of a double layer with the 
values fı of W, on S and defined W,, W4,... 
similarly. The series W, -W,+W,—W,+... 
plus a suitable constant gives a solution to 

the exterior Dirichlet problem for the bound- 
ary function f. In 1887, H. Poincaré also used 
(1) to solve the Dirichlet problem. He ob- 
tained Green’s function with the pole at O 

in a bounded domain D in the following man- 
ner: Let D’ be the image of D by an inversion 
with center O, Sọ be a spherical surface sur- 
rounding the boundary ôD’ of D’, and u be a 
uniform measure on Sọ such that the potential 
of u is equal to 1 inside Sọ. By ‘sweeping out u 
to €D’, the solution in D’ of the exterior prob- 
lem for the boundary function t is obtained. 
A tKelvin transformation of this solution 
yields the solution h in D of the interior prob- 
lem for the boundary function 1/OP. Now 
1/OP —h(P) is Green’s function in D. In 1899 
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Poincaré used another method (without utiliz- 
ing (1)) to solve the Dirichlet problem [8]. He 
observed that it is sufficient to consider the 
case where f is equal to the restriction to S of 
a polynomial g and that g is expressed in D as 
the sum of the Newtonian potential of a signed 
measure t with density — Ag/(4x) and a func- 
tion that is harmonic in D and continuous on . 
DUS. If it is possible to sweep out t to 6D, 
then the solution is obtained. He showed that 
this is in fact the case if at every point P of S 
there exists a cone that is disjoint from D and 
has its vertex at P. This condition is called 
Poincaré’s condition. In 1900, I. Fredholm 
discussed the Dirichlet problem by reducing it 
to a problem of tintegral equations. A domain 
D is called a Dirichlet domain if the (classical) 
Dirichlet problem is always solvable in D. H. 
Lebesgue (1912) showed that a solution is 
obtained by the method of iterative averaging 
in every Dirichlet domain. 


B. The Dirichlet Problem in a General Domain 


It has been believed that the classical Dirichlet 
problem is always solvable in every domain 
until S. Zaremba observed in 1909 that the 
problem is not always solvable for a punctured 
ball. In 1913, Lebesgue gave a decisive exam- 
ple in which the domain is homeomorphic to 
a ball and bounded by a surface sufficiently 
smooth except at one point. Thus the central 
interest shifted to finding a harmonic function 
in D that depends only on a continuous func- 
tion f given on GD and coincides with the 
classical solution when D is a Dirichlet do- 
main. Extend f to a continuous function in 
the whole space, and denote it by fo. Approxi- 
mate D by an increasing sequence {D,} of 
Dirichlet domains, and denote by u, the solu- 
tion in D, of the Dirichlet problem for the 
boundary function fọ. N. Wiener proved in 
1924 that u„ coverges to a harmonic function 
that is independent of the choice of the exten- 
sion of f and {D,}. The problem of deciding 
where on CD the general solution assumes the 
given boundary values is treated in Section D. 
O. D. Kellogg (1928) found a general method 
that includes Poincaré’s method of sweeping 
out, Schwarz’s alternating method, and the 
result of Wiener. Both Poincaré’s method of 
sweeping out and Lebesgue’s method of itera- 
tive averaging yield Wiener’s general solution. 


C. Perron’s Method 
We explain O. Perron’s method (1923) by 


considering the improved method of M. Brelot 
[1]. For simplicity we assume that the domain 
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D is bounded in R*. Let U be the family of 
subharmonic functions u bounded above and 
satisfying lim supp- y u(P)< f(M) for any 
boundary point M. Define H,(P) as sup, u(P), 
where u runs through U, if this family is not 
empty; otherwise, set Hy = —oo. Call H, a 
hypofunction. Define H; by — H_, and call it a 
hyperfunction. If H, = H,, the common func- 
tion is denoted by H,; if H,(P)< œ, then H; is 
harmonic. This ts called a Perron-Brelot solu- 
tion (Perron-Wiener-Brelot solution or simply 
PWB solution). The method of defining H, is 
called Perron’s method (the Perron-Brelot 
method or the Perron-Wiener-Brelot method). 

Wiener showed in 1923 that the tDaniell- 
Stone integral can be regarded as a general 
solution if a Daniell-Stone integrable function 
f is given on the boundary of a Dirichlet 
domain; in 1925, he showed that the same is 
true for a general domain (not necessarily 
Dirichlet). He showed also that his solution 
coincides with the Perron-Brelot solution H, if 
f is continuous. Unfortunately, however, from 
a wrong example he concluded that H, +H, 
can hold even for a simple discontinuous f, 
and so he lost interest in Perron’s method. 
Brelot (1939) corrected Wiener’s erroneous 
conclusion and proved that the Daniell upper 
and lower integrals are equal to H , and H,, 
respectively. To any continuous f there corre- 
sponds an H,, and there exists a Radon mea- 
sure pip Satisfying H,(P)=| fdup. This measure 
is called a harmonic measure or harmonic 
measure function. Brelot showed that H, = H f 
if and only if f is zp-integrable for one (or 
every) P. In particular, If D is a Dirichlet 
domain and £ is a closed set on the boundary 
oD, then the harmonic measure function up(E) 
takes the value 1 at an inner point (in the 
space CD) of E and vanishes on ôD — E. We 
note that pp is equal to the measure obtained 
by sweeping out the unit mass at P to ðD. 


D. Regular Boundary Points 


If H,(P)—> f(M) as P—>M eD for any con- 
tinuous function f on 6D, then M is called 

- regular. The regularity of a point is a local 
property. A boundary point that is not regu- 
lar is called irregular. The regularity of M is 
equivalent to the convergence of zp as PM 
to the unit mass at M with respect to the 
tvague topology. There are many sufficient 
conditions and necessary conditions for a 
boundary point to be regular. The existence 
of a barrier is a qualitative condition that is 
necessary and sufficient for a boundary point 
to be regular. It was used by Poincaré and so 
named and used effectively by Lebesgue. A 
barrier is a continuous superharmonic func- 
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tion in D that assumes the boundary value 0 
at M and has a positive lower bound outside 
every ball with center at M. A positive super- 
harmonic function defined in the intersection 
of D and a neighborhood of M and taking the 
boundary value 0 can be used as a barrier. A 
necessary and sufficient condition for a bound- 
ary point M to be regular is the existence of a 
Green’s function in D assuming the value 0 at 
M. This condition was given by G. Bouligand 
(1925), and it follows from the existence of a 
barrier. Another necessary and sufficient con- 
dition of a quantitative nature was obtained 
by Wiener. It is equivalent to the requirement 
that the complement of D is not tthin at M 
(— 338 Potential Theory G). Kellogg conjec- 
tured that the set of irregular boundary points 
is of capacity zero and verified this in R? 
(1928). The conjecture was proved first by G. 
C. Evans (1933) in R?, and different proofs 
were given by F. Vasilesco (1935) and O. 
Frostman (1935). The conjecture is also true 
in R" for n>4. 


E. The More General Dirichlet Problem 


So far, we have been concerned with R”. More 
generally, Brelot and G. Choquet [3] obtained 
the following result in a Green space & (— 193 
Harmonic Functions and Subharmonic Func- 
tions): Consider a metric space that contains & 
and in which @ is everywhere dense, and de- 
note by A the complement of & with respect to 
the space. Let {F} be a family of ffilters on & 
such that each F converges to a certain point 
of A. Suppose that u <0 whenever u is sub- 
harmonic and bounded above on & and 
limsupu <0 along every F. Assume the exis- 
tence of a barrier v in a neighborhood in & 

of the limit point Q of every F; that ts, v is to 
be positive superharmonic, to tend to 0 along 
F, and to have a positive lower bound outside 
every neighborhood of Q. Under these as- 
sumptions, we obtain the PWB solution on & 
as in R°. There are various examples of A and 
F that satisfy these conditions. In particular, L. 
Naim [6] investigated in detail the case where 
A is the *Martin boundary. More generally, 

it is possible to treat the Dirichlet problem 
axiomatically (= 193 Harmonic Functions 
and Subharmonic Functions). 


F. The Dirichlet Principle 


Let D be a bounded domain with a sufficiently 
smooth boundary in R” and f be a piecewise 
C'-function on D with finite Dirichlet integral 
Il fl? =Jolgrad f|? dt, where dt is the volume 
element. Suppose that f has a continuous 
boundary value ọ on 0D. The classical Dirich- 
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let principle asserts that the solution of the 
Dirichlet problem for g has the smallest Dir- 
ichlet integral among the functions that are 
piecewise of class C' in D and assume the 
boundary value ø. In a general domain, H, 
minimizes ||u— f || among harmonic functions 
u in D. Brelot [2] discussed the principle for a 
family of competing functions that are defined 
in a domain in & and whose boundary values 
cannot be defined in the classical manner. 
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A. Dirichlet Series 


For z=x +iy, 4, >0, and „f+ 00, the series of 
the form 


f= Š, ayexp(— 22) (1) 


is called a Dirichlet series (more precisely, a 
Dirichlet series of the type {/,}). If 4 „=n, then 
(1) is a power series with respect to e 7. If 4,= 
logn, the series (1) becomes 


Ș a,/n?, (2) 


which is called an ordinary Dirichlet series. If 
a, = 1, then (2) is the Riemann zeta function. 
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Series of the form (2) were introduced by 

P. G. L. Dirichlet in 1839 and utilized in an 
investigation of the problems of fanalytic 
number theory. Later J. Jensen (1884) and 
E. Cohen (1894) extended the variable z to 
complex numbers. The Dirichlet series is not 
only a useful tool in analytic number theory, 
but is also investigated as a generalization of 
power series. The tLaplace transform is the 
generalization of the Dirichlet series to the 
integral, and similar formulas often hold for 
both cases. 


B. Convergence Regions 


If the series (1) converges at z= Zo, then it 
converges in the half-plane Rez > Rezo. There- 
fore there is a uniquely determined real num- 
ber S such that (1) converges in Rez>S and 
diverges in Rez<S. If (1) always converges 
(diverges), we put S= —œ (+00). We call S the 
abscissa of convergence (or abscissa of simple 
convergence). Similarly, there is a uniquely 
determined real number A such that (1) con- 
verges absolutely in Rez> A and is not ab- 
solutely convergent in Rez<A. We call A the 
abscissa of absolute convergence. Furthermore, 
there is a uniquely determined real number U 
such that (1) converges uniformly in Rez> U’ 
for every U’ >U and does not converge uni- 
formly in Rez>U" for every U” < U. The 
number U is called the abscissa of uniform 
convergence. Among these abscissas we always 
have the relations 


-o IKSU SA+, 


l 
A-S<limsup =. 


n> oO n 


The latter was proved by Cohen (1894). The 
numbers S, A, U are determined from a,, À„ by 
means of the formulas 


1 

S=limsup—log} > a,|. (3) 
x00 [x] <A, <x 
i 1 

4 =limsup—og( X al), (4) 
x» X [x] <A, <x 
f 1 

U =limsup—log T,, 
X> wO x 

T,= sup È 4,exp(—id,y)], (5) 
-~a<y<too |[x] <A,<x 


where [ ] is the tGauss symbol. Formulas (3) 
and (4) were proved by T. Kojima (1914) and 
(5) by M. Kunieda (1916). In particular, when 
lim,» (logn)/A, =0, we have 


log|a,,| 


S=U=A=limsup 


n> oO 


(6) 


n 


(O. Szasz, 1922; G. Valiron, 1924). 
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The series (1) converges uniformly in the 
angular domain {z| |arg(z—zo)| <a < 7/2}, 
where the vertex Z, lies on the line Rezo = S. 
Hence it represents a holomorphic function in 
the domain Rez > S, but it is possible that 
there is no singularity on the line Rez =S. For 
example, if a,=(—1)”, then the series (2) has 
S =O, but the sum is an tentire function (2'~* 
— 1) (z). Taking the analytic continuation f(z) 
of the series (1), the infimum R of p such that 
f(z) is holomorphic in Rez> p is called the 
abscissa of regularity. It is still possible that 
there is no singularity on the line Rez=R. We 
always have R<S, and R is given by 
R= sup jim (loglog*| g(x + iy)| +x), 

—wo<y<0 X>- 0 
2 a,exp(—A,z) 


Tn ° 


where log* a=max(loga, 0) (C. Tanaka, 1951). 
The infimum B of p such that f(z) is bounded 
in Rez> p is called the abscissa of boundedness. 
We always have R< B< A. H. Bohr proved 
the following three theorems concerning these 
values: (1) If {/,} are linearly independent over 
the ring of integers, then A = B (1911). (2) If 
(Ans: —4n) * = O(expe**) for every e>0, then 

U =B (1913). (3) If lumsup(logn)/4, =90, then S$ 
= U = A =B (1913). In the final case, the values 
are given by (6). 


C. Properties of Functions Given by Dirichlet 
Series 


The coefficients a, in (1) are given in terms of 
the function f(z) by 


n 1 etic wz 
È a= | fle)—dz, (8) 


v=1 2ni c~ico 


where c > max(S,0), 4,<@<A,4,, and the 
integration contour does not pass through 
{4,}. If © = ån, then the term a, in the sum of 
the left-hand side of (8) is replaced by a,,/2 (O. 
Perron, 1908). Furthermore, if S <x, then we 
have 


ao+T 
a, = lim zl © fx+iyjexp[A x +i)]dy ©) 
Too T isp 
(J. Hadamard, 1908; C. Tanaka, 1952). 

If x=Rez>S, then f(z)=o(|y|) (y|- oo). In 
order to investigate its behavior more pre- 
cisely, Bohr introduced 


log| f(x +iy)| 

log|y| 
in his thesis (1910) and called it the order over 
Rez=x. The function u(x) is nonnegative, 
monotone decreasing, convex, and continuous 
with respect to x. Bohr later found that there ts 


(x) =lim sup 
lyl+ +00 
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a kind of periodicity for the values of f(z) over 
Rez=x; this was the origin of the theory of 
talmost periodic functions. 

As for the zeros of the function f(z), the 
following theorems are known: If f(z) is not 
identically zero, it has only a finite number of 
zeros inx>S+e,e “*<y<e™* for arbitrary 
positive numbers ¢, M (Perron, 1908). If we 
denote by N(T) the number of zeros in x > S$ 
+e, T<y<T+26logT, then limsup,;_,, N(T)/ 
(log T)? <65/e (E. Landau, 1927). 

There have been many investigations into 
the connection between the singularities of f(z) 
and the coefficients a,. If the a, are real and 
positive, the point z=S is always a singular- 
ity of f(z). Moreover, if S=0, Rea, >0, and 
lim,,_,,. (cos(arga,))"/4"=1, then z=0 is a sin- 
gularity of f(z) (C. Biggeri, 1939). Furthermore, 
if 4,,/n-> œ, liminf, , (4,41 — An) > 0, then the 
line Rez=S is the natural boundary of f(z) 
(F. Carleson and Landau, 1921; A. Ostrowski, 
1923). If S=0, liminf,_,.,(A,4; —4,)=q > 0, then 
there always exist singularities on every inter- 
val on the imaginary axis with the length 2z/q 
(G. Polya, 1923). S. Mandelbrojt (1954, 1963) 
gave some interesting results concerning the 
relations between the singularities of (1) and 
the Fourier transform of an entire function. 

If U = —oo, the function f(z) is an entire 
function. Its torder (in the sense of entire func- 
tion) p is given by 

log* log* M(x) 


p=limsup ; 
x>- |x| 


[f(x +iy)l. 


M(x)= sup 
—ao<p<a@ 
There have been many investigations into the 
tJulia direction of f(z) and related topics by 
Mandelbrojt, Valiron, and Tanaka. 


D. Tauberian Theorems 


As in the case of power series, if the series 

i a, converges to s, then f(+0)=s (Abel’s 
continuity theorem). The converse is not neces- 
sarily true. The converse theorems, with ad- 
ditional conditions on a, and 4,, are called 
Tauberian theorems, as in the case of power 
series. Many theorems are known about this 
field. The most famous additional conditions 
are lIMy o AnGn/(An — 44-1) = 9 (Landau, 1926) 
and a, =O((A, —A,-1)/4,) (K. Ananda-Rau, 
1928). 

For the summation of Dirichlet series (es- 
pecially tRiesz’s method of summation) — 379 
Series R. For Tauberian theorems (especially 
the Wiener-Ikehara-Landau theorem) of the 
ordinary Dirichlet series — 123 Distribution 
of Prime Numbers B. 
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E. Series Related to Dirichlet Series 
A series of the form 


3 


nai 2(Z+ 1)(z+2)...(z+n)’ 


nia, 


z#0, —1, —2,... 
is called a factorial series with the coefficients 
{a,,}. It converges or diverges simultaneously 
with the ordinary Dirichlet series È a,,/n7 
except at z =0 and negative integers. The series 


$ S E L al) 


n! 


is called a binomial coefficient series. It con- 
verges or diverges simultaneously with the 
ordinary Dirichlet series }(—1)"a,/n? except 
at z=0 and positive integers. 
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A. Definitions [1-4] 
Suppose that a group T acts continuously on a 
tHausdorff space X, that is, for every yer and 
xex, an element yx of X is assigned in such 
a way that the mapping x—>yx is a homeo- 
morphism of X onto itself and that we have 
yi (y2x)=(%1y2)x, Lx =x, where 1 is the identity 
element of I. Two points x, ye X are said to 
be I -equivalent if there exists a ye I such 
that y=yx. (T-equivalence for subsets of X is 
defined similarly.) 

We consider the following conditions of 
discontinuity of T. (1) For every xe X and any 
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infinite sequence {y;} consisting of distinct 
elements of T, the sequence {y;x} has no telus- 
ter point in X. (ii) For every xe X, there exists 
a neighborhood U, such that yU, N U, = @ for 
all but finitely many yer. (ir) If x, ye X are 
not I’-equivalent, there exist neighborhoods 
U,, U, of x, y, respectively, such that yU, U, 
= @ for all yeT. (iit) For any compact subset 
M of X, yMOM=@ for all but finitely many 
yer. 

It is easy to see that (ii) = (i), (i)+ (ii^) = (iii); 
if, moreover, X is tlocally compact, we also 
have (iii) = (ii), (ii). When (i) holds, T is called 
a discontinuous transformation group of X, 
and when (11) holds, I is called a properly 
discontinuous transformation group. In partic- 
ular, when X can be identified with a thomoge- 
neous space G/K of a locally compact group 
G by a compact subgroup K, the conditions 
(i), (ii), and (iti) for a subgroup I of G are all 
equivalent, and they are also equivalent to the 
condition that F is a tdiscrete subgroup of G. 

For a discontinuous group T acting on X, 
the fstabilizer r= {yeT |yx x x} of xe X is 
always a finite subgroup. When I’, = {1} for 
all xe X, T is said to be free (or to act freely 
on X). If fy=()xex = {1}, I is said to act 
teffectively on X. A point xe X is called a fixed 
point of F if r, #T x. In the following, we 
assume for simplicity that I’ acts effectively on 
X, unless otherwise specified. 

Since Į -equivalence is clearly an fequiva- 
lence relation, we can decompose X into I- 
equivalence classes, or I’-torbits. The space of 
all I’-orbits, called the quotient space of X by 
T, is denoted by T\X. When T satisfies the 
conditions (it) and (ti), the space T\ X becomes 
a tHausdorff space with respect to the topol- 
ogy of the quotient space. If, moreover, IT is 
free, X is an (unramified) tcovering space of 
T\X with the tcovering transformation group 
T. (Conversely, a covering transformation 
group is always a free, properly discontinuous 
transformation group.) In general, X may be 
viewed as a covering space of F\X with rami- 
fications, and the ramifying points (in X) are 
nothing but the fixed points of T. 


B. Fundamental Regions 


A complete set of representatives F of T\X in 
X (that ts, a subset F of X such that [DF = 
X,yFOF=©@ for yel,y#1) is called a fun- 
damental region of I in X if it further satisfies 
suitable topological or geometrical require- 
ments. Here we assume that F, the closure 

of F, is the closure of its interior F'. (In this 
case, F or F' is sometimes called the funda- 
mental region of T instead of F itself.) Such 

a fundamental region exists if I’ satisfies the 
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conditions (ii), {ii’), and the set of fixed points 
is fnowhere dense in X (R. Baer, F. W. Levi, 
1931). A fundamental region F is called normal 
if the set {yF} (yeT) is locally finite, that is, if, 
for every xe X, there exists a neighborhood 
U, such that yF N U, = Ø for all but a finite 
number of yer. If X is tconnected and F is 
normal, then T is generated by the set of ye F 
such that yF N F # Ø. Thus it is useful to have 
a fundamental region in order to find a set of 
generators of T and a set of tfundamental 
relations for them. When X has a T-invariant 
*Borel measure u and I is countable, then the 
measure (F) of F is independent of the choice 
of F. Hence it is legitimate to put n(T\X)= 
u(F); T is called a discontinuous group of the 
first kind (C. L. Siegel [2]) if T is a discontinu- 
ous transformation group which has a normal 
fundamental region F such that {y|yF NF = 
Ø} is finite and (F) < oo. For instance, if X is 
locally compact and F is compact (e T\ X: 
compact), then I is of the first kind. 

When we are concerned only with the quali- 
tative properties of T, it is sometimes conve- 
nient to relax the conditions for a fundamental 
region, replacing it by a fundamental (open) set 
Q of T, that is, an (open) subset Q of X such 
that TO= X and yQNQ= Ø for all but a finite 
number of yer [5-9]. 


C. The Case of a Riemann Surface 


Let F be a discontinuous group of analytic 
automorphisms of a tRiemann surface X. In 
virtue of tuniformization theory, it is enough, 
in principle, to consider the case where X is 
tsimply connected. Thus we have the following 
three cases: 

(1) X =CU {oo} (Riemann sphere). I is a 
finite group. Since I can also be considered as 
a group of motions of the sphere, it is either a 
cyclic, ‘dihedral, or tregular polyhedral group 
[10]. 

(2) X =C (complex plane). F is contained in 
the group of motions of the plane. The sub- 
group consisting of all parallel translations 
contained in T is a tfree Abelian group of rank 
v<2. If v=0, then F is a finite cyclic group. 
When v>0, T consists of the transformations 
of the following form: 


When v=1, z=>ęřz+mo (k,meZ), 


When v=2,  ze*z+m,a,+m,@, 


(k, M; E Z), 


where w, w,, œ, are nonzero complex numbers 
with Im(w,/w,)>0, and £= +1 in general, 
except in special cases when v=2 and w,/a, = 
C4 (resp. ¢; or ¢,, where €¢,=exp(2zi/1)), in 


' point 
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which cases we may put ¢=€, (resp. €, or Ce). 
For the fundamental regions corresponding to 
these values of £e, see Fig. l. In the cases v= 1 
and 2, the tautomorphic functions with respect 
to T are essentially given by exponential func- 
tions and elliptic functions, respectively (— 
134 Elliptic Functions). 


a 


(c) (d) 


Fig. 1 
(a) v=2, £=1. (b) w, =1, œ =i, £=i. (c) w = 1, 
O = %3, e=C3. (d) w il @ = 63, E= Co 


(3) X = {|z| <1} (unit disk) [3, 10, 11]. By a 
tCayley transformation, the unit disk can be 
transformed to the upper half-plane = {z = 
x+iy|y>0}. Any analytic automorphism of § 
is given by a real tlinear fractional transforma- 
tion (Möbius transformation) z—(az + b) (cz + 
d) (a,b,c,deR, ad —bc=1). The totality of 
real linear fractional transformations acts tran- 
sitively on §. Hence § can be identified with 
the fhomogeneous space G/K of G=SL(2,R) 
by K =SO(2) (which is the stabilizer of the 
—1). Hence discontinuous groups I 
of analytic automorphisms of $ are obtained 
as discrete subgroups of G. Actually, every 
element of G defines an analytic automor- 
phism of the whole Riemann sphere, which 
leaves the real axis RU {oo} invariant. For any 
zeCU {oo} and a sequence {y;} consisting of 
distinct elements of T, a cluster point of the 
sequence {y;z} in CU {0} is called a limit 
point of IT. When only one or two limit points 
exist, can easily be transformed to one of 
the groups given in (2). Otherwise, the set £ of 
all limit points of T is infinite, and either 2 = 
RU {oo} or £ is a tperfect, tnowhere dense 
subset of RU {00}. When £ is infinite, T is 
called a Fuchsoid group. 

Since $ has a G-invariant *Riemannian 
metric ds? = y7 *(dx? + dy”) (called the tPoin- 
caré metric), by which § becomes a hyperbolic 
plane (‘non-Euclidean plane with negative 
curvature), we can construct a fundamental 
region F of T which is a normal polygon 
bounded by geodesics, that is, the arcs of cir- 
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cles orthogonal to the real axis. A set of gen- 
erators of I and the fundamental relations for 
them are easily obtained by observing the 
correspondence of the equivalent sides of the 
fundamental polygon. (Conversely, starting 
from a normal polygon satisfying a suitable 
condition, one can construct a discontinuous 
group F having F as a fundamental region. In 
this manner, we generally obtain a (nontrivial) 
continuous family of discrete subgroups of G.) 
A Fuchsoid group I is finitely generated if 
and only if the fundamental polygon F has a 
finite number of sides, and in that case T is 
called a Fuchsian group. (More generally, a 
finitely generated discontinuous group of 
linear fractional transformations acting on 
a domain in the complex plane is called a 
Kleinian group.) A Fuchsian group T is of the 
first kind if and only if 2=RU {%0}; otherwise, 
it is of the second kind. It is also known that a 
discontinuous group F is a Fuchsian group of 
the first kind if and only if n(T-\) < œ [12]. 
For a real point xe RU {œ}, we also denote by 
I’, the stabilizer of x (in T). The point x is 
called a (parabolic) cusp of T if T, is a free 
cyclic group generated by a fparabolic trans- 
formation (4 +1). Cusps of T are represented 
by vertices of the fundamental polygon on the 
real axis. On the other hand, if a fixed point z 
of T lies in $, then the stabilizer T, is always a 
finite cyclic group generated by an felliptic 
transformation. Hence such a point z is also 
called an elliptic point of I. For a Fuchsian 
group T of the first kind, let {z,,...,z,} be 
a complete set of representatives of the F- 
equivalence classes of the elliptic points of I” 
(which can also be chosen from among the 
vertices of the fundamental polygon), and let e; 
be the order of I’,,; furthermore, let ¢ be the 
number of the T’-equivalence classes of para- 
bolic cusps of I. Then the quotient space ['\ 
can be compactified by adjoining t points at 
infinity, and the resulting space becomes a 
compact Riemann surface R, if we define an 
analytic structure on it in a suitable manner. 
The area u(R,) measured by the Poincaré 
metric is given by the tGauss-Bonnet formula: 


dxdy 
(Ry) = 2 
F Y 


5 1 
= 2n| 29-24 > (: -Heel 
i=1 ei 


where g is the tgenus of the Riemann sur- 
face Rp. It is known that there exists a lower 
bound (= 7/21) for (Rp) [12]. Automorphic 
functions (or Fuchsian functions) with respect 
to a Fuchsian group T, which are essentially 
the same thing as algebraic functions on the 
Riemann surface Ry, have been objects of 
extensive study since Poincaré (1882). 
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D. Modular Groups [10,13] 


The group 
F=5SL(2, Z) 


Ae) 


(or the corresponding group of linear frac- 
tional transformations) is called the (elliptic) 
modular group. The modular group T is a 
Fuchsian group of the first kind acting on $, 
and its fundamental region together with the 
correspondence of the equivalent sides is 
shown in Fig. 2. Fig. 3 illustrates the trans- 
formations under T of the fundamental tri- 
angle, where I is regarded as acting on the 
unit disk. From Fig. 2 we obtain the gen- 


a,b,c,de Z, ad — bc = i 


1 0 
erators of T (mod{+1,}) (L A D and the 


fundamental relations: 


03 =(0,0;,) = =h. 


There are two [’-equivalence classes of elliptic 
points of T, which are represented by ¢4 =i 
and ¢3, with [I;:{ +1} J=2, 1, :{+h}]=3; 
and only one I’-equivalence class of parabolic 
cusps, which coincides with QU {co}. The 
corresponding Riemann surface Rp is analyti- 
cally equivalent to the Riemann sphere. 


Fig. 2 


Fig. 3 


For a positive integer N, the totality T(N) 
of elements in T satisfying the condition 


a b\ /1 Oye dN)E | 
ee ee mo orms a norma 


subgroup of I, called a principal congruence 
subgroup of level N. (For the case N =2, see 
Fig. 4.) In general, a subgroup I’ of I contain- 
ing I (N) for some N ts called a congruence 
subgroup of I. (It is known that there actually 
exists a subgroup I” of I with a finite index, 
which is not a congruence subgroup.) For 
N>3, —L€T(N), so that T(N) is effective. 
(For N=1, 2, we have T(N)g={+4}.) If 

N >2, T(N) has no elliptic point. The number 
t(N) of the equivalence classes of cusps of I (N) 
and the genus g(N) of the corresponding Rie- 
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mann surface Ry are given as follows: 


t(1)=1, t(2)=3, 
t(N)=(1/2N)[E:T(N)] (N23), 
g(1)=g(2)=0, 


g(N)=1+((N—6)/24N)[T:T(N)] (N>3), 


where [T:T(N)]= N° TI, a(l — 1/p°). Auto- 
morphic functions with respect to T(N) are 
called tmodular functions of level N. 


sos 


p, 
4 
p. 
y 
7 


Fig. 4 
A fundamental region of T (2) that consists of six 
fundamental regions of T (1). 


E. The Case of Many Variables 


Up to the present time, discontinuous groups 
T and the corresponding automorphic func- 
tions have been studied only in the following 
cases: (2^) X = C", T = Z” (the free Abelian 
group of rank 2n, consisting of parallel trans- 
lations) [14] (— 3 Abelian Varieties); (3°) X is 
a bounded domain in C” and T is a discon- 
tinuous group of analytic automorphisms of 
X. (In this case, conditions (i), (ii), (iii) are 
equivalent.) 

In the case (3’), the group W(X) of all (com- 
plex) analytic automorphisms of X, endowed 
with its natural (tcompact-open) topology, 
becomes a tLie group, which has F as a 
discrete subgroup. When T\X is compact, 
it is known by the theory of automorphic 
functions (or by a theorem of Kodaira) that 
T\X becomes a tprojective variety, which is a 
tminimal model [3, 14]. In particular, when X 
is a tsymmetric bounded domain, i.e., when X 
becomes a tsymmetric Riemannian space with 
respect to its Bergman metric, the connected 
component G of the identity element of G’= 
A(X) (which incidentally coincides with that 
of the group I(X) of all tisometries of X) is a 
tsemisimple Lie group of noncompact type 
(i.e., without compact simple factors), and X 
can be identified with the homogeneous space 
of G by a maximal compact subgroup K of G. 
The theory of discontinuous groups of this 
type, initiated by Siegel (especially in the case 
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where X = §, = Sp(n, R)/K, *Siegel’s upper half- 
space; = Sp(n, Z), *Siegel’s modular group of 
degree n), O. Blumenthal, H. Braun, and L.-K. 
Hua, and continued by those in the German 
school such as M. Koecher, H. Maass, and 
others, has undergone substantial develop- 
ment in recent years under the influence of the 
theory of algebraic groups [5, 8, 14-17] (— 32 
Automorphic Functions). 

On the other hand, for a symmetric Rieman- 
nian space X of negative curvature, the group 
of isometries G=1(X) is a ‘semisimple Lie 
group of noncompact type with a finite num- 
ber of connected components and with a finite 
center, and X can be identified with the homo- 
geneous space of G by a maximal compact 
subgroup. Therefore the study of discontinu- 
ous groups of isometries of X can be reduced 
to that of discrete subgroups of a Lie group 
G of this type. A typical example is the case 
where X is the space of all real positive def- 
inite symmetric matrices of degree n with 
determinant 1; this space can be identified with 
the quotient space SL(n, R)/SO(n) (Ae SL(n, R) 
acts on X by X3S-'ASA). The unimodular 
group r =SL/(n, Z) is a discontinuous group of 
the first kind acting on this space X, and a 
method of constructing a fundamental region 
of T in X is provided by the Minkowski reduc- 
tion theory [6,7]. 


F. Discrete Subgroups of a Semisimple Lie 
Group 


Two subgroups T, I’ of a group G are called 
commensurable if I NT” is of finite index in 
both I and I. For a real ‘linear algebraic 
group GC GL(n, R) defined over Q, a subgroup 
TY commensurable with G} = GN GL({n, Z) is 
called an arithmetic subgroup (in the original 
sense) of G (examples: SL(n, Z), Sp(n, Z)). An 
arithmetic subgroup F is always discrete, and 
when G is semisimple, the quotient space \G 
is of finite volume (u(T\G)< œ) with respect 
to an invariant measure u. Moreover, F\G is 
compact if and only if G is *Q-compact (or tQ- 
anisotropic), that is, if Gg or Gz consists of 
only ‘semisimple elements (A. Borel, Harish- 
Chandra, G. D. Mostow, and T. Tamagawa 
[ 6, 7]); the same results remain true if G is 
tZariski connected and has no ‘character 
defined over Q. The proofs of these facts (and 
the compactification of the quotient space 
T\X for the noncompact case) depend on a 
construction of fundamental open sets that 
generalizes the reduction theory of Minkowski 
and Siegel [5, 8, 15-18]. 

For a connected semisimple Lie group G 
of noncompact type and a discrete subgroup 
IT with u(I\G) < œ, the following density 
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theorem holds (Borel, Ann Math., (2) 72 (1960)): 


(i) For any linear representation p of G, the 
linear closure of p(T) coincides with that of 
p(G); (ii) if G is algebraic, I is tZariski dense 
in G. Furthermore, suppose that G is a direct 
product of simple groups G; and the center of 
G is finite; F is called irreducible if its projec- 
tion on any (proper) partial product of {G;} is 
not dicrete. For instance, if G is a *Q-simple 
algebraic group then T = Gz is irreducible. In 
general, there exists a partition of the set of 
indices {i} such that F is commensurable with 
a direct product of irreducible discrete sub- 
groups of the partial products corresponding 
to this partition, and these irreducible compo- 
nents are unique up to commensurability. 

The method of constructing a discrete sub- 
group F of G=SL(2,R) in a geometric manner 
using the upper half-plane can be generalized 
to some extent to the construction of discrete 
subgroups of certain groups using hyperbolic 
spaces of low dimensions (E. B. Vinberg). 
Except for these few cases, today it is known 
that a discrete subgroup T of a semisimple Lie 
group G (of R-rank > 2) with u(T\G)< œ is 
arithmetic in a certain sense (— Section G). 
This implies there are only very few discrete 
subgroups for a semisimple Lie group of 
higher rank. Actually a number of facts sug- 
gesting this result were already known in the 
1960s. First, the only subgroups of SL(n, Z) 
(n> 3), Sp(n, Z) (n> 2) with finite index are 
congruence subgroups (H. Bass, M. Lazard, 
and J.-P. Serre; this result has been generalized 
to the case of an arbitrary tChevalley group 
over an algebraic number field by C. C. Moore 
and H. Matsumoto). Second, Gz = SL(n, Z), 
Sp(n, Z) are maximal in G [8]. Finally, it is 
known (rigidity theorem) that if a connected 
semisimple Lie group G with a finite center 
does not contain a simple factor which is 
locally isomorphic to SL(2, R), then any dis- 
crete subgroup T of G with compact quotient 
T\G has no nontrivial ‘deformation (i.e., all 
deformations are obtained from inner auto- 
morphisms of G) (A. Selberg, E. Calabi and E. 
Vesentini, and A. Weil [19]). This last result 
amounts to the vanishing of the cohomology 
group H'(T, X,ad), and in this connection an 
extensive study has been made by Y. Mat- 
sushima, S. Murakami, G. Shimura, and K. G. 
Raguanathan to determine the *Betti numbers 
of T\X, and more generally the cohomology 
groups of the type H(T, X, p) with an arbi- 
trary representation p of G. These cohomol- 
ogy groups are closely related to automor- 
phic forms with respect to I’ [8, 20].) For a 
tnilpotent or tsolvable Lie group, a general 
method of constructing discrete subgroups is 
known; see, for example, M. Saito, Amer. J. 
Math., 83 (1961).) 
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G. Rigidity and Arithmeticity 


A discrete subgroup T of a Lie group G with 
u(T\G)< œ is usually called a lattice of G. A 
lattice I` of G is said to be uniform if I'\G is 
compact. By a theorem of Borel and Harish- 
Chandra [7], an arithmetic subgroup of a real 
linear group (defined over Q) is a lattice. Using 
this result, Borel showed further, by a con- 
structive method, that any semisimple Lie 
group has a lattice, especially a uniform one 
(Borel, Topology, 2 (1963)). 

For a long time there were no known ex- 
amples of nonarithmetic irreducible lattices 
in semisimple Lie groups other than those 
locally isomorphic to $L,(R). This naturally 
led to Selberg’s conjecture that any irreducible 
(nonuniform) lattice in a semisimple Lie group 
G not locally isomorphic to SL,(R) ts arith- 
metic (Selberg, International Colloquium on 
Function Theory, Bombay, 1960). The con- 
jecture seemed to be well-grounded by the 
rigidity theorem of Weil and Selberg [19]. 

However, in 1966—1967, V. S. Makarov 
and Vinberg constructed nonarithmetic non- 
uniform lattices in SO(n, 1) (n =3, 4,5) by a 
geometric method; the lattices are generated 
by freflections [21]. Thus rigidity and arith- 
meticity do not necessarily coincide, and the 
conjecture should be considered under stronger 
conditions. 

As for rigidity of uniform lattices, Mostow 
established in 1970 the following strong rigid- 
ity theorem [22]: If G, G’ are semisimple Lie 
groups with trivial center and without com- 
pact factors, and are not locally isomorphic 
to SL,(R), and if I, F’ are irreducible uni- 
form lattices, then any isomorphism 0: > 
F” extends to an analytic isomorphism ĝ: G> 
G’ (namely, Ir =). The previous rigidity 
theorem is now implied by Mostow’s. If X is a 
simply connected symmetric space, then a 
tlocally symmetric space M covered by X is 
expressed as a quotient of X by a fixed-point- 
free properly discontinuous group I in the 
group F(X) of total isometries (when the Lie 
algebra of I(X) does not have a compact fac- 
tor, this condition for F is equivalent to saying 
that it is a torsion-free discrete subgroup of the 
semisimple Lie group I(X)), and the funda- 
mental group z,(M) of M =T\X ts isomorphic 
to T. The strong rigidity theorem implies that 
compact locally symmetric spaces M, and 
M, (of higher dimensions) are isomorphic as 
Riemannian spaces if and only if z,(M,) and 
m,(M,) are isomorphic as abstract groups. 

On the other hand, in 1973, G. A. Margulis 
proved the arithmeticity of irreducible non- 
uniform lattices in semisimple real linear 
groups of R-rank greater than 1 (Russian 
Math. Surveys, 29 (1974) (original in Russian, 
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1974); Functional Anal. Appl. 9 (1975) (original 
in Russian, 1975). M. S. Raghunathan also 
proved independently the same fact under 

a slightly stronger condition. Their results, to- 
gether with the rigidity of nonuniform lattices 
in (higher-dimensional) semisimple Lie groups 
of R-rank 1 established by H. Garland, Rag- 
hunathan, and G. Prasad (Inventiones Math., 
21 (1973)), imply that the strong rigidity theo- 
rem holds similarly for nonuniform lattices. 

The results of Margulis and Raghunathan 
show that the Selberg conjecture in the orig- 
inal sense is affirmative for the case of R-rank 
greater than 1. But neither argument is appli- 
cable to uniform lattices, for they depend deeply 
on the fact (proved by D. A. Kazdan and 
Margulis) that a nonuniform lattice contains a 
nonidentity unipotent element. Previously, in a 
lecture at the international congress of mathe- 
maticians at Moscow, 1966, I. I. Pyatetskii- 
Shapiro generalized the definition of arith- 
meticity and suggested that arithmeticity of 
lattices should be investigated without the 
distinction of whether they are uniform or 
nonuniform. His definition is equivalent to the 
following [9, 24]: For a connected semisimple 
algebraic group G defined over R, a subgroup 
T c G= Gr is an arithmetic subgroup (of G) 
if there is an algebraic group H defined over 
Q and a surjective homomorphism @:H> 
Ad G defined over R such that the Lie group 
(Ker @)p is compact and that ọ(Hz) and Ad T 
are commensurable. The uniform lattice in G 
that is constructed by the method of Borel is 
arithmetic in this sense. In 1974, Margulis 
finally established the following arithmeticity 
theorem [23, 24]: If the R-rank of G is not 
less than 2, an irreducible lattice T in G is an 
arithmetic subgroup of G (even if it is uni- 
form). In the same lecture, Pyatetskii-Shapiro 
also extended the Selberg conjecture to such 
“semisimple Lie groups” as those containing 
p-adic Lie groups as factors. Margulis proved 
this Pyatetskii-Shapiro conjecture affirmatively 
by showing that an analog of the strong rigid- 
ity theorem holds for such groups. 

As for semisimple Lie groups of R-rank 1, 
besides the lattices constructed by Makarov 
and Vinberg there are only the few examples 
of nonarithmetic lattices in SU (2, 1) presented 
by Mostow (Proc. Nat. Acad. Sci. US, 75 
(1978); Pacific J. Math., 86 (1980)). The prob- 
lem of arithmeticity still remains open for 
groups locally isomorphic to SO(n, 1) (n> 6), 
SU (n, 1) (n> 3), Sp(n, 1), or F4. 


H. Geometric Discontinuous Groups [1,25] 


The study of discontinuous groups T acting on 
a Euclidean or projective space X as a trans- 
formation group of a given structure is a class- 
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ical problem. All possibilities for such T have 
been enumerated in low-dimensional cases. 
For instance, there are 230 kinds of discontinu- 
ous groups of tEuclidean motions acting on 
3-dimensional Euclidean space without fixed 
subspaces, which are classified into 32 tcrystal 
classes (A. Schénflies and E. S. Fedorov, 1891- 
1892; — 92 Crystallographic Groups). All 
discontinuous groups of a Euclidean space 
generated by ‘reflections have also been enu- 
merated (H. S. M. Coxeter, 1934 [25, 26]). 


I. Kleinian Groups 


The last decade has seen considerable research 
on (finitely generated) Kletnian groups. This 
research is closely related to the theory of 
tquasiconformal mappings and the tmoduli of 
Riemann surfaces. 

Making use of Eichler cohomology and 
tpotentials, L. V. Ahlfors established his finite- 
ness theorem and L. Bers his area theorem. 
Bers and B. Maskit investigated the bound- 
aries of *Teichmiiller spaces and discovered 
Kleinian groups with the property that the 
complement of the set £ of limit points is 
connected and simply connected. 

Numerous mathematicians have subse- 
quently discussed the classification, defor- 
mation, and stability properties of the set Q, 
uniformization and deformation of Riemann 
surfaces with or without nodes, and other 
geometric properties. In their discussions, the 
theory of quasiconformal mappings has played 
an important role. The discontinuous groups 
of motions of hyperbolic 3-space have also 
been studied. 
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A. General Remarks 


Given a real number x, we denote by z(x} the 
number of primes not exceeding x. A. M. 

Legendre (1808) obtained empirically the for- 
mula 2(x)=x/(log x — B) for some constant B, 
and C. F. Gauss (1849) obtained the formula 


f du 
x)= , 
2 logu 


assuming the average density of primes to be 
1/log x. The Bertrand conjecture, which asserts 
the existence of at least one prime between x 
and 2x, was proved by P. L. Chebyshev (1848), 
who introduced the functions 


A(x) = y $ logp 


m=1 psx 


and 


W(x)= ð logp 
pu sx 


= 0(x) + O0(,/x)+ 09x) +o... 
(In this section, p represents a prime number.) 
He thereby proved 


Ax + O(./x) <O(x) < W(x) < (6/5) Ax + O(,/x), 


where A =log(2??731/35"° 3071/9). G. F. B. 
Riemann (1858) considered the function {(s) 
(where s=o-+ it is a complex variable), ex- 
pressed by the Dirichlet series Xpan °, which 
is convergent for g > 1. He found relations 
between the zeros of (s) (— 450 Zeta Func- 
tions) and (x). F. Mertens (1874) obtained the 
formulas 


l 
OLP Llogx+0(), 
p 


psx 


se loglogx+B+0O 
—=loglo —— J 
Boer logx 


psx 


1 eo" 1 
1—- }=-——_[1+0 : 
ae (ies) 


where c is the Euler constant and B is some 
constant. 
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B. Prime Number Theorem 


The prime number theorem 


_ mx)logx 
lim ———— = 


x>w x 


1, or 2(x)~ 


logx’ 


was proved almost simultaneously (1896) by J. 
Hadamard and C. J. de La Vallée-Poussin. 
Without using the theory of tentire functions, 
E. Landau (1908) established the formula 


n(x) = Lix + O(xe~*V 8%), 


where 


1-6 x 
Lix= lim (| -f ) a 
teo i+5/ logu 


is the ‘logarithmic integral. It can be shown by 
integration by parts that 


Li x F 1!x 
Top x log? x 


(k—1)!x o( 
log**! x E 


log* x 
For example, by taking x = 10’, we get n(x)= 
664,579, Lix = 664,918, and x/log x = 620,417. 

If the Dirichlet series f(s)\=X~,a,n 5 
satisfies the condition Xp <,a,~cx, then its 
abscissa of convergence is 1, and we have 
lim,..,+9(s— 1) f{s)=c. The converse is known 
as the tTauberian theorem. If F(s)= Èr 4,n7 
(a, 20) converges absolutely for ¢>1 and F(s) 
—c/(s—1) is analytic for c 2 1, then we obtain 
Lin<x4,~ ex (Wiener-Ikehara-Landau theorem, 
1932). Specifically, if we put —C'(s)/C(s)= 

1 A(n)n‘, then the conditions of the theo- 
rem are satisfied, and we obtain ©, < ,A(n)= 
W(x)~x. 

It is easily seen that the prime number 
theorem is equivalent to y(x)~x or O(x)~x. 
The *number-theoretic function A(n) (Man- 
goldt’s function) introduced above satisfies 
Lain Ald) =logn. It follows from the *Mébius 


S 


inversion formula that A(n)= Łan (d) log(n/d). 


Hence A(n)=log p (n= p") and =0 other- 
wise. Thus we obtain y (x)= È, <,A()= 

O(x). When f(x) = (x) or y(x), it is easy 

to show that [}(f(t)/t?)dt =logx + O(1) and 
lim inf,.,, f (x)/x <1<limsup,..,, f(x)/x. How- 
ever, it is not easy to prove f(x)~x. To do 

so, introduce the number-theoretic function 
M(n), which satisfies £4, M(d)=log? n. As 
before, we have M(n)= Liq), uld) log? (n/d); 
hence 


(21-1logp  (n=p', I> 1), 
M(n)=< 2logplogqg = (n=p'q",!>1,m>1), 
0 (otherwise). 


Thus we obtain Y,,<,M(n)=2xlogx + O(x). 
This leads to A. Selberg’s well-known formula 
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(1949): 
O(x)logx+ >) O(x/p)log p= 2x log x + O(x), 


psx 
which enabled him to prove 0(x)~ x. Thus 
he obtained for the first time a proof of the 
prime number theorem that does not use com- 
plex analytic methods. The simple formulas 
Èra H(n)/n=0 and &,, <, u(n) =0(x), obtained 
by H. von Mangoldt (1897), were revealed by 
Landau to have a deep meaning concerning 
the prime number theorem. Let z,(x) denote 
the number of integers not exceeding x that 
can be expressed as the product of r distinct 
primes. In generalizing the prime number 
theorem, Landau (1911) proved that 


1 x(loglog xj"! 
(r—1)! log x l 


m(x) ~ 


Let us write &(x)= E2; e" ™™, Riemann 
proved that 


(ijro 


ji (00 
= +| SO aa dx 
SUS — 


and obtained the well-known functional equa- 
tion for the zeta function (— 450 Zeta Func- 
tions B) 


n PT (s/2) Els) =n tT (1/2 — 8/2)C(1 — s8). 


This enables us to extend ¢(s) as a meromor- 
phic function to the whole complex plane. 
Utilizing this extended ¢(s) and the following 
result of O. Perron on Dirichlet series, we can 
estimate y(x). Let co (4 00) be the abscissa of 
convergence of F(s)= Xp- f(n)n *, and let 
a>0O,a>o,, and x>0. If 


atiT x’ 


lim — F(s)— ds 

T>% 2ni Jair S 

exists, then the limit is equal to 2), <x f(n), 
where >’ means that in the summation the last 
term f(x) is replaced by f(x)/2 if x is an in- 
teger. In many cases, F(s) has a pole at s=1, 
and the principal part of the sum is obtained 
from the residue at s=1, whereas the residual 
part is given by a certain contour integral. To 
estimate w(x), we use —¢'(s)/C(s) as F(s); hence 
the problem arises of determining the zeros of 
€(s). Riemann conjectured that all the zeros 
of €(s) in the strip 0<o@<1 must be situated 
on the vertical line ø= 1/2. If this so-called 
tRiemann hypothesis (— 450 Zeta Func- 
tions) is true, then it follows that n(x) = Li x 

+ O(,/x log x). The ultimate validity of 
Riemann’s hypothesis remains in doubt. 
Concerning this, the most recent major 

result is the following formula, obtained 

by I. M. Vinogradov (1958): 2(x)=Lix+ 
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O(xexp(—clog?*x/loglog!*x)). Without 
using Riemann’s hypothesis, J. E. Littlewood 
(1918) proved that 


: m(x)—Lix 
lim sup ———————_ > 0 
eae (./x /log x)logloglog x 
as 
lim ae A <0. 


ae (/x /log x)log log log x 
If we denote by N(T) the number of zeros of 
€(s) in the domain 0<a0<1,0<t<T, then we 
have 


1 
N(T)=— TlogT— T + O(log T). 


1+log2z 

20 
Let No(T) denote the number of zeros of ¢(s) 
on the interval o = 1/2,0<t< T. Selberg (1942) 
obtained the impressive result 


N(T)>cT log T. 


E. C. Titchmarsh (1936) showed that there 
exist 1041 zeros of {(s) in the domain 0<¢<1, 
0<t< 1468 and that all lie on the line o = 1/2. 
Computers have provided further results that 
seem to justify the Riemann hypothesis. For 
example, it has been calculated that there are 
75,000,000 zeros of f(s) in the domain 0<o<1, 
0 <t<32,585,736.4 and that all are simple 
zeros and lie on the line o= 1/2 (R. P. Brent, 
Math. Comp., 33 (1979)). N. Levinson proved 
in 1974 by another method that at least one- 
third of the zeros of the Riemann zeta function 
are on the line o = 1/2. The minimum of the 
modulus of the imaginary part of the zeros 
with o=1/2 is t=14.13.... 


C. Twin Primes 


Let p,, be the nth prime. We know from the 
prime number theorem that p, ~nlogn; more 
precisely, p, =nlogn+nloglogn+ O(n). A pair 
of primes differing only by 2 are called twin 
primes. It is still unknown whether there exist 
infinitely many twin primes. There exist in- 
finitely many n satisfying p,., —p, < clog p, (P. 
Erdés, 1940). Suppose that ¢(1/2 + it)=O(|t|*). 
A. E. Ingham (1937) proved that 


Putt —Pn<Pr> O=(1+4e)/(2+4c)+¢, 


by using the following density theorem related 
to the zeros of f(s): If we denote by N(a, T) 
the number of zeros of ¢(s) in the domain 
a<o<l,0<t<T (1/2<«<1}), then there is 

a positive constant c such that N(a, T)= 
O(T70 +290" log> T). The Lindelöf hypothe- 
sis asserts that the constant c can be made 
arbitrarily small. If the Riemann hypothesis 
holds, then the Lindelöf hypothesis also holds. 
It is clear that we can substitute 1/6 +e (e>0) 
for c, e being arbitrarily small. W. Haneke 
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(1963) showed that c can be replaced by 6/37 
+e, and consequently that © <61/98 +e. D. R. 
Heath-Brown and H. Iwaniec (1979) proved 
that © < 11/20 +e by using the tsieve method 
and the zero density theorem of L-functions 
(Section E). R. A. Rankin (1935) proved that 


Pn+1 — Pa > Clogp,loglogp,loglogloglog p, 
x (logloglog p„,) 7 


holds for infinitely many n. If we denote by 
n(x) the number of primes p< x such that p 
+2 is also a prime, then it has been conjec- 
tured by Hardy and Littlewood (Acta Math., 
44 (1922)) that 


i du 
m(x) ~C] —— 


> log? u 


as x00, where 


1 
C al fi TE ah P3203 2 cic. 
The numerical evidence provided by the com- 
putation 7,(10°) = 3424506 (Brent, Math. 
Comp., 28 (1974)) tends to indicate the truth of 
this conjecture. At present, 76: 3!°°+1 seems 
to be the largest known pair of twin prime 
numbers (H. C. Williams and C. R. Zarnke, 
Math. Comp., 26 (1972)). 


D. Prime Numbers in Arithmetic Progressions 


Let k be a positive integer, y(n) be a residue 
character modulo k (— 295 Number-Theoretic 
Functions D), and L(s,y)= 22, x(n)n * (o> 1) 
be the *Dirichlet L-function. The function 
L(s, x) of s defined by this series can be ex- 
tended to an analytic function in the whole 
complex plane in the same way as the Rie- 
mann zeta function. In particular, when y is 
the principal character, then L(s, x), thus ex- 
tended, is a meromorphic function whose only 
pole is situated at s=1 and is simple; otherwise 
the function L(s, x) is holomorphic on C. 
Using this function L(s, x) and in connection 
with his research concerning the tclass num- 
bers of quadratic forms, P. G. L. Dirichlet 
(1837) proved that there exist infinitely many 
primes in the arithmetical progression l, l+ k, 
1+2k,..., where l is the initial term and k a 
common difference relatively prime to l. This 
result is called the Dirichlet theorem (or prime 
number theorem for arithmetic progressions). 
Suppose that a runs over all integra! ideals 
in a tquadratic number field K of discriminant 
d. Then the tDedekind zeta function ¢,(s) of K 
is defined by (Na) ° for o> 1. By virture of 
the decomposition law of prime ideals (— 347 
Quadratic Fields C), we have €,(s)=€(s)L(s, x), 
where y(n) =(d/n) is the ‘Kronecker symbol. 
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Utilizing ¢,(s), we obtain formulas concerning 
the tclass number h(d) of the field K. If d>0, 
then h(d)=(/d/2loge)L(1, 7), where e is the 
tfundamental unit of K. On the other hand, if 
d<0, then h(d)=(w./—d/2n)L(1, 7), where w 
denotes the number of the roots of unity 
contained in K. It follows that L(1, y) > 
2log((1 +./5)/2)/./\dl. Let y be a character 
modulo k induced by a primitive character 

x°. Since we have L(s, 7)=L(s, x°) TT (1 — 
7°(p)p *), it can be shown that L(1, 7) 40 for 

a real character y. It is easy to prove that 

L(1, 4) 40 for a complex character y. These 
statements then lead to the Dirichlet theorem. 
The proof was simplifted by H. N. Shapiro 
(1951). Besides Landau’s three proofs for 

L(1, x) 40 for real character y (1908), there 

are elegant proofs by T. Estermann (1952), 
Selberg (1949), and others. For a character 
ymod k, we always have L(1, y)= O(logk), while 
L(1, x)’ = O(log k) with one possible excep- 
tion, which may occur only if y is a real char- 
acter. Even in this case, we have L(1, 7)! 

= O(k*) (where £>0 is arbitrary, but O depends 
on £). This result was obtained by K. L. Siegel 
(1934) from his study concerning class num- 
bers of imaginary quadratic number fields. His 
proof was simplified by Estermann (1948) and 
S. D. Chowla (1950). The importance of the 
prime number theorem for arithmetic pro- 
gressions was revealed when it was applied to 
the Goldbach problem (— 4 Additive Number 
Theory C). Concerning this problem, the man- 
ner in which the remainder term depends on 
the modulus k became an object of investiga- 
tion. The Page-Siegel-Walfisz theorem is 
convenient to use: Denote by x(x; k, l) the 
number of primes not exceeding x and of the 
form ky +1, where (k, 1)=1. If x > exp(k*) 
(where ¢>0 is arbitrary), then we have 


ere ug — 
sk, D=——+ 0( —__ }. 
DET olk) 


Further research on the distribution of zeros 
of L(s, y) is necessary for the study of 2(x; k, J) 
when x takes smaller values. If y is a nonprin- 
cipal real character, then L(s, y) may have at 
most one real zero f, around 1; this is called 
Siegel’s zero. Because of this fact, when x is 
small we are unable to obtain any formula to 
indicate the uniform distribution of primes. 
However, we have the following deep result, 
obtained by E. Fogels (1962). For a given 
positive ¢, there exist cọ(e) and c(é) such that 
n(x; k,1)>c(e)x/p(k)k* log x, provided that 

x 2k, On the other hand, Titchmarsh (1930), 
using the tsieve method, obtained x(x; k,l) 

= 0(x/p(k)log x) for x >k°. A theorem of this 
type is called the Burn-Titchmarsh theorem (— 
Section E). Fogels’s theorem is based on the 
following theorem by Yu. V. Linnik (1947) and 
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K. Prachar (1957), which is an extension of 
Page’s theorem (1935): We let 6 be the function 
defined by ô= 1 — $, if L(s, y) has an excep- 
tional real zero f,, and 6=c,/log(k(|t|+2)) 
otherwise (where c, is a suitably small num- 
ber). If we denote by £ the real part of any zero 
(#B,) of L(s, x), then the theorem states that 


z Ci i ( cje ) 
log(k(\t|+2)) S \Slogk(t]+2)/ 


provided that dlog(k(|t}+ 2))<c,. Linnik 
called this result the Deuring-Heilbronn phe- 
nomenon. Using this theorem and the zero 
density theorem, Linnik (1944) proved that 
pík, )<«k*, where p(k, 1) is the least prime in 
the arithmetic progression l, l+ k, 1+2k,... and 
Lis a constant. We call L Linnik’s constant. M. 
Jutila (1977) and Chen Jing-Run (1979) proved 
that L<80 and L<17, respectively. 

Let s be positive, b;, z; (1 <j <s) be complex, 
and l, m be real numbers satisfying max |z,| > 1, 
l> s, and m>0. Under these conditions P. 
Turan (1953) obtained the following funda- 
mental theorem, which ts called the power sum 
theorem: 


B<l 


max |bjz,+...+b,2% 
m<r<li+m 


l n 
>) min |, +... +bjl. 


This theorem is effective in research on the 
distribution of zeros of zeta functions. Based 
on this new method, Turán (1961), S. Knapow- 
ski (1962), and Fogels (1965) reached the re- 
sults cited above. 

Another method of research, considered to 
be a new sieve method, on the distribution of 
primes was introduced by Selberg and A. I. 
Vinogradov. This method was followed by W. 
B. Jurkat and H. E. Richert (1965). Linnik, A. 
Rényi (1950), and E. Bombieri (1965) founded 
still another method, called the large sieve 
method, by which P. T. Batemann, Chowla, 
and P. Erdos studied the value of L(1, y). 


E. Sieve Method 


Let A be a set of integers, and P a set of 
primes. For each peP, let Q, be a set of resi- 
dues mod p, and w(p) the number of residues 
belonging to Q,. The sieve method is a device 
for estimating (from above or below) the 
number of integers n belonging to the set 
S(A, P,Q,)= {n|ne A,nmod p€Q, for peP}. 
The combinatorial methods of the Brun, Bus- 
tab, and Richert sieves are interesting and 
efficient but quite complicated. Here, we shall 
briefly describe the Selberg sieve. As an ex- 
ample, denote by S(x; q,/) the number of n 
satisfying n=/ modq, n <x, (n, D)=1, where q 
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is a prime not exceeding x,z<x, D=[] 
and (/,q)=1; then 


S034.) = 2; en? 
n=imodg m 


2 
< } ( 2, ia) 
nsx d\(n, D) 

n=imod 


odq 


p<zPs 


where A, =1 and 4, is arbitrary for d>1. Thus 
the problem reduces to the optimization of 2,. 
Proceeding in this manner, C. Hooley (1967) 
and Y. Motohashi (1975), using analytic 
methods, obtained certain deep results relating 
to the Brun-Titchmarsh theorem. Using the 
large sieve, H. L. Montgomery and R. C. 
Vaughan (1973) expressed this theorem in a 
precise form: 2(x, g,1)<2x/@(q)log(x/q) for all 
q<x. 

Let n,,n,,...,n, be Z natural numbers not 
exceeding N, and Z(p, a) the number of n,’s 
such that n;=amod p. Rényi (1950) proved 
that 


pal Z}? 
>.> pe [zma -2} <2NZ. 
p<s%Nn3 a= p 


Set a,=1 for n=n, and a,=0 otherwise, and 
set S(x)= È „< ná exp (2ming); then 


a 2 
s(=) | 
p 
In view of this simple fact, Bombieri (1965) and 


P. X. Gallagher (1968) extended the problem 
and proved that, in general, 


a 
ya, exp (2x4 n) 
M<n<M+N q 


<(N+20*) $ fa,|?. 


M<n<M+N 


pat 


pol Z? 
PEO 


a=1 


2 


q<Q a=1 
(a,q)=1 


Similar results can be obtained for character 
sums Dy cncm+n4nX(n). In this connection 
Montgomery proved that 


S({n; M<n<M+N};P,Q,) 


woo") a 


a<0 pia P—@(p) 


q| P(z) 
where 
P(z)= I] p. 
peP 
pz 


Using these methods, the following estimate 
was obtained by Bombier: 


my, 4,1) 


max max 
a <x 
loge)? get 


-ali 
ela) Ja logu 


where A is arbitrary and B is a certain func- 


qsx"?( 


«x(logx) í, 
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tion of A (Vaughan). These methods, known 
collectively as the large sieve, were first di- 
rected toward proving the Rényi theorem 
stating that every sufficiently large even integer 
can be represented as the sum of a prime and 
an almost prime integer (M. B. Barban). After- 
ward, combining Richert’s sieve with this 
large sieve, Chen Jing-Run (1973) improved 
this result to a remarkable degree (— 4 Addi- 
tive Number Theory C). Several applications 
of Bombieri’s theorem have been demon- 
strated by P. D. T. A. Elliot and H. Halber- 
stam (1966): e.g., the estimation of the num- 
ber of representations of n as p+ x? + y? 
(Hooley, Linnik) and the estimation of 
Lp<n4(n—p) (Linnik, B. M. Bredihin). 

Let N(x, T, x) denote the number of zeros of 
L(s, xX) in the rectangle a <o <1, |t| < T. Com- 
bining the large sieve with new Fourier in- 
tegral techniques and the Turán tpower-sum 
method, Gallagher (1970) proved that there 
exists a positive constant c satisfying 


2 Næ T, 9 «QTE, 


xmodQ 


Y È} Nw yT”. 

q<Qxmodq 
Similar results were also obtained by G. Ha- 
lasz and Montgomery (1969) using another 
method, which was further exploited by Jutila, 
M. H. Huxley, and Iwaniec. In particular, 
Heath-Brown and Iwaniec (1979) deduced that 
n(x+y)—2(x)=cy(logx)! if y 2x! 170+e, 
Combined with the Deuring-Heilbronn 
phenomenon, the zero density theorem above 
not only establishes the Linnik theory, but 
also yields the following result, due to K. A. 
Rodoskii, T. Tatuzawa, and A. I. Vinogradov: 


1 * du 
n(x; q,1)=—— 
(gq) J2 logu 


l x 8-1 1 é 
pul | is iu40( x) 
olaq) Ja logu (4) 
if x >exp(logqgloglogq), where E=1 or 0 


according as Siegel’s zero f exists or not, and 
where 


A= Max(log q, (log x)” (log log x)"°). 


Throughout these researches, estimates of 
the type 


5 (34x) 


xmodq 
«ela (lti + 2)log*{q(|t| + 2)} 


and 
L Lug 
a) If, 
7 x 


T 
A 
zxmodq J -T 


<« p(q)(T + 2)log’ q(T + 2) 


4 


4 


dt 
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are of great importance and have been studied 
by A. F. Lavrik (1968), Linnik (1961), Huxley 
(1972), and K. Ramachandra (1975). 


F. The Prime Ideal Theorem in Algebraic 
Number Fields 


In an algebraic number field of finite degree, 
the prime number theorem is replaced by the 
prime ideal theorem (T. Mitsui, 1956, Fogels, 
1962), which is based on the theory of the 
Hecke tL-function (E. Hecke, 1917; Landau, 
1918). Let K be a finite Galois extension over 
an algebraic number field k of finite degree. 
Suppose that p is a prime ideal of k and is not 
ramified in K. The tFrobenius automorphism 
of a prime divisor of p in K determines a 
conjugate class C of the Galois group of K/k. 
Let n(x; C) denote the number of prime ideals 
in k associated with the class C in the above 
sense and whose norm does not exceed x. 
Then we have 


h(C) 
[K:k] 


n(x, C)= Lix+ O(xe ~e/logxy, 

where h(C) is the number of elements con- 
tained in C and c is a positive constant de- 
pending on K/k. This is an extension of Che- 
botarev’s theorem (E. Artin, 1923). 
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124 (XI.8) 

Distribution of Values of 
Functions of a Complex 
Variable 


A. General Remarks 


Suppose that we are given a function f: AB 
and that the variables z, w take on values in A, 
B, respectively. A value distribution of f(z) is a 
set of points z where f(z) takes on a certain 
value w (called w-points of f(z)). Value distri- 
bution theory is usually concerned with the 
study of value distributions of complex tana- 
lytic functions. Value distribution theory has 
been developed extensively and deeply for the 
case where A is the finite plane |z|< œ or the 
unit disk |z|<1 and B is the extended complex 
plane [w| < œ, and there are many interest- 
ing results in this case (— 272 Meromorphic 
Functions). Value distributions for analytic 
functions on general domains or on Riemann 
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surfaces or of several complex variables have 
also been studied. 


B. The Case of |z| < R < œ 


For a transcendental entire function f(z), 
every value (including o0) is a value of the 
tcluster set of f(z) at the point at infinity 
(tWeierstrass’s theorem). This theorem was 
improved in the following way by E. Picard in 
1879: A transcendental entire function f(z) has 
an infinite number of w-points for any finite 
value w except for at most one finite value 
(*Picard’s theorem). A value w for which the w- 
points are at most finite is said to be a Picard’s 
exceptional value. E. Borel gave a precise form 
of this theorem, taking into consideration the 
order of a function, and G. Julia proved the 
existence of Julia’s directions (= 272 Mero- 
morphic Functions; 429 Transcendental En- 
tire Functions). After other results in value 
distribution theory had been obtained by J. 
Hadamard, G. Valiron, and others, R. Nevan- 
linna published an important work in 1925 in 
which he established the so-called Nevanlinna 
theory of meromorphic functions in |z|<R< 
œ, unifying results obtained up until that 
time, and which became the starting point of 
the subsequent value distribution theory (— 
272 Meromorphic Functions). T. Shimizu and 
L. V. Ahlfors gave a geometric meaning to the 
Nevanlinna tcharacteristic function T(r). Ahl- 
fors established the theory of covering surfaces 
by metricotopological methods in 1935, and 
applied it to obtain the Nevanlinna theory 
and many other results on meromorphic func- 
tions. This theory revealed that the topological 
meaning of the number 2 of Picard’s excep- 
tional values is closely related to the *Euler 
characteristic 2 of the sphere. H. Selberg es- 
tablished the value distribution theory of 
talgebroidal functions and gave a precise form 
of G. Rémoundos’s theorem, which corre- 
sponds to Picard’s theorem for algebroidal 
functions. 

Moreover, as an extension of the value 
distribution theory of meromorphic functions, 
there is the theory of holomorphic curves or of 
systems of entire functions. Let fo, fis- Ja 
(n> 1) be entire functions without common 
zeros for all and for which fo: f,:...:f, is not 
constant. Put f=( fo, fis -<-> fa). This is a re- 
duced representation of a nonconstant holo- 
morphic curve f :C—>P"(C). For a =(&o, a, 

,a,)€C"*! — {0}, considering the zeros of 


(x, f) =&oJo t fi + -+ Anfa (40), 


we can extend Picard’s theorem and the Ne- 
vanlinna theory for meromorphic functions to 
holomorphic curves. 
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We a the characteristic function 
T(r) (= T(r, f)) as 


r 


>’ 


1 2n 
= log| f|d0 


0 


ro 


A= HAT Ft, 


where rọ is a fixed positive number and A (r){fo 
= A(r)— A(ro). Using a bivector Le =(fif; 
—ffi) of f and f’=(fo. fi. ---, Ja), we have 


another representation of i 


rin=2 | of [ete tdt do. 
T Jro S JoJo Ifl 


f is transcendental when lim T(r)/logr = œ 
and the number 


CEC" cofo +c fit 
+O tn= 


is the degeneracy index of f. It holds that 0< 
A<n—1. As an extension of Picard’s theorem, 
J. Dufresnoy stated that, for transcendental 
holomorphic curves f and among « in gen- 
eral position, the zeros of (a, f) are infinite 
except for at most n+å+ 1. X (<C"*!—{0}) 
is in general position when any p (<n+ 1) vec- 
tors in X are linearly independent. In connec- 
tion with this result, there are many Picard- 
type theorems, and when 2>0, there are some 
particular results for holomorphic curves. H. 
Cartan, H. and J. Weyl, and Ahlfors extended 
the Nevanlinna theory to holomorphic curves 
as follows. For «=(%,0,,.--,%,)€C"*! — {0}, 
we put 


A=dim{(co,¢1, ban 


|~| =the length of a, 
Hofl=Ma AWS) (a f) 49, 


and define the proximity function 


1 f7 1 
mnd Bri 


and the counting function 


1 2n r 
Nirad=z- | log (o, f)| d0] 
T Jo ro 
Then we have the first main theorem: For any 
a for which (a, f) #0, 


T(r) +m(ro, 0) =N (r, a) + m(r, æ); 


and the second main theorem: When 4 =0, for 
any %,,%2,...,%, in general position, 


(q—n—-lA)T(r)< 5 N(r, ;)+ S(r), 


where S(r)= O(logrT(r)) except for r in a set e 
of finite logarithmic measure, (i.e., fedlogr < 
oo). For A>0, it is conjectured that “qg —n — 
1” can be changed into “q—n—A—1.” This 

is unsolved except for some special cases. 


0} 
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Some results show relations between excep- 
tional values and the order as in the case of 
meromorphic functions. Nevanlinna theory 
has been extended to the associated curves f? 
(p=1,2,...,m f'=/f) for A=0, and there have 
been attempts to extend this theory of holo- 
morphic curves even further by generalizing 
domains or ranges. 


C. The Case of General Domains 


The value distributions of meromorphic func- 
tions defined in a general domain or an open 
Riemann surface depend on the function- 
theoretic “size” of the set of singularities (— 
169 Function-Theoretic Null Sets) or the type 
of the Riemann surface (— 367 Riemann Sur- 
faces). For instance, we have the following 
theorem of Picard type: A single-valued mero- 
morphic function with a set of singularities of 
tlogarithmic capacity zero takes on every 
value infinitely often in any neighborhood 
of each singularity except for at most an F,- 
set of values of logarithmic capacity zero 
(Hallstrém-Kametani theorem). For the study 
of value distribution at general singularities, it 
is useful to investigate cluster sets (— 62 Clus- 
ter Sets). In order to generalize the Nevan- 
linna theory to the case of general domains or 
Riemann surfaces, we take their exhaustions 
depending on a real parameter r and define the 
*counting function and so on (— 272 Meromor- 
phic Functions). G. J. Hällström established 
the value distribution theory of meromor- 
phic functions defined in the complementary 
domain D of a compact set E of logarithmic 
capacity zero by taking D, = {z|v(z)<r} as the 
exhaustion of D, where v(z) denotes the Evans 
potential for E, 1.e., v(z) is the potential corre- 
sponding to a positive mass distribution 4 
on E of total mass 1 which tends to +00 as 
z tends to any point of E. Thus the number 
of Picard’s exceptional values is not greater 
than 2+ č, where €=limsup,_,,, F(r)/T(r) 
with —n(r)=the Euler characteristic of D, and 
F(r)= f; n(r)dr (Hallstrom-Tsuji theorem). J. 
Tamura, L. Sario, and others studied the value 
distributions of meromorphic functions de- 
fined on Riemann surfaces. Sario succeeded in 
extending the Nevanlinna theory to analytic 
mappings of a Riemann surface R into another 
Riemann surface S by introducing a suitable 
metric in S to define the tproximity function. 
In the Nevanlinna theory on general do- 
mains, we must sometimes impose condi- 
tions that the functions must satisfy in order 
to obtain a concrete conclusion, for instance, 
the condition that € be finite in the Hällström- 
Tsuji theorem. But it is also important to 
determine the domains where some result can 
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be obtained without imposing any additional 
conditions on the functions. The Hällström- 
Kametani theorem is an example. Although 
the set of exceptional values in this theorem 
cannot be replaced generally by a smaller set 
than an F,-set of logarithmic capacity zero, we 
have the following theorem: Let E be a tCan- 
tor set with successive ratios €,=21,,/l,—1, 
where l, denotes the length of the segments 
that remain after repeating n times the process 
of deleting an open segment from the middle 
of another segment. Then any single-valued 
meromorphic function with E as the set of 
singularities has at most 3 Picard’s excep- 
tional values if lim, ,,, €,=0 and at most 2 if 
€,+1 =0(&?) (L. Carleson, K. Matsumoto). By 
weakening the conditions on E, one can get 
several improvements of this result. 
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125 (XII.7) 
Distributions and 
Hyperfunctions 


A. History 


The advancement of analysis, particularly in 
the field of partial differential equations and 
harmonic analysis, necessitated the generali- 
zation of the notion of functions and deriva- 
tives. For instance, “functions” such as Dirac’s 
tdelta function and tHeaviside’s function were 
used by physicists and engineering scientists 
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even though the former is not a function and 
the latter is not a differentiable function in the 
classical sense. The finite parts of divergent 
integrals, used by J. Hadamard to investigate 
the fundamental solutions of wave equations 
(1932), and the Riemann-Liouville integrals 
due to M. Riesz (1938) were the notions that 
eventually led to the theory of generalized 
functions. The rudiments of the idea of distri- 
bution, however, can also be found in other 
earlier works. S. Bochner (1932) and T. Carle- 
man (1944) discussed the Fourier transforms of 
locally integrable functions on the reals with 
growth as large as a polynomial. S. L. Sobolev 
introduced the notion of generalized derivative 
and also of generalized solution of differential 
equations by means of integration by parts in 
studying the Cauchy problem for hyperbolic 
equations (1936); J. Leray (1934), K. O. Fried- 
richs (1939), and C. B. Morrey, Jr. (1940) also 
discussed generalized derivatives. On the other 
hand, L. Fantappié (1943) investigated analytic 
functionals that are elements of the dual of the 
space of analytic functions and applied them 
to the theory of partial differential equations. 
Based on a systematic generalization of these 
investigations, L. Schwartz [1] established the 
theory of distributions (1945), which not only 
provided a mathematical foundation for a 
number of formal methods that had been used 
in mathematical physics but also gave new and 
powerful tools for the theories of differential 
equations (L. H6rmander [2]) and tFourier 
transforms. Furthermore, it has been applied 
to trepresentation theory of locally compact 
groups, the theory of probability, and the 
theory of manifolds (G. de Rham [3]). As will 
be seen in Section B, distributions are defined 
as continuous functionals on a certain func- 
tion space, and it is essential to select a func- 
tion space appropriate to the problems con- 
cerned. For this reason, I. M. Gel’fand and G. 
E. Shilov defined various classes of general- 
ized functions [4] as a natural extension of 
Schwartz’s theory. In this direction there are 
also various classes of ultradistributions intro- 
duced by C. Roumieu [5] and A. Beurling. 
Another but completely different approach 
was given by M. Sato [6, 15] in the form of the 
theory of hyperfunctions (1958). Intuitively a 
hyperfunction is the sum of (ideal) boundary 
values of holomorphic functions at the real 
axis. We obtain in this way an ultimate class 
of localizable generalized functions. Sato em- 
ployed the relative (or local) cohomology 
theory to define the “boundary values” and to 
prove their localizing property. Such an alge- 
braic approach to generalized functions led 
naturally to an algebraic treatment of systems 
of linear partial differential operators [7], 
called algebraic analysis by comparison with 
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algebraic geometry. Independently, Horman- 
der established a similar theory for distri- 
butions using Fourier analysis (— 274 Micro- 
local Analysis). 


B. Definition of Distributions 


Let (x) be a complex-valued function of x = 
(x,,...,X,) defined on an open set Q in the n- 
dimensional Euclidean space R”. By the sup- 
port (or carrier) of o, denoted by supp ¢, we 
mean the ‘closure of {x| p(x) 40} in Q. For 
multi-indices p, i.e., n-tuples p=(p,,..., p,) of 
nonnegative integers, we set |p| =p; +... + Pn- 
For a function ọ(x) of class C'”', we write 


Gl?! p(x) 


DP p =———__—_. 
Ox?! ... Oxf" 


(1) 
In particular, D- = ø. To indicate the 
variables x we shall adopt the notation D?. 

P(Q) denotes the set of all complex-valued 
functions of class C® defined on Q with tcom- 
pact support, which is a ‘linear space under 
the usual addition and scalar multiplication in 
function spaces. A sequence {¢,,} in 2(Q) is 
said to converge to 0 (the function identically 
equal to zero) as m> œ, denoted by Pm = 0, if 
there exists a compact set E in Q such that E 
contains supp Pm for every m, and for every p, 
{DP Pm} converges uniformly to 0 as moo. 

We sometimes abbreviate AQ) either as Y 
or, when we want to indicate the variables x, 
as 2y- 

A complex-valued ‘linear functional T de- 
fined on Z(Q) is called a distribution on Q if it 
is continuous on AQ), i.e., Pm => 0 implies 
T(Q,)—9. The set of all distributions on Q is 
denoted by #(Q) (or &’). For distributions S 
and T, the sum $+ T and scalar multiple «T 
are defined by (S+ T)(g) =S(@)+ T(@) and 
(«T)(~)=aT(¢), respectively, which are also 
distributions. Hence 2'(Q) is a flinear space. 


C. Examples of Distributions 


(1) Let f(x) be a tmeasurable and flocally 
integrable function on Q. Then a distribution 
T; is defined by T,(~) = @(x)f(x)dx. Here dx 
is the Lebesgue measure and the domain of 
integration is Q (in fact, supp @). If T, = T}, 
then f(x)=g(x) talmost everywhere. Thus we 
can identify the distribution T, with the cor- 
responding function f, and sometimes T, will 
be denoted simply by f. 

(2) Let u be a Radon measure on Q, i.e., 
a complex-valued tregular completely ad- 
ditive set function on the ‘Borel sets in Q. 
Then a distribution T, is defined by T,(~)= 
f p(x) u(dx). Example (1) is a special case where 
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pdx) = f(x)dx. Two Radon measures yt and v 
coincide if T, = T,. If the measure is concen- 
trated at the origin, then T,(¢~)=c@(0), denoted 
by cô, and ô is called Dirac’s distribution. 
Sometimes 6 is denoted by 6,, ôx» or 6(x) to 
indicate that it operates on functions of x. A 
distribution Te 4 (Q) is a measure if and only 
if T(¢,,)70 whenever the supports supp Pm of 
a sequence ¢,,€ YQ) are in a fixed compact 
set in Q and g,, converges uniformly to zero. 
A distribution T is called a positive distribu- 
tion if T(@)>0 for any pe 2(Q) that has non- 
negative values at all points x in Q. Every 
positive distribution is equal to a T, corre- 
sponding to a positive measure p. 

(3) For given p the distribution 5‘) is defined 
by ôP (@) =( —1)?'D? @(0) » §(0> 0) = § 

(4) Let g(x) be a function defined but not 
integrable on an interval (a, b), and assume 
that for any positive number ¢ it is integrable 
on (a +e, b). Moreover, assume that 


gtx)= Axa) >h), 


where Re/, > 1, A, is not an integer, and h(x) ts 
integrable on (a, b). Then 


b 
| g(x) dx —¥, A,A, — 1) ~'e! >= F(e) 
ate 

tends to a finite value as —>0. This limit is 
called the finite part (in French partie finie) of 
the integral f? g(x)dx, denoted by Pf f? g(x) dx: 


b 
pr g(x)dx 


a 


b 


=-y Ai b-ais | h(x)dx. 


7 Ay— I a 


In the same way, for every pe A(R’), T(g)= 
Pf f? g(x) p(x) dx is defined, and T is a distribu- 
tion which will be denoted by Pfg, and which 
is frequently called a pseudofunction. This 
notion is extended to the n-dimensional case 
and used to express the fundamental! solutions 
(— Section EE; Appendix A, Table 15.V) of 
thyperbolic partial differential equations. 


D. Localization of Distributions 


Let,’ be an open subset of Q. Every function 
pe BQ’) can be extended to a function pe 
AQ) by setting @(x)=0 for x EQ’. Thus if 

Te FQ), then a distribution SEF (Q’ is de- 
fined by S(g) = T(@) for every pe FQ), and S 
is called the restriction of T to Q’. Two distri- 
butions T and S are said to be equal in Q if 
their restrictions to Q are equal. If every point 
in Q has a neighborhood where T= S, then 
T=S. In this sense a distribution is determined 
completely by its local data, although the 
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notion of pointwise value, having exact mean- 
ing for functions, has no meaning for distri- 
butions. Moreover, we can construct a distri- 
bution with given local data in the following 
way: Suppose that an topen covering {Q,} of Q 
and a set of distributions Te Z(Q,) are given 
and that 7; and T, are equal in QN Q, for any 
jand k. Then there exists a unique distribu- 
tion Te Y'(Q) such that T= T; in Q; for each j. 
(Define T(~) = T(x) for a *partition of unity 
{a;} subordinate to {Q,;}). Namely, the distri- 
butions & form a tsheaf of linear spaces over 
R”. It is not fflabby but is soft, i.e., for any 
distribution T defined on a neighborhood of a 
closed set F in R”, there is a distribution on R” 
that coincides with T on a neighborhood of F. 

For each distribution Te 2'(Q) there is a 
largest open subset Q of Q on which T van- 
ishes. Its complement is called the support (or 
carrier) of T and is denoted by supp T. The 
support of the distribution given in example (1) 
coincides with that of the function f. The sup- 
port of 6” is the origin. 


E. Derivatives of Distributions 


In example (1) in Section C, if f(x) is a func- 
tion of class C*, then by integration by parts 
Tye (p) =(—1)" TD” p) for |p| <k. The right- 
hand side defines a distribution even if f is not 
differentiable. In view of this example, we 
define derivatives D?” T of any distribution T by 


(D°T)(y)=(—1)"'T(D’9), PEZ, (2) 


Any distribution is infinitely differentiable. 
Any locally integrable function is infinitely 
differentiable in the sense of distributions, and 
its derivatives D” T, are called distribution 
derivatives (or generalized derivatives or weak 
derivatives). 

For example: (1) D?6 = ô”; (2) (1-dimen- 
sional case) dx, /dx =1, d1/dx =ô, where x, = 
max(x,0) and I(x) is Heaviside’s function, 
which is equal to 1 for x >0 and to 0 for x <0. 


F. The Operation of Linear Differential 
Operators on Distributions 


Let T be a distribution and g(x) a C”-function 
on Q. Then we can define the product xT by 
(xT)(@)= T(a@), where the right-hand side is a 
continuous linear functional on AQ) since the 
multiplication g@t-a¢@ is continuous in Z(Q). 
We define the dual operator (or conjugate 
operator) P'(x, D) of a linear differential opera- 
tor P(x, D)= È a,(x)D” by 


P(x, D)o =} (—1)"D?(a,(x) g(x), (3) 


where the a,(x) are C*-functions on Q. Com- 
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bining differentiation of distributions, multi- 
plication by functions, and addition, we can 
apply partial differential operators to distribu- 
tions, and we have (P(x, D)T)(~) = T(P'(x, D)o). 

Linear differential operators commute with 
restriction mappings. In particular, we have 
supp P(x, D)T c supp T. In other words, linear 
differential operators are ‘sheaf homomor- 
phisms on the sheaf & of distributions over Q. 
On the other hand, every continuous sheaf 
homomorphism on 2 is a linear differential 
operator whose order is finite on each com- 
pact set in Q. Even when no continuity is 
assumed, a sheaf homomorphism on 9 is a 
linear differential operator except on a discrete 
subset of Q (J. Peetre). 


G. The Topology of 2 and 2’ 


Let @=K,€K,€... be a sequence of com- 

pact sets in Q that exhausts Q. For every 

nondecreasing sequence of positive numbers 

{a} = {dp, a ,...} and nondecreasing sequence 

of nonnegative integers {k} = {ko,k,,...} we 

set 

Piaj (P)=sup sup supa,|D? p(x)| (4) 
j20 |pl<k; x¢K; 

for pe AQ). We define a flocally convex topol- 

ogy of the space Z(Q) by taking the totality 

of the tseminorms pra}, {x as a fundamental 

system of continuous seminorms. Then &(Q) is 

a ‘nuclear *(LF)-space, and the convergence 

Pm => Q of a sequence Pp is identical to the 

convergence ,,—0 in this topology. A sub- 

set Bc BQ) is thounded in the topology 

if and only if there exist a compact set EcQ 

such that supp ọ € E for every ge B, and 

positive numbers M, for each p such that 

sup|D?@(x)|< M, for every ge B. 

The topology of the space 2(Q) is the 
strong topology of the tdual of A(Q): the 
ttopology of uniform convergence on every 
bounded set in Y(Q). Under this topology 
YQ) is a tnuclear freflexive linear topo- 
logical space. With respect to this topology, 
the linear differential operators of Section F 
are continuous in &, 

By virtue of the following convergence 
theorems, various limiting processes concern- 
ing distributions can be treated easily. If the 
limit lim 7;,(~) = T(¢) exists for every pe 2, 
where {7;} is a sequence of distributions, then 
Te and {T} is convergent to the distribu- 
tion T (convergence theorem). Moreover, for 
any p, D’? T; is convergent to D” T (theorem of 
termwise differentiation). Any bounded set in 
2 or & is totally bounded. Thus weak conver- 
gence of a sequence {T} implies strong conver- 
gence (convergence in the topology of the 
space 2’) (strong convergence theorem). 
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H. Distributions Depending on a Parameter 


Consider distributions T, depending on a 
parameter A, where / ranges over the real line, 
the complex plane, or more generally an open 
set in Euclidean space. The convergence theo- 
rem and the strong convergence theorem also 
hold in the case of a continuous parameter A. 
Thus T, is continuous (differentiable) with 
respect to the parameter / if T,(@) is continu- 
ous (differentiable) with respect to 4 for any 
geQ. If T, is continuous with respect to À, 
then D?T, is also continuous with respect to 
2. If T, is differentiable with respect to a real 
variable 4, then D? T, is also differentiable, and 
OD? T, /6A = D?(0T,/22). The same facts hold for 
the case of several real variables. For a com- 
plex parameter A, T, is holomorphic with re- 
spect to 4 if T,(@) is holomorphic for any pe 2. 
The fundamental properties of holomorphic 
functions also hold in this case. 

If T, 1s defined and continuous in an interval 
[a,b], then the integral T= f T, då with respect 
to 7 exists in Z and we have 


ro=| T,(p)ds, pe. 


a 


I. Distributions with Compact Support 


We denote by &(Q) (abbreviated to &) the 
space of all complex valued C%-functions 

on Q. &(Q) is a nuclear tFréchet space with 

the locally convex topology defined by the 
seminorms 

Pv K(Q)= sup | D? p(x)| (5) 
as p ranges over all multi-indices and K ranges 
over all compact sets in Q. 

We denote by 6’(Q) (abbreviated to &’) the 
tstrong dual of &(Q), i.e., the set of all continu- 
ous linear functionals on 6(Q) equipped with 
the topology of uniform convergence on 
bounded sets in 6(Q). &(Q) is a nuclear *(DF)- 
space. If Te &’(Q), then its restriction to A(Q) is 
a distribution with compact support. Con- 
versely if T is a distribution with compact 
support in Q, then choosing «e AQ) which is 
identically equal to 1 on a neighborhood of 
the support of T, we define a linear functional 
S on &(Q) by S(~)= T(ag). Then SE &(Q), and 
S is independent of the choice of «. In this 
sense, we can consider that S is the extension 
of T to &(Q) and identify S and T. thus &(Q) 
coincides with the set of all distributions with 
compact support in Q. 


J. Structure Theorems for Distributions 


A distribution Te@'(Q) is said to be of order at 
most k if | T(o)| S Piaj io lP) for {k} =fk, k, k, discs } 
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and for some {a}. A distribution of order 0 is a 
measure. Every distribution of finite order can 
be represented as a finite linear combination 
of derivatives (in the sense of distributions) of 
measures or locally integrable functions. The 
restriction of any distribution to a relatively 
compact open subset 9’ can be represented 
in this way because it is of finite order. There- 
fore the distributions form the smallest class 
of generalized functions that contains all 
locally integrable functions, is stable under 
differentiation, and forms a sheaf. Further- 
more, for any distribution T there exists a 
sequence {y,} < 2 that converges to T in the 
distribution sense. 

A closed set F is said to be regular if for 
every point a in F we have a neighborhood U 
of a and constants w and 1 >«>0 such that 
every pair of points x and yin FNU is con- 
nected by a curve contained in F with length 
less than or equal to w|x —y|*. If the support 
of a distribution T is contained in a regular 
closed set F, then 


T=} D"T,, 


(the expression is not necessarily unique, and 
the sum is locally finite), where the x, are 
complex-valued measures with support con- 
tained in F. In particular, if the support of a 
distribution T contains only one point a, then 
T can be represented uniquely as a finite linear 
combination 
T= Ý 4,D"6 


|lpl<m 


(xa) 


of derivatives of the distribution 6 defined 


by dx—a)(P) = Pla). 


(x-a) 


K. Tensor Products of Distributions 


Let Q, cR” and Q, c R” be open sets. If Te 
P'(Q,) and Se Y'(Q,), then there is a unique 
distribution TO Se @(Q,. x Q,), called the 
tensor product (or direct product), such that 
T® S(o(x)H(y)) = T()S(W) for any pe WQ,) 
and ye 2(9,). More directly we have Fubini’s 
theorem: T ® S(e(x, y)) = T.(S,(9(x, y))) = 
S,(T(e(x, y))) for any pe BQ, x Q,). If F and 
G are linear spaces of distributions on Q, and 
Q, respectively, then the ‘tensor product 

F ® Gis identified with the linear combina- 
tions of tensor products T® S of Te F and 
SeG, which form a linear space of distribu- 
tions on Q, x Q,. When F and G have locally 
convex topologies, the tcompleted tensor 
products F ® G and F QG are also usually 
identified with subspaces of 9’(Q, x Q,). For 
example, we have Z(Q,) © F'(Q,) = BQ, x 
Q,),6'Q,) ® 6'(Q,)=6'Q, x Q,) and 6Q,)® 
&(Q,) = 6(Q, x Q,) (including the topologies). 
Similarly, we have A(Q,) ® AQ,) = AQ, x Q,), 
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but the left-hand side has a strictly weaker 
topology than the right-hand side. Spaces of 
distributions with parameters (— Section H) 
are often identified with completed tensor 
products of spaces of distributions [8-11]. 


L. The Kernel Theorem 


Let Q, and Q, be as above. Every distribu- 
tion K on Q, x Q, induces the continuous 
linear mapping Lg: A(Q,)> F'(Q,) defined by 
(L(x) (W(y) = Klo Y), Y E2). 
Conversely every continuous linear mapping 
L: 2(Q,)> B(Q,) is equal to Lg fora Ke 
P(Q, x Q,) and the correspondence K> Ly is 
a topological isomorphism of the locally con- 
vex space Y(Q,. x Q,) onto L(F(Q,), J(Q,)) 
equipped with the topology of uniform conver- 
gence on the bounded sets (L. Schwartz’s 
kernel theorem, [8-11]). K is called the kernel 
(distribution) of the mapping Ly. 

Lx is a continuous linear mapping Z(Q,)> 
&(Q,) (resp. F(Q,)— &'(Q,)) if and only if Ke 
BQ.) È E(Q,) (resp. J(Q,) ® €(Q,)), in which 
case K is said to be regular (resp. compact) in 
y. Lg can be extended to a continuous linear 
mapping &(Q,)>9'(Q,) (resp. &(Q,) > B(Q,)) 
if and only if Ke&(Q,) È H'(Q,) (resp. &(Q,) Ô 
Y'(Q,)), in which case K is said to be regular 
(resp. compact) in x. K is said to be regular 
(resp. compact) if it is regular (resp. compact) 
both in x and y. Lg can be extended to a con- 
tinuous linear mapping 6(Q,)>6(Q,) (resp. 
6(Q,)> €(Q,)) if and only if Ke é(Q, x Q,) 
(resp. &(Q, x Q,)). Then K is said to be regu- 
larizing (resp. compactifying) [8, 10]. 


M. Convolution 


For distributions S and T on R”, assume that 
either S or T has a compact support or more 
generally that supp TN ({x} — supp 5) is locally 
uniformly bounded with respect to x. Then the 
correspondence 


pE P> Six Ty (p(x +y)) 


defines a distribution called the convolution of 
S and T and is denoted by S T. In particular, 
T, * T,= Tg where f*g is the tconvolution of 
functions f and g, and T;,, T;, and T;,, denote 
the distributions corresponding to f, g, and 
f*g, respectively. For example: T+6=6* T= 
T, D? Tx S= T*(D?S)=D?(T«S). 

Thus a solution of the partial differential 
equation P(D)T =S with constant coefficients 
is given by the convolution S» E of S with a 
tfundamental solution E of P(D), whenever 
the convolution exists. 

If Te@ and we, the convolution Tx @ 
is equal to a function f of class C® (or the 
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distribution corresponding to f), and f(y)= 
Ty (p(y —x)). f is called the regularization of 
T. For distributions S and T, assume that 

| f(x)g(y—x)| is integrable on R” for any regu- 
larization f =S x and g= Ty. Then there 
exists a unique distribution V such that 


Vž(o* D= froo» dx. 


This distribution V is called the general- 
ized convolution and is denoted by S* T 
(C. Chevalley). 


N. Tempered Distributions /’ 


We denote by (R") (abbreviated to S) the 
space of all trapidly decreasing functions of 
class C” on R". ¥ is a nuclear Fréchet space 
with the locally convex topology defined by 
the seminorms 


Pp, q(P) = Sup |x’ D4 p(x), (6) 


as p and q range over all multi-indices, where 
xP= xP... xP". We denote by #’(R") (abbre- 
viated to S’) the strong dual of Y(R"). Its 
elements are called tempered distributions (or 
slowly increasing distributions) and are identi- 
fied with their restrictions to A(R"). A distri- 
bution Te’ can be continuously extended to 
YF if and only if any regularization f =T*@ 
of T is a slowly increasing continuous func- 
tion (i.e., there is a polynomial P(x) such that 
f(x) <|P(x)}). F(R”) is a nuclear (DF)-space. 
Let 1<p< œ, and let q be the tconjugate ex- 
ponent. The strong dual of the function space 
2, R") is denoted by G(R”). Since Z is dense 
in 2, H, is identified with a linear subspace 
of F’. The strong dual of 2, (R") is equal to 
D,(R"). Let A(R") be the closed linear sub- 
space of 4(R") consisting of all functions p 
such that D” e C,(R”) for all p (— 168 Func- 
tion Spaces). It is also the closure of Y in 2. 
The strong dual of # is denoted by 2; (R"). 
Its elements are called integrable distributions 
because the strong dual of 2, (R") is equal to 
4(R") so that the integral T(1) has a meaning. 
Here 1 is the function identically equal to 1. 
tSobolev spaces W,(R") and *Besov spaces 
By, 4(R") are also linear subspaces of Y’(R"). 


O. Fourier Transforms 
The Fourier transform 
F(x)=(V/ T | oei- ixg)d¢ (7) 


is an isomorphism of .~ onto Sp, where x¢ 
=X,¢,+...+x,¢,. The Fourier transform 
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FTES of Te F is defined by 


(FT)(p)=T(FQ), pef. (8) 


The inverse Fourier transform is defined simi- 
larly, except that —iis replaced by i. For 
example: (1) F1 =(./2n)"6, where | is the 
distribution corresponding to the function 1. 
(2) F(D?T) =i" x? FT. 

A function ¢ of class C® is called a slowly 
increasing C®-function if @ and its derivatives 
of any order are slowly increasing continuous 
functions. The space Oy is the set of all such 
functions. Its image under the Fourier trans- 
form F (Cu), denoted by Og, coincides with 
the set of all distributions T such that any 
regularization f =T » ọ is a rapidly decreas- 
ing C”-function. A member of the space O 
is called a rapidly decreasing distribution. If 
xE Oy (PEO) and Te S’, then aTe F’ and 
F(aT)=( 20)" Fax FT (B*TeF' and 
F(B*T)=( 21 FBFT).IfT is a distri- 
bution with compact support in R”, then its 
Fourier transform is the tentire function f({)= 
(./2n)-" T(exp(—ix)), where {= č + iņe C". 
For a convex compact set K in R”, define its 
supporting function H, by Hx(y)=sup,.x XN. 

Paley-Wiener theorem. An entire function 
f(¢) on C” is the Fourier transform of a distri- 
bution (resp. a function of class C°) with sup- 
port in K if and only if (i) for any ¢>0 there is 
a constant C, such that | f(| < C,exp(H,(y)+ 
e|C|) on C”, and (ii) there are constants C and 
N such that | f(€)|<C(1+|€|)* (resp. for any 
N there is a constant C such that | f(| < 
C(1+ |é|)~%) on R”. 


P. Fourier Series and Distributions on Tori 


The n-dimensional ttorus T” is the quotient 
space of R” with respect to the equivalence 
relation x;=y,;mod Z (j=1,...,n). The space 
&(T") of distributions on T” is defined to be 
the strong dual of 2(T”) = &(T"). The volume 
element dx of T” is defined from that of R”. 


_ Thus, for an integrable function f, we can 


define a distribution T, by 


T;(~)= | f(x)o(x)dx, pe G(T"). 
T 


Consider the family of functions f,(x)= 
exp(2zipx), where p ranges over all n-tuples 
of integers. Then any distribution T on T” has 
the Fourier series expansion T = ic, f,(x), 
where the Fourier coefficients c, = T(f_,) are 
slowly increasing, i.e., |c,| <C(1+|p|?)* for 
some k and C. Conversely if {c,} is a slowly 
increasing sequence, then È c, f, converges to 
a distribution T on T”. 
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Q. Substitution 


Let f=(f;,...,f,,) be a C*-mapping from an 
open set Q in R” into an open set © in R”, and 
assume that the trank of the Jacobian matrix 
(6f,/0x,) i=1,...,mj=1,...,n) is equal to m in 
a neighborhood of the tinverse image E of the 
support of a distribution S = S€ 2'(®). Ina 
neighborhood U of E, we can choose u, = 
G(X), -.-, Up—m = Jn—m(X) SO that the trans- 
formation (y, u)=(f(x), g(x)) has the inverse 
transformation x = Ņ(y, u) of class C”. If the 
support of pe AQ) is contained in U, then 
defining @ by 


y= | p(y, u))J(y, u) du, 
ae 


we have ĝe AQ), where J(y, u) is the abso- 
lute value of the tJacobian of W(y, u), and @ 

is independent of the choice of g,...,g9,—m- 
Now we define T(~)= S(@) if supp@ € U, and 
T(g~)=0 if supp @ does not intersect E. Since a 
distribution can be determined by its local 
behavior, we have a distribution Te 2'(Q) with 
support E, which is denoted by T=Sof=S(f) 
= Sw and is called the substituted distribution 
of Sy by y= f(x). It is also called the pullback 
of S by f and is denoted by f*S. The chain 
rule for derivatives of composites 


6 m Of, (0S 
—(Sof)= —{ —— 9 
ax; of) 2 HEr) (9) 
also holds. 
For example: (1) If S = ô®} (ye R”), then E is 
the surface fi (x)= f (x)=... = fa(x)=0. Assum- 


ing that (fi, ---, fm) satisfies the condition in the 

previous paragraph, we obtain the distribution 
(fi, -.., fa) with support contained in a sur- 

face. From the fact 6 = D18, we can write 


_ lS, f, vice tia) 
iC Age ie 


(Gel’fand-Shilov notation). (2) For the map- 
ping f(x)=Ax+b, where A is ann xn regular 
matrix and b is an n-vector, we can define the 
substituted distribution of S by f= f(x) for any 
Se QR"). For instance, ôx- (P)= (b). d..2—2) 
= (2c) t(x- + x+) (C>0, xER'). 


Sfi, ars 


R. Distributions and Currents on a 
Differentiable Manifold 


Let M be an n-dimensional tdifferentiable 
manifold of class C”. Let {(U,, W,)} be an 
tatlas. Then a distribution T on M is defined to 
be a collection of distributions T, on w,(U,)< 
R” such that 7, on W,(U,/ Uz) is equal to T,0 
(Wo Wg) for any x and £. Since the distribu- 
tions obey the same transformation law as 
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functions under coordinate transformation, 
this definition has an invariant meaning, and 
every locally integrable function on M is re- 
garded as a distribution. More generally, a 
distribution tcross section of a tvector bundle 
of class C” is defined in the same way. The 
most important is the case of the tp-fold ex- 
terior power of the tcotangent bundle. Its 
distribution cross sections, i.e., texterior dif- 
ferential forms of order p with distribution 
coefficients, are called currents of degree p. If 
M is foriented, then the space of currents of 
degree p on M is identified with the dual space 
of the locally convex space Z”~?)(M) of all 
differential forms of degree n— p, of class C” 
and with compact support in M. For example, 
if C is an (n—p)-dimensional fsingular chain, a 
current To of degree p is defined by T-(a) = fca. 
If M is not torientable, we have to consider 
either the double covering M of M or the cross 
sections of the tensor product of the bundle of 
(n — p)-covectors and the orientation bundle 
[1,3]. Currents were introduced by de Rham 
to prove his celebrated isomorphism theo- 
rem of the tde Rham cohomology groups de- 
fined by differential forms and the tsingular 
cohomology groups defined by singular co- 
chains (— 105 Differentiable Manifolds). 


S. Gel’fand-Shilov Generalized Functions 


Let E and F be tlocally convex spaces for 
which a continuous linear injection i: E>F 
with dense range is defined. Then the dual 7’: 
FE’ is a continuous linear injection with 
weak*-dense range. Therefore if we shrink the 
function space E, we obtain a larger space E’ 
of continuous linear functionals. Gelfand and 
Shilov [4] introduced many function spaces 
E, called fundamental spaces or test function 
spaces, and defined corresponding spaces E’ of 
generalized functions. Their motivations were 
in applications to the theory of partial dif- 
ferential equations, where Fourier transforms 
play an essential role and Schwartz’s frame- 
work of tempered distributions is too restric- 
tive. They considered a pair of function spaces 
E and E such that the Fourier transformation 
F :E-E is an isomorphism of locally convex 
spaces. Then the Fourier transformation de- 
fined to be the dual of F gives an isomor- 
phism E'— E’ of spaces of generalized func- 
tions. The function spaces E and E are often 
spaces of entire functions, so that the Gel’fand- 
Shilov generalized functions are not neces- 
sarily localizable in R”. As typical examples of 
spaces E and Ë we shall mention only spaces 
of type S in the next section. 

L. Ehrenpreis’s analytically uniform spaces 
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[12] are also a class of generalized function 
spaces introduced from a similar point of view. 


T. Spaces of Type S 


Let a, $, A, B, A, and B be n-dimensional 
vectors. By A> B we mean A,;> B,(j=1,...,n). 
We use notations A? = AM... ABr, p = phi... 
pp. (i) For «20, A>0 the space S, 4 consists 
of all C”-functions ø such that seminorms 


|x? D*p(x)| 
Pa, (P) =sup sup = — (10) 


p Ap” 

are finite for all A > A and q. For example, 
So, a is the space of all functions of class C” 
with support in {x||x,|< Aj, j=1,...,n}. (ii) 
For f >0, B>0 the space S®? consists of all 
C”-functions ọ such that the seminorms 


|x? D1o(x)| 
Pp.ä(P)=SUPSUP ap (11) 
are finite for all B >B and q. (iii) For x, B>0 
and A, B>0, the space SË? consists of all C”- 
functions such that the seminorms 

|x? D*p(x)| 


P4,3()=sup sup sup 


x p a APBIprtgt 


are finite for all A > A and B>B. The topol- 
ogies for these spaces are given in terms of 
the seminorms (10), (11), and (12), respectively. 
These spaces are generically called spaces of 
type S. 


FS, a) =S% (&>0), 
F(SP*=Sp s  (B>0), 
F(SA=Si5 (a+ B>1). 
F (So, s)=S*, 

F (so) = Sop, 

F(a =e (a+ B=1), 


where A’ = Aexp(1/A) and B’ = Bexp(1/B). 
We set S= |] Sa, 4, Sf = |_] SPP, and SË = 
(J S22, where A and B range over all positive 
n-dimensional vectors. S$ + {0} if and only if 
one of the following conditions is satisfied: (i) 
x+ß 21, «>0, B>0; (ii) «> 1, B =0; (iti) «=0, 
B>1.In such a case the space S*’ of general- 
ized functions contains the tempered distribu- 
tions F(R"). 


U. Ultradistributions 


The localizing property of distributions is 
proved mainly from the existence of fpartitions 
of unity by functions in 2. Therefore if a test 
function space admits partitions of unity, the 
corresponding class of generalized functions 
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forms a sheaf. Roumieu [5] took the space 

2 m (Q) of tultradifferentiable functions of 
class {M,} with compact support. We can also 
use the space 2m (Q) of class (M,). The cor- 
responding generalized functions are called 
ultradistributions of class {M,} and of class 
(M,), respectively. Here M, is a sequence of 
positive numbers satisfying the logarithmic 
convexity M?<M,_,M,., and the Denjoy- 
Carleman condition X M,/M,+, < ©. Dry (O) 
(resp. Ay,)(Q)) is the space of all functions 

oe 2(Q) such that there are constants k and 

C (resp. for any k>0 there ts a C) for which 
we have |D*p(x)| < Ck"IM q. In particular, 

the Gevrey classes {s} = { p!*} and (s) =(p!‘) 
defined for s>1 are important, and they ap- 
pear often in theory of differential equations. 
Almost all results for distributions have been 
extended to ultradistributions under appropri- 
ate conditions on M,, which are satisfied by 
Gevrey sequences p!* (— [14] in particular). 
Closely connected are A. Beurling’s test func- 
tion spaces F,,(Q) (— [13]), where œw is a func- 
tion on R” continuous at the origin, satisfying 
0=0(0) <o +4) <@(¢)+ «(n) for all č and y 
in R” and such that f o(&)(1 +1¢|) "7! dë < œ. 
G.,(Q) is defined to be the space of all pe 2(Q) 
such that the Fourier transform ¢ satisfies 
FIGE) exp(Am(é))dé < œ for any A>0. If w(é) 
=log(1+|č|), then 2, = 2; if o(č)=|č|!, then 
Ba =Z. Let @(C)= a — é). The space 2,9) 
of Beurling’s generalized distributions is defined 
to be the strong dual of 2(9). 


V. Hyperfunctions 


In this and the following sections we mean by 
a cone [c R” a convex open cone with vertex 
at 0. For two cones T, A, we write ACT if 
ANS” ' is relatively compact in -S""!, where 
S"! = {|x|=1}. By a wedge we mean an open 
subset of C” of the form Q + il, where Qc R” is 
an open subset and T c R” is a cone. F is called 
the opening of the wedge and Q its edge. By an 
infinitesimal wedge (0-wedge for short) or a 
tuboid of opening T and edge Q, we mean a 
complex open set U such that U cQ +iľ and 
that for any AET, U contains the part of 
Q+ iA which is contained in some complex 
neighborhood of the edge Q. The symbol Q + 
ilO will represent any one of such open sets, 
and ©(Q + iTO) (the inductive limit of) the total- 
ity of functions holomorphic on some of them. 
A hyperfunction f(x) on an open set Qc R” 
is an equivalence class of formal expressions of 
the form 


foo=¥ Re+, (13) 


where F(z)e O(Q + iT,0). {F;(z)} is called a set 
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of defining functions of f(x). Here the equiva- 
lence relation is given as 


F(x +110) + F(x +i, 0) =(F,+ F) (x+ iE NT,0) 
(14) 


if OT, #2, that is, we can contract two terms 
into a single term as above and, conversely, 
can decompose, if possible, a term in the 
inverse way. These are considered to be modi- 
fications of the expression of the same hyper- 
function. The totality of hyperfunctions on Q 
is denoted by A(Q). It is a linear space by 
virtue of the linear structure naturally induced 
from that of holomorphic functions, combined 
with the above equivalence relation. 

The symbol F,(x + i150), which represents by 
itself a hyperfunction, is called the boundary 
value of F(z) to the real axis. It is merely 
formal and does not imply any topological 
limit, though there is some justification for the 
terminology as will be seen in Section Z. 

In the case of one variable, we have only 
two kinds of wedges Q+iR*, hence a hyper- 
function can be expressed by two terms: 


F(x + i0) — F_(x —i0). (15) 


Some examples of hyperfunctions of one vari- 
able are 6(x)= —(2ni)'((x + i0)! —(x—i0)7"); 
1(x)= — (27i) !(log(— x —i0) —log(— x + i0)); 
Pfx 1 =((x+ i0) ! +(x —i0)71)/2. 


W. Localization of Hyperfunctions 


If QO’ Q is an open subset, the restriction 
mapping BQ) AO’) is induced from that for 
holomorphic functions. With this structure the 
correspondence Q- (9) becomes a *presheaf. 
It is in fact a sheaf, because it can be expressed 
by the terminology of relative (or local) co- 
homology as follows: Let H&(C", F) denote 
the kth relative cohomology group of the pair 
(C", C"~Q) (also called the kth local coho- 
mology group with support in Q) with coeffi- 
cients in a sheaf F on C”. (It is by definition 
the kth tderived functor of FrT,(C", F)= 
the totality of sections of F defined on a 
neighborhood of Q and with support in Q and 
is calculated as the kth cohomology group of 
the tcomplex I)(C”, Z), where Z denotes any 
flabby resolution (1.e., tresolution by tflabby 
sheaves) of F.) Let #kn(F) denote the kth 
derived sheaf of F to R”. (It is by definition the 
sheaf on R” associated with the presheaf 

Q HE(C", F ).) Then the cohomological 
definition of the sheaf of hyperfunctions is # 
= HAC) (the orientation being neglected). A 
fundamental theorem by Sato says that R” 

c C” is purely n-codimensional with respect to 
© (i.e, AK(O)=0 for k#n), and moreover - 
H§(C", ©)=0 for k#n for any open set Qc R”. 
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Then by the general theory it can be shown 
that the remaining H{(C", ©) agrees with the 
section module A(Q) of HR.(@). Since H"(V, ©) 
=0 for any open set V c C" (B. Malgrange), it 
follows further that the sheaf 2 is flabby, i.e., 
its sections on any open set can always be 
extended to the whole space. 

If U is a tStein neighborhood of Q, then 
H¢(C", ©) can be expressed using the covering 
cohomology as the quotient space 


ouso] O(U #;Q), (16) 


where 
U4Q={zeU|pr,(z)¢pr,(Q) forall k}, 


U #,Q={zeU|pr,(2)¢pr,(Q) fork#j} (17) 


and pr, is the projection from C” to the kth 
coordinate. Then the isomorphism Hg(C", @) = 
BQ) is induced by the correspondence 


OU €Q)3 F(z) > [F(z)] 
=} sgn oF(x+il, Ne AQ), (18) 


where T, is the g-orthant {ye R"|a,y,>0,j= 
1,...,n} and sgnao=o, ...a,. F(z) is called a 
defining function of the corresponding hyper- 
function. For one variable, any complex neigh- 
borhood U >92 is Stein, and the above isomor- 
phism reads O(U\.Q)/O(U) = AQ), from which 
the naturality of the sign in (15) follows. 

Thus the notion of support is also legitimate 
for hyperfunctions. The sheaf of hyperfunc- 
tions # does not admit partitions of unity as 
for &. It is, however, flabby. Consequently, 
given a decomposition of a closed set into 
locally finite closed subsets E=|_),-, E, and 
a hyperfunction f with support in E, we can 
always find hyperfunctions f, with support in 
E, such that f=%,., f}. For distributions this 
property holds only under some regularity 
assumption for the decomposition. 

There are several practical criteria to deter- 
mine whether or not a hyperfunction is zero in 
some open set Q. These are called the edge of 
the wedge theorem. A hyperfunction F(x + iI 0) 
with single expression is zero if and only if F(z) 
itself is zero. F,(x+i1,0)= F(x +iI°,0) if and 
only if they stick together to a function in O(Q 
+i(T, +T,)0) (Epstein type). (Note that I; +T, 
is equal to the convex hull of T; UI,, e.g., I + 
(—T)=R" (Bogolyubov type).) £}, F(x + i150) 
=0 if and only if there exist G,(z)e@(Q + i(I; + 
T,,)0), j, k=1,..., N, such that G;(z)= G,,(z) 
and F= Di, Gy, j=1,...,N (A. Martineau 
[19]). These are interpretations of cohomology 
in terms of coverings and have global variants 
concerning the envelope of holomorphy. 

The real analytic functions o €. (Q) on Q 
are naturally included in A(Q) via the ex- 
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pression (x + iF0) for any F or with r =R”. 
They form a subsheaf. Let f be a hyperfunc- 
tion on Q. The complement in Q of the largest 
open subset Q’ <Q, where f(x) is real analytic, 
is called the singular support of f(x) and is 
denoted by singsupp f. If Q is bounded and 
sing supp f € K, then we can choose an ex- 
pression of the form (13), where Fj(z) can be 
continued analytically to ON K. If suppfcK, 
then these F;(z) satisfy X F,(x)=0 on QNK in 
the usual sense. 


X. Operations on Hyperfunctions 


The derivatives of a hyperfunction f(x) with 
the expression (13) are defined through the 
defining functions as D?” f(x) = L(D? F,)(x + 
i] ;0). The product by a real analytic function 
w(x) is defined by W(x) f(x) = L(WF)(x + iT;0). 
Combining these, we have the operation of a 
linear partial differential operator with real 
analytic coefficients P(x, D) on hyperfunctions. 
It is a tsheaf homomorphism. 

Let fe BQ) and D cQ be a compact 
set with piecewise smooth boundary. If 
singsupp f N ôD = @, then by means of the 
special expression (13) mentioned at the end of 
Section W the definite integral is defined as 


| fooydx= 5 | Fz), 


where D, is a path deformed from D in such a 
way that 0D,;=¢D and F;(z) is holomorphic on 
D,. The result is independent of the choice of 
deformations or of the boundary value ex- 
pression employed. 

If supp f€ D, the result is also indepen- 
dent of D; hence it can be written as fg: f(x)dx. 
If f(x, t) is a hyperfunction of the two groups 
of variables (x, t)EQ x V such that singsupp fN 
oD x V= Ø, then the integral fp f(x, thdxe 
BV) is defined by the same method. It com- 
mutes with differentiation er integration with 
respect to t. 

If f(x)=X F(x + 50), g(x) = 1 G,(x + iA, 9), 
then we can define the product by f(x)g(x)= 
X(F,G,)(x + 1; A,0) under the assumption 
that I; A, # Ø for every pair j, k. Especially, 
the product f(x)g(t) is always legitimate for 
two hyperfunctions depending on different 
groups of vartables. (It can also be interpreted 
as the tensor product.) 

A real analytic coordinate transformation 
x = @(%):Q-9 extends naturally to a holo- 
morphic coordinate transformation z= ®(Z) 
on a complex neighborhood. A 0-wedge Q + 
ir0 is transformed by ®~' to a twisted wedge 
containing, for each xeQ and ACT, a 0- 
wedge &,+ i(D®~'), AO with some (real) neigh- 
borhood ©, of =~ !(x), where (DO~'), is the 
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derivative or the Jacobian matrix of the map- 
ping ®~' at x. Thus the pullback ®* f(x) = 


f(®(%))e AQ) of f(x)e AQ) by the transforma- 


tion x = ©(xX) can be defined by substituting 

z= (2) into the defining functions. It is consis- 
tent with the definition of the coordinate trans- 
formation for real analytic functions. Also, the 
law of the change of variables for the definite 
integral is the same as that for real analytic 
functions. This can be extended to the general 
operation of substitution as mentioned in Sec- 
tion Q for distributions or even to much more 
general operations (— Section CC). 

The convolution (f * g)(x)= Jae f(x —tg(t)dt 
is defined under the same assumption on sup- 
port as in the case of distributions. It can be 
defined either literally as the composition of 
the above operations, or directly by way of the 
defining functions: For example, if g has com- 
pact support € D and È G,(x+iA,0) is an 
expression as mentioned at the end of Section 
W, then by choosing a suitable deformation D, 
of D, we have 


owy] | Fle—G,e) | 


zx+i(T j;+A,)0 


Y. Hyperfunctions and Analytic Functionals 


The totality ALK] of hyperfunctions with sup- 
port in a compact set K becomes a nuclear 
Fréchet space. It is the dual of the nuclear 
(DF)-space .(K) of real analytic functions de- 
fined on a neighborhood of K. Thus [K] is 
the space of analytic functionals with tporter 
in the real compact set K. The duality is given 
by the definite integral <f; p> = Jp» f(x) p(x) dx 
for fe BLK] and pe.s(K). A sequence { f,(x)} 
in ALK] converges if and only if it admits an 
expression (13) for a common fixed set of 0- 
wedges Q+ iJ;0, j=1,...,N, such that the 
defining functions F, ;¢ O(Q + iI;0) are also 
holomorphic on a fixed complex neighbor- 
hood U of QNK and converges locally uni- 
formly in Q+ iT;0U U, where Q is a (real) 
neighborhood of K. 

BLK] is isomorphic to HR(C", ©), and 
the above duality is a special case of the 
Martineau-Harvey duality Hg (C”, ©) = O(K)’ 
for a Stein compact set K c C". In the 1- 
dimensional case this is due to G. Köthe. If 
K=[a,,b,]x... x [a,,5,], then Hg(C”, ©) can 
be represented, including the topology, by the 
quotient space 


J 


ou#K)| OU #;K), 

=1 

where U is a Stein neighborhood of K and 

U # K, U#;K are defined in the same way as 
(17). If we choose U=U, x... x U,, then by 


483 


way of a defining function F(z)e O(U # K) for 
f¢ BK] the inner product is given by the 
contour integral 


| f(x)e(x)dx 
“= 


=d -d F(z)p(z)dz,...dz,, 


where y,< U; is a closed path surrounding 

[a;, b;] once in the positive sense. Similar inte- 
gral formulas are known for some special K 
of various types. 

Starting from analytic functionals we can 
reconstruct the sheaf of hyperfunctions. For 
example, we can put @(Q) =the totality of 
locally finite sums of analytic functionals with 
porter in Q modulo the rearrangement of 
supports by decomposition (Martineau [17 ]). 
If Q is bounded, we can also put AB(Q)= B[Q]/ 
BN] (Schapira [18]). The proof of local- 
izability and/or flabbiness is based on the 
decomposability of support (which is the dual 
of the exact sequence 0D>.A/(K UL)> A (K) 
AL) AKN L)—>0) and the denseness of 
BL K|- BLL] (which is the dual of the unique 
continuation property 0—.4(L)> A (K)) for a 
pair K CL with the same family of connected 
components. Note that in no way is the topol- 
ogy of @[K] localizable, or equivalently, A(Q) 
does not admit a reasonable topology. 


Z. Embedding of Distributions 


As the dual of the natural mapping ./(K)—> 
&(K) we have the topological embedding 
&'(K)o.A(K)' = BK]. This embedding con- 
serves the support and hence gives rise to an 
embedding of sheaf 2' c8 (R. Harvey). 

For a distribution T with compact support 
a set of its defining functions as a hyperfunc- 
tion is given by F,(z)= T,(W(z—x,T,)), o being 
the multisignature, where W(z, Ip) = [ge-tar, 
W(z,w)dw and W(z, w) is the component of a 
Radon decomposition of 6(x) (— Section CC). 
If supp Te K =[a,,b,] x... x [a,, ba], then 
as a hyperfunction it is represented by the 
following element of O(C" # K}: 


F(z)= 1 - : ) 

(27i) (x1 — 21) -< (Xn Zn) 
It is in fact in O((P')" # K) and vanishes at 
infinity, where P’ =C'U {œ} is the Riemann 
sphere. These formulas are valid also for hyper- 
functions, and they give defining functions of 
some canonical types. Especially, the one given 
by (19) is called the standard defining function 
and is characterized by the foregoing prop- 
erties. 

For the above-mentioned defining functions 


(19) 
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for a distribution T, we have the following 
convergence in the sense of distributions on 
R”: 


T, =) sgno lim Fi(x+iey,) for y,Er,. 
o è 
More generally if the limit 
È lim F(x+iey) for yer, (20) 


exists in 2'(Q), then the distribution defined as 
this limit admits {F;(z)} as a set of defining 
functions when it is considered as a hyper- 
function. However, for an arbitrary set of 
defining functions for a distribution, (20) does 
not necessarily converge in Z’(Q). (This is be- 
cause we can add terms with any bad behavior 
which cancel each other formally.) If a distri- 
bution admist the boundary value expression 
with only one term, or if the dimension is one, 
then the convergence of (20) in @’(Q) neces- 
sarily holds. 

The convergence of (20) in Z’(Q) is equiva- 
lent to the locally uniform estimate for the 
defining functions of the type F,(z)=O(|y|™) 
for some M > 0. These assertions can be gener- 
alized to ultradistributions. For 2; m, (Q) 
(resp. Zim (Q)) the last growth condition for 
the defining functions reads as follows: F,(z)= 
O(exp M*(L/ly|)) for some L (resp. every L>0), 
where M*(p)=sup, log(p? p!M,)/M,); espe- 
cially for M,=p!* we have M*(p)~ pS". 


AA. Hyperfunctions on a Real Analytic 
Manifold 


Sticking the hyperfunctions on coordinate 
patches by the transformation law mentioned 
in Section X, we can define the sheaf of hyper- 
functions on a real analytic manifold. More 
generally, for a real analytic vector bundle 
over a real analytic manifold, we can consider 
the sheaf of its hyperfunction local cross sec- 
tions, which is also flabby. Thus, especially 
on a real analytic manifold M, we can obtain 
a concrete flabby resolution of the constant 
sheaf C,, of length dim M by the sheaves of 
differential forms with hyperfunction coeffi- 
cients: 0>Cy BO 4B... + Bein™) 0, 
With this sequence we can calculate the rela- 
tive cohomology groups of open pairs with 
coefficients in C by an analytic method. This 
is an extension of the de Rham theory for 
distributions [16]. 

If M is acompact manifold equipped with a 
nowhere vanishing real analytic density glob- 
ally defined on M, then we have the topolog- 
ical duality @(M)=.s/(MY. The inner product 
is given by the definite integral with respect to 
the density. 
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The Fourier series is an example of hyper- 
functions on real analytic manifolds. The series 
XC, exp(2zipx) converges in #(T") and defines 
a periodic hyperfunction if and only if c, is of 
infra-exponential growth, i.e., c, = O(e"?!) for 
any ¢>0. T” has the global complex neighbor- 
hood (P’)", and f(x)e A(T") has the corre- 
sponding boundary value expression 


F,(2)€O({|z4 |" <1, ...,[2gl<1}), 


which represents the terms in the Fourier 
series such that a;p;>0. 


BB. Fourier Hyperfunctions 


In place of Y we take as the basic space the 
space Y, of exponentially decreasing real ana- 
lytic functions in the sense of M. Sato [15]: 
S(x)eY,, if and only if there exist 6 >0 and 
é>0 such that for any ð <ô and ¢é’ <e, f(x) 
extends holomorphically to the neighborhood 
{Imz| <ô} of the real axis and has order 

O(e *'R*#!) there. A, is endowed with the struc- 
ture of nuclear (DF)-space via the inductive 
limit for 6>0, ¢>0. The classical Fourier 
transform F acts isomorphically on &,. (In 
fact, ô and ¢ change their roles under F.) The 
strong dual of #2, is called the space of Fourier 
hyperfunctions and is denoted by 2. It is a 
nuclear Fréchet space. It contains S’ as a 
dense subspace in view of the continuous 

and dense inclusion Z, S F. It also contains 
classical locally integrable functions of infra- 
exponential growth, i.e., of order e**! for any 
é>0. Thus by the duality we obtain a wider 
extension of Fourier transformation on 2. 

In the following a 0-wedge of the form R”+ 
ilO will be called a 0-wedge of the form D” + 
ilO at the same time if it is a tubular domain 
(i.e., with fixed imaginary part 0). Then an 
element f(x)e2 can be expressed in the form 
(13), where each F,(z) is holomorphic in a 0- 
wedge D” + iI;0 and is of infra-exponential 
growth there locally uniformly in Imz. The 
inner product of such f(x) with pe, is given 
by the definite integral 


N 

f (x)p(x)dx = 3 F(z) p(z)dz, 

J= Imz=y, 
where the y,¢ 150 are fixed. Given a cone A we 
define its dual cone by A° = {ye R"| <, y) S0 
for all ye A}. If F(z) are all of exponential 
decrease in Rez locally uniformly with respect 
to ImzeI,0 and Rez/|Rez|¢A’, then the de- 
finite integral 


Gira)" | eH fodx 
N 
-Zum | 


j Imz=y; 


e "S F(z)dz (21) 
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converges locally uniformly for ¢ in some 0- 
wedge D” — iA0, and defines there a holo- 
morphic function of infra-exponential growth. 
Thus we obtain a Fourier hyperfunction 

G(č —i^0) that agrees with Ff calculated by 
the duality. For a general f(x)e 2, the Fourier 
transform in the manner of Sato is calculated 
as follows: First we decompose f(x) into the 
sum È f(x) for which the defining functions 
of f,(x) decrease exponentially outside A?. 
Then we calculate G,(¢) by (21) and put Zf = 
È G,(€—iA,0). An example of such decom- 
position is given by multiplication by 7,(x)= 
j=; 11 +expo;x;), which decreases expo- 
nentially outside I’, = {a,x,>0}. 

The relation between 2 and # is more com- 
plicated than the relation S’ c 2'(R"). The 
growth condition for 2 is interpreted as a 
condition concerning germs at infinity. Thus 2 
can be considered to be a sheaf on the direc- 
tional compactification D” =R” US"! such 
that 2|p.=Z. Just as the sheaf Z is obtained 
from ©, the sheaf 2 is obtained as the nth 
derived sheaf #%.(@) from the sheaf @ on 
D” +iR" consisting of germs of holomorphic 
functions of infra-exponential growth with 
respect to Rez. We have H4 (D" + iR", =0 
for k#n for any open set Qc D”. Especially, 2 
is flabby, and the decomposition of support is 
available to calculate the Fourier transform. 
The symbol 2 employed at the beginning to 
express the global Fourier hyperfunctions 
corresponds to 2(D"), and 2(R")= A(R”) by 
definition. The canonical restriction mapping 
2(D")— A(R") is surjective but not injective. 

As for tempered distributions s”, we can 
introduce various subclasses of Fourier hyper- 
functions, e.g., exponentially decreasing Fourier 
hyperfunctions | );..exp(—6,/x? +1) 2, real 
analytic functions of infra-exponential growth 
@(D"), etc. We can also consider operations 
such as convolution and multiplication be- 
tween adequate pairs, and apply differential 
operators with suitable coefficients. Concern- 
ing these we can avail ourselves of the same 
formulas as given in Section O. 

A hyperfunction with compact support ts 
naturally considered as a Fourier hyperfunc- 
tion, and its Fourier transform agrees with the 
inner product < f(x), (\/2) "exp(—ixé)), 
which gives an entire function of exponential 
type. 

Paley-Wiener theorem. An entire function 
f(E) is the Fourier transform of a hyperfunc- 
tion with support in a compact convex set K 
in R" if and only if it satisfies condition (1) of 
Section O. 

The theory of Fourier hyperfunctions de- 
scribed above is mainly due to Sato and T. 
Kawai [20]. They are not the largest class of 
generalized functions stable under the Fourier 
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transformation. Following a suggestion of 
Sato, S. Nagamachi and N. Mugibayashi, Y. 
Saburi, and Y. Ito have extended them to the 
modified Fourier hyperfunctions in which the 
radial compactification D?” of C” is employed 
instead of the horizontal compactification 
D"+iR" in the above theory. If we discard 
the localizing property, they can be extended 
further to the Fourier ultrahyperfunctions or 
ultradistributions of J. Sebastiad e Silva (1958), 
M. Hasumi (1961), and M. Morimoto (1973). 


CC. Micro-Analyticity of Hyperfunctions 


The boundary value expression (13) for a 
hyperfunction f(x) can be interpreted con- 
versely as the description of the state of ana- 
lytic continuation of f(x) to the complex do- 
main. Thus we say that f is micro-analytic at 
(Xo, čo) if on a neighborhood of x, it admits 
the analytic continuation into the half-space 
<Imz, čo <0 in the sense that it admits an ex- 
pression (13) satisfying I, {<Imz, čo <0} 4 
@ for every j. The set of points (Xo, &))EQ x 
S’!, where fe A(Q) is not micro-analytic, is 
called the singularity spectrum or singular 
spectrum of f and is denoted by S.S. f. We have 
by definition S.S.F(x + i0)cQ x (Fen s""?). 
Micro-analyticity can be characterized by the 
Fourier transformation as follows: f is micro- 
analytic at (Xo, čo) if and only if there exists 

a Fourier hyperfunction g(x) such that the 
Fourier transform ĝ(č) decreases exponentially 
on a fconical neighborhood of čo and that 

f—g is real analytic on a neighborhood of 

Xo- A hyperfunction f(x) is real analytic in a 
neighborhood of x, if and only if it is micro- 
analytic at (xo, č) for any če S"! 

With this notion we can clarify the oper- 
ations on hyperfunctions. (In the following 
S""! is identified with (R”\{0})/R* and + 
denotes the sum in the latter.) We have 


S.S.( fg) < {(x, 24E +0 —A)n) (x, HES.S.F, 
(x, neS.S.g,0 <4 <1} US.S.fUS.S.g, 
S.S.f(®(X)) < {(3, DOE) | (O(X), éE S.S.S }, 


and these operations are legitimate if and only 
if 0 does not appear in the direction compo- 
nent of the right-hand side. We also have 


S.S. | fix, Ddx f(x, E(x, t, & OeSS.f fora t}, 


SS.(feghe{xty, OO, NESS Ff (y, eSS.g} 


under suitable conditions for support. 
The classical decomposition formula of 
Radon (or the plane wave decompostion of ô) 


2 (n—1)! dw 
oN) ~ (—2ni)" Ike (xæ +10)" eA 
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can be interpreted as the decomposition of 

6 into hyperfunctions with S.S. in the single 
direction œ. By the convolution, this formula 
supplies similar decomposition for a general 
hyperfunction (called the singular spectrum 
decomposition). (22) can be generalized to 


aa det(grad,, V(x, œ)) 
s”! 


000) = ni (D(x, o) +10)" 


where the twisted phase D(x, œ) satisfies (i) 
M(x, @) 1s a real analytic function of positive 
type (Le., Re D(x, @)=0 implies Im ®(x, w) > 0) 
and (ii) P(x, w) is homogeneous of order 1 in œw 
and ®(0, w)=0, grad, O(0, w) =a; and the 
vector ‘¥(x,q@) 1s such that <Y¥ (x, œ), x> = 

(x, w). If B(x, œw) further satisfies O(x, œ) #0 
for x £0, then the component becomes a 
hyperfunction (even a distribution) of x whose 
S.S. is precisely one point (0, w), and this fact is 
useful in theoretical applications [7,21] (— 274 
Microlocal Analysis). 


DD. Structure Theorems of Hyperfunctions 


A hyperfunction whose support is concen- 
trated at the origin is expressed as the infi- 
nite derivative J(D)d(x) = a,D?0(x) of the 
Dirac measure, with the coefficients satisfying 
lim(|a,|p!)’/”=0. Such an operator J(D) is 
called a local operator with constant coeffi- 
cients and acts on 2 as a sheaf homomor- 
phism. Its total symbol J({) ts an entire func- 
tion of infra-exponential growth or of type 
[1,0]. By way of such an operator, a general 
(Fourier) hyperfunction can be written in the 
form J(D)g(x) with a continuous function on 
R” (of infra-exponential growth). 

If Oesupp fc {<v, x> >0}, then SS f3(0, +v) 
(Holmgren type theorem of Kashiwara and 
Kawai), and furthermore, the direction compo- 
nent of S.S.f at 0 has the form of {+v}Up7'(E) 
with some EcS" ?, where p:S""'\{+v}-> 
S"-? is the projection to the equator (the 
watermelon-slicing theorem of Morimoto, 
Kashiwara, and K. Kataoka). E is called the 
reduced S.S. of f at 0. These theorems have 
many applications in the theory of linear par- 
tial differential equations and also in physics. 

A hyperfunction f(x) with support in the 
hyperplane x, =0 has several further remark- 
able properties in x,. It admits a formal ex- 
pansion of the form Eo f(x) (x), where 
M0 sc Xa) and F.0)—) 59 Oda, /k dey: 
The sum converges in the sense of the topol- 
ogy if f has compact support. It reduces lo- 
cally to a finite sum if f is a distribution, and 
the kth term represents the k-ple layer distri- 
bution of mass, electric charge, etc. Fora 
general f the coefficients { f,(x’)} do not neces- 


dw, (23) 
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sarily determine f, though they are determined 
by J. 


EE. Complex Powers of Polynomials 


Among examples of special generalized func- 
tions the most important are those of the form 
Jê, where f, =max{ f(x), 0} (or more generally 
it can be replaced by zero on some connected 
components of { f(x)>0}), and 2eC denotes a 
holomorphic parameter. (The discussion is the 
same for f_ =max{—f(x),0}.) The simplest 
example, x2, is defined as the analytic con- 
tinuation of the locally integrable function x4 
for ReA> —1 by repeated use of the formula 
x} =(A+ 1) 'D,x4*!, and becomes meromor- 
phic in 4 with simple poles at A= —1, —2,.... 
As a hyperfunction we have x7 = {(—x + i0} — 
(—x—i0)*}/2isin rå. At a negative integer À = 
—n, x4 has residue (—1)"7'6""(x)/(n— 1)! 
and finite part [— (27i) tz ”log(—2)]. The 
latter is often denoted by x". In general, for 

a germ of a real-valued real analytic func- 

tion f(x) we can find a differential operator 
P(A, x, D,) with polynomial coefficients in å 
and a monic polynomial b(å) of minimum de- 
gree such that 


P(A, x, D) S$ =b) SP (24) 


(Sato, I. N. Bernshtein, Kashiwara, J.-E. Bjork 
[22]). This formula gives the analytic con- 
tinuation of f? just as for x. The polyno- 
mial b(A) is called the b-function or the Sato- 
Bernshtein polynomial and contains valuable 
information regarding the singularity of f. It 
has only negative rational roots (Kashiwara). 

We have f- f?=f?*'; hence f7! — f=! (suit- 
ably interpreted as above) gives a solution of 
the division problem u: f= 1. Thus if f is a 
polynomial, its inverse Fourier transform gives 
a tempered fundamental solution of f(—iD). 
Furthermore, when f is the relative invariant 
of a tprehomogeneous vector space, we can 
calculate b(A) explicitly by way of the holo- 
nomy diagram. Also, the Fourier transform of 
f? can be calculated explicitly by way of the 
real holonomy diagram as a linear combina- 
tion of the corresponding objects for the dual 
prehomogeneous vector space with coefficients 
similar to the Maslov index. The simplest 
example is 


F xt =( 2r) (Fije RT (A+ (EFI)! 
=(,/22) (ENP A+) fetes 
a emai), 
Among practical examples are the following 
classical formulas: Let P(x) be a nondegener- 


ate real quadratic form and Q(€) its tdual 
form, and let q denote the number of negative 
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eigenvalues of P(x). Then 
FP} = —( 2r) "22a + 1A + n/2) 
x |det P|"? {sin n(å+q/)Q0 72 "7 
~sinn((n —q)/2)Q 47"2}, 


Here the arguments in the T -factor (A+ 1)(4 + 
n/2) give the b-function of P(x). If g=n—1, 
we further have, letting P% = P21(+ <x, vy) 
for an eigenvector v corresponding to the 
unique positive eigenvalue, 


F Ph, = (27) "2224+ gn- A41) 
x P(A +n/2)|det P|! {eF"iA+n2gz}=m2 
FERRANDO ALT ak, 


From these formulas (taking the finite part if 
necessary) we obtain the fundamental solution 
of the wave equation, the Laplacian, and their 
iterations. These are exactly the distributions 
introduced by Hadamard, M. Riesz, and 
others, as mentioned in Section A (— Appen- 
dix A, Table 15.V). 
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A. History 


The theory of dynamical systems began with 
the investigation of the motion of planets in 
ancient astronomy. Qualitative investigation 
of mechanics in antiquity and the Middle Ages 
culminated tn the work of J. Kepler and G. 
Galilei in 17th century. At the end of that 
century, I. Newton founded his celebrated 
Newtonian mechanics, by means of which 
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Kepler’s law on the motion of planets and 
Galileo’s observations of movement can be 
explained theoretically. Following this, L. 
Euler, J. L. Lagrange, P. S. Laplace, W. R. 
Hamilton, C. G. J. Jacobi, and others devel- 
oped the theory using analytical methods, and 
founded analytical dynamics. From the end of 
the 18th century through the 19th century, the 
tthree-body problem attracted the attention of 
many mathematicians. At the end of the 19th 
century, H. Bruns and H. Poincaré found that 
general solutions of the three-body problem 
could not be obtained by tquadrature, and this 
gave rise to a crisis of analytical dynamics. But 
this was resolved by Poincaré himself. He 
pointed out the importance of the qualitative 
theory based on topological methods, and 
obtained many fundamental results. A. M. 
Lyapunov with his theory of stability and 

G. D. Birkhoff with his many important con- 
cepts of topological dynamics established 
foundations of the new qualitative theory. 

In 1937 A. A. Andronov and L. S. Pontryagin 
introduced the concept of structural stability, 
which attracted the attention of S. Lefschetz. 
Lefschetz’s school investigated structural sta- 
bility and tnonlinear oscillations, and obtained 
many important results (H. F. de Baggis, L. 
Markus, M. M. Peixoto, and others). In about 
1960, S. Smale initiated study of differentiable 
dynamical systems under the influence of Lef- 
schetz’s school. Smale and his school founded 
a new theory of differentiable dynamical 
systems using ‘differential topology. D. V. 
Anosov generalized the work of E. Hopf and 
G. A. Hedlund on tgeodesic flows of closed 
surfaces of tconstant negative curvature and 
established the concept of Anosov systems, 
which played an important role in Smale’s 
theory. The work of Hopf, Hedlund, and 
Anosov is closely related to tergodic theory. 
Ya. G. Sinai and R. Bowen obtained impor- 
tant results in ergodic theory. The concept of 
structural stability and its generalization are 
essential in the tcatastrophe theory of R. Thom 
(— 51 Catastrophe Theory); the theory of bi- 
furcation of dynamical systems is another 
essential part of catastrophe theory. D. Ruelle 
and F. Takens proposed a new mathematical 
mechanism for the generation of turbulence 
using Smale’s theory and Hopf bifurcation. 
The new theory of dynamical systems devel- 
oped by Smale and others is now applied to 
the mathematical explanation of chaotic phe- 
nomena in many branches of science. Finally, 
we mention that in the 1960s A. N. Kolmo- 
gorov, V. I. Arnold, and J. Moser obtained 
remarkable results on the existence of quasi- 
periodic solutions for the n-body problem, 
which turned out to solve the long-standing 
problem of the stability of the solar system. 
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B. Definitions of Dynamical Systems 


In the study of the evolution of physical, bio- 
logical, and other systems, we construct math- 
ematical models of the systems. Usually, the 
state of a given system is completely described 
by a collection of continuous parameters, 
which may be related in some cases. Thus the 
space X of all possible states of the system can 
be regarded as a Euclidean space or a subset 
of a Euclidean space defined by some equa- 
tions. In general, we assume that the space X 
of all possible states of the system forms a 
ttopological space, and we call it a state space 
or a phase space. Second, we assume that the 
law of evolution of states in time is given, by 
which we can tell the state x, at any time r, if 
we know the state x, at time tọ. Assigning x, 
to Xo, we have a mapping z(t,,t)):X >X for 
any times tọ and t,, which satisfies the follow- 
ing conditions: (i) (tz, t;)O2(t,, tp) =2(ty, to); 
(ii) 2(tg, to) = 1x, the identity mapping of X. 
Finally, we assume that the mapping z(t,, to) 
depends only on t =t; — to. Writing z,= 
m(t,,to) if t=t, —to, we have the following 
conditions from (i) and (it) above: (i’) z,0 2,= 
T+ S, CER; (i) To =1y,. 

In general, the theory of topological dynam- 
ics can be regarded as the study of topological 
transformation groups (— 431 Transformation 
Groups) originating in the topological investi- 
gations of problems arising from classical 
mechanics. Here, we restrict our attention to 
some important special cases. 

(1) Let X be a topological space and R the 
additive topological group of real numbers. 
Let go: X x R-+X be a continuous mapping. 
For each teR, we define a mapping ọ,: X =X 
by (x)= (x, t), xe X. If the family of map- 
pings {P,} er satisfies the following conditions, 
we say that (X, ọ) is a (continuous) R-action, a 
(continuous) flow, or a (continuous) dynamical 
system on X, and that X is the phase space: 

(i) 9,0 @,=@,4, for all s, teR; (ii) py = 1x- 

Let (X, pọ) be a flow. Then o,:X > X isa 
thomeomorphism with (p)! =, for each 
teR. For each xe X, define a mapping y*:R~— 
X by o*(t)= (x, t), teR. The mapping ¢”" is 
called a motion through x, and its image C(x) 
= {*(t)|teR} is called the orbit or the trajec- 
tory through x. An orbit is nonempty, and two 
orbits are either identical or mutually disjoint. 
The family of orbits fills up the phase space X 
and is called the phase portrait. 

Let R, be the additive tsemigroup of all 
nonnegative real numbers. If we replace R by 
R., in the definition of a (continuous) flow, we 
obtain a definition of a (continuous) semiflow. 
For a semiflow (X, p), the mapping ¢,:X >X, 
teR, is in general not a homeomorphism but 
a continuous mapping. 


488 


Let (X, o) and (Y, y) be flows. A homeo- 
morphism h: X >Y is called a topological 
equivalence from (X, p) to (Y, y) if it maps 
each orbit of ọ onto an orbit of y preserving 
orientations of orbits (i.e., there exists an in- 
creasing homeomorphism a,:R->R for each 
xe X such that he(x, t)=(h(x), «,(t)) for all 
teR). Two flows are topologically equivalent if 
there exists a topological equivalence from one 
to the other. If two flows are topologically 
equivalent, their phase portraits have the same 
topological structure. Two flows (X, @) and 
(Y, y) are flow equivalent if there exist a c>0 
and a homeomorphism h: X > Y such that 
ho(x, t)=w (h(x), ct) for all teR. Such an h is a 
topological equivalence from (X, o) to (Y, y). 

(2) Let X be a topological space and Z the 
additive group of integers. If we replace R by 
Z in the definition of a continuous flow, we 
obtain a definition of a (continuous) Z-action, a 
discrete flow, or a discrete dynamical system on 
X. If (X, ọ) is a discrete flow, then f=@,:X > 
X is a homeomorphism and „= f” for all 
neZ. Conversely, for a given homeomorphism 
f:X—X, define a mapping p: X x ZX by 
p(x,n)=f"(x), xe X and neZ. Then (X, ¢) is 
a discrete flow such that g, =f" for neZ. So 
we identify a homeomorphism with a discrete 
flow. Thus the orbit of a homeomorphism 
f:X-+X through x is C(x)={ f"(x)|neZ}. 

Let Z , be the additive semigroup of all 
nonnegative integers. If we replace R by Z, in 
the definition of a continuous flow, we obtain 
a definition of a discrete semiflow on X. For a 
discrete semiflow (X, o), the mapping 9: X > 
X,neZ,, is in genéral not a homeomorphism 
but a continuous mapping. We can identify 
a continuous mapping f: X >X with a discrete 
semiflow (X, @) in a natural way as above. 

Let f:X +X and g: YY be two homeo- 
morphisms (continuous mappings). A homeo- 
morphism h: XY such that ho f=goh is 
called a topological conjugacy from f to g. 

And f and g are called topologically conjugate 
if there exists a topological conjugacy from 

f to g. Topologically conjugate homeomor- 
phisms have the same phase portrait in a topo- 
logical sense. 

(3) Let M be a tdifferentiable manifold of 
class C (1<r<o or r=). A continuous flow 
(M, ¢) is a flow of class C’, a C’-flow, a differ- 
entiable dynamical system of class C’, or a 
one-parameter group of transformations of 
class C’, if p: M x RM is of class C". A semi- 
flow of class C” is defined similarly. 

Let (M, o) and (N, y) be C’-flows. A topo- 
logical equivalence h: M >N from (M, ¢) to 
(N,w) ts called a C’-equivalence if it is a tC- 
diffeomorphism. Two flows are C’-equivalent if 
there is a C’-equivalence from one to the other. 
Classification of C’-flows by C’-equivalence is 
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dificult and sometimes too unwieldy to work 
with. On the other hand, there are many prob- 
lems which can be solved by the knowledge 

of the topological structure of the phase por- 
trait of C’-flows. 

(4) Let (M, n) be a discrete flow on a dif- 
ferentiable manifold M of class C”. If x: M x 
Z—M is of class C”, then (M, z) is called a Z- 
action of class C’, a discrete flow of class C’, a 
discrete C’-flow, or a discrete dynamical system 
of class C”. If (M, z) is a discrete C’-flow, then 
f=1,:M->M is a C’-diffeomorphism. Con- 
versely, a C’-diffeomorphism f: M— M defines 
a discrete C’-flow in a natural way on M, and 
we identify a C’-diffeomorphism f: M >M 
with the discrete C’-flow on M defined by f. 

A discrete semiflow of class C” is defined simi- 
larly, and a tC’-mapping f: M—M is identi- 
fied with a discrete semiflow of class C’ on M 
defined by f. 

Let f:M—>M and g: N >N be C- 
diffeomorphisms (C’-mappings) of differ- 
entiable manifolds M and N of class C". A 
topological conjugacy from f to g is called a 
C’-conjugacy if it is a C’-diffeomorphism. f and 
g are C’-conjugate if there is a C’-conjugacy 
from f tog. 


C. Examples and Remarks 


(1) Let D be an open set of R” and f:D>R”" a 
continuous mapping. Consider the tautono- 
mous system of ordinary differential equations 


dxjdt= f(x)  xeD. (1) 


We assume that for each xe D there exists a 
unique tnonextendable solution (x, t) with 
the initial condition g(x, 0)=x defined on a 
maximal interval (a,,b,), -œ <a, <0 <b, < © 
(— 316 Ordinary Differential Equations (Ini- 
tial Value Problems)). The set {@(x, t)|a,<t< 
b,} is called the trajectory through x. By the 
uniqueness assumption, we have @(@(x, t), s)= 
(x, t +s) whenever both sides of the equality 
are defined. When (a,, b,)=R for all xe D, 
then equation (1) is called complete. If (1) is 
complete, the mapping gy: D x R>D defined 
by the solutions @(x, t) determines a continu- 
ous flow (D, o). Furthermore, if f is of class 
C, then (D, ọ) is of class C”. If (1) is not com- 
plete, then there exists a continuous positive 
scalar function «:D-R such that 


dx/dt = a(x) fo) 


is complete. The trajectories of (1) and (2) 
through x coincide for all xe D, and thus the 
phase portraits of (1) and (2) are the same. 

(2) Let M be a differentiable manifold of 
class C”. A tvector field of class C” on M 
(1<r< oo) gives rise to an autonomous system 


xéeD, (2) 
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of ordinary differential equations in a coordi- 
nate neighborhood of each point of M, and it 
generates a tlocal one-parameter group of 
local transformations of class C’. If M is tcom- 
pact, this local one-parameter group of local 
transformations is uniquely extended to a one- 
parameter group of transformations of class 
C (— 105 Differentiable Manifolds). Thus 

a vector field of class C” on M generates a 
unique C’-flow on M if M is compact. There- 
fore, sometimes we identify a vector field of 
class C” with the C’-flow generated by it. 

(3) Let (M, p) be a C’-flow. Then g,:M—> 
M is a C’-diffeomorphism, which we call the 
time-one mapping (time-one map) of (M, 9). 
Thus every C’-flow (M, o) induces a C’- 
diffeomorphism as a time-one mapping. But 
the set of C!-diffeomorphisms that are time- 
one mappings of C!-flows is of the first cate- 
gory in the space of all C!-diffeomorphisms 
with C! topology (J. Palis). Thus most C!- 
diffeomorphisms are not expressed as time-one 
mappings of C}-flows. 

(4) Let M be a compact differentiable mani- 
fold of class C” with dimension m and (M,¢) a 
C’-flow (1 <r < œ). An (m— 1)-dimensional 
closed submanifold 2 of M is called a cross 
section of (M, ¢) if the following conditions are 
satisfied: (i) For any xe X, there exist t; >0 
and t,<0 such that 9, (x), 9,,(x)e2; (ti) Every 
orbit intersects X ttransversally whenever it 
meets X. Let Z be a cross-section of (M, p) and 
xe. Let t, be the least positive number with 
P, (x)E X. Such a t, exists for every xe X, and 
f: Xk defined by f(x)= Q, (x), xe X is a C- 
diffeomorphism. We call this diffeomorphism f 
the first-return mapping (first-return map) or 
the Poincare mapping (Poincaré map) for X. 

(5) Let N be a compact differentiable 
manifold of class C” and f: N>N a C’- 
diffeomorphism (1 <r < œ). Define an equiva- 
lence relation ~ on N x R generated by (x,t + 
1)~(f(x), £) for xe X, teR. Then the quo- 
tient space N(f)=N x R/~ has a natural 
tdifferentiable structure of class C”, and a C’- 
flow (N(f), W) is defined by (Lx, t], s)= [x,t + 
s] for xe N, t, seR, where [x,t]e N(f) is the 
equivalence class of (x, t). The flow (N( f), y) 
thus obtained is called the suspension of f. The 
suspension (N(f), y) has a cross section X = 
{Lx,0]|xeN}, and the Poincaré mapping 
for X is C’-conjugate to f. Conversely, if (M, o) 
has a cross section X and the Poincaré map- 
ping for J is f:—2, then the suspension 
(2(/), w) is C’-equivalent to (M, ¢). 

(6) Let U be an open set of R” and f: U >R" 
a continuous mapping. Consider the tdiffer- 


ence equation 
Xm+1 =f (Xm)s XmE U. (3) 


For each xe U, let @(x,m) be the solution of (3) 
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with (x, 0)=x. If f(U)c U, then @(x, m) exists 
for all meZ, and xeU, and ọ defines a dis- 
crete semiflow on U. If f: UU is a homeo- 
morphism, then (x, m) exists for all me Z 

and xe U, and ọ defines a discrete flow on U 
(— 104 Difference Equations). 


D. Basic Concepts 


For simplicity we assume that the phase 
spaces of dynamical systems are metric spaces, 
and we denote their metrics by d. 

(1) Let (X, p) be a continuous flow ona 
metric space X and xe X. A subset A of X is 
called invariant if @,(A)c A for all teR. A 
subset of X is invariant if and only if itis a 
union of orbits. A subset A of X is positively 
(resp. negatively) invariant if ~,(A)< A for all 
t>0 (resp. t<0). The subset C, (x)= {*(t)|t 2 
0} (resp. C_(x)= {p*(t)|t <0}) is called the 
positive (resp. negative) semiorbit or the posi- 
tive (resp. negative) half-trajectory starting 
fromex. A subset is positively (resp. negatively) 
invariant if and only if it is a union of positive 
(resp. negative) semiorbits. A subset is invari- 
ant if and only if it is both positively and nega- 
tively invariant. 

The union and the intersection of invariant 
sets are invariant. If A is invariant, then its 
closure A, its interior Int A, its boundary 0A, 
and its complement A‘ = X — A are invariant. 
If A is invariant, then @(A x R)c A and the 
restriction mapping ~|A x R defines a continu- 
ous flow (A, g| A x R) on A. The flow thus 
obtained is called the restriction of (X, o) on A. 

(2) A point xe X is a singular point, an equi- 
librium point, a critical point, a rest point, or 
a fixed point if C(x)= {x} (— 290 Nonlinear 
Oscillation). A point is regular or nonsingular if 
it is not a singular point. The set of all singular 
points is a closed invariant set, and the set of 
all nonsingular points is an open invariant 
set. If A is a positively invariant set which is 
homeomorphic to the closed unit ball in R”, 
then there exists a singular point in A (tBrou- 
wer fixed-point theorem). 

A point xe X is periodic if there exists a 
T #0 such that 


o(x, t+ T)= (x,t) (4) 


holds for all teR. If x is periodic, the motion 
ọ* and the orbit C(x) are said to be periodic. A 
point x is periodic if and only if there exists a 
T 40 with g(x, T)=x. A singular point is 
periodic. For a periodic point x, a number T 
satisfying (4) is called a period of x. If x is 
nonsingular and periodic, then there exists 

a smallest positive period T of x, and any 
period is an integral multiple of Tọ. An orbit of 
a nonsingular periodic point is called a closed 
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orbit. If C is a closed orbit, then all points of C 
are nonsingular periodic points, and their 
smallest positive periods coincide. A closed 
orbit is compact. 

(3) Let x be a point of X. A point ye X is 
called an w-limit (resp. a-limit) point or a posi- 
tive (resp. negative) limit point of x if there 
exists a sequence {t,} of real numbers such 
that (1) t, > 00 (resp. t,—> —00) as n> oo and 
(ii) p(x, t,)>y as n> 0. The set of all w-limit 
(resp. x-limit) points of x is denoted by w(x) 
(resp. «(x)) and is called the w-limit (resp. g- 
limit) set of x. For each xe X, w(x) and «(x) 
are closed invariant sets, and the following 
equalities hold: C, (x)= C, (x)U a(x), C_(x)= 
C_(x)Ua(x), and C(x) = C(x) U a(x) U w(x). 

If x is a periodic point, C(x) = C(x)=a(x)= 
(x). If C is a closed orbit, then C = C = C(x) = 
a(x) = (x) for all xe C. If A is a compact in- 
variant set and xe A, then «(x) and w(x) are 
nonempty. 

Assume that X is tlocally compact and 
xe X. Then w(x) is connected if it is compact, 
and none of the connected components of w(x) 
is compact if a(x) is not compact. 

(4) Let x be a point of X. Let J, (x) (resp. 
J_(x)) be the set of all points y satisfying the 
following condition: There exist a sequence 
{ta} of numbers and a sequence {x,, of points 
in X such that (i) t, œ (resp. t, > —o) as n> 
oo, (ii) x, xX as n> oo, and (iii) ~(x,,t,)>y 
as n> œ. The set J, (x) (resp. J_(x)) is a closed 
invariant set containing (x) (resp. «(x)), called 
the first positive (resp. negative) prolongational 
limit set of x. If X is locally compact, then 
J,(x) is connected if it is compact, and none of 
the connected components of J, (x) is compact 
when J,(x) is not compact. 

Notions of higher prolongations have been 
defined and investigated by T. Ura, J. Aus- 
lander, and P. Seibert. 

(5) For a discrete flow, we can similarly 
define basic notions such as an invariant set, 
fixed point, periodic point, and so on defined 
in Sections D(1)—D(4) by replacing R by Z. 
The propositions and theorems stated above 
hold for discrete flows, except those concern- 
ing connectedness. 


E. Recursive Concepts and Dispersive Concepts 


(1) Let (X, ọ) be a flow on a metric space X. 
Let xe X be a point such that there exist a 
neighborhood U of x and a T >0 satisfying the 
condition UN g,(U) = @ for t> T. Then x is 
called wandering. The set of all wandering 
points is an open invariant set. A point is 
nonwandering if it is not wandering. The set Q 
of all nonwandering points is a closed invar- 
iant set and is called the nonwandering set. 
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The following conditions are equivalent: (i) x 
is nonwandering, (ii) xe J, (x), (iti) xe J_(x). 
The nonwandering set Q contains all singular 
points, closed orbits, and (x) and a(x) for all 
xeXx, 

(2) A point xe X is positively (resp. nega- 
tively) Poisson stable if xe w(x) (resp. x €a(x)). 
It is Poisson stable if it is both positively and 
negatively Poisson stable. A positively Poisson 
stable point and a negatively Poisson stable 
point are nonwandering. The following con- 
ditions are equivalent: (i) x is positively Pois- 
son stable, (ii) C, (x) = (x), (iii) C(x) w(x), (iv) 
for any neighborhood U of x and T’>0, there 
exists a t> T with @(x, t)e U. If X is tcomplete 
and xe X is positively Poisson stable but not 
periodic, then œ(x)— C(x) is tdense in a(x). 
This implies that if X is complete, then C(x) is 
periodic if and only if C(x) = a(x). 

(3) If all the points of the phase space X are 
wandering, then (X, o) is called completely 
unstable. If all the points of X are nonwander- 
ing, then (X, ¢) is called regionally recurrent. If 
A is an invariant set such that the restriction 
of (X,@) on A is regionally recurrent, then 
(X, @) is said to be regionally recurrent on A. If 
(X, @) 1s regionally recurrent and X is locally 
compact, then the set of all Poisson stable 
points is dense in X. 

For a given flow (X, o), we obtain a se- 
quence of invariant sets {Q,,} and a sequence 
of restriction flows {(Q,, @,)} such that (i) 

(99, Po) =(X, p); (i) Q,,,, is the nonwandering 
set of (Q,, ¢,), n ÈO; and (iii) (QO,,,,,@,4,) is the 
restriction of (X, p) on Q,,,. Then X =Q,)>Q, 
3...99,>.... Put Q,=()},Q,. Then Q, is an 
invariant set of (X, @), and we denote the re- 
striction of (X, o) on Q, by (Qu, Pa). Starting 
from (Q,,, Po), we obtain similarly a sequence 
of invariant sets {Q,,,,} and a sequence of 
flows {(Q,445 Po+n)}- If we obtain an ordinal 
number y such that Q,=Q,,,#@ by continu- 
ing this process, then we call Q, the set of 
central motions. In this case, the flow (Q,, ¢,) is 
regionally recurrent, and every invariant sub- 
set of X on which (X, ¢) is regionally recurrent 
is contained in Q,. When X is tseparable and 
Q, is compact and nonempty, then the mini- 
mum of such ordinal y is an fordinal of at 
most the second number class. 

(4) Let & be the set of all xe X satisfying the 
following condition: For each e, T>0, there 
exist a sequence {x)= x,X,,...,X,=X} of 
points in X and a sequence {to,t,,...,t,-,} of 
numbers such that t;> T and d(@,,(x;,), Xj41)<é 
for 0<i<k—1. The set & is a closed invar- 
iant set containing the nonwandering set Q 
and is called the chain recurrent set. If X = 2, 
then (X, p) is called chain recurrent. The re- 
striction (2, |Z x R) of (X, o) on & is chain 
recurrent (C. C. Conley). 


126 E 
Dynamical Systems 


(5) A nonempty closed invariant set is called 
a minimal set if none of its nonempty proper 
closed subsets is invariant. For a nonempty 
compact subset A of X, the following con- 
ditions are equivalent: (i) A is minimal, (ii) C(x) 
= A for all xe A, (iii) C, (x)= A for all xe A, 
(iv) C_(x)=A for all xe A, (v) w(x)=A for all 
xe A, (vi) «(x)= A for all xe A. A point xe X is 
positively (resp. negatively) Lagrange stable if 
C, (x) (resp. C_(x)) is compact. If C(x) is com- 
pact, then x is called Lagrange stable. Every 
nonempty compact invariant set contains a 
compact minimal set. If xe X is positively 
(resp. negatively) Lagrange stable, then a(x) 
(resp. «(x)) contains a compact minimal set. 

A closed invariant set A is called a saddle set 
if there exists a neighborhood U of A such that 
every neighborhood of A contains a point x 
such that C(x) U and C_(x)¢U. Otherwise 
A is called a nonsaddle set. For a point x of X, 
let E(x) be the subset of X consisting of the 
points y satisfying the following condition: 
There exist a sequence {x,} of points in X and 
two sequences {t,}, {s,} of numbers such that 
(i) x „>x, t > ©, S> —00 as n>% and (ii) 
(Xps ta) Y, P(X, 5,) —> y as n> æ. For a subset 
B of X, put E(B)=( )xes E(x). Let {S,} be the 
family of all saddle minimal sets and {F,} the 
family of all nonsaddle minimal sets. If the 
phase space X is compact, then the nonwan- 
dering set Q=((), FAU) E(S.)) =U) Fp) U 
E(().S,) (T. Saito). 

(6) A point xe X is said to be recurrent if for 
any e>0 there exists a T >0 such that the tz- 
neighborhood U of g*([t,t+ T]) contains C(x) 
for all teR. If x is recurrent, the motion * 
and the orbit C(x) are said to be recurrent. 
Every orbit of a compact minimal set is recur- 
rent, and if the phase space is complete, then 
the closure of a recurrent orbit is a compact 
minimal set (Birkhoff). 

A set D of real numbers is called relatively 
dense if there exists a T>0 such that DN 
(t,t+T)# Ø for all teR. Assume that xe X is 
Lagrange stable, then x is recurrent if and only 
if for every ¢>0 the set {t|d(x, p(x, t))<e} is 
relatively dense. 

(7) A flow (X, ọ) is dispersive if for any x, 
yeX there exist neighborhoods U, of x and U, 
of y and a T>0 such that U, N p (U, = Ø for 
all t > T. The following conditions are equiva- 
lent: (i) (X, ọ) is dispersive; (ii) For any x, ye X, 
there exist neighborhoods U, of x and U, of y 
and a T>0 such that U.N (U) =Ø if |t|> T; 
(itt) J (x)= Ø for all xe X. 

A flow (X, ọ) is parallelizable if there exist 
a subset S of X and a homeomorphism h: 

X >S x R such that (i) g(S x R)=X and (ii) 
ho ~(x, t)=(x, t) for all xe S and teR. A flow 
(X, @) 1s parallelizable if and only if there is a 
subset S of X satisfying the following con- 
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ditions: (i) For each xe X there exists a unique 
t(x)ER with (x, t(x))ES; (ii) t:X >R is con- 
tinuous. A parallelizable flow is dispersive. A 
flow on a locally compact separable metric 
space is parallelizable if and only if it is disper- 
sive (V. V. Nemytskii, V. V. Stepanov). 

An open set U of X is called a tube if there 
exist a T>0 and a subset S of U satisfying the 
following conditions: (i) @(S x (— T, T)) =U, (ii) 
For each xe U there is a unique t(x)eR with 
|t(x)|< T such that p(x, t(x))e S. The set S in 
the above definition is called a local section. If 
xe X is a regular point, then there exists a tube 
containing x (M. Bebutov, H. Whitney). The 
notion of a local section is a generalization 
of Poincare’s “surface sans contact” [1] or 
Birkhoff’s “surface of section” [4], and it is 
related to the notion of the cross section. 

(8) For discrete flows (homeomorphisms), 
basic notions, such as a nonwandering set, 
Poisson stability, regional recurrence, central 
motion, and minimal set, are defined similarly, 
and many of the propositions and theorems in 
Sections E(1)—E(5) hold similarly for discrete 
flows (homeomorphisms). 


F. Stability 


(1) Let (X, o) be a continuous flow on a metric 
space X. A point xe X is called orbitally stable 
if for any ¢>0 there exists a ô >0 such that 
C,(y) is contained in the e-neighborhood of 
C(x) for all y with d(x, y)<6 (— 394 Sta- 
bility). A nonempty set A is called stable if 
every neighborhood of A contains a positively 
invariant neighborhood of A. If A is compact 
(in particular, if A is a periodic orbit), then 
orbital stability and stability for A are equiva- 
lent. A nonempty set A is called asymptotically 
stable if A is stable and there exists a neighbor- 
hood V of A such that w(x)c A for any xeV. 
If A is stable and w(x) cA for all xe X, then A 
is called globally asymptotically stable. A point 
xe X is Lyapunov stable if for any e>0 there 
exists a ô >Q such that d(¢,(x), 9,(y))<e for all 
t20 and y with d(x, y)<6. Lyapunov stability 
implies orbital stability. A point x is uniformly 
Lyapunov stable if for any ¢>0 there exists a ô 
>Q such that for ze C(x) and y with d(y, z)<6 
we have d(9,(z), 9,(y)) <e for all t>0. Uniform 
Lyapunov stability implies Lyapunov stability. 
For a singular point, the notions of uniform 
Lyapunov stability, Lyapunov stability, orbital 
stability, and stability are equivalent. Assume 
that the phase space X is locally compact and 
A is a nonempty compact subset of X. Then A 
is asymptotically stable if and only if there 
exist a neighborhood N of A and a continuous 
real-valued function L on N such that (i) L(x) 
=0 if xe A and L(x)>0 if x€ A; (ii) L(@(x, t)) 
< L(x) if x¢A, t>0, and p({x} x [0,t]) eN. 


492 


Such a function L is called a Lyapunov func- 
tion for A (— 394 Stability). Assume that X is 
locally compact and A is nonempty, stable, 
and invariant. Then A is asymptotically stable 
if and only if there exists a neighborhood U 
of A such that any invariant set in U is neces- 
sarily contained in A (Ura). Let A be a non- 
empty closed invariant set. A is called an at- 
tractor if it has an open neighborhood U satis- 
fying the following conditions: (i) U is posi- 
tively invariant; (ii) For each open neighbor- 
hood V of A, there is a T>0 such that o,(U)c 
V for all t> T. Condition (ii) implies that 
(\:209(U)=A and w(x) A for all xe U. 
Thus an attractor is asymptotically stable. If A 
is an attractor, the basin of A is the union of 
all open neighborhoods of A satisfying (i) and 
(ii). 

(2) Assume that the phase space X is com- 
plete. A motion o* (xe X) is called almost 
periodic if for any ¢>0 there exists a rela- 
tively dense subset {1,} in R such that d(g*(t), 
o*(t+t,))<e for all teR and t, (— 18 Almost 
Periodic Functions). If @* is almost periodic, 
then @” is almost periodic for all ye C(x). If A 
is a compact minimal set and if g* is almost 
periodic for some x in A, then every motion 
p”, yE Á, is almost periodic. An almost periodic 
motion is recurrent. The converse is not true. 
But if x is recurrent and Lyapunov stable in 
C(x) (i.e., in the restriction flow (C(x), o | C(x) 
x R)), then g* is almost periodic. If x is uni- 
formly Lyapunov stable in C(x) and negatively 
Lagrange stable, then o” is almost periodic 
(A. A. Markov). 


G. Singular Points and Closed Orbits 


In this section we assume that the phase space 
is a tparacompact differentiable manifold of 
class C” with metric d. 

(1) Let E be a finite-dimensional real vector 
space and L:E>E a linear automorphism. L 
is called hyperbolic if it has no teigenvalues of 
absolute value 1. If L: EE is hyperbolic, 
there are unique vector subspaces E* and E" 
satisfying the following conditions: (i) E= 
E5 @ E“, (it) L(ES)= E° and L(E"“) = E“, (iii) if 
-|| is a tnorm on E, then there exist constants 
c>0O and 0</A<1 such that, for any positive 
integer m, || L”(v)|| <cA™ ||v|| when ve E* and 
|L-™(v)|| <cA™|lv|| when ve £“. The zero 0 of E 
is the only fixed point of a hyperbolic linear 
automorphism. Put s=dim E* and u = dim E". 
Then s+u=dim E, and s (resp. u) is the num- 
ber of eigenvalues of absolute value <1 (resp. 
> 1) counted with multiplicity. A topological 
conjugacy class of a hyperbolic linear auto- 
morphism L: E-E is determined by s, u, and 
the signs of det(L| E*°) and det(L| E“), where 
det(L| E°) is the tdeterminant of the restriction 
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L| E”: E°>E° of L on E° (o=s, u). Further 
investigations of topological classification of 
linear automorphisms have been carried out 
by N. H. Kuiper and J. W. Robbin. 

(2) Let f:R”>R” be a C’-diffeomorphism 
(1<r< œ). Assume that the origin 0ER" is a 
fixed point of f. Then the tdifferential dfo: R” 
>R” of f at 0 is a linear automorphism. It is 
given by dfo(x)=Jo(f)x, xeR”, where Ja( f) is 
the tJacobian matrix of f at 0 and x is ex- 
pressed as a column vector. If dfọ is hyperbolic, 
then f is topologically conjugate to df, ina 
sufficiently small neighborhood of 0 (P. Hart- 
man). Assume that f is of class C”, and let 
Åi, -> An (possibly repeated) be the eigenvalues 
of dfo. If A; AA... åp" for all 1 <i<n and for 
all nonnegative integers m,,...,m, with 2< 
M, +... +M, then in a sufficiently small 
neighborhood of 0, f is C®-conjugate to dfo 
(S. Sternberg). 

(3) Let f: M->M be a C’-diffeomorphism 
(1<r<oo) ofa differentiable manifold M of 
class C”. Let pe M be a fixed point of f. Then 
the ‘differential df, of f at p is a linear auto- 
morphism of the ttangent space T,(M) of M at 
p. If df, is hyperbolic, then p is called a hyper- 
bolic fixed point of f. Let p be a hyperbolic 
fixed point of f and T,(M)= E* @ E” be the 
direct sum decomposition with respect to df,. 
Put W*(p)={xeM|f"(x)>p as n> oo} and 
W"(p)={xeM|f-"(x)p as n00}. W*(p) 
and W“(p) are invariant sets and images of 
tinjective immersions of class C” of vector 
spaces E° and E“, respectively. At the point p, 
W*(p) and W“(p) are tangent to E° and E", 
respectively [21]. We call W°(p) (resp. W“(p)) 
the stable (resp. unstable) manifold of f at p. In 
a sufficiently small neighborhood of a hyper- 
bolic fixed point p of f, f is topologically 
conjugate to its differential df,. Therefore a 
hyperbolic fixed point is an isolated fixed point 
(i.e., there are no fixed points in its sufficiently 
small neighborhood except itself). A hyperbo- 
lic fixed point p is called a source if dim W*(p) 
=Q and a sink if dim W“(p)=0. Otherwise it is 
a saddle point. The number of topological 
conjugacy classes of hyperbolic fixed points on 
an n-dimensional manifold is 4n. 

Let pe M bea periodic point of f and m the 
smallest positive period of p. Replacing f by 
f™, we obtain notions of hyperbolicity, stable 
manifold, and so on for the periodic point p. 
We also obtain propositions and theorems 
similar to those stated above for periodic 
points. 

(4) Let A be a real n x n matrix. Consider the 
system of linear differential equations 


dx/dt=Ax, XxeéER". (5) 


This equation generates a C”-flow (R”, o), and 
p, :R">R'" is given by @,(x)=e'4x, where e is 
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the fexponential of the matrix tA. Thus g, is a 
linear automorphism for all teR. The origin 
OeER" is a singular point of (R", ø). If none of 
the real parts of the eigenvalues of A are zero, 
we call 0e R” a hyperbolic singular point of (5). 
If OER" is a hyperbolic singular point of (5), 
then there exist two vector subspaces E° and 
E* of R” satisfying the following conditions: (i) 
R” = E; @ E“; (ii) A(E*)= E5 and A(E")= E"; 
(iii) E* and E” are invariant sets of {(R", @); (iv) 
there exist positive constants c and 4 such 
that for all t>0, |o (|| <ce7# |x| when xe E5 
and ||@_,(x)|| <ce 7” ixl] when xe E”, where |i- || 
is the norm on R”. The origin 0 is a hyperbolic 
singular point of (5) if and only if the time- 
one mapping p, =e4:R"—>R" of ọ is a hyper- 
bolic linear automorphism. If 0 is a hyperbolic 
singular point of (5), the above direct sum 
decomposition R” = E* @ E” coincides with the 
one with respect to g,. Put s=dim E* and u = 
dim E". Then s+u=n, and s (resp. u) is the 
number of all eigenvalues of A with negative 
(resp. positive) real parts counted with multi- 
plicity. Also we obtain the following proper- 
ties: (i) xe ES (resp. E") ~,(x) 0 as t> œ 
(resp. t—> —00), (ii) x¢ Æ (resp. E") = ||,(x)|| 
+00 as t>% (resp. t — oo), (iii) the origin 0 
is the only singular point. 

Let A and B be n xn real matrices. Let 
(R", g,) and (R", pp) be the flows determined 
by the equations dx/dt = Ax and dx/dt = Bx, 
respectively. Assume that 0€R" is a hyperbolic 
singular point for both of the equations. Let s 
and u (resp. s’ and u’) be the integers defined 
for p, (resp. @,) in the above paragraph. Then 
the following conditions are equivalent: (1) 
(R", o4) and (R”, pg) are flow equivalent, (ii) 
(R”, @,) and (R”, op) are topologically equiva- 
lent, (iti) s=s’, (iv) u=u', (v) (@,), and (pp); are 
topologically conjugate. Further investigations 
of the phase portrait of the equation dx/dt = 
Ax without the assumption of hyperbolicity 
were done by Kuiper. 

(5) Let D be an open set of R” and f: D—R" 
a continuous mapping. Consider the system of 
differential equations (1), which we write down 
again here: 


dx/dt= f(x), xeD. (6) 


A point peD is a singular point of (6) if the 
trajectory through p consists of a single point 
p. If (6) generates a flow (D, ọ), then p is a 
singular point of (6) if and only if p is a sin- 
gular point of (D, g). A point p is a singular 
point of (6) if and only if f(p)=0. 

Suppose further that f is a C’-mapping 
(1<r<oo). A singular point p of (6) is called 
hyperbolic if none of the real parts of the eigen- 
values of the Jacobian matrix J,(f) of f at p is 
zero. Let p be a singular point of (6). We as- 
sume for simplicity that p=0 and (6) generates 


126G 


Dynamical Systems 


a flow (D, o). Denote the Jacobian matrix of f 
at 0 by A, and let (R”, 4) be the flow generated 
by the equation dx/dt = Ax. If 0 is a hyperbolic 
singular point, then in a sufficiently small 
neighborhood of 0, the flows (D, p) and (R”, o4) 
are flow equivalent and hence topologically 
equivalent (Hartman, D. M. Grobman). Let 
Ži» <., Ån (possibly repeated) be the eigen- 
values of A. If f is of class C” and 4;4m,A,4+ 
...+m,/, for all 1 <i<n and for all nonnega- 
tive integers m,,...,m, with 2<m,+...+m,, 
then in a sufficiently small neighborhood of 0, 
two flows (D, @) and (R”, g,) are C”-equivalent 
(Sternberg). 

(6) Let M be a paracompact differentiable 
manifold of class C® and V a vector field of 
class C (1 <r < œ) on M. For simplicity we 
assume that V generates a C’-flow (M, @). For 
each point p of M, take a coordinate neighbor- 
hood (U, x) of class C” around p, where U is 
an open neighborhood of p in M and a: U—R" 
is a homeomorphism onto an open set D in R”. 
Using (U, x), we can express the vector field V 
as a system of ordinary differential equations 
of the form (6) in D. The eigenvalues /,,...,4, 
(possibly repeated) of the Jacobian matrix of f 
at a(p) are independent of the choice of local 
coordinates (U,«) around p. Thus /,,...,2, are 
invariants of the C’-equivalence, but they are 
not invariants of the topological equivalence. 

A singular point p of a flow (M, ọ) (ora 
vector field V) is called hyperbolic if «(p) is a 
hyperbolic singular point of the corresponding 
equation (6) (i.e., none of the real parts of the 
above eigenvalues 4,,...,2, are zero). A sin- 
gular point of (M, ~) is hyperbolic if and only 
if it is a hyperbolic fixed point of the time-one 
mapping ¢, of ọ. A hyperbolic singular point 
p is an isolated singular point. 

Let p be a hyperbolic singular point of 
(M, p) and T,(M) = E* @ E" the direct sum 
decomposition with respect to g,. Put s= 
dim E* and u=dim E". Then s (resp. u) is the 
number of all eigenvalues 4; of negative (resp. 
positive) real parts. Put W*(p)={xeM]|o,(x) 
pas too} and W“(p)={xeM|Q_(x)>p 
as t>00}. W*(p) and W“(p) are invariant sets 
and images of injective immersions of class C” 
of vector spaces E* and E", respectively. At 
the point p, W*(p) and W“(p) are tangent to 
E* and E“, respectively. We call W*(p) (resp. 
W"(p)) the stable (resp. unstable) manifold of f 
at p. The stable manifold and the unstable 
manifold of (M, @) at a hyperbolic singular 
point p coincide with those of the time-one 
mapping ¢, at p. A hyperbolic singular point 
p is called a source if dim W*(p)=0 and a sink 
if dim W“(p)=0. Otherwise it is called a saddle 
point. If p is a source (resp. a sink), then W“(p) 
(resp. W*(p)) is a neighborhood of p. A sin- 
gular point p is a source (resp. a sink) if and 
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only if all real parts of the corresponding eigen- 
values /,,...,4, are positive (resp. negative). 

(7) Let V be a vector field of class C” on a 
C*-differentiable manifold M of dimension n 
22 and (M, p) the C’-flow generated by V. Let 
C be a closed orbit of (M, o), p a point of C, 
and T>0 its smallest positive period. Then 
p is a fixed point of the C’-diffeomorphism 
Pr: M—>M. Let V(p)eT,(M) be the value of V 
at p and (d@_),: T,(M)— T,(M) the differen- 
tial of r at p. Then (dor) (V(p))= V(p), and 
V(p) #0. Therefore 1 is an eigenvalue of (d@z),. 
The other eigenvalues 2,,...,/,-1 (possibly 
repeated) of (dør), do not depend on a choice 
of peC and are called the characteristic multi- 
pliers of C. If none of the characteristic multi- 
pliers of C is of absolute value 1, we call Ca 
hyperbolic closed orbit. If C is a hyperbolic 
closed orbit, then there exist vector subspaces 
Ej, and E; for each peC satisfying the follow- 
ing conditions: (i) T,(M)= L(V(p)) ® Ep ® E*, 
where L(V(p)) is the 1-dimensional subspace 
generated by V(p); (1i) (do) (Ep) = Epp (0 = 
s, u) for all teR, where (dg,),: T,(M)— Toup (M). 
In particular, (dor), (Ep) = E} (o =s, u) (iii) 

dim E; (resp. dim E¥) is independent of pe C 
and is equal to the number of /; of absolute 
value <1 (resp. > 1). Put W*(C)={xeM| 
d(p,(x), C) 0 as t> 00} and W“(C)={xeM| 
d(p_,(x), C) +0 as t>00}. Then W*(C) (resp. 
W*(C)) is an invariant set and is an injectively 
immersed C’-submanifold of M which is tan- 
gent to L(V(p)) ® E$ (resp. L(V(p)) ® E%) for 
each peC. W*(C) (resp. W“(C)) is called the 
stable (resp. unstable) manifold for C. 

Let p be a point of a closed orbit C. An 
embedded (n— 1)-dimensional disk D of class 
C” in M containing p is called a cross section 
for a closed orbit C if V is transverse to D (ie., 
T,(M) = L(V(x)) ® T.(D) for each xe D) and 
D(C= {pt}. For a given cross section D for C, 
there exists a neighborhood U of p in D such 
that for any xe U there exists a t(x)>0 such 
that (x, t(x))eD and o(x, t)¢ D for 0<t< 
t(x). A mapping f: UD, defined by f(x)= 
(x, t(x)), xe U, is called a Poincaré mapping 
for D. The C’-conjugacy class of the tgerm of 
a Poincaré mapping f:V—D at p is indepen- 
dent of the choice of peC and the cross section 
D for the closed orbit C. The point p is a fixed 
point of f, and the eigenvalues of df, coincide 
with the characteristic multipliers of C includ- 
ing multiplicity. Therefore C is hyperbolic if 
and only if peC is a hyperbolic fixed point of 
J. Ina sufficiently small neighborhood of C, 
the flow is C’-equivalent to the suspension of f. 
The topological equivalence of the flow in a 
sufficiently small neighborhood of C is deter- 
mined by the topological conjugacy class of f. 
The number of topological equivalence classes 
of hyperbolic closed orbits which appear in a 
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flow on an n-dimensional manifolds is 4(n— 1) 
(M. C. Irwin). 

(8) Let V be a vector field of class C" (1 <r < 
oo) on a compact C”-manifold M. If M has 
a nonempty boundary, we assume that V 
points outward at all boundary points. Let p 
be an isolated singular point of V and denote 
by i(p) the tindex of the vector field V at the 
singular point P. If all singular points are 
isolated, then the sum $, i(p) is independent of 
V and is equal to the tEuler-Poincaré charac- 
teristic y(M) of M (Poincaré, H. Hopf) (— 153 
Fixed-Point Theorems). A vector field (or a 
flow) is called nonsingular if it has no singular 
points. If 7(M) #0, then any vector field (and 
hence any flow) on M has a singular point. If 
y(M)=0, then there exists a nonsingular vector 
field on M. If M is 2-dimensional and without 
boundary, then M admits a nonsingular vector 
field if and only if M is a ftorus or a tKlein 
bottle. 

There are many directions in which gen- 
eralizations of the Poincaré-Hopf theorem can 
be made. For example, an index of an isolated 
closed orbit of a flow has been defined and 
investigated by F. B. Fuller. 

The phase portrait near a singular point or 
a closed orbit which is not hyperbolic is com- 
plicated. Many results in general situations 
have been obtained for planer flows, but there 
are only a few for higher-dimensional flows. 


H. Generic Properties and Structural Stability 


(1) Let M be a compact C”-manifold without 
boundary. The set 27(M) of all vector fields of 
class C” (1 <r < œ) on M forms a real vector 
space in a natural way. We can give a norm 

IF ||, (called a C’-norm) for Ve 2"(M) using its 
expressions in a given suitable system of local 
coordinates of class C” on M and their par- 
tial derivatives up to order r. By virtue of 

this norm, 2”(M) is a Banach space with the 
topology of uniform C’-convergence (— 168 
Function Spaces). A subset of a topological 
space is called a residual set or a Baire set if it 
is the intersection of a countable number of 
dense open sets. A residual set in 2’(M) is a 
dense set. A proposition P concerning a vector 
field of class C” is called generic if the set {Ve 
X'(M)| P(V)} contains a residual set for any 
M. The following properties are generic prop- 
erties: (i) All singular points and all closed 
orbits are hyperbolic [21, 22]; (ii) any stable 
manifold (of a hyperbolic singular point or a 
hyperbolic closed orbit) meets transversally 
with any unstable manifold (of a hyperbolic 
singular point or a hyperbolic closed orbit) at 
any point of their intersection [21, 22]; (iii) the 
set of all periodic points (i.e., the union of all 
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singular points and all closed orbits) is dense 
in the nonwandering set Q for Ve X'(M) [23]; 
(iv) there is no regular first integral for ve 

4 1(M), where a regular first integral of a vec- 
tor field V on M is a C!-function f: M>R 
such that f is not constant on any open set of 
M and is constant along any orbit of (the flow 
generated by) V (R. C. Robinson). 

Let V (resp. V’) be an element of 27(M) 
(resp. 27(M’)), (M, ~) (resp. (M’, ¢’)) the flow 
generated by V (resp. V’), and Q (resp. Q’) the 
nonwandering set of (M, p) (resp. (M’, 9’)). 

V and V’ are called topologically equivalent 
(resp. Q-equivalent) if (M, o) and (M’, o^) (resp. 
the restrictions (Q, o |Q x R) and (0’, o'| Q x 
R)) are topologically equivalent. A vector 
field VE 2"(M) (or the flow generated by V) is 
called C’-structurally stable (resp. C’-Q-stable) 
if there exists a neighborhood of V in 
2"(M) such that any V’ in M is topologically 
equivalent (resp. Q-equivalent) to V. If Ve 
4"(M) is structurally stable (resp. Q-stable), 
then the topological structure of the phase 
portrait of (the flow generated by) V in the 
whole space (resp. the nonwandering set) re- 
mains invariant under a sufficiently small C’- 
perturbation of V. The generic properties (1)— 
(iv) above hold for C!-structurally stable 
vector fields (Markus, Thom, Peixoto, J. Ar- 
raut). The generic properties (1) and (iii) above 
hold for C'-Q-stable vector fields. 

(2) Let M be a compact C”-manifold with- 
out boundary. Let F’(M) be the set of all 
C’-mappings of M into itself with the topol- 
ogy of uniform C’-convergence. Let Diff’(M) 
be the subset of F’(M) consisting of all C’- 
diffeomorphisms of M onto itself. Then F’(M) 
is a complete metric space, and Diff’(M) ts 
open in F’(M). Thus Diff’(M) is a *Baire space 
and also a topological group with respect to 
this topology. A proposition P concerning 
fe Diff’(M) is called generic if the set { fe 
Diff’(M)| P(f/)} contains a residual set. The 
following properties are generic: (i) Every 
periodic point is hyperbolic [21]; (11) for each 
pair of hyperbolic periodic points p and q, the 
stable manifold W°(p) intersects the unstable 
manifold W“(q) transversally [21]; (iii) for a 
C'-diffeomorphism fe Diff'(M) the set of all 
periodic points is dense in the nonwandering 
set ([23], Robinson). 

Let f: M>M and g:NN be C’- 
diffeomorphisms and Q( f) and Q(g) their 
nonwandering sets. f and g are called Q- 
conjugate if f|Q(f):Q(f)rQ(f) and g]Q(g): 
Q(g)>Q(g) are topologically conjugate. A 
diffeomorphism f € Diff’(M) is called C’- 
structurally stable (resp. C’-Q-stable) if there 
exists a neighborhood ~ of f in Diff"(M) such 
that any g in M is topologically conjugate 
(resp. Q-conjugate) to f. Generic properties (i)- 
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(iii) above hold for C'-structurally stable dif- 
feomorphisms, and (i) and (iii) above hold for 
C'-Q-stable diffeomorphisms. 


I. Low-Dimensional Systems 


(1) Let S' = {zeC||z|=1} be the unit circle in 
the complex plane C and p:R-—S' the tcover- 
ing projection defined by p(x)=e?"", xeR. 
Let f:S!—St be an torientation-preserving 
homeomorphism. Then f can be lifted to a 
mapping F:R-R satisfying the following 
conditions: (i) po F = fo p; (ii) F is monotone 
increasing; (ili) F — lp is a periodic function of 
period 1, where Lp is the identity mapping of 
R. The limit p( f)=lim,,.,. F"(x)/n exists for all 
xeR, and its tresidue class modulo Z is inde- 
pendent of F and x. We call p(f) the rotation 
number of f. Let f, g:S'—S' be orientation- 
preserving homeomorphisms. If f and g are 
topologically conjugate by an orientation- 
preserving (resp. treversing) homeomorphism, 
then p(f)=p(g) (resp. p(f) = — p(g)) modulo 
Z. An orientation preserving homeomorphism 
f{:S!—S? has a periodic point of the smallest 
positive period s if and only if p( f)=r/s, where 
rand s are relatively prime integers. In this 
case all periodic points are of the smallest 
positive period s. If p(f) is irrational, then 
the w-limit set w(x) of xe S! is independent 

of x, and E=«(x) is either tperfect and tno- 
where dense or the whole space S'. If p(f) is 
irrational and E=q@(x)=S', then f is called 
transitive. If f is transitive, then it is topologi- 
cally conjugate to the rotation r,,,,:S'>S! 
defined by r,,,,(e77%) =e?" 1 PY, xER. Let 
f:S'—S' be of class C! with p(f) irrational. If 
its derivative f’ is of tbounded variation, then 
f is transitive (A. Denjoy). In particular, if f 
is of class C? with p(/) irrational, then f is 
topologically conjugate to the rotation ryp- 
However, there are C!-diffeomorphisms of $! 
onto itself whose nonwandering sets are not 
the whole space. Those C'-diffeomorphisms 
are never topologically conjugate to C?- 
diffeomorphisms. M. R. Herman gave a suffi- 
cient condition for a diffeomorphism of S! 
onto itself to be differentiably conjugate to a 
rotation. 

A C’-diffeomorphism f:S!—S! is structur- 
ally stable if and only if Q(/) is finite (hence 
Q( f) consists of a finite number of periodic 
points), and all periodic points are hyper- 
bolic. The set of all structurally stable C’- 
diffeomorphisms of S! onto itself is a dense 
open set in Diff’(S’) (Peixoto). 

(2) Let D be an open set in R? and f:D—R? 
a continuous mapping. We assume that for 
each xe D there exists a unique nonextend- 
able solution g(x, t) with the initial condition 
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(x, 0)=x defined on (a,, b,) for the equation 
dx/dt = f(x), xe D. Suppose that for a given 
point xe D there exists a compact set K in D 
containing the positive semiorbit C,(x)= 
{p(x,)|0<t<b,' of x. Then b, = œ. Further, 
we assume that the w-limit set w(x) of x con- 
tains no singular points. Then we have either 
(i) C(x) =c(x) and it is a closed orbit, or (ii) 
C,(x)#w(x) and w(x) is a closed orbit. In case 
(ii), (x) is a tsimple closed curve and C, (x) 

is a spiral which tends the closed orbit w(x) 
(Poincaré, I. Bendixson). We call such a closed 
orbit w(x) a limit cycle. Let f, and f, be poly- 
nomials of two variables and m the maximum 
of degrees of f, and fọ. Let f=(f;, f,):R?—-R? 
be the mapping defined by f, and fz. The 
equation dx/dt = f(x), xe R?, defined by such 
an f is called a polynomial system of degree 
m. The following is Hilbert’s 16th problem: Is 
there a number N(m) depending only on m 
such that the number of limit cycles for any 
polynomial system of degree m is bounded by 
N(m)? 

Let M be a closed (i.e., compact, without 
boundary) C*-manifold of dimension 2 and 
(M, @) a C?-flow on M. Then a minimal set of 
(M, ọ) is either (i) a singular point, (ti) a closed 
orbit, or (iti) the whole space M. For case (iii), 
M is a torus (Poincaré, Denjoy, C. L. Siegel, A. 
J. Schwartz). Let T be a 2-dimensional torus 
and (T, ~) a C’-flow. Suppose that (T, o) has a 
cross section X which is C!-diffeomorphic to 
S'. Let f:X—2 be the Poincaré mapping for 
2. Then (T, ọ) has a closed orbit if and only if 
the rotation number p(/) of f is rational. If 
p(f) is irrational and (T, @) is of class C?, then 
T is a minimal set. 

Let M be an orientable 2-dimensional 
closed C”-manifold and Ve 2'(M). Then V is 
structurally stable if and only if the following 
conditions are satisfied: (i) There are only a 
finite number of singular points, all hyperbolic; 
(ii) There are only a finite number of closed 
orbits, all hyperbolic; (iii) There are no orbits 
which connect saddle points; (iv) The «(x) and 
w(x) for any xe M are singular points or 
closed orbits (Peixoto). The above theorem 
was first proved by Andronov and Pontryagin 
for analytic vector fields on a 2-dimensional 
disk which are transverse to the boundary of 
the disk at any boundary point. The set of all 
structurally stable vector fields of class C! on 
M is a dense open set in %!(M) (Peixoto). 


J. Axiom A Systems 


In this section we assume that phase spaces 
are closed C*-manifolds with metric d and 
1<r<oo unless stated otherwise. 
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(1) A vector field VE2"(M) (resp. a C’-flow 
(M, ọ)) is called a Morse-Smale vector field 
(resp. a Morse-Smale flow) if the following 
conditions are satisfied: (i) The nonwandering 
set is the union of a finite number of singular 
points and a finite number of closed orbits; (ii) 
The singular points and closed orbits are all 
hyperbolic; (iit) The stable manifolds and the 
unstable manifolds of the singular points and 
closed orbits intersect each other transversely. 
If dim M = 2, the Morse-Smale vector fields on 
M are exactly the structurally stable ones 
discussed in Section I(2). A Morse-Smale 
vector field is structurally stable [25]. The set 
of all Morse-Smale vector fields on M is open 
but not dense in 2"(M) if dim M > 2 (Palis, 
Smale). However, it contains a dense open 
subset of the set of all tgradient vector fields 
with respect to a given ‘Riemannian metric 
(Smale). In particular, any closed manifold 
admits Morse-Smale vector fields and hence 
structurally stable vector fields. For a Morse- 
Smale vector field, the *Morse inequalities 
hold as they do in the tcalculus of variations 
in the large [24]. 

(2) A vector field Ve 2"(M) (or the C’-flow 
(M, ~) generated by V) is called an Anosov 
vector field (or an Anosov flow) if the following 
conditions are satisfied: (1) There is a direct 
sum decomposition T.(M)=L(V(x)) @ ES @ E! 
of the tangent space T.(M) for each xe M 
which depends continuously on xe M; (ii) 
(de,),(ES) = ES w and (dọ,), (E!) = E$ 4» for all 
xeM and teR; (iii) There are a Riemannian 
metric on M and constants c, 4 >0 such that, 
for all t>0 and xe M, ||(dg,),(w)|| <ce~** || w| 
when we EX, and ||(dg_,),(w)|| < ce™* |w] 
when we E%, where ||: || is the norm induced 
by the Riemannian metric. The suspensions of 
Anosov diffeomorphisms and the *geodesic 
flows on Riemannian manifolds of negative 
curvature are important examples of Anosov 
flows (J. Hadamard [8]). There are exam- 
ples of Anosov flows other than the ones 
stated above (M. Handel and W. P. Thurston, 
J. Franks and R. F. Williams). The follow- 
ing have been proved by Anosov [8]: (i) An 
Anosov flow is structurally stable; (it) there 
are countably many closed orbits for an 
Anosov flow; (itt) 1f there exists a smooth in- 
variant measure (i.c., an invariant measure 
which has a tsmooth density with respect to 
the measure associated with the Riemannian 
metric), then the set of all closed orbits 1s dense 
in M. If we assume further that the flow is of 
class C?, then it is tergodic; (iv) the set of all 
Anosov vector fields is open in Z(M), (v) {ES}, 
{Ex} (xe M) define tfoliations on M, which are 
called Anosov foliations. 

(3) A vector field Ve2"(M) (or the C’-flow 
(M, p) generated by V) is called an Axiom A 
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vector field (or an Axiom A flow) if the follow- 
ing conditions are satisfied: A(a) The non- 
wandering set consists of a finite set F of sin- 
gular points, all hyperbolic, and the closure A 
of the union of closed orbits, and FN A = Ø; 
A(b) The conditions (i)—(11i) in the definition of 
Anosov flow in which we replace the terms 
“xe M” by “ xeA.” For each xe M, put W*(x) 
={yeM|d(9,(x), p(y) 0 as t 0} and 
W"(x)={yeM |d(p_,(x), p-(y))—>0 as t 00}. 
We call W*(x) (resp. W“(x)) the stable (resp. 
unstable) manifold of V at x. For a subset A 
of M, put W7(A)=| Jie 4 W(x) (o =s, u). 

For xe M, put W”?(x)= W°(C(x)) (o=s, u), 
where C(x) is the orbit of x. If the flow satis- 
fies Axiom A, then W*(x), W“(x), W(x), and 
W*"(x) are injectively immersed submanifolds 
for all xe M (M. W. Hirsch and C. C. Pugh). 
If xe M is a hyperbolic singular point, then 
W*(x) and W“(x) defined above coincide with 
those defined before. If C(x) is a hyperbolic 
closed orbit, then W*(C(x)) and W“(C(x)) 
defined above coincide with those defined 
before. 

For an Axiom A flow, there is a decomposi- 
tion of the nonwandering set QN=Q, U...UQ,, 
(disjoint union), where each Q; is closed, invar- 
iant, and transitive (i.e., has a dense orbit), 
and M =|)", W(Q)=(J2, W*(Q,) (disjoint 
union) [7]. This decomposition is called the 
spectral decomposition of Q, and each Q; is 
called a basic set. Let Q=Q, U...UQ,, be the 
spectral decomposition of the nonwandering 
set for an Axiom A flow. Denote 0; <Q; if 
W=) N WQ) 4. A sequence of basic sets 
Qi Qi- Q (k> 1) is called a cycle if Q, < 
Q; S<. SQ Q; =Q; and otherwise Q, # 
Q, for p#q. An Axiom A flow which has no 
cycles in the above sense is said to satisfy the 
no cycle condition. The Q-stability theorem: 
An Axiom A flow with the no cycle condition 
is Q-stable [27]. Q-explosion: If an Axiom A 
flow has a cycle, then it is not Q-stable (Palis). 
An Axiom A flow is said to satisfy the strong 
transversality condition if W(x) and W(x) 
intersect transversely at any xe M. An Axiom 
A flow with the strong transversality condi- 
tion satisfies the no cycle condition [7]. The 
structural stability theorem: An Axiom A flow 
of class C! with the strong transversality con- 
dition is structurally stable [29, 31]. Morse- 
Smale flows and Anosov flows are Axiom A 
flows with the strong transversality condition. 
There are Axiom A flows other than Morse- 
Smale flows and Anosov flows that satisfy the 
strong transversality condition [7]. Stability 
conjecture: A C’-flow is structurally stable 
(resp. Q-stable) if and only if it ts an Axiom A 
flow with the strong transversality condition 
(resp. the no cycle condition). S. Newhouse, 
V. A. Pliss, Robinson, R. Mañé, S. D. Liao, 
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and A. Sannami have made important contri- 
butions to the study of the stability conjecture. 

Neither the set of all Axiom A flows nor the 
set of all Q-stable flows nor the set of structur- 
ally stable flows is dense in 2’(M) if dim M >2 
(R. Abraham and Smale, Newhouse). How- 
ever, the set of all structurally stable flows is 
dense in 2’(M) in the C? topology (M. Shub, 
M. M. C. de Oliverira). 

(4) A diffeomorphism fe Diff’(M) is called a 
Morse-Smale diffeomorphism if the following 
conditions are satisfied: (i) The nonwandering 
set Q is a finite set, and hence it consists of 
periodic points; (ii) all periodic points are 
hyperbolic; (iii) for each pair p, qeQ, W*(p) 
intersects W“(q) transversally. The Morse- 
Smale diffeomorphisms on the unit circle S! 
are exactly the structurally stable ones de- 
scribed in Section I(1). A Morse-Smale dif- 
feomorphism is structurally stable [25]. The 
set of all Morse-Smale diffeomorphisms on M 
is open but not dense in Diff’(M) if dim M >2 
(Palis, Smale). However, it contains a dense 
open subset of the set of all time-one mappings 
of the flows generated by gradient vector fields. 
In particular, any closed manifold admits 
Morse-Smale diffeomorphisms and hence 
structurally stable diffeomorphisms. For a 
Morse-Smale diffeomorphism, the Morse 
inequalities hold [24]. 

Let A be a closed invariant set of fe 
Diff’(M). A is called hyperbolic if the follow- 
ing conditions are satisfied: (i) There is a split- 
ting T,(M)= E$ ® EX of the tangent space 
T,(M) for each xe A, which depends continu- 
ously on xe A; (ii) df (EX) = ES and df,(EX) = 
E w for all xe A; (iti) There is a Riemannian 
metric on M and constants c>0,0<A<1 such 
that, for any integer m>0 and xe A, |d (w) 
<ch"||wl| when we Ef and |idf-(w) || < 
cA™||wl| when we EX. A diffeomorphism fe 
Diff”(M) is called an Anosov diffeomorphism 
if M itself is hyperbolic for f. Manifolds which 
admit Anosov diffeomorphisms are restricted 
(Hirsch, K. Shiraiwa). Examples of Anosov 
diffeomorphisms are given as follows: Let 
L:R">R’ be a hyperbolic linear automor- 
phism with L(Z")= Z", where Z” is the dis- 
crete subgroup of R” consisting of all elements 
with integral coordinates. Then L induces an 
automorphism f:T”— T" of the n-dimensional 
torus T”=R"/Z", which is an Anosov diffeo- 
morphism. There are similar constructions 
using hyperbolic automorphisms of simply 
connected ‘nilpotent Lie groups and their uni- 
form discrete subgroups (Smale). A homeo- 
morphism h: X +X of a metric space X is 
called expansive if there is a constant ¢>0 
such that x, ye X and d(h"(x), h"(y))<e for all 
neZ imply x=y. An Anosov diffeomorphism 
is expansive. For Anosov diffeomorphisms, 
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theorems similar to those stated in Section 
J(2) hold. Franks, Newhouse, A. Manning, 
and J. N. Mather obtained important results 
concerning Anosov diffeomorphisms. 

A diffeomorphism fe Diff (M) is called an 
Axiom A diffeomorphism if the following con- 
ditions are satisfied: A(a) The nonwandering 
set Q is hyperbolic; A(b) the set of all periodic 
points is dense in Q. There are examples that 
satisfy Axiom A(a) but not Axiom A(b) (A. 
Dankner, M. Kurata). For xe M, put W‘(x) 
={yeM|d(f"(0), f(y) 20 as n>} and 
W"(x)={yeM|d(f-"(0,f "(3)) 70 as n> 00}. 
We call W*(x) (resp. W“(x)) the stable (resp. 
unstable) manifold of f at x. For a subset A of 
M, put W°(A)=|),e4 W(x) (0 =s, u). If f is an 
Axiom A diffeomorphism, W*(x) and W“(x) 
are injectively immersed submanifolds of M. If 
x is a hyperbolic periodic point, W*(x) and 
W(x) defined here coincide with those defined 
before. For Axiom A diffeomorphisms, notions 
such as spectral decomposition, basic sets, the 
strong transversality condition, and the no 
cycle condition are defined similarly, and 
theorems similar to those stated in Section J(3) 
hold. 

Let h: X +X be a homeomorphism of a met- 
ric space X and a, f>0. A sequence {x;};.7 of 
points in X is an a-pseudo-orbit if d(h(x;), x;+1) 
<a for all ie Z. We say that {x;},.7 is f- 
shadowed (or f-traced) by a point xe X if 
d(h'(x), x) <$ for all ie Z. We say that h has 
the pseudo-orbit tracing property if for any f 
>0 there exists an « >0 so that any «-pseudo- 
orbit is B-shadowed by some point. If f is an 
Axiom A diffeomorphism, then f|Q(/):Q(/) 
—>Q(f) has the pseudo-orbit tracing property 
(Bowen). Axiom A diffeomorphisms with the 
strong transversality condition (in particular, 
Anosov diffeomorphisms) have the pseudo- 
orbit tracing property (Bowen, K. Sawada, K. 
Kato, A. Morimoto). 

(5) Let S be a discrete topological space with 
n elements (n> 2) and X(S)=[T;.7 S; the dou- 
bly infinite product of copies S; of S with the 
product topology. A point x =(x,);-7 of 2(S) is 
a doubly infinite sequence of points in S. X(S) 
is a ttotally disconnected, perfect, compact, 
tmetrizable space (i.e., a Cantor set). Let 
o:4(S)-+2(S) be a mapping defined by o(x)= 
y, x= (x), V=()s Y:=X (i€ Z). Then o is a 
homeomorphism of (S) and is called the shift 
automorphism with n symbols. 

Define a diffeomorphism f:S*—S? of the 2- 
dimensional sphere S? as follows. We consider 
S? as R?U {00}. Let U be a region of R? con- 
sisting of the rectangle R (= ABCD) and an 
upper and a lower cap as shown in Fig. 1. f 
maps U into itself as in Fig. 1 (f(A)= 4’, f(B) 
= B’, f(C)=C', f(D) = D’, and so on). Here, 

Rf f(R)=PUQ is the union of two rectangles, 
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P'=f~'(P) and Q'=f~'(Q) are rectangles, and 
f is taffine on each of P’ and Q’ (stretching in 
the vertical direction and contracting in the 
horizontal direction). The lower cap is con- 
tracted into itself, and every point in it tends to 
a sink x9. The upper cap is mapped into the 
lower one and stays there. The point œ% is the 
only source, and lim,» f ~ "(x)= œ for all 
xeR?—U. Thus the nonwandering set of f 
consists of a sink x), a source œ, and A= 
(\nez.f"(R). The mapping f constructed here 
is called the horseshoe diffeomorphism. It is 

an Axiom A diffeomorphism with the strong 
transversality condition (and hence structur- 
ally stable). The restriction f|A of f ona 

basic set A is topologically conjugate to the 
shift automorphism with two symbols. It is 
neither a Morse-Smale diffeomorphism nor an 
Anosov diffeomorphism [7]. 


Fig. 1 


A restriction of a shift automorphism on a 
closed invariant set is called a subshift. Let A 
=(A;,) be an n x n matrix with (i, j)-component 
A,,=0 or 1 for all i, j=1,...,n. It is irreducible 
if for each i, j there is a positive integer m such 
that A” has a nonzero (i, j)-component. As- 
sume that A is irreducible and S={1,...,n}. 
Put 24 =({x=(x,)€2(S)| A,,x,,,=1 for all 
ie Z}. Then X; is a closed invariant set of 2(S). 
The restriction ¢,=o0|2,:2,—-, is called a 
subshift of finite type or a Markov subshift, and 
A is called the transition matrix. A topological 
classification of the subshifts of finite type has 
been investigated by Williams. 

Let fe Diff"(M) be an Axiom A diffeomor- 
phism and A its basic set. Bowen constructed a 
*Markov partition for f| A by generalizing the 
Sinai construction for Anosov diffeomor- 
phisms. The Markov partition connects f|A 
with a suitable subshift of finite type and is 
applied to the study of Axiom A diffeomor- 
phisms, especially to the ergodic theory of 
Axiom A diffeomorphisms (Bowen, Ruelle). A 
similar theory for flows was developed by 
Bowen, Ruelle, and M. E. Ratner. 

(6) Let xe M be a hyperbolic fixed point of 
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f¢Diff’(M). A point pe W*(x)N W(x) (p#x) is 


called a homoclinic point. If W*(x) and W“(x) 
intersect transversally at a homoclinic point p, 
then p is called a transversal homoclinic point. 
In a neighborhood of a transversal homoclinic 
point, there is a closed invariant set A of f” 
for some positive integer m such that f™|A is 
topologically conjugate to the shift automor- 
phism with two symbols (Smale). There are 
generalizations of this theorem for semiflows 
by F. R. Marotto, Shiraiwa, and Kurata. 


K. Topological Entropy and Zeta Functions 


(1) The notion of topological entropy was first 
defined by R. L. Adler, A. G. Konheim, and 
M. H. McAndrew as an analog to measure- 
theoretic entropy. Let X be a compact topo- 
logical space and « an topen covering of X. 
Let N(a) be the minimum number of members 
of a tsubcovering of «. Let f: X > X be a con- 
tinuous mapping and a an open covering of X. 
Then lim,..,,(1/nlogN(avfav...vf ta) 
exists, where av f tav ...v f "tta is the open 
covering {Ao Nf HADNA... Af" (An) Ao, 
A,,...,A,-,€%}. We denote the above limit by 
h(f, «) and call it the topological entropy of f 
with respect to «. The topological entropy h(/) 
of f is defined as the sup h( f, «), where the 
supremum is taken over all open coverings « 
of X. Now assume that X is a compact metric 
space. For an open covering « of X, put diama 
=sup{diam A|Aeé«}, where diam A is the 
diameter of A. If {%,},5, is a sequence of open 
coverings of X such that diama,—-0 as n œ, 
then A( f, )>h(f)as n- 00. The topological 
entropy of the shift automorphism with n 
symbols is equal to logn. Let L:R”—>R” be a 
linear mapping with L(Z")c L(Z”) and 4,,..., 
A, its eigenvalues. Let f: T"— T" be the in- 
duced endomorphism of the n-dimensional 
torus T”=R"/Z”". Then the topological entropy 
is given by h( f)=%),,>, log|A(l- 

Let f:X >X be a continuous mapping 
and M(f) the set of all tf-invariant probabil- 
ity measures on the fBorel sets of X. For 
each pe M(f), denote by h,(f) the (measure- 
theoretic) entropy of f with respect to u. Then 
h(f)=sup{h,(f)| we M(f)} (E. I. Dinaburg, 
T. N. T. Goodman, L. W. Goodwyn). The 
following properties hold: (i) If f:X +X isa 
homeomorphism, then h( f")=|n|h(/) for all 
neZ. (ii) If (X, ọ) is a continuous flow, then 
h(g,)=|t|h(@,) for all teR. (iii) Let f: X;- X; 
(i= 1,2) be continuous. Then h(f, x f,)= 
h(f,)+h(/,). (iv) Let f: X; >X; (i= 1, 2) be con- 
tinuous. If there is a continuous mapping 
g:X,->X, such that g(X,)=X, and gof,= 
f, 0g, then h( f,)>h(f,). In particular, if fi and 
fa are topologically conjugate, then h( f,)= 
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hfa). (v) If f: X +X is a homeomorphism and 
Q is the nonwandering set of f, then h( f)= 

h( f\Q). In particular, Q-conjugate homeo- 
morphisms have the same topological entropy 
(Bowen). (vi) If f: M—>M is an Axiom A diffeo 
morphism of a closed C*-manifold M, then 

h( f)=limsup,,..,,,(1/n)log N,( f), where N,(/) is 
the number of all periodic points of period n. 
And h( f)=0 if and only if the nonwandering 
set 1s finite (Bowen). 

Bowen gave alternative definitions of the 
topological entropy and generalized it for 
uniformly continuous mappings of (not neces- 
sarily compact) metric spaces. If f: M >M 
is a C'-mapping of an n-dimensional Rie- 
mannian manifold M, then h(f)<max {0, 
nlogsup{ ||df,|||xéM}}, where |/df,|| is the 
norm of df,: T,(M)— T,,.)(M), and hence h(f) 
is finite (Bowen, S. Ito). 

(2) Let f: MM be a continuous mapping 
of a closed C”-manifold M and f}: H,(M)> 
H,,(M) (resp. f, ,:H,(M)—> H,(M)) the tinduced 
homomorphism if f on the fhomology group 
H,,(M) (resp. the ‘first homology group 
H,(M)) with coefficients in R. The spectral 
radius s(L) of a linear mapping L:E-E ofa 
real vector space E is the maximum of the 
absolute values of the eigenvalues of L. The 
following entropy conjecture is still open: 

If f: M—M is a C'-diffeomorphism (C!- 
mapping), then h( f)>logs(f,). Concerning the 
entropy conjecture, the following are known: 
(i) The conjecture holds for a dense open set of 
Diff"(M) in the C°-topology (Shub); (ii) for 

any continuous mapping f, h( f)>logs(f,1) 
(Manning); (tii) the conjecture fails for a 
homeomorphism (Pugh); (iv) the conjecture 
holds for an Axiom A diffeomorphism with 
the no cycle condition (Shub and Williams); 
(v) the conjecture holds for any continuous 
mapping of the n-dimensional torus (M. 
Misiurewicz, F. Przytycki); (vi) for a C?- 
mapping f:M—>M, h(f)>log|deg f|, where 
deg fis the ‘mapping degree of f (Misiurewicz, 
Przytycki). 

(3) M. Artin and B. Mazur first defined the 
zeta function of a diffeomorphism by analogy 
to *Weil’s zeta function. Let f:X —>X bea 
homeomorphism of a compact metric space 
X. Assume that the number N,, = N,,(/) of all 
periodic points of period m is finite for all m. 
Put ¢(t)=exp(X7_, N,,¢’"/m) and call it the 
zeta function of f. For any closed C®-manifold 
M, there is a dense set E of Diff"(M) such that 
NA f)<ck™ for feE, where N,,(f) is the 
number of isolated periodic points of period m 
and c and k are positive constants depending 
only on f (Artin, Mazur). Hence, for such fe E, 
the series exp(X%_, N,,( f)t™/m) has a positive 
radius of convergence. Originally, Artin and 
Mazur called this the zeta function of f. 
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Let o,:2,72, be a subshift of finite type 
with transition matrix A; then ¢(t)= 1/det(E — 
tA), where E is the unit matrix (Bowen and O. 
E. Lanford III). The zeta function of an Axiom 
A diffeomorphism has a positive radius of 
convergence and is a rational function (K. 
Meyer, J. Guckenheimer, Manning). 

Let (M, o) be a nonsingular C’-flow on a 
closed C*-manifold M. Let T be the set of 
all closed orbits and /(y) the smallest posi- 
tive period of yeI. Smale defined the zeta 
function of (M, p) by Z(s)= Iler Mol — 
[expl(y)] -* *). If (M, @) is the geodesic flow on 
a surface of constant negative curvature, Z(s) 
reduces to the tSelberg zeta function. 

There are generalizations and modifications 
of the notion of zeta function by Ruelle and 
Franks. 


L. Classical Dynamical Systems 


(1) Let M be a C”-manifold without bound- 
ary. A C’-flow (M, p) or a C’-diffeomorphism 
{:M-M with a smooth invariant measure is 
called a classical dynamical system. Most 
important classical dynamical systems are 
Hamiltonian or Lagrangian systems. In the 
modern formulation, a Hamiltonian system 
consists of M, a symplectic form w on M (i.e., a 
tnondegenerate ‘closed 2-form), and the vector 
field Xy on M defined by a C’*!-function 
H:M >R. We call X,, the Hamiltonian vector 
field with energy function H. Let (M, œ, Xy) be 
a Hamiltonian system. Then the following 
hold: (i) M is of even dimension and there ts a 
system of local coordinates (q', ...,q", Py, ---5 
Pa) such that w= È! dq! ^dp; (J. G. Darboux). 
In these coordinates X y is expressed by 
tHamilton’s equations dq‘/dt = 0H /ôp;, 
dp;/dt= —0H/dq', i=1, ...,n; (ii) the smooth 
measure defined by the tvolume element Q= 
((—1)"71/n!)co” is an invariant measure for 
the flow generated by X,, (J. Liouville); (iii) 
the energy function H is constant along any 
trajectory of Xy. Especially, H~'(e) (ee R) is 
an invariant set for each e and is called an 
energy surface. Energy surfaces are submani- 
folds of codimension one for almost all ee R. 
Important examples of Hamiltonian sys- 
tems are given as follows. Let Q be an n- 
dimensional manifold and T*(Q) the tcotan- 
gent bundle of Q. Then T*(Q) has a canonical 
symplectic form Wp). We call Q a configuration 
space and 7*(Q) a momentum phase space. For 
any C’*!-function H on T*(Q), we have a 
Hamiltonian system (7*(Q), wo, Xy) in which 
Xy is of class C". The Hamiltonian formalism 
is translated into Lagrangian formalism by 
using the tangent bundle 7(Q) instead of 
T*(Q). T(Q) is called a velocity phase space. 
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For a given differentiable function L on T(Q), 
the energy function E on T(Q) and the Lagran- 
gian vector field X; on 7(Q) are constructed. If 
L satisfies a certain condition, there is a system 
of local coordinates (qt, ...,q",4', ...,g”) such 
that Xz is expressed in these coordinates by 
the tEuler-Lagrange equations dq‘/dt = 4’, 
d/dt(0L/éq') = 6L/éq', i=1,...,n. The projec- 
tion of an integral curve (i.e., an orbit) of Xg 
into Q is called a base integral curve. Under a 
suitable condition, the base integral curve of a 
Lagrangian (or a Hamiltonian) system is the 
geodesic of the Jacobi metric on Q up toa 
reparametrization, and the restriction of the 
flow generated by X, on an energy surface is 
the geodesic flow described below. 

(2) Let M be a fcomplete Riemannian mani- 
fold of class C” and S(M)= {ve T(M)| |v] =1}. 
Let n:S(M)—>M be the projection defined by 
n(v)=x for ve T,(M). Then S(M) is a tsphere 
bundle over M, which is called the unit tangent 
sphere bundle over M. Each ve S(M) deter- 
mines a unique tgeodesic C,:R—M such that 
C,,(0)=x(v) and the ftangent vector C,(0) to C, 
at t=0 is equal to v. Let v, be the tangent 
vector C,(t) to C, at t for te R. Then ||v,|| =1 
and x(v,)=C,(t) for all teR. Define a map- 
ping @:S(M) x R>S(M) by ¢(v, t)=0,. Then 
(S(M), o) is a C®-flow, which is called the 
geodesic flow on M. By the classical Liouville 
theorem, a geodesic flow has a smooth invar- 
lant measure. 

(3) Let T"=R"/Z” be the n-dimensional 


torus and w,,...,@, real constants. Define a 
mapping 9: T" x R>T" by g([X1,---5 XL t) = 
[x,+@,t,...,x,+@,t], where [x,,...,x,]€ 


R"/Z" = T" is the residue class of (x,,...,X,)€ 
R” modulo Z”. Then (T”, o) is a C®-flow. We 
call it the translation flow with frequencies 
Ois On If @,,...,@, are linearly indepen- 
dent over Z, they are called independent. Every 
orbit of the translation flow is dense if and 
only if its frequencies are independent (Poin- 
caré, H. Weyl). A translational flow with inde- 
pendent frequencies is called quasiperiodic. 

Let (M, œ, Xy) be a Hamiltonian system on 
a 2n-dimensional manifold M. Under a certain 
condition, there exist an open set U of M and 
a diffeomorphism f: U — T” x R” such that the 
following holds: Identify U with T” x R” by 
fand let (q',...,q",P1,---5P,) be the coordi- 
nates of T” x R”. Then the energy function 
H is independent of qg=(q', ...,q") so that 
Hamilton’s equations becomes dq'/dt = dH/ép,, 
dp,/dt = —dH/éq'=0 for i=1,...,n. Therefore 
the solutions are given by q‘(t)=(0H/ép,(c))t + 
q'(0) modulo Z, p,(t)=p,(0)=c;, i=1,...,7, 
where c=(c,,...,C,). Therefore N.= f +T" x 
{c\) is an invariant torus (i.e., an invariant 
set diffeomorphic to T”) of (the flow generated 
by) Xy for all ce R”, and the restriction flow on 
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N. 1s the translational flow with frequencies 

w, =0H/ép,(c), ...,@,=0H/dp,(c) (Arnold). As- 
sume further that the Hessian det(d? H/ép,0p,) 
does not vanish at c and @,,...,@, are inde- 
pendent. Let A be an energy function obtained 
by adding a sufficiently small perturbation to 
H. Then for almost all c’ near c, there exists an 
invariant torus Ñ of Xg near N, such that the 
restriction flow on N is differentiably equiva- 
lent to the translational flow with the same 
frequencies (Kolmogrov, Arnold, Moser). 

(4) Generic properties for Hamiltonian 
systems were investigated by M. Buchner, 
Markus, Meyer, Pugh, Robinson, Takens, and 
Newhouse. 


M. Bifurcation 


(1) Consider a differential equation with a 
parameter. For example, let X be a domain 
of R”, J=(-1, 1), and f:J x X>R'" a C- 
mapping. For each pe J, define f,:X +R" by 
SAX) = f(u x), xe X. Consider the differential 
equation 


dx/dt = f,(x), 


As u varies, the topological structure of the 
phase portrait of (7) may change. Suppose that 
there exists 49éJ such that the topological 
structure of the phase portrait of (7) changes at 
L= Ho but remains the same when pọ—£< u< 
Ho OF Ho < H< Ho +E for some e>0. Then Ho 

is called a bifurcation point of (7). 

Hopf bifurcation: Assume that X = R? and 
the origin Oe R? is a singular point of (7) for 
all we J. Assume further that the Jacobian 
matrix of f, at OER? has two distinct complex 
conjugate eigenvalues (x) and A(z) such 
that the real part Re A(,2) of A(4) is positive 
when p> 0, zero when x =0, and negative 
when u <0. Then 0€R? is a sink for u <0 and 
a source for zp >0. Now assume further that 
d/du({Re A(u))| „=o is positive and NER? is a 
“vague attractor.” Then 0eJ is a bifurcation 
point, and there exists an asymptotically stable 
closed orbit for (7) near and around 0eR? 
which depends continuously on yu for y>0 
[37]. Thus a sink of (7) (u<0) changes to a 
source and an asymptotically stable closed 
orbit (u >0) when u changes its sign. The Hopf 
bifurcation theorem can be generalized to a 
higher-dimensional case, and there is a dif- 
feomorphism version of the theorem. 

(2) Bifurcations in more general settings 
have been investigated by many mathema- 
ticians, including Thom, Arnold, R. J. Sacker, 
G. R. Sell, D. H. Sattinger, G. Iooss, Ruelle, 
and Takens. Generic bifurcations of dynamical 
systems have been investigated by J. Soto- 


xeX and peJ. (7) 
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mayer, Meyer, P. Brunovsky, and others; 
bifurcations of Morse-Smale systems by New- 
house, Palis, Peixoto, and S. Matsumoto; 

and bifurcations of Axiom A diffeomorphisms 
by Newhouse and Palis. 


N. Miscellaneous Topics 


(1) Let S* be the 3-dimensional sphere and 
VeX'(S°*). H. Seifert proved that if V is suffi- 
ciently close (in C? topology) to a nonsingular 
vector field tangent to the fibers of the tHopf 
fibration S?—S?, then V has a closed orbit. He 
conjectured that every nonsingular vector field 
VeX'(S?) had a closed orbit. (Seifert conjec- 
ture; — 154 Foliations D). If a nonsingular 
vector field Ve %1(S*) is transverse to a codi- 
mension 1 foliation of class C?, then it has a 
closed orbit (S. P. Novikov). Let M be a 3- 
dimensional C*”-manifold. Then there exists a 
nonsingular vector field Ve 21(M) with no 
closed orbit in any thomotopy class of a non- 
singular vector field on M (P. A. Schweitzer). 
Thus the Seifert conjecture fails for a vector 
field of class C}, but the conjecture for a vector 
field of class C” (r > 2) is an open problem. 
Related work has been done by Fuller, H. 
Chu, and A. Weinstein. 

(2) Let M be a closed connected C®- 
manifold. A C’-flow (M, ọ) (resp. f € Diff”(M)) 
is a minimal flow (resp. a minimal diffeomor- 
phism) if M itself is a minimal set. If M admits 
a tlocally free S'-action of class C”, then it 
admits a minimal C”-diffeomorphism, and if 
M admits a locally free special (in particular, a 
tfree) T?-action of class C”, then it admits a 
minimal C”-flow (A. Fathi, Herman, A. B. 
Katok). Open problems: What are the topo- 
logical properties of the manifolds admitting 
minimal flows? Does $? admit a minimal flow? 

(3) E. N. Lorenz studied numerical solutions 
of the following nonlinear equations in R? 
which arose from the convection equation: 
dx/dt = — ox + oy, dy/dt = —xz +rx— y, dz/dt= 
xy/bz. When o = 10, r=28, and b=8/3, he 
found irregular behavior in this dynamical 
system. R. M. May studied numerical solu- 
tions of the following difference equation in 
connection with the growth of biological popu- 
lations with nonoverlapping generations: 

Xy41 = 4X (1 — Xp), x E LO, 1] (1 <a <4). He 
found that the dynamica! structure of the 
above difference equation was delicate and 
complicated. Y. Ueda and H. Kawakami also 
found similar phenomena in their numerical 
study of Duffing’s equation of the type d? x/dt? 
+kdx/dt + x? = Bcost. The phenomena ob- 
served in these investigations were called 
chaos, which exhibits strange attractors. 

These investigations have attracted the 
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attention of many mathematicians and scien- 
tists. For 1-dimensional semidynamical sys- 
tems such as May’s equation, T. Y. Li, J. A. 
Yorke, A. N. Sharkovskii, J. W. Milnor, Thur- 
ston, and many others have obtained notable 
results, while for Lorenz’s equation we have 
results by Ruelle, Guckenheimer, Williams, 
Sinai, and many others. Chaos arising from 
discretization of differential equations has been 
studied by M. Yamaguti, S. Ushik1, and others 
(— 433 Turbulence and Chaos). 
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A. General Remarks 


There are two types of multistage tdecision 
processes. In one of them, an outcome of the 
whole process is determined at the final stage 
without any consideration of the outcome for 
each intermediate stage. The textensive form 
of a game is of this type. In the other type, 
an outcome is assigned at each stage of a 
multistage decision process, The theory of 
dynamic programming, dealing with this latter 
type, has been developed by R. Bellman and 
others since 1950 and is now one of the fun- 
damental branches of mathematical program- 
ming, along with the theories of linear and 
nonlinear programming. The following exam- 
ples illustrate some features of multistage deci- 
sion processes. 

Multistage allocation process. We are given 
a quantity x >Q that can be divided into two 
parts y and (x—y). From y we obtain a return 
g(y), and from (x— y) a return h(x —y). In so 
doing, we expend a certain amount of our 
original resources and are left with a new 
quantity, ay + b(x—y), 0<a, b<1, with which 
the process is continued. How do we proceed 
so as to maximize the total return obtained in 
a finite or unbounded number of stages? 

Multistage choice process. Suppose that we 
possess two gold mines A and B, the first of 
which contains an amount x of gold, while the 
second contains an amount y. In addition, we 
have a single gold mining machine with the 
property that if used to mine gold in the mine 
A, there is a probability P, that it will mine 
a fraction r, of the gold there and remain in 
working order, and a probability 1 — P, that it 
will mine no gold and be damaged beyond 
repair. Similarly, the mine B has associated 
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with it the corresponding probabilities P, and 
1 — P, and fraction r,. How do we proceed in 
order to maximize the total amount of gold 
before the machine is defunct? 

These two processes have the following 
features in common: (1) In each case we have a 
physical system characterized in any state by a 
small set of parameters, the state variables. (2) 
In each state of either process we have a choice 
of a number of decisions. (3) The effect of a 
decision is a transformation of the state vari- 
ables. (4) The past history of the system is of 
no importance in determining future actions 
(Markov property). (5) The purpose of the 
process is to maximize some function of the 
state variables. 

A policy is a rule for making decisions that 
yields an allowable sequence of decisions; an 
optimal policy is a policy that maximizes a 
preassigned function of the final state vari- 
ables. A convenient term for this preassigned 
function of the final state variables is criterion 
function. One of the characteristic features of 
Bellman’s methodology of dynamic program- 
ming is the appeal to the principle of optimal- 
ity: An optimal policy has the property that 
whatever the initial state and initial decision 
are, the remaining decisions must constitute 
an optimal policy with regard to the state 
resulting from the first decision. 

In the multistage allocation process the state 
variables are x (the quantity of resources) and 
z (the return obtained up to the current stage). 
The decision at any stage consists of an alloca- 
tion of a quantity 0< y <x. This decision has 
the effect of transforming x into ay+ b(x— y) 
and z into z+ g(y)+h(x—y). The purpose of 
the process is to maximize the final value of z. 
Denote by f,(x) the N-stage return obtained 
starting from an initial state x and using an 
optimal policy. Then we have 


fi(x)= max {g(y)+hLx— y)], 
falx)= max [g(y) + h(x — y) 


+ f,-1(ay+b(x—y))],  nB2. 


This recurrence relation yields a method for 
obtaining the sequence { f,(x)} inductively. 

In the stochastic gold-mining process, the 
state variables are x and y (the present level 
of the two mines) and z (the amount of gold 
mined to date). The decision at any stage con- 
sists of a choice of A and B. If A is chosen, 

(x, y) goes into ((1—r,)x, y) and z into z+r,x, 
and if B is chosen, (x, y) goes into (x, (1 —r3)}y) 
and z into z+r,y. The purpose of the process 
is to maximize the expected value of z ob- 
tained before the machine becomes defunct. 

Denote by f(x, y) the expected amount of 
gold obtained using an optimal sequence of 


fs, »)=mnax} 
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choice. Then we have 


piirix+ f( ~ry)x, Hi 
Pa{ray+f(x,(l—ry)y)} | 


The optimal policy can be described in the 
following way. We choose A or B according as 
pir, x/(1—p,) is greater or less than p,r,y/(1— 
p2). We can choose either A or B if equality 
holds. After an operation according to such a 
choice, the machine may become defunct and 
terminate the process. If the machine is usable, 
then we can apply our policy to a new com- 
bination of the amounts of gold in A and B. 


B. Discrete Deterministic Processes 


By a deterministic process we mean a process 
in which the outcome of a decision is uniquely 
determined by the decision. We assume that 
the state of the system, apart from time de- 
pendence, is described in any stage by an M- 
dimensional vector p constrained to lie within 
some region D. Let T={T,} (where q runs over 
a set S) be a set of transformations with the 
property that pe D implies that T,(p)eD for all 
qéS, i.e., any transformation T, carries D into 
itself. The term “discrete” signifies here that we 
have a process consisting of a finite or de- 
numerably infinite number of stages. A policy, 
for the finite process which we consider first, 
consists of a selection of N transformations 

in order, P=(T,, T3, ..., Ty), yielding succes- 
sively the sequence of states p;= T,{p;-,) (i= 
2,3,...,N) with p, = T,(p). These transfor- 
mations are to be chosen to maximize a given 
function R of the final state py. Observe that 
the maximum value of R(p,y), as determined 
by an optimal policy, will be a function of the 
initial vector p and the number N of stages 
only. Let us then define our basic auxiliary 
functions f,({p)=max R(py)=the N-stage re- 
turn obtained starting from an initial state p 
and using an optimal policy. This sequence is 
defined for N=1,2,...,and pe D. The essential 
uses of the principle of optimality can be ob- 
served from the following two features. The 
first is the use of the embedding principle. The 
original process is embedded in a family of 
similar processes. In place of attempting to 
determine the characteristics of an optimal 
policy for an isolated process, we attempt to 
deduce the common properties of the set of 
optimal policies possessed by the members of 
the family. The second feature is the derivation 
of recurrence relations by which the functional 
equations connecting the members of the se- 
quence { f,(p)} are established. Assume that 
we choose some transformation T, as a result 
of our first decision, obtaining in this way a 
new state vector T,(p). The maximum return 
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from the following k —1 stages is, by definition, 
Se-1(T,(p)). It follows that, if we wish to maxi- 
mize the total k-stage return, g must now be 
chosen to maximize this (k — 1)-stage return. 
The result is the basic recurrence relation 
f,.(p)=Max,es fy_1(T,(p)), for k> 2, with f,(p)= 
max,.5R(T,(p)). For the case of an unbounded 
process, the sequence { f,(p)} is replaced by a 
single function f(p), the total return obtained 
by using an optimal policy starting from state 
p, and the recurrence relation is replaced by 
the functional equation f(p)=max, f(7,(p)). 


C. Discrete Stochastic Processes 


We again consider a discrete process, but one 
in which the transformations are stochastic 
rather than deterministic. The initial vector p 
is transformed into a stochastic vector z with 
an associated distribution function dG,(p, z) 
dependent on p and the choice of q. We as- 
sume that z is known after the decision has 
been made and before the next decision is to 
be made. We agree to measure the value of 

a policy in terms of some average value of 
the function of the final state. Let us call this 
expected value the return. Beginning with 

the case of a finite process, we define f,(p) as 
before. The expected return as a result of the 
initial choice of T} is therefore 


| Jr-ı(2)dG,(p, 2). 


Consequently, the recurrence relation for the 
sequence { f,(p)} is 


fp) =max | f,-1(2)dG,(p, 2), k22, 


with fi (p) =MaXges Jeep R(z) dG,(p, z). Con- 
sidering the unbounded process, we obtain the 
functional relation 


foma | f(z)dG,(p, z). 


D. Continuous Deterministic Processes 


A number of interesting processes require that 
decisions be made at each point of a con- 
tinuum, such as a time interval. The simplest 
examples of processes of this character are 
provided by the tcalculus of variations. Let 

us denote by f(p; T) the return obtained over 
a time interval [0, T] starting from the ini- 
tial state p and employing an optimal policy. 
Although we consider the process as one con- 
sisting of choices made at each point t on 

(0, T], it is better to begin with the concept of 
choosing policies (functions) over intervals, 
and then pass to the limit as these intervals 
shrink to points. Let d be an allowable deci- 
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sion made over the interval [0, S], and let p; be 
the state at S starting from the initial state p 
and employing d. The application of the prin- 
ciple of optimality suggests that 


I(p.S+ Nesp iad) (1) 


where the supremum is taken over the set D of 
all allowable decisions d. 

The limiting form of (1) as S0 is a non- 
linear partial differential equation (Bellman 
partial differential equation). This expression 
is important for use in actual analysis. For 
numerical purposes, S is kept nonzero but 
small. R. Bellman showed that it is possible 
to avoid many of the quite difficult rigorous 
details involved in this limiting procedure if 
we are interested only in the computational 
solution of variational processes. 


E. Markovian Decision Processes 


Consider a physical system which at any of 
the times t=0, A, 2A, ... must lie in one of the 
states S,,S,,...,S,. Let y;(n) be the probability 
that the system is in S; at times nA, and let P, 
be the probability that the system is in state S, 
at t+A if it is in state S; at time t. We suppose 
that the transition probabilities P,; are inde- 
pendent of t. We assume that the P, depend 
on a parameter q, which may be a vector, and 
that at each stage of the process q is to be 
chosen so as to maximize the probability that 
the system is in the state S,. We obtain the 
nonlinear system 


yı(n+1)= max ¥ pyl )y(n) 


y(n) pis (4"); 


II 
it 


2 


y(n + 1)= > PilQ*) y(n), 


i=2,3,...,N 


where g* = q*(n) in the remaining N — 1 equa- 
tions is one of the values of q that maximize 
y,(n+ 1). There are similar processes that can 
be considered as continuous analogs of this 
type of decision process. These are called Mar- 
kovian decision processes and were discussed 
by Bellman. There is, however, another type 
of Markovian decision process in which a re- 
ward is given at each stage. For each state S; 
of the system there are k alternatives 1, 2, 3, 
.,k. If we choose the alternative h among 
these k alternatives, then the transition prob- 
abilities pi! (j=1,2,...,n) are determined, and 
a reward r$ is associated with each state S. 
Let us denote by v;{n) the total expected re- 
turn obtained at the nth stage by appealing 
to an optimal policy when the initial state is S;. 
Then the principle of optimality in the theory 
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of dynamic programming yields 
N 
v(n+1)= max ( x piri; + 7) 
E 


A policy-iteration method invołving a 
value-determination operation with a policy- 
improvement routine was given by R. A. 
Howard [4]. 


F. Dynamic Programming and the Calculus of 
Variations 


Problems in the calculus of variations can be 
viewed as multistage decision problems of a 
continuous type. It can be shown that the 
dynamic-programming approach yields formal 
derivations of classical necessary conditions 
for the calculus of variations. Let us consider 
the problem of minimizing the functional 


b 
J(y)= | L(x, y(x), y(x))dx, 
where the function y is subject to y(a)= 

We embed this problem within the family of 
problems generated by allowing a and c to be 
parameters with the ranges of variation —c < 
a<b, —œ <c< œ. Now we define the optimal 
value function S(a, c)=min, J(y). Then the 
principle of optimality yields the functional 
equation 


S{a, c) 


a 


=min (|. fisy, y)ds+ Sla+-A, cl), 


where the minimization is taken over all func- 
tions defined over [a,a+A] with y(a)=c and 
c(y)= yla + A). Then, writing v= y’'(a), we get 


os os 
spe n (re Cs nto). 
This yields the Euler equation, the Legendre 
condition, the Weierstrass condition, and 
the Erdman corner conditions. Furthermore, 
it can be shown that the functional equa- 
tion characterization yields the Hamilton- 
Jacobi partial differential equation of classical 
mechanics. 

The dynamic-programming approach can 
be applied to more general problems in the 
calculus of variations. 


G. Dynamic Programming and the Maximum 
Principle 


In general, the method of dynamic program- 
ming carries a more universal character than 
the maximum principle of optimal control 
theory. However, in contrast to the latter, this 
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method does not have a rigorous logical foun- 
dation. V. G. Boltyanskii [6] has presented a 
justification of the dynamic programming 
method. 

Let f;(x, u) (i=0, 1, ..., n) be defined for xe 
VCR" and ue UCR’, where V is an open set, 
and continuously differentiable on V x U. 
Suppose that two points x° and x! are given in 
V. Among all the piecewise continuous con- 
trols u(t)=(u,(0), ...,u,(t))e U which transfer 
the phase point moving in accordance with 


dx 


a =fi(x,u(t)), i=1,...,n, 


from x? = x(t) to x! = x(t,), find the control 
u(t) for which the functional 


s= |” fs Jolx(t), u(t))dt 


takes the smallest value. 

A continuous function @(x)=@(x,,...,X,) 1S 
called a Bellman function relative to a point 
ae V if it possesses the following properties: (1) 
(a) =0; (2) there exists a set M (the singular 
set of w{x)), which is closed in V and does not 
contain interior points, such that the function 
w(x) is continuously differentiable on the set 
V—M and satisfies the condition 


ueU 


sup( $ e i, u)— f(x, )=0 , xeV—M. 


The following theorem gives a sufficient opti- 
mality condition. 

Theorem: Assume that for dx/dt = f(x, u(t)) 
given in a region VCR" there exists a Bellman 
function q(x) relative to the point ae V with a 
piecewise smooth singular set. Assume, further- 
more, that for any point x°¢ V there exists 
a control u(t) which transfers the phase point 
from x° = x(t,) to a=x(t,) and satisfies the 
relation 


| “folx(),u(0))dt= — a(x, 


Then any such control u(t) is optimal in V. 

Recently, Vinter and Levis [7] obtained the 
following general result in this connection. The 
sufficient condition is given in terms of a solu- 
tion to the Bellman partial differential equa- 
tion. It is shown that if this equation is modi- 
fied so that it is actually an inequality, and if 
this inequality is required to be satisfied in a 
limiting sense only, then the condition is also 
necessary for optimality. 


H. Characteristic Features of Dynamic 
Programming 


The characteristic features of the dynamic- 
programming approach can be summarized in 


507 127 Ref. 
Dynamic Programming 


the following five points: (1) the advantage of 
lower dimensionality in comparison with the 
enumeration approach; (2) the possibility of 
finding maxima and/or minima of functions 
defined over restricted domains for which 
differential calculus may not work well; (3) the 
availability of numerical solutions in recursive 
form; (4) the possibility of formulating certain 
problems to which classical methods do not 
apply; and (5) the applicability of the method 
to most types of problems in mathematical 
programming, such as finventory and produc- 
tion control, optimal searching, and some 
optimal and adaptive control processes. 
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A. General Remarks 


The term econometrics can be interpreted in 
various ways. In its widest sense, it means 

the application of mathematical methods to 
economic problems and includes mathematical 
economics, mathematical programming, etc. 
However, here we use it to mean statistical 
methods applied to economic analysis. 

The object of econometrics is to provide 
methods to analyze relationships between 
economic variables. We classify these methods 
into four categories according to the types of 
relationships involved: (1) Analysis of causal 
relations: If a set of variables X,,...,X,, affects 
an economic variable Y, we can estimate the 
direction and extent of those effects on Y. 

(2) Analysis of equilibrium: When a set of 
economic variables Y,,..., Y,, is determined 
through a market equilibrium mechanism, we 
can analyze the structure of relationships 
that determines the equilibrium. (3) Analysis 
of correlation: When a set of economic vari- 
ables is affected simultaneously by some (un- 
known) common factors, we can analyze the 
correlation structure of the variables. (4) Ana- 
lysis of time interdependence: A process of 
development in time of a set of economic 
variables can be analyzed. 

There are two types of economic data: (a) 
macroeconomic data, representing quantities 
and variables related to a national economy as 
a whole, usually based on national census; and 
(b) microeconomic data, representing informa- 
tion about the economic behavior of individ- 
ual persons, households, and firms. Macro- 
economic data are usually given as a set of 
time series (— 421 Time Series Analysis A), 
while microeconomic data are obtained main- 
ly through statistical surveys and are given as 
cross-sectional data. These two types of data, 
related to macroeconomic theory and micro- 
economic theory, respectively, require differ- 
ent approaches; and sometimes information 
obtained from both types of data has to be 
combined; obtaining macroeconomic infor- 
mation from microeconomic data is called 
aggregation. 


B. Regression Analysis 


The most common technique for the first 

category of problems is tregression analysis, 
which is applied to both microeconomic and 
macroeconomic analysis. However, there are 
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problems peculiar to economic analysis, where 
a factor can seldom be controlled; and usually 
there are too many highly related independent 
variables. In such cases, if all possible indepen- 
dent variables are taken into the model, the 
accuracy of the testimators of the coefficients 
becomes extremely poor. Such a phenomenon, 
called multicollinearity, brings up the problem 
of selection of independent variables (— 403 
Statistical Models), to which no satisfactory 
solution has been given. Also, assumptions 
about the error terms may be dubious, the 
error terms may be correlated, or the variances 
may be different. If the tvariance-covariance 
matrix of the errors is given, the tgeneralized 
least squares method can be applied, but 
usually such a matrix is not available. 


C. Systems of Simultaneous Equations 


The second category of problems is peculiar 
to economic analysis and applies mainly to 
macroeconomic data. Suppose that Y=(Y;,, 
..., Yo) is a vector consisting of G economic 
variables, among which there exist G relation- 
ships that determine the equilibrium levels of 
the variables. We also suppose that there exist 
K variables Z=(Z,,...,Z,)’ that are indepen- 
dent of the economic relations but affect the 
equilibrium. The variables Y are called endog- 
enous variables, and the Z are called exog- 
enous variables. If we assume linear relation- 
ships among them, we have an expression 
such as 


Y=BY+TZ +u, (1) 


where B and T are matrices with constant 
coefficients and u is a vector of disturbances or 
errors. (1) is called the linear structural equa- 
tion system and is a system of simultaneous 
equations. By solving the equations formally, 
we get the so-called reduced form 


Y=MZ+v, (2) 


where /7=(1—B) tI, v=(I— B) tu. The 
relation of Y to Z is determined through the 
reduced form (2), and if we have enough data 
on Y and Z we can estimate /7. The problem 
of identification is to decide whether we can 
determine the unknown parameters in B and 
T uniquely from the parameters in the re- 
duced form. A necessary condition for the 
parameters in one of the equations in (1) to be 
identifiable is that the number of unknown 
parameters (or, since known constants in the 
system are usually set equal to zero, the num- 
ber of variables appearing in the equation) not 
be greater than K +1. If it is exactly equal to 
K +1, the equation is said to be just identified, 
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and if it is less than K + 1, the equation is said 
to be overidentified. 

If all the equations in the system are just 
identified, for arbitrary /7 there exist unique B 
and T that satisfy /7=(1 — B)~'T. Therefore, if 
we denote the tleast squares estimator of MH by 
IT, we can estimate B and T from the equation 
(I ~ B)/7=T. This procedure is called the 
indirect least squares method and is equivalent 
to the *maximum likelihood method if we 
assume normality for u. 

When some of the equations are over- 
identified, the estimation problem becomes 
complicated. Three kinds of procedures have 
been proposed: (1) full system methods, (2) 
single equation methods, and (3) subsystem 
methods. In full system methods all the para- 
meters are considered simultaneously, and if 
normality is assumed, the maximum likelihood 
estimator can be obtained by minimizing |(Y — 
ITZ) (Y — IZY |. Since it is usually difficult to 
compute the maximum likelihood estimator, a 
simpler, but asymptotically equivalent, three- 
stage least squares method has been proposed. 
The single equation methods and the sub- 
system methods take into consideration only 
the information about the parameters in one 
equation or in a subset of the equations, and 
estimate the parameters in each equation 
separately. There is a single equation method, 
called the limited information maximum like- 
lihood method, based on the maximum likeli- 
hood approach, and also a two-stage least 
squares method, which estimates J first by 
least squares, computes Y = /7Z, and then 
applies the least squares method to the model 


Y=BY4+TZ+a. 


These two and also some others are asymptot- 
ically equivalent. Among asymptotically 
equivalent classes of estimators corresponding 
to different information structures it has been 
established that the maximum likelihood 
estimators have asymptotically higher-order 
efficiency [5] (— 399 Statistical Estimation) 
than other estimators, and Monte Carlo and 
numerical studies show that they are in most 
cases better than others if properly adjusted 
for the biases. 

In many simultaneous equation models 
which have been applied to actual macro- 
economic data, the values of endogenous 
variables obtained in the past appear on the 
right-hand sides of equations (1). Such vari- 
ables are called lagged variables, and they can 
be treated, at least in the asymptotic theory of 
inference, as though they were exogenous. 
Hence exogenous variables and lagged endog- 
enous variables are jointly called predeter- 
mined variables. When many lagged variables 
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appear over many time periods and when 
some structure among the coefficients of those 
lagged variables can be assumed, such a model 
is called a distributed lag model. 

Sometimes it is necessary to include some 
nonlinear equations in the simultaneous equa- 
tion model. Such nonlinear simultaneous 
equation models are difficult to deal with, 
partly because the solution of the equation 
may not be unique, and in practical applica- 
tions ad hoc procedures are applied to obtain 
estimates of the parameters. 


D. Multivariate and Time Series Analysis 


Problems in the third category can be ap- 
proached by tmultivariate analysis techniques. 
Sometimes tprincipal component analysis and 
tcanonical correlation analysis have been 
applied to analyze the variations of a large 
amount of data. However, the practical mean- 
ing of the results obtained is often dubious. 

The fourth category is the problem of time 
series analysis. Sophisticated theories of sto- 
chastic processes have little relevance for 
economic time series, because usually the time 
series do not satisfy such conditions as being 
stationary or having the tMarkov property, 
etc. Recently, however, autoregressive moving 
average ((ARMA) and multivariate ARMA 
models [4] have been applied to macro- 
economic data, especially for the purpose of 
prediction and for determining the direction of 
causal relations (— 421 Time Series Analysis). 
Traditionally, fluctuations of economic time 
series have been thought to consist of trend, 
cyclic variation, seasonal variation, and error. 
Various ad hoc techniques have been used to 
separate or eliminate such components, but 
the theoretical treatment of such problems is 
far from satisfactory. 
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Albert Einstein (March 14, 1879—April 18, 
1955) was born of Jewish parents in the city of 
Ulm in southern Germany. He became a Swiss 
citizen soon after graduating from the Eid- 
gendssische Technische Hochschule of Zürich 
in 1900. Afterward, he obtained a position as 
examiner of patents at Bern, and while at that 
post, he published his theories on light quanta, 
tspecial relativity, and Brownian motion. 
After briefly holding professorships at the 
University of Ziirich and the University of 
Prague, he became a professor at the Univer- 
sity of Berlin in 1913. His general theory of 
relativity was announced in 1916, and in 1921 
he won the Nobel Prize in physics for his 
contributions to theoretical physics. To escape 
Nazi persecution, he fled to the United States 
in 1933, and until his retirement in 1945 he 
was a professor at the Institute for Advanced 
Study at Princeton. He advised President 
Roosevelt of the feasibility of constructing the 
atomic bomb, but after World War II, along 
with others who had been connected with the 
bomb, he was active in promoting the nuclear 
disarmament movement and the establishment 
of a world government. 

The theory of relativity raises fundamental 
epistemological problems concerning time, 
space, and matter. The results of the general 
theory were verified in 1919 by observations of 
the solar-eclipse. 

Through his latter years, Einstein continued 
to work on tunified field theory and on the 
generalization of relativity theory. 
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A. Maxwell’s Equations 


Mathematical formulation of electromagnetics 
leads to tinitial value and tboundary value 
problems for Maxwell’s equations according 
to the geometric nature of the medium. Max- 
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well’s equations for a vacuum are written in the 
form 


€,0E/ot =rotH—J,, Ey divE=p,, 


HoOH/et= —rotE—J,,,  uodivH=p,,, (1) 


where E and H are the electric and magnetic 
field vectors, p, and p,, the electric and mag- 
netic charge densities, J, and J„ the electric 
and magnetic current densities, £ and Ho con- 
stants, and the quantity 1/./&9 {49 =c the speed 
of light in vacuum (2.99797 x 108 m/s). Charge 
and current densities must satisfy the equations 
of continuity 


op,/6t+divJ,=0, 
Op, / Ot + div J, =. (2) 


Following upon the observation that, appar- 
ently, p,,=0 and J,,=0 in nature, we hence- 
forth set them equal to zero. This causes an 
asymmetry between the electric and magnetic 
quantities. On the other hand, the proposition 
“Om =0 and J,,=0” cannot be deduced from 
the classical theory itself. 

In the presence of matter, additional charge 
and current appear due to the electric and 
magnetic polarization P and M of the material. 
Therefore, in this case, it is necessary to make 
the following substitutions in (1): 


Pe p= p.—divP, 
J,~J=J,+6P/ét+rotM, 
H->H+M. (3) 


Moreover, if we define the electric flux density 
(or electric displacement) D and the magnetic 
flux density (or magnetic induction) B by 
D=28E+P, B=yp.(H+M), (4) 
then Maxwell’s equations (1) are transformed 
into | 
0D/ét=rotH—J,, divD=p,, 
0B/ôt = —rotE, divB=0. (5) 


In the electromagnetic field in a vacuum 
there is energy with a density 


U=(E/2)E* + (Uo/2)B’, (6) 


and energy flux with a density expressed by 
the Poynting vector 


S=ExH. (7) 


Between these quantities the following relation 
holds: 


du/dt + divS=0. (8) 


An electric charge g moving with velocity v 
in an electromagnetic field is subject to the 
Lorentz force 


F=qE+ qv xB. (9) 
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This force can be interpreted as being caused 
by the Maxwell stress tensor 


Ty, =(€9/2)(— E; Ep +28; E?) 
+ (ti9/2)(— H; H, + 26H”). (10) 


By introducing the scalar potential V and 
the vector potential A, we can express the field 
vectors as 


B=rotA, E=-—gradV—éA/ét. (11) 


Furthermore, if we impose an auxiliary con- 
dition (Lorentz condition) 


then we obtain from (1) the wave equations 
LV=—p./&, ODAS = Hode, (13) 


where [] = A — £o Ho0%/ôt? is called the ¥ Alem- 
bertian and is sometimes written as ©7. From 
(13) we conclude that the electromagnetic 

field can propagate in a vacuum as a wave 
with speed c= 1/\/ £o Ho. In tquantum theory 
the potentials V and A are regarded as being 
more fundamental than E and H themselves. 
However, they are not uniquely determined, in 
the sense that the gauge transformation 


V-V+eéyp/et, A>A—grady (14) 


with an arbitrary function wy of the space and 
time variables does not affect the fields. 
Maxwell’s equations are invariant under the 
tLorentz transformation. Therefore they can 
be written in 4-dimensional tensor form (— 
359 Relativity C). 
Maxwell’s equations can be regarded as 
the twave equations for tbosons with spin 1 
(‘photons). The equations of quantum electro- 
dynamics are obtained if we regard the field 
quantities as quantum-mechanical vari- 
ables (g-numbers) and then perform second 
quantization. 


B. Concrete Problems 


In solving Maxwell’s equations concretely, 
we usually make additional assumptions for 
polarizations, electric current, and field vectors 


P=y.E, M=y,H, J.=oE, (15) 


where Ye, Xm, and o are called the electric sus- 
ceptibility, magnetic susceptibility, and conduc- 
tivity, respectively. Then we have 


D=cE, B=unzH, (16) 


where £= £p + Xe Is the dielectric constant and 
U= Holl + Xm) is the magnetic permeability. 
Therefore the equations (5) become identical to 
(1) if £% and Ho are replaced by £ and p, respec- 
tively. (p,, and J„ are set equal to zero.) 
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Some cases of practical importance are 
given below. 

(1) Electrostatics. If the fields are time- 
independent and there is no electric current, 
then E and H are mutually independent. The 
static electric field is calculated from the solu- 
tion of the boundary value problem of the 
*Poisson equation AV = — p,/e. Specifically, V 
takes a constant value in each conductor. 

(2) Magnetostatics. For zero electric current, 
the problem of magnetostatics is solved in the 
same way as in electrostatics. For the case of 
nonvanishing stationary electric current, the 
problem is reduced to that of solving 


AA=—pJ,,  divA=0. (17) 


(3) Electric current in a conductor. A station- 
ary electric current in a conductor is governed 
by the equation of continuity div J =0, Ohm’s 
law J=oE, and a special case of Maxwell’s 
equation rot E=0. The electric current pro- 
duces heat (Joule heat) proportional to J-E 
per unit volume per unit time (Joule’s law). 
For certain substances, the specific resis- 
tance ao’ suddenly becomes negligibly small 
below a critical temperature. This is called 
superconductivity. 

(4) Quasistationary electric circuit. The 
problem appearing most often in electrical en- 
gineering is that of a quasistationary circuit. 
Its characteristic feature is that the electric 
currents exist only in the circuit elements 
(inductors, capacitors, and resistors) and in the 
lines connecting them. The current (both J, 
and 0D/ét) can be neglected in all other parts 
of the system. (This could be compared to 
the situation in dynamics where we consider 
systems of material points or of rigid bodies 
having a finite number of degrees of freedom, 
although every material body is essentially 
a continuum.) The system network is con- 
structed as a flinear graph with the circuit ele- 
ments as its branches. Topological tnetwork 
theory deals with the relation between the 
structure of the linear graph and the electri- 
cal characteristics of the network, whereas 
function-theoretic network theory deals with 
the relation between current and voltage at 
each part of the network. In the latter theory, 
current and voltage are considered as func- 
tions of the frequency of the sinusoidal alter- 
nating voltage applied to some point of the 
network. Together these constitute a unique 
theoretical system in engineering mathe- 
matics for designing system networks (— 282 
Networks). 

(5) Theory of electromagnetic waves. The 
theory of electromagnetic waves deals with the 
case where the changes of all field quantities 
are proportional to e'', and in addition the 
frequency w is so large that the term ôD/ôt in 
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(5) is of the same order as or larger than J,. In 
such a situation the electromagnetic field 
behaves like a wave. 

Problems of various types arise depending 
on the geometry of the conducting and dielec- 
tric substances, on the type of the energy 
source, etc. Important problems are: (i) radi- 
ation of a wave from a point source into free 
space; (ii) scattering of a plane wave by small 
bodies or cylinders; (iii) diffraction of a wave 
through holes in a conducting plate; (iv) reflec- 
tion and refraction of a wave at the boundary 
between different media; (v) wave propagation 
along a conducting tube (wave guide); and 
(vi) resonance of the electromagnetic field in a 
cavity surrounded by a conducting substance. 
Theoretical treatment similar to that for ordi- 
nary networks is possible for microwave cir- 
cuits consisting of wave guides, cavities, etc. 

(6) Wave guides. For electromagnetic waves 
with a harmonic time dependence e‘” propa- 
gating inside a hollow tube of uniform cross 
section extending in the z-direction with per- 
fectly conducting walls (called a wave guide), 
the Maxwell equations become 


rot E = iwB, div B=0, 


rotB=—ipewE, divE=0, (18) 
with the boundary condition 


nxE=0, n-B=0, 


where n is a unit normal at the boundary 
surface. A further reduction is gained by 
Fourier analysis in the z-variable. For har- 
monic z-dependence e'**, the transverse com- 
ponents E, =(e, x E) x e, and B, =(e, x B) xe, 
are determined from the z-components E, = 
e,‘ E and H,=e,-H by the following part of 
the Maxwell equations: 


ikE, + ime, x B,= grad, E., 
ikB, —ipewe, x E, = grad, B, (19) 


(where grad, is the transverse components of 
the gradient), up to the solutions for E, = B, 
=0, called transverse electromagnetic (TEM) 
waves, for which k= w/e, B,= +./pe e, x 
E,, and E, is a solution of the electrostatic 
problem in two dimensions rot E,=0, div E, 
=0. To have E, #0 for the TEM solution, it is 
necessary to have two or more surfaces, such 
as a coaxial cable (region between two con- 
centric cylinders) or a parallel-wire trans- 
mission line. Nonzero longitudinal compo- 
nents E, and B, are determined from the 
2-dimensional equations 


a 
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with boundary conditions E, =0 and ĝB,/ôn 
=0 on the wall, where (3/2n) denotes the 
partial derivative in the normal direction. The 
solutions with B,=0 are called transverse 
magnetic (TM) waves (or electric (E) waves) 
and those with E, =0 transverse electric (TE) 
waves (or magnetic (M) waves); for each of 
these cases the equations determine an eigen- 
wave number k for a given angular frequency 
œ (typically in a waveguide situation) or 
eigenfrequency v= «w/(2z) for a given wave- 
length 2 =2z/k (typically in a resonant cavity 
situation). 
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A. Definition 


A function of a finite number of real or com- 
plex variables that is talgebraic, exponential, 
logarithmic, trigonometric, or inverse trig- 
onometric, or the composite of a finite number 
of these, is called an elementary function. Ele- 
mentary functions comprise the most common 
type of function in elementary calculus. 

J. Liouville [1] defined the elementary func- 
tions as follows: An algebraic function of a 
finite number of complex variables is called an 
elementary function of class 0. Then e* and 
logz are called elementary functions of class 1. 
Inductively, we define elementary functions of 
class n under the assumption that elementary 
functions of class at most n—1 have already 


515 


been defined. Let g(t) and g(w,,...,w,) (1 < 
j<m) be elementary functions of class at most 
1 and f(z,,...,2,,) be an elementary function 
of class at most n—1. Then the composite 
functions g{f(Z,,...,Zm)) and f(g, (Wy, .--5Wp)s 

-+Im{(W1,---,W,)) (and only such functions) 
are called the elementary functions of class at 
most n. An elementary function of class at 
most n and not of class at most n — 1 is called 
an elementary function of class n. A function 
that is an elementary function of class n for 
some integer n is called an elementary function. 
In this article, we explain the properties of the 
most common elementary functions. 


B. Exponential and Logarithmic Functions of a 
Real Variable 


Let a>0, a#1. A function f(x) of a real vari- 
able satisfying the functional relation 


Fxty=fOfO),  f(=a, (1) 


satisfies f(n)=a" for positive integers n and 
f(—n)=1/a" for negative integers —n. In gen- 
eral, f(n/m)= “at for every rational number 
r=n/m. If we assume that f(x) is continuous, 
then there is a unique strictly monotone func- 
tion f(x) defined in (—0o, 00) whose range is 
(0, œ). The function f(x) is called the exponen- 
tial function with the base a and is denoted by 
a*, read “a to the power x” and also called a 
power of a with exponent x. Its inverse function 
is called the logarithmic function to the base a, 
and is denoted by log, x. The specific value 
log, x is called the logarithm of x to the base a. 
If g(x) =log, x, we have 


g(a)=1. (2) 


Hence we have xy= f(g(x)+g(y)). Therefore 
we can reduce multiplication to addition by 
using a numerical table for the logarithmic 
function. 


g(xy)=g(x) + 9(y), 


C. Logarithmic Computation 


The logarithm to the base 10 is called the 
common logarithm. If two numbers x, y ex- 
pressed in the decimal system differ only in the 
position of the decimal point (i.e., y=x- 10" for 
an integer n), they share the same fractional 
parts in their common logarithms. The in- 
tegral part of the common logarithm is called 
the characteristic, and the fractional part is 
called the mantissa. (We note that the word 
“mantissa” is also frequently used for the frac- 
tional part a in the ffloating point representa- 
tion x=a- 10", 10°’ <a<1, or 1<a<10.) The 
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common logarithms of integers have been 
computed and published in tables. 


D. Derivatives of Exponential and Logarithmic 
Functions 


The function f(x)=a* is differentiable, and 
S'(x)=k, f(x), where k, is a constant deter- 
mined by the base a. If we take the base a to 
be 


lim { 1 Nas 2.71828 
e= im ( +) se err : wie 
then we have k,=1. The number )(1/v!) is 
usually called Napier’s number and is denoted 
by e after L. Euler (his letter to C. Goldbach 
of 1731; — Appendix B, Table 6). In 1873, 

C. Hermite proved that e is a transcendental 
number. We sometimes denote e* by exp x; the 
term exponential function usually means the 
function exp x. The function e* is invariant 
under differentiation, and conversely, a func- 
tion invariant under differentiation necessarily 
has the form Ce*. The logarithm to the base e 
is called the Napierian logarithm (or natural 
logarithm), and we usually denote it by log x 
without explicitly naming the base e (some- 
times it is denoted by In x). The derivative 

of log x is 1/x, hence we have the integral 
representation 


logx= |“ (3) 


1 X 


The constant factor k, in the derivative of a” 
is equal to loga. The graphs of y=e* and 
y=logx are shown in Fig. 1. The functions e” 
and log(1 +x) are expanded in the following 
Taylor series at x =0: 


É (4) 


y 


logt += $ (PH, (5) 


v=1 v 


The power series in the right-hand sides of (4) 
and (5) are called the exponential series and 
the logarithmic series, respectively. The radii 
of convergence of (4) and (5) are œ and 1, 
respectively. 
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E. Trigonometric and Inverse Trigonometric 
Functions of a Real Variable 


The trigonometric functions of a real variable 
x are the functions sin x, cos x (— 432 Trig- 
onometry) and the functions tan x = sin x/cos x, 
cot x =cos x/sin x, sec x = 1/cos x, and cosec x 
=1/sinx derived from sin x and cos x. The 
derivatives of sin x and cos x are cos x and 
—sin x, respectively. They have the following 
Taylor expansions at x =0: 


g (=I) 


: — 2v+1 

sin x= 2 Ore D“ ; (6) 
eGP a 

aa Oy (7) 


The radii of convergence of (6) and (7) are both 
oon 

The inverse functions of sin x, cos x, and 
tan x are the inverse trigonometric functions 
and are denoted by arcsin x, arc cos x, and 
arc tan x, respectively. (Instead of this nota- 
tion, sin™! x, cos”! x, and tan! x are also 
used). These functions are infintely multiple- 
valued, as shown in Fig. 2. But if we restrict 
their ranges within the part shown by solid 
lines in Fig. 2, they are considered single- 
valued functions. To be more precise, we re- 
strict the range as follows: —2z/2<arcsinx < 
n/2, 0<arccosx<a, —2/2<arctanx <n/2. 


y yY=arctans 


=arcsin X 
‘ 


Fig. 2 


The functions having these ranges are called 

the principal values and are sometimes denoted 

by Arcsin x, Arecos x, and Arc tan x, respec- 
tively. The derivatives of these functions are 
(=x?) 2, ~(1—x?) t, (1 +x?) t}, respec- 
tively (— Appendix A, Table 9.1; for the Tay- 
lor or Laurent expansions of tan x, cot x, 
sec x, cosec x, arcsin x, arccos x, arctan x, etc., 
— Appendix A, Table 10.IV). 


F. Hyperbolic Functions 


Let P be a point on the branch of the hyper- 
bola x? — y?=1, x>0, and let O be the origin 
and A the vertex (1,0) of the hyperbola. De- 
note by 0/2 the area of the domain surrounded 
by the line segments OA, OP, and the arc AP 
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of the hyperbola. Then we define the coordi- 
nates of P to be (cosh 8, sinh 0) as functions of 
0. We have 


cosh x =(e* +e *)/2, 
sinh x = (e*—e*)/2, (8) 


called the hyperbolic cosine and hyperbolic sine, 
respectively. As in the case of trigonometric 
functions, we define the hyperbolic tangent by 
tanh x = sinh x/cosh x, the hyperbolic cotangent 
by coth x =cosh x/sinh x, the hyperbolic secant 
by sech x = 1/cosh x, and the hyperbolic cose- 
cant by cosech x = 1/sinhx. They are called 

the hyperbolic functions. The graphs of sinh x 
and cosh x are shown in Fig, 3. The trigono- 
metric functions are sometimes called circular 
functions. 


We now introduce the Gudermannian (or 
Gudermann function): 


0=gdu=2arctane"—27/2, 


1+sin0 


1 
u=gd~' 0=log|tan 0 + sec 6| =-log —-—.. 
2 ` 1—sin0 


Then the hyperbolic functions can be ex- 
pressed in terms of the trigonometric func- 
tions. For example, 


sinhu=tan@, coshu=sec, tanhu=sinð. 


G. Elementary Functions of a Complex 
Variable (— Appendix A, Table 10) 


(1) Exponential function. The power series (4) 
converges for all finite values if we replace x by 
the complex variable z and gives an tentire 
function of z with an tessential singularity at 
the point at infinity. This is the exponential 
function e7 of a complex variable z. It satisfies 
the addition formula (1), e7:*72 =e7'e72, and it 
is also the tanalytic continuation of the ex- 
ponential function of a real variable. For a 
purely imaginary number z=iy, we have the 
Euler formula 


e? =cosy+isiny. (9) 


The function w=e’ gives a tconformal map- 
ping from the z-plane to the w-plane, as shown 
in Fig. 4, which maps the imaginary axis of 
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the z-plane onto the unit circle of the w-plane 
(w=u+ iv). For z=x + iy (x, y are real num- 
bers), we have e7 =e*e'”; hence e7 is a tsimply 
periodic function with fundamental period 2zi. 


JY 


w-plane 


(2) Logarithmic function. The logarithmic 
function logz of a complex variable z is the 
inverse function of e7. It is an infinitely 
multiple-valued analytic function that has 
tlogarithmic singularities at z =0 and z = œ. 
All possible values are expressed by logz 
+ 2nzi (n is an arbitrary integer), where we 
select a suitable value log z. The principal value 
of logz is usually taken as logr + iĝ, where z 
=re”® (r=|z|, 0 is the argument of z) and 0< 0 
<2zx. (Sometimes the range of the argument is 
taken as —n <0 <n.) The principal value of 
logz is sometimes denoted by Logz. The 
power series (5) gives one of its ffunctional 
elements. The integral representation (3) holds 
for a complex variable z. The multivalency of 
logz results from the selection of a contour of 
integration; the integral of 1/z around the 
origin is 2zi, which is the increment of logz. 

(3) Power. The exponential function a’ for 
an arbitrary complex number a is defined to 
be exp(zloga). Similarly, z“ is defined to be 
exp(alogz). The function z^ is an algebraic 
function if and only if a is rational. In other 
cases, the function z“ is an elementary function 
of class 2. 

(4) Trigonometric functions, inverse trig- 
onometric functions, hyperbolic functions. 

The trigonometric, inverse trigonometric, and 
hyperbolic functions of a complex variable are 
defined by the analytic continuations of the 
corresponding functions of a real variable. For 
example, sin z and cosz are defined by the 
power series (6) and (7), respectively. They are 
entire functions whose zero points are nz and 
(n —4)z (n is an integer), respectively. They are 
also represented by tWeierstrass’s infinite 
product (— Appendix A, Table 10.VI). 

The functions tanz, cotz, sec z, and cosecz 
are tmeromorphic functions of z on the com- 
plex z-plane, and they are expressed by 
tMittag-Leffler partial fractions (— Appendix 
A, Tables 10.IV, 10.V). As can be shown from 
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(8) and (9), we have 


cosz=(e¥ +e7")/2,  sinz=(e¥—e~*")/2i, 


coshz=cosiz, sinh z= (sin iz)/i. (10) 


Each of these formulas (10) is called an Euler 
formula. For a complex variable, the trig- 
onometric and hyperbolic functions are com- 
posites of exponential functions, the inverse 
trigonometric functions are composites of 
logarithmic functions, and all of them are 
elementary functions of class 1. The definition 
of elementary functions by Liouville described 
in Section A refers, of course, to the functions 
of a complex variable. We remark that the 
inverse function of an elementary function is 
not necessarily an elementary function. For 
example, the inverse function of y=x—asinx 
is not an elementary function (— 309 Orbit 
Determination B). 

The derivative of an elementary function is 
also an elementary function. However, the 
tprimitive function of an elementary function 
is not necessarily an elementary function. The 
primitive function of a rational function or an 
algebraic function of tgenus 0 is again an 
elementary function. Similar properties hold 
for rational functions of trigonometric func- 
tions. Liouville [1] carried through a deep 
investigation of the situation where the in- 
tegral of an elementary function is also an 
elementary function. 
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A. Introduction 


The word “atom” is derived from the Greek 
word for indivisible. It turns out that an atom 
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is divisible into its constituent nucleus and 
electrons. The nucleus, in turn, consists of 
protons and neutrons (together called nu- 
cleons). Photons (quanta of electromagnetic 
waves), electrons, protons, and neutrons (de- 
noted by y, e, p, and n, respectively), along 
with many other subsequently discovered sub- 
nuclear particles, are called elementary parti- 
cles, while nuclei, atoms, and molecules are the 
composite particles composed of these elemen- 
tary particles. 

States of a particle form an irreducible (uni- 
tary) representation of the proper inhomoge- 
neous Lorentz group with positive energy (or 
a finite direct sum of such representations). 
Thus a mass (m>0) and a spin (j =0,4, 1,...) 
are assigned to each particle (— 258 Lorentz 
Group). For example, e, p, and n have spin 4 
and nonzero masses, while y has spin 1 and 
zero mass. 

Many elementary particles are unstable, 
decaying into other particles. The average 
lifetime is denoted by zq and its inverse is called 
the half-width. The time in which half of many 
samples of the same particle decays is called 
the half-life, given by tlog2. For example, 
electrons, protons, and photons are supposed 
to be stable (or at least to have very long life- 
times), while a neutron decays into a proton, 
an electron, and a neutrino v (f-decay) with a 
lifetime of about 15 minutes. 

From the study of the relativistic equations 
(Dirac equations) for wave functions of an elec- 
tron, P. A. M. Dirac predicted the existence of 
particles with the same mass as the electron, 
but of opposite electric charge (Dirac’s hole 
theory, 1930). These were discovered in 1932 
and called positrons. Every elementary particle 
is now believed to be associated with an anti- 
particle characterized by the opposite sign of 
the particle’s additive quantum numbers, the 
two being connected by the tPCT theorem. 
Hence the positron is the antiparticle of the 
electron. The antiproton, theoretically ex- 
pected for a long time and experimentally 
found in 1955, is the antiparticle of the proton. 
Antiprotons, antineutrons, and positrons are 
constituents of antimatter. The particles whose 
additive quantum numbers are all invariant 
under change of sign, such as photons (y), neu- 
tral pions (7°), etc., seem to be antiparticles of 
themselves and are said to be self-conjugate. 

Elementary particles have four distinct types 
of interaction: gravitational, weak, electro- 
magnetic, and strong, in increasing order of 
interaction strength. Gravitational and electro- 
magnetic interactions were recognized in 
earlier centuries because these interactions are 
of long range. A. Einstein put forward the idea 
of a light quantum or photon as a lump of 
electromagnetic energy behaving like a par- 
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ticle. Whether a corresponding quantum (grav- 
iton) exists for gravitational interactions is a 
question related to the existence of gravita- 
tional waves themselves, and is not yet settled. 

The nuclear force is an example of a strong 
interaction and is studied to elucidate nuclear 
structure and to derive the tcross sections of 
various collision processes involving nuclet. H. 
Yukawa predicted in 1935 the existence of a 
particle associated with the nuclear force, just 
as photons are associated with the electro- 
magnetic interaction. Its mass was predicted, 
from the range of the nuclear force, to be 
about 200 times the electron mass, which is 
intermediate between the masses of electrons 
and nucleons, and hence the particle was 
named a mesotron or a meson. These were 
found in cosmic rays in 1947 and, in fact, it 
was found that the mesons relevant to the 
nuclear force (now called pions and denoted 
by x*, x, n? according to their electric charge) 
decay into other kinds of particles called 
muons (denoted by u*, u~) with a charged- 
pion lifetime of 2.6 x 1078 sec (n+ >p* 4+ v(¥); 
the neutral pion z? decays, over a shorter life- 
time, into photons). The muons then decay 
into electrons and neutrinos (u* >et +v+7, 
v indicating antineutrinos) with muon life- 
times of about 2.2 x 1076 sec. Weak inter- 
actions are relevant to these decays as well as 
to the B-decay of the nucleus and the neutron. 
Since 1962 electron neutrinos v, and muon 
neutrinos v, have been distinguished in these 
decays, so that electron and muon numbers 
may be conserved. 

Since 1949, many new particles (unstable 
under weak interactions) have been gradually 
found in cosmic rays; these are called strange 
particles. Some of the early ones are hyperons 
(A, X,&), of spin 4, and kaons (K, K), of spin 0. 
For such strange particles, the strangeness 
quantum number, which is preserved in the 
strong interaction, has been introduced, and 
the Nakano—Nishijima—Gell-Mann formula 
concerning this number is known to hold. This 
says that Q = I, +4(B + S) for each elementary 
particle, where Q is the electric charge in units 
of that of the positron, I, is the third compo- 
nent of the isospin, B is the baryon number, 
and S is the strangeness. 

Quantum-mechanical wave functions for a 
system of identical particles seem to be either 
totally symmetric (Bose statistics) or totally 
antisymmetric (Fermi statistics) under permu- 
tations of particles. Accordingly, particles are 
either bosons (or Bose particles) or fermions (or 
Fermi particles). All bosons seems to have 
integer spins and all fermions half-odd-integer 
spins. This is the connection of spin and statis- 
tics and follows from certain axioms in quan- 
tum field theory (— 150 Field Theory). There 
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has been some discussion on intermediate 
statistics (parabosons and parafermtons). 


B. Families of Elementary Particles 


Elementary particles are classified into families 
of leptons, photons, and hadrons. 

The family of leptons now consists of elec- 
trons (e), muons (u), and tau-leptons (t), their 
accompanying neutrinos (ve, v,, v,), and their 
antiparticles; t was discovered in the late 
1970s. Experimentally, v, 1s not well estab- 
lished nor has the possibility v, = v, yet been 
excluded. Leptons have spin 4. They are char- 
acterized by having no strong interactions. 

The family of photons consists of photons 
and the recently discovered intermediary weak 
vector bosons. Gluons (— Section C (4)), if 
they exist, also belong to this family. 

The family of hadrons has a large number of 
members which are either mesons or baryons, 
the former being bosons and the latter fer- 
mions. We now have, in addition to pions and 
kaons, many resonant mesons (unstable under 
strong interaction) such as p-mesons and w- 
mesons. At present we have, in all, more than 
20 species of mesons. This number does not 
count spin, charge, and antiparticle degrees of 
freedom. The baryon subfamily includes nu- 
cleons, hyperons, and excited states, now con- 
sisting of more than 30 species, again not 
counting spin, charge, and antiparticle degrees 
of freedom. We have nucleonic resonances 
with spin as high as 11/2. 

Hadrons are now well understood as com- 
posites of subhadronic constituents called 
quarks (and antiquarks), although free quarks 
have not been observed. (Hence the problem of 
quark confinement has been discussed ex- 
tensively.) Mesons are systems made up of 
a quark and an antiquark. Baryons are sys- 
tems made up of three quarks. Therefore, at 
our present level of knowledge, the elemen- 
tary particles might be leptons, photons, and 
quarks (and the corresponding antiparticles). 


C. Methods in the Theory of Elementary 
Particles 


(1) tQuantum Field Theory. Application of 
the ideas of quantum mechanics to electro- 
magnetic fields and their interaction with elec- 
trons resulted in the formulation of quantum 
electrodynamics (and more generally quan- 
tum field theory). Application of quantum- 
mechanical perturbation theory to quantum 
electrodynamics with the fine-structure con- 
stant «= e7/(hc) (about 1/137) as an expansion 
parameter resulted in divergent expressions— 
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the so-called divergence difficulties. The ultra- 
violet divergence comes from integration over 
high momenta (of virtual particles), and the 
infrared divergence is due to the zero mass of 
photons. It was later found that the ultraviolet 
divergence can be combined with a small 
number of parameters of the theory (i.e., elec- 
tron mass, (possibly nonzero) photon mass, 
and electromagnetic coupling constant e) into 
a revised set of constants (called the renormal- 
ized masses and coupling constant), which are 
then equated to the observed finite values of 
these constants—a procedure called renor- 
malization. Physically, this renormalization is 
pictured to be effected by virtual photons (and 
electron-positron pairs) surrounding (bare) 
electrons and photons. The infrared divergence 
is supposed to be a reflection of the fact that 
electrons can be accompanied by infinitely 
many photons with negligibly small total 
energy (a situation made possible by the zero 
mass of photons); this cannot be experimen- 
tally analyzed (and is indistinguishable from a 
single electron). 

The relativistically covariant formulation 
of the renormalized perturbation theory of 
quantum electrodynamics proposed by S. 
Tomonaga, J. S. Schwinger, and R. P. Feyn- 
man (independently and in different forms), 
and in particular the Feynman rules and 
Feynman diagrams that lead to the Feynman 
integrals (— 146 Feynman Integrals), made 
possible detailed theoretical computations, 
and the computed values (such as the Lamb 
shift of hydrogen and the anomalous magnetic 
moment of an electron) fit marvelously well 
with observed values—an achievement con- 
sidered a great success of quantum electro- 
dynamics. F. J. Dyson more or less showed 
that the renormalization procedure really 
absorbs all the divergences in all orders of the 
perturbation expansion in terms of renormal- 
ized constants, though there were later refine- 
ments and elaborations of the proof. This 
work also leads to the division of quantum 
field theories into two classes: renormalizable 
theories, where (infinitely many) ultraviolet 
divergences can be absorbed into a finite 
number of constants by renormalized pertur- 
bation theory, and unrenormalizable theories. 
The question of whether perturbation series 
converge in some sense is an unsolved ques- 
tion of quantum electrodynamics. 

In quantum field theory, the central role is 
played by quantum fields, which are operator- 
valued generalized functions of a space-time 
point. Particle interpretations of any state at 
infinite past and infinite future are obtained in 
the theory from the asymptotic behavior of the 
fields (tasymptotic fields) at time —oo and time 
+œ, and from their relation, expressed by the 
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tS-matrix, describing how particles scatter by 
collision. Thus any model of quantum fields 
makes, in principle, a prediction about what 
particles appear and how they behave (asymp- 
totically) in mutual collisions. 

After the success of quantum electrodynam- 
ics, the perturbation theory of quantum fields 
was applied to systems of pions and nucleons; 
this proved to be unsuccessful, possibly due to 
the lack of an appropriate expansion param- 
eter. This has led some people to study the 
mathematically rigorous consequences of 
quantum field theories; these mathematical 
consequences do not require any perturbation 
calculation, simply following from a small 
number of mathematically formulated axioms 
believed to be satisfied by a large class of 
quantum field theories. This approach, re- 
ferred to as taxiomatic quantum field theory, 
has yielded a few physically meaningful con- 
sequences of general nature: analyticity of 
some S-matrix elements, the PCT theorem, 
and the connection of spin and statistics (— 
Section A). 

While the axiomatic approach provides a 
general framework, the tconstructive field 
theory developed later provides examples of 
quantum field theories that fit into such a 
framework. Because of its concrete nature, it 
can make statements about detailed properties 
of the model, such as the (non-)existence of 
composite particles, the establishment of per- 
turbation theory as an asymptotic expansion, 
and phase-transitions phenomena and the 
related broken symmetry as the coupling con- 
stant varies. It has, however, been successful 
for only 2 and 3 space-time dimensions. 


(2) Analytic S-Matrix Approach. Due to the 
failure of the perturbation approach in quan- 
tum field theory, a new approach was devel- 
oped based on assumptions about the analy- 
ticity properties of the S-matrix elements. The 
assumed analyticity properties were surmised 
from examination of Feynman integrals and of 
nonrelativistic potential scattering, and par- 
tially follow from axiomatic field theory. In 
this approach, the information that scattering 
amplitudes, or S-matrix elements, possess 
certain analytic properties with respect to 
energies, scattering angles, and so on is ex- 
pressed by means of integral representations. 
For example, the forward two-body scattering 
amplitude f(s) as a function of s = (energy in 
the center of mass system)’ is written as 
soi as (1) 


S$ —S 


~o 


Im f(s) is related to the total cross section by 
the optical theorem, which is a statement of 
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the unitarity of the S-matrix. Hence equation 
(1), called a dispersion relation, gives a relation 
among observable quantities. An integral rep- 
resentation for the general two-body scattering 
amplitude f(s, t,u) (2 incoming and 2 outgoing 
particles) has been proposed by S. Mandelstam 
and is called the Mandelstam representation, 
where s=(p;,+ p>)”, t=(p, +P3)*, and u= 
(pı + p4)* (squares in a Minkowski metric) 
and p1, P2, P3, P4 are the 4-momenta of the 
incoming and outgoing particles, with sign 
reversed for the latter. (Some relations hold 
among variables: p? = m? with m; the mass, 
> p;=0, s+t+u= > m?.) Under the inter- 
change of incoming and outgoing particles, 
the corresponding f’s are related by analytic 
continuation. This is called the crossing sym- 
metry. The study of the S-matrix directly 
from its analyticity and unitarity is called the 
S-matrix approach (— 386 S-Matrices). 

The analyticity of the two-body scatter- 
ing amplitude f as a function of the angular 
momentum / with a fixed s was investigated by 
T. Regge for nonrelativistic potential scatter- 
ing, and later the idea was applied to the S- 
matrix approach. The poles !=I{s) of f (fis 
considered to be a function of / for each fixed 
s) are called Regge poles. Regge trajectories /= 
I(s) for variable s <0 have been shown to play 
important roles in the high-energy behavior of 
scattering amplitudes for small values of the 
variable t (four-momentum-transfer squared). 

An approximate expression of S-matrix 
elements, with Regge poles and satisfying the 
crossing symmetry, was introduced by G. 
Veneziano and is called the Veneziano model. 
It has developed into the so-called dual reso- 
nance model (dual in the sense that s-channel 
poles are dual to t-channel poles) and has 
subsequently evolved into the string model 
of hadrons, according to which hadrons are 
viewed as systems composed of strings joining 
quarks (and antiquarks). 


(3) Group-Theoretical Approach. In connec- 
tion with the symmetry properties shown by 
the spectra and reaction patterns of hadrons, 
a group-theoretical approach has been de- 
veloped. For example, the similarity of the 
behavior of neutrons and protons in nuclei, 
apart from the difference in their electro- 
magnetic properties, was formulated as isospin 
invariance (under the group SU(2)). The sym- 
metry properties are sometimes understood in 
terms of new additive quantum numbers that 
are conserved or nearly conserved, and these 
properties are made more concrete in the final 
stage by the introduction of fundamental 
constituents carrying these quantum numbers. 

Based on the canonical formalism of quan- 
tum field theory, (Noether) currents, as 
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quantum-mechanical generators of the sym- 
metry, are introduced in association with con- 
served quantum numbers. The commutation 
relation of these currents, referred to as a cur- 
rent algebra, shows the structure of a Lie alge- 
bra expressing the symmetry of the Lagran- 
gian of the system of hadrons; for example, 
SU (3) x SU (3) for three species (flavors) of 
massless quarks (the first factor for vector 
currents and the second for axial-vector cur- 
rents). The approach showed remarkable suc- 
cess when combined with the hypothesis of 
the partially conserved axial-vector currents 
(PCAC), which requires the divergence of the 
axial-vector currents to be proportional to the 
pseudoscalar meson fields. 

Even if a theory has a symmetry under a 
group G in its formulation, a vacuum state of 
the theory might not have a symmetry under 
G. If that occurs, we speak of spontaneously 
broken symmetry. Under some assumptions, a 
particle of zero mass (which is connected to the 
vacuum by the current for the spontaneously 
broken symmetry) is associated with the spon- 
taneously broken symmetry. This statement is 
called Goldstone’s theorem, and the relevant 
particle of zero mass is called the Nambu- 
Goldstone boson. The PCAC hypothesis is 
believed to be connected with the spontaneous 
breakdown of the axial SU(3) symmetry with 
pions, kaons, and eta-mesons as Nambu- 
Goldstone bosons. This approach produced 
the Adler-Weisberger sum rule, which relates 
the weak axial-vector coupling constants to 
pion-nucleon scattering cross sections. 

A group-theoretical attempt to treat fer- 
mions and bosons on an equal footing resulted 
in the introduction of super Lie algebras (Z,- 
graded Lie algebras). The basic new ingredient 
in this approach ts a special class of generators 
roughly interpreted as the square root of the 
four-momentum, whose anticommutators 
(instead of commutators) are linear combina- 
tions of ordinary generators. A supermultiplet, 
which is an irreducible representation of a 
super Lie algebra, consists of both fermions 
and bosons. Extensions to local super Lie 
algebras (and also incorporation of gravitons 
into the framework) have been tried, with no 
realistic model emerging so far. 


(4) Non-Abelian Gauge Field Theory. Recently, 
quantum field theory has been revived as non- 
Abelian tgauge theory, resulting in what are 
considered to be successful qualitative and 
semiquantitative predictions. The quantization 
of the theory was at first carried out in terms 
of the tFeynman path integral, where fictitious 
particles, called Faddeev-Popov ghosts, appear 
through the precise definition of the functional 
measure of the path integral. The canonical 
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quantization has also been formulated with 
the explicit introduction of Faddeev-Popov 
ghosts from the beginning. Non-Abelian 
gauge field theory exhibits the very important 
property of asymptotic freedom, which states 
that the interaction at asymptotically high 
energies, or at very short distances, approaches 
that of free (noninteracting) theory. This prop- 
erty ts required for the description of hadronic 
systems made of quarks in view of the experi- 
mental observation of the scaling behavior of 
deep inelastic inclusive structure functions. 
Thus gauge theory is believed to describe the 
dynamics of systems of quarks in hadrons. It 
is called quantum chromodynamics, and the 
quantum of the gauge field is called the gluon. 
The recent development of gauge field theory 
has been accompanied by many technical 
refinements of quantum field theory, which 
include the methods of dimensional regulariza- 
tion and renormalization groups. 

Dimensional regularization starts with 
Feynman integrals defined for n-dimensional 
momenta with n #4, in order to give meaning 
to integrals divergent for n=4. Then Feynman 
integrals have poles at n=4, which are ab- 
sorbed into the unrenormalized constants by 
means of the renormalization procedure. This 
method is particularly suited for non-Abelian 
gauge theory, since it is the regularization 
method that keeps gauge invariance at each 
step of the calculation. l 

The trenormalization-group equation has 
been known for a long time. It results from 
the requirement that physical quantities 
should be dimensionally covariant under re- 
normalization of constants in the theory. The 
renormalization-group equation is usually 
written as a differential equation for a Green’s 
function, expressing the fact that a change of 
scale of momenta is balanced by changing 
coupling constants and masses. Further refine- 
ments are due to C. G. Callan, K. Symanzik, S. 
Weinberg, and G. ’t Hooft. 


D. Models of Elementary Particles 


Hadrons are now considered to be made of 
more fundamental constituents; at present, at 
least 15 species of subhadronic, fermionic 
constituents (quarks) have been proposed. 
Attempts to understand subhadronic structure 
have a long history, the main landmarks of 
which include the Fermi-Yang model (where 
m-mesons are supposed to be made of protons 
and neutrons), the Sakata model (where all 
hadrons are supposed to be made of protons, 
neutrons, and A-hyperons), and a variety of 
quark models developed from the eightfold 
way of M. Gell-Mann and Y. Ne’eman (new 
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assignments of representations of SU(3) to 
particles somewhat different from the Sakata 
model; for example, the octet representation 
for mesons and low-lying baryons, and the 
decuplet for excited baryons). Originally, 
quarks were supposed to come in 3 species 
(u-quarks, d-quarks, s-quarks), each carrying 
its own quantum number, now called the 
flavor quantum number. Then it was suggested 
that each flavored quark has three additional 
degrees of freedom, i.e., three new quantum 
numbers (called color quantum numbers) in 
order to account for experimental data: (1) the 
spin-statistics problem of baryonic ground 
state wave functions, (2) the decay rate of 
nm— 2y, (3) the Drell ratio {= total cross section 
for {e~ +e* >anything}, divided by the cross 
section for e7 +e* >” +u”). Recently, two 
new flavor degrees of freedom besides the old 
3 flavors (u, d,s) have been discovered, the 
carriers of which are c-quarks and b-quarks, c 
being the constituents of J/i/-particles, b of 
Y-particles. The combination of 5 flavors and 
3 colors results in 15 quarks, as stated earlier. 
The so-called standard model is a quantum 
field theory based on a local, non-Abelian 
gauge group SU(3) x SU(2) x U(1). The group 
SU (3) is called the color SU (3) group, which 
is supposed to be strictly unbroken and ex- 
presses the invariance of the theory under the 
local SU(3) transformation of the three color 
degrees of freedom. Quarks having spin 4 
transform as its 3-dimensional fundamental 
representation, and the vector gauge bosons 
transforming as its 8-dimensional regular 
representation are gluons. This part of the 
theory is quantum chromodynamics (QCD). 
The remaining part of the theory, based on 
the local gauge group SU(2) x U(1), is called 
the Glashow-Weinberg-Salam model or its 
hadronic extension quantum flavor dynamics 
(QFD), and unifies the electromagnetic and 
the weak interactions. This gauge group is 
supposed to be spontaneously broken with the 
only unbroken subgroup U(1) corresponding 
to the electromagnetic gauge transformation. 
The underlying mechanism for the spontaneous 
breakdown of the gauge group SU(2) x U(1) is 
not well understood, but conventionally it is 
assumed to occur through the so-called Higgs 
mechanism. This is a mechanism proposed by 
P. W. Higgs, whereby the Goldstone boson 
acquires a nonzero mass if the broken sym- 
metry occurs in the presence of an associated 
massless vector field (called a gauge vector 
field), which also becomes associated with 
massive bosons. The gauge vector bosons are 
identified with photons (y) (unbroken sym- 
metry and hence massless) and weak inter- 
mediary bosons (broken symmetry and hence 
massive), usually denoted by W and Z. Quarks 
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and leptons transform under the group SU(2) 
as doublet or singlet representations. The 
renormalizability of the spontaneously broken 
gauge field theory has been established by 

’t Hooft. Major predictions of the Glashow- 
Weinberg-Salam model, including the existence 
of the gauge bosons W and Z, have been 
borne out experimentally. 

Grand unified models attempt to unify 
QCD and QFD, employing a larger Lie group 
containing SU(3) x SU(2) x U(1) as a sub- 
group. The most popular ones are those based 
on the groups SU(5), SO(10) and on some of 
the exceptional groups. Super grand unified 
models attempt to unify QCD, QFD, and 
the gravitational interaction, with super Lie 
groups as a possible basis. Recently, there 
have been attempts to search for subquark 
structures. 
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133 (XIV.9) 
Ellipsoidal Harmonics 


A. Ellipsoidal Coordinates 


If a>b>c, then for any given (x, y,z)eR?, the 
three roots of the cubic equation in 0 


2 y? z? 


x 
= = 
PLO Pa ee 


F(0) 
are real and lie in the intervals 0> —c*, —c? > 
6>—b?, and —b*>6> —a?. Denoting these 
three roots by A, u, and v (they are labeled 

so as to satisfy the inequalities 4> —c?>p> 
—b?>v> —a’), F(A)=0, F(u)=0, and F(v)=0 
represent an ellipsoid, a hyperboloid of one 
sheet, and a hyperboloid of two sheets, re- 
spectively. They are confocal with the ellipsoid 


x? y? z? 


PEITO 1 =0, 


pass through the point (x, y, z), and mutually 
intersect orthogonally. 

The quantities 2, u, v are called the ellip- 
soidal coordinates of the point (x, y, z). Rect- 
angular coordinates (x, y, z) are expressed in 
terms of ellipsoidal coordinates (A, u, v) by the 
formula 


> (a? +A)(a? +p)(a? +y) 
a (a? — b*)(a? —c?) 


(1) 


and two others obtained from (1) by cyclic 
permutations of (a, b,c) and (x, y, z). 


B. Ellipsoidal Harmonics 


When a tharmonic function y of three real 
variables is constant on the surface 4 = con- 
stant, 4=constant, or v=constant in ellipsoidal 
coordinates, the function w is called an ellip- 
soidal harmonic. A solution of Laplace’s equa- 
tion Ay =0 in the form w= A(A)M(u) N(v) 

can be obtained by the method of separation 
of variables. The equation Ay =0 is written in 
the form 


ô ow L 
Donnas (ace) =0, 


where the summation is taken over the even 
permutations of (A, u, v), and 


A,=,/(a? +A)(b? +A) (c? +A). 
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The ordinary differential equation 


da dh 


is satisfied by A and also by M and N if we 
replace À by u and v, respectively. Equation (2) 
is called Lamé’s differential equation, with K 
and C the separation constants. 

Let K =n(n+ 1) for n=0, 1, 2,.... Then 
equation (2), for a suitable value (the eigen- 
value) of C, has a solution that is a polynomial 
in A or a polynomial multiplied by one, two, or 
three of ./a* +4, ./b? +4, and ./c? +4. 

Among these solutions 2n + 1 are linearly 
independent. We denote these solutions by A 
= fi"(A) (m=1,2, ...,2n+ 1). They are essen- 
tially equivalent to the Lamé functions, to be 
defined at the end of this section. To be pre- 
cise, by setting 


A+(a?+b?4+0*)/3=6, 
C=B+4+n(n+1)(a? +b? +c7)/3, 
e; =(b? +c? —2a’)/3, ...; 


ry, (a a )A(KAFOA (2) 


€i +é€,+e;=0, 


we have 

a+ 1/2 : 1/2 R 1/2 E 

dé? Gre č—e, Sey) de 

E n(n+ 1) +B 6) 
4(¢—e)(č—e:)(č—e3) 
This can also be written in the form 

2A 

— = (n(n + 1) (u) + B)A (4) 


du? 


by the change of variable ¢ = gə (u) with the 
Weierstrass tg-function. 

The differential equation (3) has č =e}, e3, 
e3, © as tregular singular points. A solution of 
(3) that is a polynomial in € or a polynomial 
multiplied by one, two, or three of ./E—e,, 
/€—e, and ./E—e; is called a Lamé function 
of the first kind. 


C. Classification of Lamé Functions 


The 2n + 1 linearly independent solutions 
f(A) of (2) are classified into the following 
four families. If n is an even number 2p, then 
p+1 solutions f(A) among the 2n+ 1 solu- 
tions are polynomials in 4 of degree p, and the 
other 3p functions are polynomials in / of de- 
gree p—1 multiplied by 


(b? +å)? +å), /(c7 +A)(a? +4), 
or 
./ (a? +4)(b? +4). 


Since all these polynomials are products with 
real factors of degree 1, the solutions belonging 
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to the first family are of the type 
Jn" (2)=(4—0,)(4— 82)... (4 — Onj2), (5) 


while solutions of the latter kinds are of the 
type 


b+? +A) 

IA = 4 (ce? +14)(a? +4) 

(a? + a)(b? +2) 
x(A—0,)(A—0,)...(A~Oy-1). (6) 


The functions (5) and (6) are called Lamé 
functions of the first species and of the third 
species, respectively. On the other hand, if n is 
an odd number 2p+ 1, then 3(p+ 1) solutions 
among the 2n+1 functions f(A) are of the 


type 
PA= 4 S/FA 
x (å —80,)(4— 03)... (4—8n-2)} (7) 


and the other p functions are of the type 


fA) =a +14)(b? +4) +) 
x (4—8,) (4—02)... (å— bn-3x2) (8) 


The functions (7) and (8) are called Lamé 
functions of the second species and of the fourth 
species, respectively. Hence in either case 
we have 2n + 1 linearly independent Lamé 
functions. 

When n is even, we obtain an ellipsoidal 
harmonic 


n{2 
Yr = I] (2—0) (u — 8,)(v — 9,) 


by multiplying f(A), f(W), and f7"(v) belong- 
ing to the first family. Also, in this case, by 
setting 
x2 y’ z? 
O, == ta ta! 
P” a?+0, b?+0, 2+0, 


= (A—86,)(u— 0,)(v —8,) 
(a? + 0,) (b? + 9,)(c?7 + 0) 


we have 
r =9,0,...9,/2 (9) 


up to constant coefficients. Utilizing Lamé 
functions of the third species {instead of func- 
tions of the first species) and formula (1), we 
find that 


Wr =(yz or zx or xy) x 0,03... On (10) 


For even n, every ellipsoidal harmonic ex- 
pressible in terms of polynomials in x, y, z of 
degree n can be written as a linear combina- 
tion of the functions (9) and (10), which are 
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called the ellipsoidal harmonics of the first and 
of the third species, respectively. Similarly for 
odd n, the ellipsoidal harmonics of the second 
and of the fourth species 


wr =(x or y or 2)0, 02... On- (11) 
Yr =xy20103 ... O n-3)/2 (12) 


are composed of the Lamé functions of the 
second and fourth species, respectively. 

For odd n, these forms are a complete sys- 
tem of ellipsoidal harmonics that are linearly 
independent and expressible in terms of poly- 
nomials in x, y, z of degree n. 

The zeros €,, €3,...,¢, of the Lamé func- 
tions are real, and ¢;4¢, (i#j). They never 
coincide with any one of e,, e,, and e;. If e, > 
€, >e3, then ¢,,...,¢, all lie between e, and 
ez. If mis an integer such that O0<m<p, we 
have one and only one Lamé function (with 
the species given) with m of its zeros lying 
between e, and e, and the remaining p—m 
zeros between e, and e, (Stieltjes’s theorem). In 
this way, a complete system of linearly inde- 
pendent Lamé functions of the specified type 
may be obtained, since m assumes p+ | differ- 
ent values. When the constant B appearing 
in the differential equation (3) takes specific 
values so that the equation has Lamé func- 
tions of the first kind as its solution, (3) also 
yields a solution A such that A= č7 +2 as 
č— œ. This function A is called the Lamé 
function of the second kind. 


D. Ellipsoids of Revolution (Spheroids) 


When the fundamental ellipsoid is a spheroid 


x?+y? z 


a c 


it is convenient to use the spheroidal coordi- 
nates (č, n, o) given by 


x=- D-n’) cosg, 
y=1/(é? —1)(1—n?)sing, (13) 
z=lčņ 1=</e 


—a 
for a? <c? (prolate) and 

x=l./(€? + 1)(1—n7) cosg, 
yal /(2°+)(U—n’)sing, 
z=léy, l= /a?—c? (14) 


for a? >c? (oblate). The solutions of Laplace’s 
equation, which are regular at all finite points, 
are given by 


Y = Pr (C) Pa (sin MP 
in the prolate case, and 


Y = Pr (iG) Pr'(a)sin MP 
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in the oblate case. Here P”" is the tassociated 
Legendre function of the first kind. Solutions 
which are regular outside a finite ellipsoid 
can be composed of the tassociated Legendre 
functions of the second kind, Q7'(¢) or Qr (ič) 
instead of P”(é) or PP (ié), respectively. 


E. Spheroidal Wave Functions 


Transforming the tHelmholtz equation in 
prolate spheroidal coordinates (13), we have 


1 , oF), 4 ow 
ieee ek oS erase 
E- oar Ge oje n( =) 

1 1 8? Y 

O then a 


(15) 


By separating variables in the form Y = 
X (éE) Y(n) mo, we have the equations 


which X and Y, respectively, must satisfy. The 
only difference between equations (16a) and 
(16b) arises from the fact that the domain of 
(16a) is given by 1< č whereas the domain of 
(16b) is given by —1<n<1. For the oblate 
spheroid, utilizing formula (14) we have 


EES So ov\ 2 =E) 
mae eie z(u ar 


1 1 8Y j 
Ae eS VY =0. 1 
(a aaar 0 (17) 


By separating variables as before, Y(n) satisfies 
the same equation as (16b), while X(€) satisfies 
equation (16a) with č replaced by ič. All these 
equations are of the type 


(18) 


A solution of (18) is known as a spheroidal 
wave function. The equation (18) has +1 as 
regular singular points and œ as an irregular 
singular point of class 1. Hence spheroidal 
wave functions behave like tLegendre func- 
tions in the interval [—1, 1] and like *Bessel 
functions in the neighborhood of œ. 


F. The Functions pe;'(x) and qe; (x) 


When we write a solution of (18), it is custom- 
ary to write x instead of z when z is contained 
in the interval [ —1, 1]. We denote solutions of 
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(18) which are regular on the whole domain 
—1<x<1 by pe™(x), and the corresponding 
eigenvalues by 4, ,, (assuming the boundary 
condition stated in this paragraph concern- 


ing singularities). In particular, when x0, 


equation (18) reduces to tLegendre’s associ- 
ated differential equation, the eigenvalues of 4 
become n(n + 1) (n is a positive integer), and 
the corresponding eigenfunctions become the 
associated Legendre functions of the first kind: 


d” P, 


Pan(x) =(1— x2)". 


(19) 


Hence pe”(x) is a solution which tends to a 
constant multiple of P(x) as x0. Using a 
system of orthogonal functions P% (x), we can 
expand pe;"(x) as 


(y= & Am P(x 
|/—n|=even number. (20) 


The coefficients A satisfy a recurrence 
formula 


(Het 1)+K? 


27 421—-1—2m?\ 
(21—1)(214+ 3) 


,—m—1)(l—m) „ 
QI—3) (21—11) "7 


_atmst Wt mt2) m 
(21+ 3)(21+5) miso =O. a 


The functions pe'"(x) and pe;"(x) are ortho- 
gonal in the domain —1<x<1. 

Another solution of (18) exists which corre- 
sponds to the same eigenvalue A, m, is inde- 
pendent of peṣř (x), and has the opposite parity: 
Ani Or (x) 


I> —m,I-—n=even 


Psy es 


jzm,j—n=odd 


Ba Pi O), (22) 


where the Az, are the same as in (20) and are 
determined by the recurrence formula (21), 


while for j>m +2, the By ; satisfy the recur- 
rence formula 


2i? +2j)-1—2m? 
(ete a 


Bj- D2+3) 
em DU ay 
B-3j-) 7? 


s(it+m+l)\(j+m+2) om 


(2) + 3)(2j+5) mj+2=0. 2) 


Since the associated Legendre function of 
the second kind 
QP(x)=(1 —x?)"? d™Q,/dx™ 
is of the form 
Qr (x)= P(x)log./(1 + x)/(l —x) 


+(1—x?)"™? x (a polynomial in x) 
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for !>m, the qeř (x) have x= +1 as singular 
points. 

By expressing the solution of equation (18) 
in integral form we find that pe™(x) satisfies an 
integral equation 


"Vn, mDCn (x) 


1 
-| (1—x?)"?(1— Syme pem(E)de, (24) 
-1 


where the coefficient v, „ is related to P” (0) or 
P? (0). 

In order to extend the domain of defini- 
tion of pe/"(x) and ge”"(x) beyond the interval 
[—1,1], we adopt, in the domain G obtained 
by deleting the interval [ —1, 1] from the com- 
plex plane, the Heine-Hobson definition of the 
associated Legendre function, 


P™(z) =(z22 — 1)" d™ P. /dz™, (25) 


instead of N. M. Ferrers’s definition (19), and 
construct a solution of (18) in G: 


pen(z)= ), Ani Pi"(z), 


izm 
|! —n|=even number, (26) 
which is like (20) and again satisfies the inte- 


gral equation (24). From this we can obtain 
the expansion formula 


m( y2 (z?— 1)” 


e, {zZ 
ág p.m 2)" 


x) (piel tm! am Zeta) 


12m (1—m)! KZ 
|1—n|=even number. (27) 


Multiplying this by a constant, we define 


ae n (z7—1)"? 
Jen (Z)= a 
FA 


x 5 (oyeg eka y F", 
izm i /Kz l ' 


(l+m)! 
(ae 


m 
n, 


m 
This expression asymptotically assumes the 

form 

jen (z)~sin(kz—nr/2)/kz 


for |z|» 1. In a similar manner we find a 
solution 


5 1 (z? — 1)" 
IRR a ae 


2 N;+12(KZ2) 
x ¥ (Depa ER, 
RFA Izm 


lzm KZ 
having the asymptotic form 


nem(z)~cos(kz —nn/2)/Kz. 
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Disregarding a constant factor, this coincides 
with the function defined by (22) with Q”(z)= 
(z*—1)"? d™Q,,/dz™ in place of the associated 
Legendre function of the second kind. 
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Elliptic Functions 


A. Elliptic Integrals 


Let (z) be a polynomial in z of degree 3 or 4 
with complex coefficients and R(z, w) a rational 
function in z and w. Then R(z,./@(z)) is called 
an elliptic irrational function. An integral of the 
type f R dz is called an elliptic integral. The 
origin of the name comes from the integral 
that appears in calculating the arc length of an 
ellipse. Any elliptic integral can be expressed 
by a suitable change of variables as a sum of 
elementary functions and elliptic integrals of 
the following three kinds: 


| dz 
Ja —2?)(1—k222) 
[= Ze 
k*z E 
1—2z? 


and 


| dz 

(1 —a?z?),/(1 —2?)(1 —k?z?) 

(— Appendix A, Table 16.1). These three kinds 
of integrals are called elliptic integrals of the 


first, second, and third kind, respectively, in 
Legendre-Jacobi standard form. This classifica- 
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tion corresponds to that of tAbelian integrals. 
The constant k is called the modulus of these 
elliptic integrals, and a is called the parameter. 
Let the four zeros of (z) be «,, a, %3, %4 (we 
take one of them as œ when ¢(z) is of degree 
3). The Riemann surface R corresponding to 
the elliptic irrational function has the zeros 
1, X2, 3, &, as tbranch points with degree 
of ramification 1, and is of two sheets and 

of tgenus 1. If the integrand does not have 

a pole with a tresidue, then the integral is 
multivalued only because the value (called 
the periodicity modulus) of the integral taken 
along the normal section (basis of the homol- 
ogy group) is not equal to zero. 


B. Elliptic Integrals of the First Kind 


When R is a function without singularities 
other than branch points, only the topological 
structure of R gives rise to the multivaluedness 
of the integral of R. The standard form is 


\. dz 
o /(1—2?)(1 — k?z?) 


p do 
-| SSS = WHF (Kp), (1) 
o ./1—k? sin? o 


where z=sin ø. This integral is the inverse 
function of sn w (— Section J). The periodicity 
moduli are 2iK' and 4K, where 


i dz 
ee e e 
7 a 


Lyte) 
er o 2 


=K(k), k?=1—k?. 


We call K(k) a complete elliptic integral of the 
first kind and F (k, @) an incomplete elliptic 
integral of the first kind (— Appendix A, Table 
16). Setting 


d (1+k’)sin o cos 1—k 
sing =— ks—, (2) 
~ 1—k?sin? o 1+k 


we have the relation 
F(k,p)=(+k,)F(k,, Pı)/2, 


which is called Landen’s transformation. Since 
kı <k when 0<k <1, this transformation 
reduces the calculation of elliptic integrals 

to those with smaller values of k. For two 
given positive numbers a and b, put a) =a, 

bo =b, and 4,4; =(4,+b,)/2, ba+1 =V nbn- 
Then the sequences {a,} and {b,} converge 
rapidly to the common limit, which is called 
the arithmetico-geometric mean of a and 

b, and is denoted by ag(a, b). The complete 
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elliptic integral satisfies the relation K(k)= 


n/(2ag(1,./1—k?)]. 


C. Elliptic Integrals of the Second Kind 


When R has poles with residue zero, its inte- 
gral has no singularities other than poles. 
The standard form is 


FO z 1—k?2? 
z= —— 
p 
-| ./i—k?* sin? ọ dọ = E(k, 9), (3) 
o 
where z=sin yg. We have 
" O'(u) E 
ro=| dn?udu=—— +—u 
0 Olu) 


if we set z=snu. Here, O is Jacobi’s theta func- 
tion, and 


aWw=8,( 2 u =). 
2K’ K 


where 9, is a theta function to be described in 
Section I and K, K’ are the same as in the case 
of elliptic integrals of the first kind. The 
quantity 


1— k?z? 


1—z 


n/2 T 
-| V1=# sin? odo= (4,5) 
0 2 


is called a complete elliptic integral of the 
second kind. 


D. Elliptic Integrals of the Third Kind 


When R has poles with nonzero residue its 
integral has logarithmic singularities. In this 
case, residues also contribute to multivalued- 
ness of the integral. The standard form is 


Z dz 
ro=f 
o (1—a?z?), /(1 —z7)( 


aj do 
o (1—a’sin? ọ) 


and it is expressed as 


sna 1 
F(z) =———_| = 1 
@) ae Ü 


if we set z=snu and a? =k? sn? a (— 
A, Table 16). 


1— k?z?) 


1— k? sin? p 


Olua) | oa 
oura Seay 


Appendix 


E. Elliptic Functions and Periodic Functions 


Historically the elliptic function was first intro- 
duced as the inverse function of the elliptic 
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integral. However, since it has been realized 
that elliptic functions are characterized as 
functions with double periodicity, it is now 
customary to define them as doubly periodic 
functions. 

If f(x), defined on a linear space X, satisfies 
the relation f(x + w) = f(x) for some we X and 
all xe X, the number w is called a period of 
f(x), and f(x) with a period other than zero 
is called a periodic function. The set P of all 
periods of f(x) forms an additive group con- 
tained in X. If a basis œ, ...,@, of the additive 
group P exists, its members are called funda- 
mental periods of f(x). 

Any continuous, nonconstant, periodic 
function of a real variable has only one posi- 
tive fundamental period and is called a simply 
periodic function. The ttrigonometric functions 
are typical examples: sin x and cos x have the 
fundamental period 27; tan x and cot x have 
the fundamental period z (— 159 Fourier 
Series). 

A single-valued nonconstant tmeromorphic 
function of n complex variables cannot have 
more than 2n fundamental periods that are 
linearly independent on the real number field. 
A function of one complex variable with two 
fundamental periods is called a doubly periodic 
function. 

Let w, w be the fundamental periods of a 
doubly periodic function. For a given number 
a, the parallelogram with vertices a, a+ a, 
a+a’,at+@+q’ is called the fundamental 
period parallelogram. The complex plane is 
covered with a network of congruent parallel- 
ograms, called period parallelograms, obtained 
by translating the fundamental period parallel- 
ogram through mw +n’ (m,n=0, +1, +2,...). 

A doubly periodic function f(u) meromor- 
phic on the complex plane is called an ellip- 
tic function. For simplicity, we usually denote 
the fundamental periods of an elliptic function 
by 2m, and 2, and introduce œw, defined by 
the relation w, +œ, +@,=0. The first, and 
therefore also higher, derivatives of any elliptic 
function are elliptic functions with the same 
periods. The set of all elliptic functions with 
the same periods forms a tfield. The number of 
poles in a period parallelogram is finite. The 
sum of the orders of the poles is called the 
order of the elliptic function. An elliptic func- 
tion with no poles in a period parallelogram is 
merely a constant (Liouville’s first theorem). 
The sum of the residues of an elliptic function 
at its poles in any period parallelogram is zero 
(Liouville’s second theorem). Hence there can 
be no elliptic function of order 1. An elliptic 
function of order n assumes any value n times 
in a period parallelogram (Liouville’s third 
theorem). The sum of the zeros minus the 
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sum of the poles is a period (Liouville’s fourth 
theorem). 


F. Weierstrass’s Elliptic Functions 


Weierstrass defined 


o1 


as the simplest kind of elliptic function. Here 
Q=2me, + 2nw3, with m, n integers. The sum- 
mation } extends over all integral values (pos- 
itive, negative, and zero) of m and n, except 

for m=n=0. (u) is an elliptic function of 
order 2 with periods 2%; and 2a , called a 
Weierstrass ¢9-function. The following func- 
tions €(u) and oa(u) are called the Weierstrass 
zeta and sigma functions, respectively: 


u 1 
w=5+X (at a +5) 
and 


coal (('-a)=e(arm)) 


These have quasiperiodicity, expressed by 


Clu + 2m) =C(u)+ 2n;, (4) 


o(u+2m,)= —e7M tod g(y), (5) 
Nit+n2+n3=0, 4,=C(o,), i=1, 2, 3; 

and they satisfy the relations 

plu)= —C'(u) (6) 
and 

ee (7 


du au) 


The function g({u) is an even function of u, 
and ¢(u) and o{u) are odd functions of u. By 
considering the integral f ¢(u)du once around 
the boundary of a fundamental period par- 
allelogram, we have 


3 — 3, x 
H2@,—N,@, >= tzi, Im (2 ‘)z0, (8) 
2 OA 
H32 — 4203 


which is called the Legendre relation. 
The derivative 


g'(u)= -27 1/u-—9) 


of a g-function is an elliptic function of order 
3 and bears the following relation to g(u): 


('(u))’ =4(e(u)) —g @(u)—g3 
=4(9(u)—e,)(@(u) —e2)(@(u)— es), 
g,= 605" 1/04, g3=1405''1/08, 


e=olo), i=1,2,3. (9) 


529 


By differentiating this relation successively, 
we see that o(u) is expressed as a polynomial 
in (u) if nis an even number and as a prod- 
uct of polynomials in g(u) and g'(u) if n is an 
odd number. 

In particular, writing ~(u)=z in (9) we find 
that the -function is the inverse function of 
the elliptic integral 


| ú dz 

u= 

æ / 4z? — 927—993 

(— Appendix A, Table 16.IV). 

Any elliptic function can be expressed in 
terms of Weierstrass’s functions. Specifically, 
let the poles of f(u) and their orders be a,, 
Q,...,4, and hi, ha, -.., hm, respectively, and 
let the principal part in the expansion of f(u) 
near the pole a, be 


hy Ay 


j=1 (u—a,) i 


k=1,2,...,m. (10) 
Then we obtain 
fuw=C+¥ (autua 

k=1 


hk (—1¥A;; 
4 > CUA 

ja (j— 1)! 
where C is a constant depending on f(u). This 
can be reduced to 


Sw=A+ Be) 


by using the addition theorems (— Appendix 
A, Table 16) for œ and zeta functions, where A 
and B are rational functions of D{u). Therefore, 
given two elliptic functions with the same 
periods, after expressing them as rational 
functions of p and ø’ in the above form and 
eliminating g and g’, we obtain an algebraic 
equation with constant coefficients. In par- 
ticular, for any elliptic function f(u) we ob- 
tain an falgebraic differential equation of the 
first order by using this method, with f'(u) an 
elliptic function with the same periods. Fur- 
thermore, the functions f(u + v), f(u), and f(v) 
satisfy an algebraic equation. Thus for any 
elliptic function an algebraic addition theorem 
holds. 


Mua), (11) 


G. Elliptic Functions of the Second Kind 


As an extension of the definition of elliptic 
functions, if a meromorphic function f satis- 
fies the relations 


f(u+2a,)= 1, f(u), TOEF Aom (12) 


(u; and u, are constants) with the fundamental 
periods 2w,, 2w3, we call f(u) an elliptic func- 
tion of the second kind. What we have called 
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simply an elliptic function can now be called 
an elliptic function of the first kind. For con- 
stants p and v, the function 


f(u)=e"a(u—v)/o(u) (13) 


is an example of an elliptic function of the 
second kind. In this case 


p= eram, j=1,2, (14) 


Furthermore, for given constants u, and H3, 
any elliptic function of the second kind is 
expressed as the product of an elliptic function 
of the first kind and the function (13) with p 
and v determined by (14). 


H. Elliptic Functions of the Third Kind 


If a meromorphic function f satisfies 
f(u+2m,)=e%"*f(u), i=1,3 (15) 


(a; and b; are constants) with periods 2%, 2m; , 
we call it an elliptic function of the third kind 
(— 3 Abelian Varieties I). 

The Weierstrass sigma function o(u) is an 
example of an elliptic function of the third 
kind. The functions o,, 63, and o3, defined by 
the equations 
o,(u)= ————_—_——.,__ i= 1, 2, 3, (16) 
are also elliptic functions of the third kind. In 
the case of elliptic functions of the second and 
third kinds, 2m, and 2@, are not periods in 
the strict sense defined earlier, but are conve- 
niently referred to as the periods. The func- 
tions o; (i= 1, 2,3) are called cosigma functions. 


I. Theta Functions 


The theta functions, or more strictly, elliptic 
theta functions, are defined by 


I (v,1)=2 $ (—1)"g"*!?" sin(2n + 1) x0, 
n=Q 

9, (v, 1)=2 X q+? cos(2n + 1)rv, 
n=0 


J (v, 1) =1+2 ¥ q” cos 2nazv, 


n=1 
9,(v,t)=1+2 ¥ (—1)"q" cos 2nnv, (17) 
n=1 


where g=e'™, Imt>0. We call (17) the q- 
expansion formulas of the theta functions, and 
we sometimes write % in place of 94. A theta 
function is an elliptic function of the third kind 
with periods 1 and z. Any elliptic function can 
be expressed as a quotient of theta functions 
(— Appendix A, Table 16 for specific exam- 
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ples). The g-expansion formula is quite suit- 
able for numerical computation because of 

its rapid convergence; its terms decrease as the 
n? powers of q as n> œ. 

An elliptic function with the fundamental 
periods 2m, and 2m, can also be viewed as 
having the periods 2m =2m,, 2w3 = —2a,. 
Consequently, theta functions formed with the 
parameter t= ,/w, can be expressed in terms 
of the parameter t’=@5/@, = —@,/w,= —1/1, 
and we have 


d (v, 1) =i Ad, (v/t, —1/1), 
I (v, t) = Ad, (v/T, —1/1), 
33(v, tT) = AI;(v/t, —1/t), 
34(v, 1) = Ad, (v/T, —1/2), 
A=, /i/t exp(—niv?/t). (18) 


These are called Jacobi’s imaginary trans- 
formations. If Im(—1/t)> 1, then |q|=1, and 
therefore the series in the g-expansion for- 
mula converges slowly. However, by an imag- 
inary transformation we get Im(t)> 1 and 
\q|=0, so that computations become much 
easier. 

Each of the theta functions satisfies, as a 
function of two variables u and t, the follow- 
ing partial differential equation of the heat- 
conduction type: 


6? G(u, t)/du? = 4ni8.9(u, 1)/6t (19) 
(— Appendix A, Table 16.II). 


J. Jacobi’s Elliptic Functions 


C. G. J. Jacobi defined elliptic integrals as 
inverse functions of elliptic integrals of the first 
kind in the Legendre-Jacobi standard form (1). 
They are, in the above notation, 


E alu)  3(0)9,0) 
SWEN ees u ORO) a 
PORO 
o3(u) 3, (0) 94 (v) 
„2U _ LORN) 
03 (u) 3; (0)3,(v) ? 


where w=./e, —e, u and v=u/2a,. 


These functions satisfy the relations 


(21) 


(22) 


sn?w+cn?w=1, k?sn?w+dn?w=1, (23) 


where 
p2 62783 _(92(0))" (24) 
e1 — e3 (94(0))4 


The constants k and k’=./1—k? are called 


the modulus and the complementary mod- 
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ulus, respectively. Furthermore, the relation 
dsnw/dw=cnwdnw holds. The function z 
= snw is the inverse function of the elliptic 
integral (1) (— Appendix A, Table 16.III). 
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A. General Remarks 


Suppose that we are given a relation R be- 
tween elements of a set X such that for any 
elements x and y of X, either xRy or its nega- 
tion holds. The relation R is called an equiva- 
lence relation (on X) if it satisfies the following 
three conditions: (1) xRx, (2) xRy implies yRx, 
and (3) xRy and yRz imply xRz. Conditions 
(1), (2), and (3) are called the reflexive, sym- 
metric, and transitive laws, respectively. To- 
gether, they are called the equivalence prop- 
erties. Condition (1) can be replaced by the 
following: (1') For each x there exists an x’ 
such that xRx’. The relation “x is equal to y” 
is an equivalence relation. If xRy means that x 
and y are in X, then R is also an equivalence 
relation. An equivalence relation is often de- 
noted by the symbol ~. The relations of con- 
gruence and similarity between figures are 
equivalence relations. If X is the set of integers 
and x= y means that x— y is even, then the 
relation = is an equivalence relation. 
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B. Equivalence Classes and Quotient Sets 


Let R be an equivalence relation. “xRy” is 
read: “x and y are equivalent” (or “x is equiva- 
lent to y”). The subset of X consisting of all 
elements equivalent to an element a is called 
the equivalence class of a. By (1), (2), and (3), 
each equivalence class is nonempty, the equiv- 
alence class of a contains a, and different 
equivalence classes do not overlap. Namely, 
X is decomposed into a fdisjoint union of 
equivalence classes. This tpartition is called 
the classification of X with respect to R. For 
example, the set of integers is classified into the 
equivalence class of even numbers and that of 
odd numbers by the relation =. Conversely, 
since the relation “x and y belong to the same 
member of a partition” is an equivalence re- 
lation, we can regard any partition as a classi- 
fication. An element chosen from an equiva- 
lence class is called a representative of the 
equivalence class. In the example we can take 
0 and 1 as the representatives of equivalence 
classes of even and odd numbers, respectively. 

X/R denotes the set of equivalence classes of 
X with respect to R, and is called the quotient 
set of X with respect to R. The mapping p: X > 
X/R that carries x in X into the equivalence 
class of x is called canonical surjection (or 
projection). The idea of equivalence relations 
can be generalized to deal with the case when 
X is a telass. 


C. Stronger and Weaker Equivalence Relations 


Let R and S be two equivalence relations on 
X. If xRy always implies xSy, then we say that 
R is stronger than S, S is weaker than R, the 
classification with respect to R is finer than the 
one with respect to S, or the classification with 
respect to S is coarser than the one with re- 
spect to R. The relations “x is equal to y” and 
“x and y are in X” are the strongest and the 
weakest equivalence relations, respectively. 
Any two equivalence relations on X are 
ordered by their strength, and the set of 
equivalence relations on X forms a fcomplete 
lattice with respect to this ordering. 
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A. General Remarks 


The origin of ergodic theory was the so-called 
ergodic hypothesis, which provided the foun- 
dation for classical statistical mechanics as 
created by L. Boltzmann and J. Gibbs toward 
the end of the 19th century (— 402 Statistical 
Mechanics). Attempts by various mathemati- 
cians to give a rigorous proof of the hypoth- 
esis resulted in the recurrence theorem of 

H. Poincaré and C. Carathéodory and the 
ergodic theorems of G. D. Birkhoff and J. von 
Neumann, which marked the beginnings of 
ergodic theory as we know it today. As the 
theory developed it acquired close relation- 
ships with other branches of mathematics, for 
example, the theory of dynamical systems, 
probability theory, functional analysis, number 
theory, differential topology, and differential 
geometry. 

The principal object of modern ergodic 
theory is to study properties of tmeasurable 
transformations, particularly transformations 
with an invariant measure. In most cases, the 
transformations studied are defined on a Le- 
besgue measure space with a finite (or c-finite) 
measure. A Lebesgue measure space with a 
finite measure (a-finite measure) is a tmeasure 
space that is measure-theoretically isomorphic 
to a bounded interval (to the real line) with the 
usual tLebesgue measure, possibly together 
with an at most countable number of atoms. It 
is known that any separable complete tmetric 
space with a complete regular Borel tproba- 
bility measure is a Lebesgue measure space 
with a finite measure. We assume, unless stated 
otherwise, that the measure space (X, 2, m) is a 
Lebesgue measure space. All the subsets of X 
mentioned are assumed measurable, and a pair 
of sets or functions that coincide almost every- 
where are identified. We use the abbreviation 
“a.e.” to denote “talmost everywhere.” 


B. Ergodic Theorems 


Let (X, Z, m) be a o-finite measure space. 

A transformation ọ defined on X is called 
measurable if for every BEB, 9 '(B)EB. A 
tbijective transformation ọ on X is called 
bimeasurable if both g and `+ are measur- 
able. A measurable transformation ¢ is called 
measure-preserving (or equivalently, the mea- 
sure m is invariant under ¢) if m(p~'(B))=m(B) 
holds for every B. It is called nonsingular if 
m(o~1(B))=0 whenever m(B)=0, and ergodic 
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if m((p_‘(B)U B)—(ọ~'(B)N B))=0 implies 
either m(B)=0 or m(X — B)=0. 

The mean ergodic theorem of von Neumann 
(Proc. Nat. Acad. Sci. US, 18 (1932)) states that 
if ọ is a measure-preserving transformation or 
(X, @,m), then for every function f belonging 
to the tHilbert space L,(X)=L,(X, B,m) (— 
168 Function Spaces), the sequence 


Aafis)=(¥ slo's) 


converges in the tnorm of L,(X) as no toa 
function f* that satisfies f*(@x) = f*(x) a.e. The 
individual (or pointwise) ergodic theorem of 
Birkhoff (Proc. Nat. Acad. Sci. US, 17 (1931)) 
states that for every f belonging to the tfunc- 
tion space L,(X), the sequence A, f(x) con- 
verges a.e. to f*. From either of these theo- 
rems it follows that for any set E satisfying 

gp ‘\(E)=E and m(E)< œ, the limit function f* 
satisfies |, f* dm= fp fdm. In particular, if m(X) 
=1 and ọ is ergodic, then the limit f* equals 
the constant f fdm a.e. This fact therefore gives 
a mathematical justification to the ergodic 
hypothesis, which states that the “time mean” 
(32i f(e*x))/n of what is observable over a 
sufficiently long time can be replaced by the 
“phase mean” f fdm. Both von Neumann’s 
theorem and Birkhoff’s theorem were sub- 
sequently generalized in various directions by 
many authors. 

(1) Mean ergodic theorems are concerned 
with the tstrong convergence of the sequence 
of averages A,=(Xf-) T")/n of the iterates of a 
*bounded linear operator T on some tBanach 
space. A generalization of von Neumann’s 
theorem due to F. Riesz, K. Yosida, and S. 
Kakutani dispenses with the assumptions that 
the linear operator T is induced by a measure- 
preserving transformation ¢ and that T acts 
on the Hilbert space L,(X). A version of this 
generalization states that if a linear operator T 
defined on a Banach space & satisfies the 
conditions 

yar s 
(i) supl- } T*| <œ, 
ng=0 


nzi 


(i) lim- IT" =0, 
n>on 


then for an element fe 7 the sequence of 
averages A„f converges strongly to an element 
f*e if and only if there exists a subsequence 
converging tweakly to f*. From this theorem 
of Riesz, Yosida, and Kakutani follows the 
L,-mean ergodic theorem (1 < p< œ) for so- 
called Markov operators: If T is a linear oper- 
ator defined on each of the Banach spaces 
L,(X) (1 <p < œ) by means of the formula 
Tf(x)=[ f(y) P(x, dy), where P(x, B) is the ttran- 
sition probability of a Markov process on 
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(X, 2) leaving the measure m invariant (i.e., 

| P(x, B)dm=m(B) for every Be 2) (— 261 
Markov Processes), then for every f belonging 
to L (X) (1 <p< œ) the sequence A„f con- 
verges in the norm of L,(X) to a limit function 
f*. 

(2) Birkhoff ’s ergodic theorem has been 
extended to the following individual ergodic 
theorem by E. Hopf (1954): If T is a tpositive 
linear operator mapping L,(X) into L; (X) 
and L,,(X) into La (X) with |T}; <1 and 
|| T ||, <1, then for every f in L,(X) the se- 
quence A„f converges a.e. to a limit f *. If 
T is a Markov operator, then T maps each 
L,(X) into itself and satisfies || T ||, <1 for 
each p (1 <p < œ), and therefore Hopf’s er- 
godic theorem applies to such T. Special cases 
of this theorem were proved earlier by J. 
Doob and by Kakutani. Later, N. Dunford 
and J. Schwartz showed that the assump- 
tion of the positivity of T can be dispensed 
with in Hopf’s theorem. For a positive linear 
operator T on L,(X) satisfying ||T ||, <1, R. 
Chacon and D. Ornstein (1960) proved that 
the ratio ergodic theorem holds: For every pair 
of functions f and g in L,(X) with g>0 a.e., 
lim, Leo T*f(x)/XR=} T*g(x) exists and is 
finite a.e. on the set {x| Df=4 T*g(x)>0}. This 
theorem extends earlier results of Hopf and 
W. Hurewicz dealing with special classes of 
operators arising from measurable transfor- 
mations. Hopf’s ergodic theorem can be de- 
duced from the Chacon-Ornstein theorem, 
while it is known that there are positive oper- 
ators T on L,(X) satisfying ||T ||, <1 for which 
lim,_,,,. Anf fails to exist on a set of positive 
measure for some feL (X). This shows that 
the assumption || T || ,, <1 is crucial in Hopf’s 
theorem. 

(3) As was the case in the original proof by 
Birkhoff of his ergodic theorem, every known 
proof of an individual ergodic theorem de- 
pends crucially on the so-called maximal 
ergodic lemma (or maximal inequality). For 
the case of a positive linear operator T on 
L (X) with ||T ||, <1, Hopf proved the rele- 
vant maximal ergodic lemma: If E(f) is the set 
{x|sup,>,A,f(x)>0} for each f in L’(X), then 
feg) fdm > 0. Hopf’s original proof of this 
lemma was quite intricate, but A. Garcia 
(1965) succeeded in giving an extremely sim- 
ple and elegant proof. From the maximal er- 
godic lemma the following so-called domi- 
nated ergodic theorem can also be obtained: 
Let T be a linear operator mapping each 
L,(X) into L,(X) satisfying ||T||,<1 for each 
p (1 <p< œ). If for fin L,(X) we let f(x)= 
SUPp>1|4nf (x)|, then if 1 < p< œ we have 


= 2P 
| fxyrdm SZE [i fexyram, 
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while if p=1 and m(X)< œ we have 
[Foamea] moo + | i/ootios" sid | 


This theorem was obtained first by N. Wiener 
(1939) for the special case of T induced by a 
measure-preserving transformation. M. Ak- 
coglu (1975) proved an isometric “dilation” 
theorem for positive contractions (i.e., positive 
linear operators T for which || T|| <1) in the 
space L,(X) for 1<p<oo, by means of which 
he was able to deduce a dominated ergodic 
theorem for an arbitrary positive contraction 
in L,(X) from the corresponding theorem for a 
positive linear isometry in L,(X) proved earlier 
by A. Ionescu-Tulcea. From this theorem of 
Akcoglu one can obtain the following indi- 
vidual ergodic theorem: If T is a positive con- 
traction in L,(X) for some p with 1 <p< œ, 
then for every f in L,(X), the sequence A, f 
converges a.e. 

(4) Both mean and individual ergodic theo- 
rems can be extended without difficulty to 
a continuous time parameter semigroup 
{T,|t>0} of bounded linear operators such 
that 7, 7;= T, (Tọ =I), under a suitable con- 
tinuity assumption on T, with respect to t, by 
replacing the discrete time average (L?_}, T*)/n 
with (f6 7,ds)/t. Further extensions to n- 
parameter semigroups were obtained by N. 
Wiener and by N. Dunford and A. Zygmund. 
For mean ergodic theorems, even further 
extensions were possible to tamenable semi- 
groups of bounded linear operators. For 1- 
parameter semigroups, the behavior of the 
mean at zero, (fo T,ds)/t as t0 (local ergodic 
theorem) or Affe *° T, fds as 410 or Atco 
(Abelian ergodic theorem), has also been in- 
vestigated by Wiener, U. Krengel, E. Hille, 
Yosida, and others. Abelian ergodic theorems 
are related to properties of the tresolvent of 
the semigroup {7;} (— 378 Semigroups of 
Operators and Evolution Equations). Further 
extensions of all these theorems in various 
directions have been given by many authors; 
for these extensions and related topics — 
[5-7]. 

(5) J. F. C. Kingman (1968) proved an inter- 
esting and useful extension of (both mean and 
individual) ergodic theorems, called the sub- 
additive ergodic theorem. A real-valued tsto- 
chastic process {X; ,|O<i<k, k=1,2,3,...} 
is called a subadditive process if it satisfies the 
following conditions: (i) Whenever i<j<k, 
Xia <X;,,+X;,,. (il) The tjoint distribution of 
{Xj+1,j+1} is the same as that of {X; ;}. (iii) The 
texpectation g, = E(Xo_,) exists, and satisfies 
gZ — Ak for some constant A and for all 
k> 1. Kingman proved that if {X;,,} is a sub- 
additive stochastic process, then the limit é = 
lim,,,.(1/n)Xp,, exists a.e. and in the mean, 
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and E(€)=lim,_.,(1/n)g,. Note that if g is a 
measure-preserving transformation on a finite 
measure space (X, #,m) and if f is an element 
of L,(X), then X;,= 2f} fog?! for 0<i<k, 
where k=1,2,3... defines a subadditive pro- 
cess (in fact, X;,=X;,;+ X;, for i<j<k). The 
multiplicative ergodic theorem, proved by V. 
Oseledec, which plays a significant role in 
ergodic theory of tnonhyperbolic smooth 
dynamical systems and which has found ap- 
plications also in talgebraic groups, can be 
derived from the subadditive ergodic theorem 
[8,9]. 


C. Recurrence and Invariant Measures 


In this section we assume that the measure 
space (X, 4,m) is nonatomic. A nonsingular 
measurable transformation ¢ defined on 

(X, B,m) is called recurrent (infinitely recurrent) 
if for every set B and for almost all xe B, there 
exists an ne Z* (infinitely many ne Z*) such 
that p"(x)eB. A set W is called wandering 
under ¢ if 9 ”(W)N o *(W)=@ for n#k. The 
transformation ¢ is called conservative if no 
sets of positive measure are wandering under 
o, and incompressible if B> ' B implies 
m(B— ọ 'B)=0. The following statements 
about a nonsingular measurable transforma- 
tion ọ are equivalent: (i) ọ is recurrent; (ii) @ is 
infinitely recurrent; (iii) ~ is incompressible; 
(iv) ọ is conservative. An immediate con- 
sequence of this is the following recurrence 
theorem of Poincaré (in the form formulated 
by Caratheodory): A measure-preserving 
transformation on a finite measure space is 
infinitely recurrent. In fact, in order for a non- 
singular measurable transformation ọ to be 
recurrent (and hence infinitely recurrent) it 

is sufficient that there exist a finite measure 

H invariant under p and equivalent to (i.e., 
mutually tabsolutely continuous with) the 
given measure m. 

The invariant measure problem is one of the 
basic problems in ergodic theory and is for- 
mulated abstractly in the following way: Given 
a nonsingular measurable transformation p on 
a o-finite measure space (X, #, m), find neces- 
sary and sufficient conditions for the existence 
of a finite (or o-finite) measure invariant under 
o and equivalent to m. The given measure m 
specifies only the class of equivalent measures 
among which an invariant measure is to be 
found. Therefore we can assume without loss 
of generality that m is a finite measure. For the 
remainder of this section, unless we explicitly 
state otherwise, we always mean by an invar- 
iant measure the one that is equivalent to m. 

The Poincaré recurrence theorem states that 
gy being recurrent is necessary for the existence 
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of a finite invariant measure. The recurrence of 
ọ is, however, not sufficient, since an ergodic 
transformation with an infinite but o-finite 
invariant measure is recurrent and has no 
finite invariant measure. A necessary and 
sufficient condition for the existence of a finite 
invariant measure was given by a theorem of 
A. Hajian and Kakutani (1964) which states: @ 
has a finite invariant measure if and only if ọ 
has no weakly wandering sets (a set W is called 
weakly wandering under ¢ if there exists an 
infinite subset {n,} of Z* such that p ™ WN 
go "W=2%,k#¥j). Hajian also proved that a 
bimeasurable transformation ¢ has a finite 
invariant measure if and only if ¢ is strongly 
recurrent in the following sense: For every set 
E with m(E£)>0, there exists a positive integer 
k=k(E) such that 

max m(o"” JEN E)>0 

O<j<k 

for every ne Z. 

H. Furstenberg [11] obtained the following 
striking extension of the Poincaré recurrence 
theorem: If a bimeasurable transformation 
o possesses a finite invariant measure, then 
for every set E with m(E)>0 and for every 
integer k >2, there exists n>1 such that 
MEN @"EN p7"EN...@* Y"£)>0. From this 
theorem one can deduce a difficult theorem 
of E. Szemerédi on tarithmetic progressions 


which states: Any subset of the integers having — 


positive tupper density contains arithmetic 
progressions of arbitrary length. In fact, it is 
not difficult to show that the theorems of 
Furstenberg and of Szemerédi are mutually 
equivalent, and for this reason the theorem of 
Fürstenberg is sometimes referred to as the 
ergodic Szemerédi theorem. 

For a nonsingular bimeasurable transfor- 
mation ọ, a pair of sets A and B are said to 
be countably equivalent under ¢ if there exist 
countable decompositions {A,|keZ*} and 
{B,|keZ*} for A and B, respectively, and an 
infinite subset {n,' of Z such that pA, = B, 
for each k. A and B are said to be finitely 
equivalent under ¢ if finite decompositions 
{A,} and {B,} can be chosen. It was proved by 
Hopf that (i) ọ is recurrent if and only if no set 
of positive measure is finitely equivalent under 
gy to one of its proper subsets, and (ii) g has a 
finite invariant measure if and only if no set of 
positive measure is countably equivalent under 
ọ to one of its proper subsets. It can be shown 
that if o is ergodic, then @ has no o-finite 
invariant measure if and only if every pair of 
sets of positive measure are countably equiva- 
lent under o. 

The first example of a transformation ad- 
mitting no o-finite invariant measure was 
constructed by Ornstein in 1960. Since then, a 
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number of simpler examples have been ob- 
tained by L. Arnold, Brunel, and others. It is 
now known that there are many different types 
of transformations having no o-finite invariant 
measure (— Section F). Furthermore, it was 
shown by Ionescu-Tulcea that in the group of 
all nonsingular bimeasurable transformations 
with a suitable metric, those having a o-finite 
invariant measure form a subset of the tfirst 
category. 

Various extensions of results on the invar- 
iant measure problem for nonsingular trans- 
formations to the case of Markov processes 
having nonsingular transition probabilities 
were obtained by Y. Ito, J. Neveu, S. Foguel, 
and others [12]. 

For investigation of detailed properties of 
particular transformations arising from classi- 
cal dynamical systems, problems in number 
theory, and so on, rather than the existence of 
invariant measures it is more important to 
determine a specific form of an invariant mea- 
sure with desirable properties and to develop 
methods to decide when such a measure is 
unique. Various people have considered spe- 
cial classes of transformations; these workers 
have been able to obtain explicit descriptions 
of invariant measures with nice properties for 
the transformations in question and have 
derived interesting consequences. Most im- 
portant among these is the so-called Gibbs 
measure, introduced and investigated by Ya. 
Sinai. He obtained this notion by generalizing 
the concept of the equilibrium Gibbs distri- 
bution, which plays a prominent role in tstatis- 
tical mechanics. It is defined in the following 
way: Let X be a compact metric space, p a 
homeomorphism on X, and jg a probability 
measure on X invariant under o. For a func- 
tion g belonging to L,,(X) and for m, n>0 let 


Enalalto)= | exp $ Hos) Jao 


X 


and form a sequence of probability measures 
Hm,n(g) absolutely continuous with respect to 
Ho, for which the tRadon-Nikodym derivative 


Am. n(X|19) _ EXP LE -ngl x) 
Apo (x) cove „(g | Ho) 


holds. A measure that is a limit point, in the 
sense of tweak convergence, of the sequence of 
measures Hm, ,(g) is called a Gibbs measure 
constructed from jg and g. It is clear that a 
Gibbs measure is invariant under ø. For mix- 
ing topological Markov shifts (— Section D), 
Sinai showed that if one starts with the invar- 
iant measure jọ of maximal entropy (-~ Sec- 
tion H), the existence of which was earlier 
shown by W. Parry, one can determine a class 
of functions g for which the Gibbs measure 
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is unique, i.e., u(g)=liM,, n+ Hm. n(g) exists. 
Sinai further investigated the properties of the 
unique measure u(g) in detail. R. Bowen and 
D. Ruelle, defining the Gibbs measure some- 
what differently, investigated the existence 
and uniqueness of such measures, thereby 
recapturing the results of Sinai in the case of 
mixing topological Markov shifts. With the 
aid of Markov partitions (— Section G), these 
results on Gibbs measures were carried over 
to the case of tAnosov and “taxiom A” diffeo- 
morphisms, and they provided essential tools 
in the investigation of the ergodic behavior of 
these transformations. For details — [13, 14]. 

The explicit form of the density of the in- 
variant measure with respect to the Lebesgue 
measure for the transformation associated 
with fcontinued fraction expansion was al- 
ready known to Gauss, and one can draw 
from it numerous conclusions about metric 
properties of continued fractions. The way in 
which Gauss was able to determine this invar- 
iant measure, however, was never explained. 
Recently, Sh. Ito, H. Nakada, and S. Tanaka 
(Keio Eng. Rep., 30 (1977)) developed an inter- 
esting method to describe the mechanism used 
to arrive at this density function for the invar- 
iant measure for the continued fraction trans- 
formation. They employed similar methods in 
subsequent work to determine explicit density 
functions for invariant measures for other 
related number-theoretic transformations and 
for certain classes of continuous mappings 
over an interval; by means of the explicit forms 
of invariant measures they were able to de- 
scribe the metric properties of these transfor- 
mations in detail. 


D. Examples and Construction of Measure- 
Preserving Transformations 


Examples of measure-preserving transforma- 
tions appear in many different contexts. We 
describe some of the important ones. 

(1) Let G be a *locally compact Abelian 
group Satisfying the second axiom of tcounta- 
bility (— 423 Topological Groups), Z a a- 
algebra of Borel subsets of G, and m its tHaar 
measure (normalized if G is compact). Then 
(G, 2,m) is a Lebesgue measure space. For a 
fixed element g,éG, define the transforma- 
tion 9, ,:G—G by ¢,,(g)=9 +9o. Then @,, is 
a bijective measure-preserving transforma- 
tion on (G, 2, m) and is called the rotation 
on G by the element go. If G is compact, then 
the rotation œ, is ergodic if and only if the 
cyclic subgroup generated by the element go is 
dense in G. If this happens, the element gg is 
called the topological generator of G. A group 
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is called monothetic if it has a topological 
generator. 

(2) If ọ is a group tendomorphism of a com- 
pact Abelian group G, then ọ preserves the 
Haar measure m. If ọ is a group tautomor- 
phism, then it is a bijective measure-preserving 
transformation on (G, 4, m). A continuous 
group automorphism @ induces a group auto- 
morphism ø* of the character group G*. 

The measure-preserving transformation ¢ is 
ergodic if and only if every character except 
the identity has an infinite orbit under the 
induced automorphism g*. When the group 
is the n-dimensional torus T”, a continuous 
group automorphism ¢ is uniquely repre- 
sented by an n x n matrix with integer entries 
and with determinant +1. In this case, ¢ is 
ergodic if and only if no roots of unity appear 
among the teigenvalues of the representing 
matrix. 

(3) Let (Y, £) be a measurable space, let 
(Y,,, %,) =(Y, £) for each ne Z, and define 
(Y¥*, ./*) to be the *product measurable space 
(lez Yn Inez Sna). The transformation @ 
defined on (Y*, ./*) by 


y*=(..., V1, Yo Y1) 
> p(y*)=(..., Y1 Vor Yis) 


with y= y,4, for each n is called the shift 
transformation. Let be a probability measure 
on (Y, £) such that (Y, 7, u) is a Lebesgue 
measure space, let „= u for each n, and define 
u* to be the tproduct measure [Inez Hn on 

(Y*, 4*). Then (Y*, /*, u*) is a Lebesgue 
measure space, and the shift transformation @ 
is a bijective measure-preserving transforma- 
tion. Considered with the product measure, @ 
is called a generalized Bernoulli shift. When the 
set Y is at most countable and the measure p 
on Y is given by a sequence {p;} of positive 
numbers with © p;= 1, ¢ is called a Bernoulli 
shift. Suppose that P(y, A) is a Markov tran- 
sition function on (Y, /) and r is a proba- 
bility measure invariant under P(y, A). Then 
we can define the Markov measure z* on the 
product space (Y*, ./*) by setting 


renf ni |, mard Poodvo) 


x P(y,,dy2)... P(Ys-1; dYs), 


for a cylinder set 


e-q) A) 
j<i i<k<its n>its 


and extending it to all of ./*. The shift trans- 
formation g preserves the Markov measure 
n*, Considered as a measure-preserving trans- 
formation on (Y*, ../*, 2*), ọ is called a Mar- 
kov shift. 
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A generalized Bernoulli shift is atways 
ergodic. A Markov shift is ergodic if and only 
if the corresponding Markov process is tirre- 
ducible, which is the case if and only if the 
following property is satisfied: For every pair 
of sets A and B with x(A)x(B)>0, there exists 

an neZ* such that f4 P"(y, B)dn >0. 
There are other measures besides the prod- 
uct measure and the Markov measure that 
can be defined on the product space (Y*, æ *) 
and are invariant under the shift transforma- 
tion g. For example, if Y is a Borel subset of 
R and .x is the a-algebra of Borel subsets of Y, 
then any tstationary stochastic process taking 
values in Y induces such a measure on 
(Y*,.¥*). When considered with a measure of 
this type, the shift transformation ø is called 
the shift associated with the stationary process. 
Properties of the shift associated with a 
tstationary Gaussian process have been inves- 
tigated by G. Maruyama, I. Girsanov, H. 
Totoki, and others. In particular, it is known 
that the shift is ergodic if and only if the tspec- 
tral measure for the tcovariance function of 
the associated Gaussian process is continuous 
[15]. 

(4) Other important examples of measure- 
preserving transformations arise from classical 
dynamical systems, which will be described in 
Section G. 

(5) There are several ways of constructing 
new measure-preserving transformations from 
given ones. We describe important cases. 

(1) Let ọ be a nonsingular, measurable, 
recurrent transformation (not necessarily 
measure-preserving) on a a-finite measure 
space (X, #,m), and let A be a set of positive 
measure. For xe A, let n(x)=min{neZ* | 
"(x)é A}. The transformation g,:A—>A de- 
fined by o,4(x)=o"™(x) is a nonsingular mea- 
surable transformation on the measure space 
(A, ANA, m,), where m,(B)=m(AN B)/m(A), 
and it is measure-preserving if ¢ is. We call 
p4 the transformation induced by ¢ on A. It is 
ergodic if @ is ergodic. 

(ii) Let ọ be a nonsingular measurable 
transformation on a a-finite measure space 
(X, 2,m), and suppose that {A,} is a countable 
(possibly finite) partition of X. Define a func- 
tion f:X>Z* by setting f(x)=n for xe A,, 
and let X ={(x,j)|xeX, 1<j<f(x)} bea 
subspace of the product measure space (X x 
Z*,Bx€,mx u), where @ is the o-algebra 
of all subsets of Z*, and u is the measure on 
(Z*,@) defined by u({n})=1 for each ne Z+. 
The transformation @ defined on X by ĝ(x,j) 
=(x,j+1) if 1<j</f(x) and =(¢(x), 1) if j= 
J(x) is a nonsingular measurable transforma- 
tion on the measure space (X,(@ x @).X, (mx 
u)z), and it is measure-preserving if @ is. We 
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call @ the transformation built from ¢ with the 
ceiling function f. If o is ergodic and m(A,)0 
as no, then @ is also ergodic. 

(iii) Suppose that wy is a measure-preserving 
transformation on (X, 2, m) and 9, isa 
measure-preserving transformation on 
(Y, £, u) for each xe X. Assume that the map- 
ping (x, y)> ,(y) is measurable with respect 
to the o-algebras Z x æ and £. The trans- 
formation @ defined on the product space 
(X x Y, Bx of, mx u) by A(x, y)= (W(x), P(Y) 
is measure-preserving and is called the skew 
product of y and {¢,}. If 9,=@ for all xe X, 
then we get a direct product transformation 
A(x, y) = (W(x), OC). 

(iv) A measure-preserving transformation p 
on (Y, £, u) is said to be a factor transforma- 
tion (or a homomorphic image) of a measure- 
preserving transformation w on (X, &, m) if 
there exists a measurable transformation y 
from X onto Y such that mon~! = u and py = 
ný. If 2 is a o-subalgebra of 2 and y leaves 
R invariant (i.e., Y~! 8B <B’), then y induces a 
factor transformation ø on the measure space 
(X, 2',m). Conversely, if a measure-preserving 
transformation ọ on (Y, £, u) is a factor trans- 
formation of y on (X, 2, m) via a mapping n, 
then Z' =n | oA is a o-subalgebra of 2 invar- 
iant under y. 

(6) A one-parameter family {p,|teR} of 
bijective measure-preserving transformations 
on a measure space (X, 2,m) is called a flow. 
A flow is called continuous if the mapping t> 
T, is weakly continuous where {T;} is the 
one-parameter family of tunitary operators 
on L?(X) induced by the flow {¢,}. A flow is 
called measurable if the mapping (t, x) > ¢,(x) 
is a measurable transformation of R x X into 
X. A measurable flow is continuous. A. Ver- 
shik and Maruyama proved that for any con- 
tinuous flow there exists a measurable flow, 
unique in a specified sense, which is spatially 
isomorphic (in the sense specified in Section E) 
to the given flow. 

Important examples of flows are given by 
classical dynamical systems (— Section G), 
and by continuous-time stationary stochastic 
processes. 

An important tool in the study of flows is 
provided by the theorem of W. Ambrose and 
Kakutani: Every measurable ergodic flow 
without a fixed point is spatially isomorphic to 
an S-flow. A measurable flow {¢,} is called an 
S-flow (special flow or flow built under a func- 
tion) if there exist a measure-preserving trans- 
formation @ of a measure space (X, 4, m) and 
an R*-valued function f on (X, 2, m) such 
that each ¢, is a measure-preserving trans- 
formation on the subspace X = {(x, u)|xeX, 
O<u<f(x)} of the product measure space 
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(X xR+, 2 x M,mx å) given by 
p(x, u) 


=(x,u+t) if —u<t<—u+ f(x), 


n—-1 
= (outi T orc) if 
n-1 n 
—ut § f(p*(x))<t<—ut+ ¥ f(e*(x), 
k=0 k=0 
=f 
=(orshusee X so) if 


—1 ~1 
SA flo") <t< Eg Z Ao), 
=- =-n+1 
n>1. Here æ is the a-algebra of Borel subsets 
of R*+, and å is the usual Lebesgue measure. 


E. Isomorphism Problems 


In this section we assume that the Lebesgue 
measure spaces considered are probability 
spaces. For simplicity, following the common 
usage among Russian mathematicians, we 
call a measure-preserving transformation on 
(X, 2,m) an endomorphism and a bijec- 

tive measure-preserving transformation an 
automorphism. 

An automorphism g; (a flow {g{")!) on 
(X,,4,,m,) is said to be spatially isomorphic 
(or metrically isomorphic) to an automorphism 
o, (a flow {¢!?}) on (X,, B, m,) if there exist 
sets N, and N, with m,(N,)=m,(N,)=0 anda 
bijective measurable transformation @ from 
X, —N; to X,—N, such that m,o0=m, and 
00, = 9,70 (Oe! = o 0 for each t). 

Classification of automorphisms and flows 
into isomorphism classes constitutes the cen- 
tral problem of modern ergodic theory. Prop- 
erties of automorphisms and flows that are 
preserved under spatial isomorphisms are 
called isomorphism invariants (or metric in- 
variants). There are several isomorphism tn- 
variants that are essential to the study of the 
isomorphism problem. We describe these 
below for the case of automorphisms. There 
are corresponding invarianis for flows as well. 


(1) Spectral Invariants. Two automorphisms ¢, 
and g, are said to be spectrally isomorphic if 
the tunitary operators T, and T, induced by 
@, and ọ, on the Hilbert spaces L,(X,) and 
L,(X>), respectively, are unitarily equivalent 
(i.e., there exists an isometric isomorphism V 
of L,(X,) onto L,(X,) such that VT, = T, V). 
Properties preserved under spectral isomor- 
phisms are called spectral invariants (or spec- 
tral properties). If p; and ọ, are spatially iso- 
morphic, it is clear that they are spectrally iso- 
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morphic; but the converse is not true in 
general. 

(i) The property of g being ergodic is a 
spectral property since @ is ergodic if and only 
if the number 1 is a simple eigenvalue of the 
induced unitary operator T. If @ is ergodic, 
then the set of all eigenvalues of the induced 
operator T forms a subgroup of the circle 
group, each eigenvalue is simple, and each 
teigenfunction has constant absolute value. If 
the tspectrum of the induced operator T con- 
sists entirely of eigenvalues, @ is said to have 
discrete spectrum (or pure point spectrum). A 
theorem due to von Neumann and P. Halmos 
—the first theorem on the question of isomor- 
phism-—states that two ergodic automor- 
phisms p, and ¢, with discrete spectra are 
spatially isomorphic if and only if they are 
spectrally isomorphic, which is the case if and 
only if the induced operators T, and T, have 
the same set of eigenvalues. Furthermore, 
every ergodic automorphism with discrete 
spectrum is spatially isomorphic to an ergodic 
rotation on a compact Abelian group [2]. 

Analogous results were obtained by L. 
Abramov for a bigger class of automorphisms, 
namely, for ergodic automorphisms having so- 
called quasidiscrete spectra. 

(ii) An automorphism ø is ergodic if and 
only if for every pair of sets A, B, 

n-1 
lim Al Y m(p*(A)N ») =m(A)m(B). 
n>% N \k=0 
Strengthening this condition, we can define p 
to be weakly mixing if for every pair of sets A, 
B 


3 


noo n 


lim (5 Imota) B-m Am (B) )=0 


strongly mixing if 


lim m(p*(A)N B)=m(A)m(B); 


and k-fold mixing if for arbitrary choice of sets 
A; j=0,1,...,k, 


limm(A),N@"A,N@e"A,... 1 @™A,) 
=m(A,)m(A,) one m(A,), 


where the limit is taken as n,,n3,...,n,— 

oo in such a way that n; <n, <... <n, and 
min; <jcx(Nj—Nj-1) > ©. The property of an 
automorphism ọ being weakly mixing or 
strongly mixing is a spectral property. For 
instance, ~ is weakly mixing if and only if the 
number 1 is a simple eigenvalue and is the 
only eigenvalue of the induced operator T. It 
is also known that ọ is weakly mixing if and 
only if the direct product automorphism ¢ x @ 
is ergodic. The set of all weakly mixing auto- 
morphisms forms a dense tG;-set in the group 
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of all automorphisms on (X, ¥, m) considered 
with the so-called weak topology (Halmos’s 
theorem). On the other hand, it was shown 
by V. Rokhlin that the set of all strongly mix- 
ing automorphisms is a set of first category 
with respect to the weak topology. However, 
there are only a few known examples of auto- 
morphisms that are weakly mixing but not 
strongly mixing. 

(tit) An automorphism @ is said to have 
countable Lebesgue spectrum if the maximal 
spectral type of the induced unitary operator 
T restricted to the torthocomplement of the 
subspace of constant functions in L, (X) is 
equivalent to the Lebesgue measure and its 
tmultiplicity is countably infinite. 

(iv) An automorphism ọ is called a K- 
automorphism (or Kolmogorov automorphism) 
if there exists a o-subalgebra 2, of 2 such that 
(a) Po D By and PB # Bo, (b) VnezP" Bo = B, 
and (c) Anez” Bo =, where M is the o- 
subalgebra of # consisting of null sets and 
their complements. The notion of a K-flow 
(or Kolmogorov flow) is defined similarly. K- 
automorphisms are k-fold mixing for all orders 
k and have countable Lebesgue spectra. 

Generalized Bernoulli shifts are all K- 
automorphisms. An ergodic Markov shift is a 
K-automorphism if and only if it is strongly 
mixing, which is the case if and only if the 
corresponding Markov process is tirreducible 
and taperiodic (— 260 Markov Chains B). A 
continuous group automorphism of a compact 
Abelian group is a K-automorphism if and 
only if it is ergodic. In particular, a continuous 
group automorphism of the n-dimensional 
torus T” is a K-automorphism if and only if no 
roots of unity appear among the eigenvalues of 
the representing matrix. Automorphisms and 
flows arising from classical dynamical systems 
also provide examples of K-automorphisms 
and K-flows. In particular, a tgeodesic flow on 
a surface of negative curvature is a K-flow, 
and each automorphism (except the identity) 
of this flow is a K-automorphism. For the shift 
transformation ọ associated with a stationary 
Gaussian process, it was shown by Maruyama 
[15] that ọ is (a) weakly mixing if and only if 
it is ergodic, (b) strongly mixing if and only 
if the covariance function of the associated 
Gaussian process tends to 0 as n>, and (c) 
a K-automorphism if and only if the fspec- 
tral measure of the covariance function is 
absolutely continuous with respect to the 
Lebesgue measure. 

(v) Examples of automorphisms having 
various types of spectra have been constructed 
by a number of authors by using stationary 
Gaussian processes and the theory of approxi- 
mation developed by A. Katok and A. Stepin 


[16]. 
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(vi) An ergodic automorphism with quasi- 
discrete spectrum has a mixed spectrum, that 
is, the spectrum of the induced unitary opera- 
tor T has a continuous component and eigen- 
values in addition to 1. Anzai (1951) con- 
structed a special class of skew product auto- 
morphisms having mixed spectra and showed 
that in this class there are automorphisms that 
are spectrally isomorphic but not spatially 
isomorphic. However, the question of whether 
two spatially nonisomorphic automorphisms 
exist among automorphisms having the same 
purely continuous spectrum remained un- 
answered for a long time, until in 1958 Kol- 
mogorov (Dokl. Akad. Nauk SSSR, (5) 119 
(1958)) settled it affirmatively by using a new 
isomorphism invariant called entropy. 


2. Generators and Entropy. (i) By a partition č 
={A,} of the space X we mean a collection of 
sets A, such that A, N A; = Ø whenever 141 
and |] A,=X. We denote by e the partition of 
X into individual points, and by v the trivial 
partition {X}. A partition into a finite (count- 
able) number of sets is called a finite (count- 
able) partition. A partition é is said to be 

finer than another partition ¢ (or ¢ is coarser 
than č) if for every Acé there is a set Bel such 
that ACB. For a collection {¢,} of partitions 
of X, we denote by \/, €, the coarsest partition 
that is finer than each €,, and by A, é, the 
finest partition that ts coarser than each €,. If 
€, = {An} is a sequence of countable (or finite) 
partitions, then \/,¢, is precisely the parti- 

tion of X into nonempty intersections of the 
form ()\, Ay,,, With A,,,,€& for each k. With 

a partition č of X we associate a o-subalgebra 
B(é) of B which is the a-algebra of all 2- 
measurable sets that are a union of elements in 
č. Two partitions € and ¢ are said to coincide 
a.e. if BE) = All) a.e. (i.e., for every Ac BE) 
there exists a set B in Z(¢) such that m(A U 

B— AN B)=0 and conversely). 

(ii) Suppose that ọ is an endomorphism of 
(X,¥,m) and é a partition of X. By gp 1é we 
mean the partition {gp '(A)| Ae}. If ọ is an 
automorphism, we also define pé = {~(A)| 
Aeé}. A partition é is called a generator for 
an endomorphism 9 if Vo "E=eas. If 
Viz -w P” =£ a.s., € is called a two-sided gen- 
erator for an automorphism ọ. 

An endomorphism ọ is said to be periodic at 
a point xe X if there exists a positive integer n 
such that p"(x)= x and aperiodic if the set of 
points of periodicity has measure zero. If the 
measure space (X, #,m) is nonatomic, then 
every ergodic endomorphism on it is aperi- 
odic. A theorem of Rokhlin states that every 
aperiodic automorphism ¢ has a countable 
two-sided generator. This implies that every 
such ¢ is spatially isomorphic to the shift 
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transformation on the infinite product space 
(Y*,.o*) considered with some invariant mea- 
sure y*, where each coordinate space Y= Y 
has at most a countable number of points. 
Krieger improved this result by showing that if 
an ergodic automorphism ¢ has finite entropy, 
then ¢ has a finite two-sided generator [17]. 
(iii) For a finite or countable partition č = 
{A,}, define the entropy H(¢) of the partition 
to be — È, m(A,)log(m(A,)) (the logarithms 
here and below are natural logarithms). We 
denote by Z the set of all partitions č with 
H({)< œ. If ée 7, then for any endomorphism 
ọ the limit 


h(p, ¢)= lim — au (Vv os) 


exists and is finite. The entropy h(q) of the 
endomorphism ¢ is defined to be sup {h(9, €)| 
€€#} and is an isomorphism invariant. Prop- 
erties of entropy have been investigated exten- 
sively since the notion was introduced by 
Kolmogorov. We cite a few results. 

(a) If a partition če ¥ is a generator for an 
endomorphism ø or a two-sided generator for 
an automorphism o, then h(g)=h(q, č) (Sinai’s 
lemma). (b) For every integer n, h(@")=|n|h(9), 
and for a measurable flow {@,}, hlo) =|t|h(@,) 
for every real number t. (c) If an automor- 
phism ¢ is periodic, then h(@)=0. (d) If g, is a 
factor transformation of p, then h(@,)<h(@,). 
(e) hlp x G2) =h(p,) + hlp). (There is a more 
complicated formula (due to Rokhlin) for the 
entropy of a skew product automorphism.) 

(f) If ọ is a recurrent automorphism and @, 

is the automorphism induced by ọ on a sub- 
set A with m(A)>0, then h(g,)=h(@)/m(A) 
(Abramov’s formula). (g) If o is a Bernoulli 
shift with probability distribution {p,}, then 
h(g)= —&, p, |og p,. (h) If g is a Markov shift 
based on the Markov transition probability P, 
(defined on a countable or finite state space) 
and an invariant measure z;, then 


h(p)= TL2 n Plog Pj 


(i) If ọ is ergodic and has a pure point spec- 
trum, or more generally, has a quasidiscrete 
spectrum, then h(¢)=0. (j) For an ergodic 
group automorphism ¢ on an n-dimensional 
torus, h(@)= > log{A|, where the sum is taken 
over all eigenvalues 4 of modulus > 1 of the 
representing matrix. (k) If an automorphism 
has positive entropy, then in L,(X) there exists 
a subspace invariant under the induced uni- 
tary operator T such that the spectrum of T 
restricted to this subspace is countable Le- 
besgue (Rokhlin’s theorem). It follows from 
(k) that automorphisms with ‘singular spectra 
or spectra of finite multiplicity must have zero 
entropy. In proving assertion (g), Kolmogorov 
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established for the first time the fact that there 
are uncountably many spatially nonisomor- 
phic Bernoulli shifts. 

(iv) An automorphism ø is said to have 
completely positive entropy if h(o, €)>0 for 
every partition ¢#¥v. It was shown by Rokhlin 
and Sinai that an automorphism ¢ has com- 
pletely positive entropy if and only if ọ is a K- 
automorphism. M. Pinsker proved that for 
every automorphism ¢ there exists a partition, 
called the Pinsker partition, that is invariant 
under @ and such that the factor transforma- 
tion of @ with respect to this partition has 
zero entropy and is the largest among the 
factor transformations of o with zero entropy. 

(v) Rokhlin showed that h(g~)=0 for an 
endomorphism ọ if and only if all of its factor 
transformations are automorphisms. An endo- 
morphism ¢ is called exact if Ao Q "e=v a.e. 
Rokhlin introduced a way to associate with 
each endomorphism a certain automorphism, 
called the natural extension, which reflects the 
properties of the endomorphism. For example, 
an endomorphism and its natural extension 
are simultaneously ergodic or nonergodic, are 
mixing of the same order, and have equal 
entropy. The natural extension of an exact 
endomorphism is a K-automorphism. 

(vi) Automorphisms p, and @, are said to be 
weakly isomorphic if each of them is a factor 
transformation of the other. Sinai proved 
(1964) that for each ergodic automorphism 
with positive entropy, there exists a factor 
automorphism having the same entropy and 
isomorphic to a Bernoulli shift, and hence it 
follows in particular that Bernoulli shifts with 
the same entropy are weakly isomorphic. 
Ornstein (Adv. in Math., 4 (1970)) went further 
and succeeded in proving the following re- 
markable result: Two Bernoulli shifts with 
equal entropy are spatially isomorphic. Par- 
tial results in this direction were obtained 
earlier by L. Meshalkin, J. Blum, and D. Han- 
son. In the proof of Ornstein’s theorem, essen- 
tial use was made of the following theorem of 
C. Shannon and B. McMillan, which plays a 
fundamental role in information theory (— 
213 Information Theory): Suppose that ¢ is 
an ergodic endomorphism on (X, ¥,m) and č 
is a partition of X. For a point xe X, let A,(x) 
denote the element in the partition \/=4 o *é 
that contains x. Then for almost all x, 


lim ( —“log mAy00)) 
exists and equals h(g, č). 

(vii) Techniques and ideas developed by 
Ornstein in his proof of the isomorphism 
theorem have been refined and extended fur- 
ther by himself, B. Weiss, N. Friedman, M. 
Smorodinsky, and others, and numerous re- 
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sults were subsequently obtained on the iso- 
morphism problem. We describe below some 
of the main results and basic concepts; for 
more detailed accounts — [21-24]. 

A sequence {€,} of partitions is said to be 
independent if for every choice of sets A,,..., A, 
with AE čno M \k-1 A,) = i=; m(A,) when- 
ever n,,M>,...,n, are all distinct. For a fixed 
é>0, two partitions € and ¢ are said to be g- 
independent if 


© |m(ANB)—m(A)m(B)| <e. 


Ae€é, Bel 


A pair (g, č), where ọ is an automorphism 
of (X, #,m) and € is a finite (or countable) 
partition of X, is called a process (on X). 
Be ~~ 9") is a o-Subalgebra of 2 invariant 
under o, and ¢ restricted to this o-subalgebra 
is a factor automorphism of ọ and is isomor- 
phic to a shift transformation on an infinite 
product space, where each coordinate space 
has the same number of points as the number 
of atoms in č. A process (¢, č) is called a Ber- 
noulli process (or an independent process) if the 
sequence of partitions {p"é|ne Z} is indepen- 
dent, and a weak Bernoulli (W.B.) process if 
for every ¢>0 there exists a k >0 such that 
the partitions V£ gé and V£ _, g'é are s- 
independent for all n>0. Let J be a finite 
set and x, $ be two functions from the set 
{1,2,...,N} to J. By dy(a, B) we denote (1/N) 
#{n|a(n)#B(n)}, where # denotes the car- 
dinality. dy defines a metric on the space 
J20 called the Hamming distance. Now, 
for a process (p, €) on X with €={A,|jeJ}, 
we define for x, ye X, dz(x, y) to be equal to 
dy (EN (x), E3 (y)), where for xe X, €*(x) is the 
point (GOJE), Ap "oped 2") 
and j(x) is the index jeJ for which xe A;. A 
process (q, č) is called very weak Bernoulli 
(V.W.B.) if for any ¢>0 there exists an No 
such that whenever N > N, there is a set G 
with m(G)> 1—« belonging to 2 (V - ep") 
satisfying the following condition: for any 
AEB (V-o 9"€) with ACG and m(A)>0, 
there exists a probability measure v on X x X 
satisfying (1) v(E x X)=m(E) and v(X x F)= 
m,(F) for all E, FeB and (ii) fy. xdi(x, y)dv< 
e. It is not difficult to show that W.B. pro- 
cesses are V.W.B. For processes (Q, č) on 
(X, B,m) and (@, č) on (X, Z,m), where both č 
and č are indexed by the same finite set J, we 
define dy((p, $, (@, E) to be inf {fy ydylčž 00, 
E*(x))dp(x, x)}, where inf is taken over all 
probability measures p on X x X satisfying 
p(E x X)=m(E) for all EeB and p(X x E)= 
m(E) for all Ee@. dy decreases with N, and 
we define d((@, €), (@, €)) to be limy,.. dy((@, ¢), 
(9, €)). Finally, a process (ø, č) is said to be 
finitely determined (F.D.) if for any ¢>0 there 
exists a 6>0O and N such that if (@, €) is any 
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process satisfying (i) ë and č are indexed by 
the same set J, (ii) hlo, €)—h(@, č) <ô, and 

(iii) dy((g, č), (p, €)) <4, then d((¢, č), (G, €)) <€. 
Let us call a partition č an F.D. generator for 
an automorphism @ if č is a two-sided gen- 
erator for g and the process (ọ, č) is finitely 
determined. A general isomorphism theorem 
proved by Ornstein states: If automorphisms 
and @ have the same entropy and both have 
F.D. generators, then p and @ are isomorphic. 
Ornstein and Weiss proved further that the 
process (¢, č) is F.D. if and only if it is V.W.B. 
From these basic results a number of results 
can be deduced. (a) Nontrivial factors of Ber- 
noulli shifts are Bernoulli. (b) Strongly mixing 
Markov shifts have two-sided weak Bernoulli 
generators and hence are isomorphic to Ber- 
noulli shifts. (c) Every Bernoulli shift can be 
embedded in a flow, which implies among 
other things that Bernoulli shifts have roots of 
all orders. (d) It was shown by Y. Katznelson 
that every ergodic group automorphism of an 
n-dimensional torus has a two-sided generator 
which can be shown to be V.W.B., and hence 
is also isomorphic to a Bernoulli shift. R. Adler 
and B. Weiss earlier proved by entirely differ- 
ent methods that on a 2-dimensional torus, 
two ergodic group automorphisms having the 
same entropy are spatially isomorphic. They 
have done this by constructing a two-sided 
generator č for such automorphisms which is 
also a Markov partition (— Section G). 

(viii) Among further positive results there 
are the following: (a) Except for a trivial 
change in the time scale, any two Bernoulli 
flows are isomorphic (Ornstein). (b) Every 
ergodic group automorphism of a compact 
group is isomorphic to a Bernoulli shift 
(Thomas and Miles; Lind). (c) A number of 
automorphisms and flows arising from classi- 
cal dynamical systems are shown to be Ber- 
noulli (— Section G). (d) Examples of exact 
endomorphisms arise in connection with prob- 
lems in number theory; for example, tcon- 
tinued fraction expansion and f}-expansion. 
The natural extensions of the exact endomor- 
phisms associated with continued fraction 
expansion and f-expansion are now known 
to be spatially isomorphic to Bernoulli shifts. 

(ix) Since many of the automorphisms that 
have been shown to be K-automorphisms are 
now known to be isomorphic to Bernoulli 
shifts, it is natural to expect that every K- 
automorphism is in fact Bernoulli. However, 
Ornstein (1973) constructed an example of a 
K-automorphism that is not a Bernoulli shift. 
A lot of work has since been done to investi- 
gate how bad K-automorphisms can be. It 
turns out that K-automorphisms share almost 
none of the finer properties of Bernoulli shifts. 
For example: (a) There are uncountably many 
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nonisomorphic K-automorphisms of the same 
entropy (Ornstein and P. Shields). (b) There is 
a K-flow that is not Bernoulli, and there are 
uncountably many nonisomorphic K-flows 
having the same entropy at time one (Smoro- 
dinsky). (c) There exists a K-automorphism 
not isomorphic to its inverse (Ornstein and 
Shields). (d) There exists an automorphism 
that cannot be written as a direct product of a 
K-automorphism and an automorphism with 
zero entropy. This example shows that a con- 
jecture made earlier by Pinsker was false (Orn- 
stein). (e) There are weakly isomorphic K- 
automorphisms that are not isomorphic (Polit 
and Rudolph). 


F. Weak Equivalence and Monotone 
Equivalence 


(1) Weak Equivalence. In order to construct 
examples of ffactors in the theory of von Neu- 
mann algebras (— 308 Operator Algebras), 

F. Murray and von Neumann considered 
various ergodic groups of bimeasurable non- 
singular transformations on a finite measure 
space. In this context a group Y= {g} of trans- 
formations is called ergodic if m((g~!(B)U B)— 
(g '(B)N B))=0 for every ge% implies either 
m(B)=0 or m(X — B)=0. A measure u is said 
to be invariant under the group = {g} if u is 
invariant under every transformation g in 4. 
Murray and von Neumann’s construction (the 
so-called group measure space construction) 
gives a type II, tfactor if the group admits a 
finite equivalent invariant measure, a type II, 
factor if it admits an infinite (but o-finite) equiv- 
alent invariant measure, and a type III factor if 
it has no o-finite equivalent invariant measure. 
In this connection we note that Hajian and Y. 
Itô extended the theorem of Hajian and Kaku- 
tani and proved that an arbitrary group = 
{g} of nonsingular bimeasurable transforma- 
tions admits a finite invariant measure if and 
only if no set of positive measure is weakly 
wandering under the group @ (a set W is said 
to be weakly wandering under a group & if 
there exists an infinite subset {g,|neZ*} of Y 
such that g,(W)N9,(W)= @ for n#k). Analo- 
gously to the terminologies used in the theory 
of von Neumann algebras, we define ergodic 
countable group Y (or an ergodic bimeasurable 
nonsingular transformation ø) to be of type 
I, I, or IH if Y (or o) has a finite equivalent 
invariant measure, a o-finite infinite equivalent 
invariant measure, or no o-finite equivalent 
invariant measure, respectively. 

For a bimeasurable nonsingular transfor- 
mation ¢, define the full group [o] to be the 
group of all bimeasurable nonsingular trans- 
formations Ņ such that, for some n=n(x), W(x) 
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= "(x) for almost all x. Two transformations 
gy, and @, are said to be weakly equivalent if 
there exists a bimeasurable nonsingular trans- 
formation @ such that 6[g,]0-' =[@,]. H. Dye 
proved that any pair of type IT, transforma- 
tions are weakly equivalent. It easily follows 
from this that the same is true for type IT,, 
transformations. Krieger showed, on the other 
hand, that among type ITI transformations 
there are uncountably many weakly non- 
equivalent ones. In fact, Krieger [28] intro- 
duced an invariant for weak equivalence, 
called the ratio set, by means of which he 
classified type III transformations into mutu- 
ally weakly nonequivalent subclasses III, for 
O<A<1. Krieger showed further that (a) for 
0<A<1, every pair of transformations in the 
class HI, is weakly equivalent; (b) in the class 
III, there are uncountably many mutually 
weakly nonequivalent transformations; and 
(c) two ergodic transformations are weakly 
equivalent if and only if the corresponding 
factors constructed via the group measure 
space construction are tx-isomorphic. An 
ergodic flow of nonsingular transformations 
(not necessarily measure-preserving) called the 
associated flow was constructed independently 
by Krieger [29] and by T. Hamachi, Y. Oka, 
and M. Osikawa [30], and was shown to give 
another effective invariant for weak equiva- 
lence. Krieger showed that the mapping which 
assigns to each ergodic transformation its 
associated flow gives a tbijective mapping 
between the set of all weak equivalence classes 
of ergodic type III, transformations and the 
set of all isomorphism classes of aperiodic 
conservative ergodic flows. In this connection 
the theorem of U. Krengel and I. Kubo on the 
representation of ergodic flows, extending the 
theorem of Ambrose and Kakutani mentioned 
in Section D, plays a significant role. 

A bimeasurable nonsingular transformation 
8 is called a normalizer of another transforma- 
tion ¢ if it satisfies 0[@]=[@]9. The set of all 
normalizers “M (p) of a transformation o forms 
a group called the normalizer group which 
contains the full group [~] as a subgroup. One 
can introduce a suitable topology to (@) 
to make it a complete separable metrizable 
group. Hamachi [31] has shown that, for type 
IIF transformations o, the quotient group 
N(p)/[e]”, where [p] denotes the closure of 
[o], is algebraically and topologically isomor- 
phic to the tcommutant of the associated flow. 

Results obtained by Krieger and others 
mentioned above were motivated in part by 
corresponding developments in von Neumann 
algebra theory due mainly to A. Connes (— 
308 Operator Algebras). From the recent deep 
result of Connes on the uniqueness of tapprox- 
imately finite-dimensional factors of type II, 
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it follows that every approximately finite- 
dimensional factor with the exception of type 
III, is *-isomorphic to a factor constructed 
from an ergodic transformation via the group 
measure space construction. 


(2) Monotone Equivalence. The theory of weak 
equivalence discussed above deals with the 
structure of orbits of a transformation or of 
groups of transformations. In fact, transfor- 
mations p and w are weakly equivalent if and 
only if there exists a bimeasurable nonsingular 
transformation 0 mapping the -orbit of 
almost all x onto the W-orbit of 0(x). A some- 
what more stringent notion of equivalence 
dealing with orbit structure is called mono- 
tone equivalence or Kakutani equivalence. We 
say that measurable flows {¢,} and {@,} of 
measure-preserving transformations on finite 
measure spaces (X, 2, m) and (X, 2, m), respec- 
tively, are monotonely equivalent if there 
exists a bimeasurable measure-preserving 
transformation 8 on X to X such that for 
almost all xe X and all te R, 09,(x)= Pra, (x), 
where t(t, x) is a monotone increasing function 
of t. Two S-flows (— Section D) built over the 
same base transformation with different ceiling 
functions are monotonely equivalent, and it 
can be shown that monotonely equivalent 
ergodic flows are isomorphic to S-flows built 
over the same base transformation. This in- 
duces an equivalence relation on transforma- 
tions, also called monotone equivalence or 
Kakutani equivalence: Two transformations p 
and @ are monotonely equivalent if they can 
serve as base transformations of the same (or 
equivalent) flow. This notion of equivalence 
was introduced by Kakutant in [32], where he 
showed that g and @ are monotonely equiva- 
lent if and only if there are sets E c X and 

Ec X such that the induced transforma- 

tions (— Section D) o; and @; are isomorphic. 
Abramov’s formula mentioned in Section E 
implies that there are at least three classes of 
transformations (and flows) that are mutually 
nonequivalent: transformations (flows) of zero, 
finite, and infinite entropy. Nothing much was 
done in this equivalence theory for a num- 

ber of years until in 1975 J. Feldman and E. 
Satayev independently showed that there are 
monotonely nonequivalent transformations 
within each class of zero, positive, and infinite 
entropy. Since then extensive work has been 
done by Feldman, Satayev, A. Katok, Orn- 
stein, Weiss, D. Rudolph, M. Ratner, and 
others, and numerous results have been ob- 
tained. For detailed accounts of this devel- 
opment — [22, 24, 33, 34]. The main idea 
employed by Feldman and Satayev was to 
introduce a new metric called an (metric 

and then to define a notion corresponding to 
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V.W.B. of the isomorphism theory described 
in Section E by using an f- instead of a d- 
metric. They then showed that this property, 
called loosely Bernoulli (L.B.) by Feldman and 
monotonely very weak Bernoulli (M.V.W.B.) by 
Satayev, stays invariant under monotone 
equivalence, and that there exist transforma- 
tions with and without this property. An f- 
metric is defined by starting off with an f,- 
metric on the space J‘!:?--"} instead of the 
Hamming distance dy and proceeding in 
exactly same way as for the definition of d, 
namely, by extending fy to f and fy and 
finally to f. For « and f in J2, f(a, B) 
is defined by setting 1 — fy(«, B) equal to 1/N 
times the maximal integer n for which there 
are positive integers j,<j,<...<j,,k,<k,< 
. <k, with a( j) = B(k,), i=1,...,n. Ornstein 
and Weiss discovered that by substituting f 
for d one can develop a theory of monotone 
equivalence that parallels the isomorphism 
theory described in Section E. One can define 
a process (¢, č) to be finitely fixed (F.F.) by 
substituting f for d in the definition of F.D. 
process, and, as was mentioned earlier, to be 
L.B. by doing the same in the definition of the 
V.W.B. process. Ornstein and Weiss proved: 
(a) If pọ and @ have zero entropy, finite posi- 
tive, or infinite entropy and if both have F.F. 
generators, then they are monotonely equiv- 
alent. (b) Let ọ have an F.F. generator. If 
h(p)=0, then ọ is monotonely equivalent 

to an irrational rotation of the circle. If 0< 
h(@) < œ, then o is monotonely equivalent 

to a Bernoulli shift of finite entropy. If h(@)= 
oo, then o is monotonely equivalent to a 
Bernoulli shift of infinite entropy. (c) If g has 
an F.F. generator, then (o, €) is F.F. for all 
nontrivial partitions €. (d) (ọ, €) is F.F. if and 
only if it is L.B. It is known further that within 
each entropy class there exist uncountably 
many monotonely nonequivalent transforma- 
tions. A measurable flow {@,} is called L.B. if 
it can be represented as an S-flow built over 
an L.B. transformation. The L.B. flows of 
zero entropy are those monotonely equivalent 
to a Kronecker flow on a 2-dimensional 
torus, and L.B. flows of finite positive (infinite) 
entropy are those monotonely equivalent to 
the Bernoulli flow of finite (infinite) entropy. 
The direct product of an L.B. flow and a Ber- 
noulli flow is L.B., while it was shown by M. 
Ratner that the horocycle flow (— Section G) 
is L.B. but its direct product with itself is not. 


G. Classical Dynamical Systems 
By a classical dynamical system we mean a 


tdiffeomorphism or a flow generated by a 
smooth vector field on some fdifferentiable 
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manifold M”. Such a system is nonsingular 
with respect to a measure defined by any 
*Riemannian metric on M”. For a fixed Rie- 
mannian metric, we call measures smooth 

if they have a smooth density with respect 
to the measure given by the metric. 

(1) Among classical dynamical systems, geo- 
desic flows have been investigated most exten- 
sively. Let Y,(M) be the unitary ttangent bun- 
dle over the manifold M”. A point (x, e)e 
2 (M) defines a unique tgeodesic through x 
in the direction of e. The geodesic flow on 
A (M) is the flow defined by ¢,(x, e) =(x,, e), 
where x, is the point in M” reached from x 
after time t under a motion with unit speed 
along the geodesic determined by (x, e), and e, 
is the unit vector at x, tangent to the geodesic. 
The classical tLiouville theorem in this con- 
text implies that the measure on .Y,(M) that is 
the product of the measure on M” induced by 
the metric and the Lebesgue measure on the 
(n— 1)-dimensional sphere gives a smooth in- 
variant measure for the geodesic flow. A wide 
class of systems arising from mechanics can 
be described as geodesic flows. 

Hopf and G. Hedlund proved that if the 
manifold M” is compact and has constant 
negative curvature, then the geodesic flow is 
strongly mixing. Later, by using the theory of 
group trepresentations, I. Gel’fand and S. 
Fomin proved that the spectrum of a geodesic 
flow on a compact manifold of constant nega- 
tive curvature is Lebesgue, and is even count- 
able Lebesgue in the case where the mani- 
fold is of dimension 2. F. Mautner and later L. 
Auslander, L. Green, and F. Hahn extended 
this algebraic method to flows obtained under 
the action of some one-parameter subgroup of 
a tLie group acting on its thomogeneous space 
and obtained extensive results for the case of 
‘nilpotent and some tsolvable Lie groups [35]. 

(2) The flow on an n-dimensional torus de- 
fined by 


O(X4,X2, atts Ses) 
=(x,+@,t,x,+@ ,t,...,X,+@,8) 


is called a translational flow or a Kronecker 
flow. The numbers w,,@,,...,@, are called 
frequencies. Every orbit of {¢,} is dense in the 
torus if and only if the frequencies are linearly 
independent over Z. The motion under a 
translational flow with independent frequen- 
cies is called a quasiperiodic motion. A trans- 
lation flow for a quasiperiodic motion has 
discrete spectrum. 

(3) Sinai obtained a useful criterion for a 
classical dynamical system to be a K-system. 
Let M” be compact, and suppose that {@,} is a 
flow on M” defined by a smooth vector field 
and preserving some smooth measure u. A 
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one-parameter group {y,} of transformations 
of the space M”, given by a vector field, is 
called the flow transversal to the flow {¢,} if 
(i) the decomposition of the space M” into the 
trajectories of the flow {w,} is invariant under 
{o}; (ti) the limit lim,_,o lim, „o(W,(t, x) — t)/ts = 
a(x) exists for the function W(t, x), which is 
defined to be the time length of the segment 
{9;W,(x)|O<u<t} of the trajectory of the flow 
{w,}. Sinai’s fundamental theorem states that if 
a flow {@,} is ergodic and has a transversal 
ergodic flow {y} for which { a(x) dy <0, then 
{@,} is a K-flow. If «(x)<0, then we can even 
drop the assumption that {¢,} is ergodic. If 
fa(x)du>0, the theorem holds for the flow 
{Q-1}- 

A geodesic flow on a 2-dimensional mani- 
fold of constant negative curvature always has 
a transversal flow, called a horocycle flow. The 
ergodicity of a horocycle flow was proved by 
Hedlund. It follows from Sinai’s fundamental 
theorem, therefore, that a geodesic flow on a 
surface of constant negative curvature is a 
K-flow. Sinai proved even more: A geodesic 
flow on any surface of negative curvature ts 
a K-flow. There is an extension of the notion 
of transversal flow to higher dimensions, 
called transversal field. Using this notion Sinai 
proved that a geodesic flow on a manifold 
(of any dimension) of constant negative cur- 
vature is a K-flow. Finally, Ornstein and 
Weiss established that geodesic flows on com- 
pact manifolds of negative curvature are 
Bernoulli. 

(4) D. Anosov considered a class of flows 
and diffeomorphisms satisfying a condition 
that characterizes unstable motions such as 
geodesic flows on a manifold of negative cur- 
vature. They are now called Anosov flows (or 
Y-flows) and Anosoy diffeomorphisms (or Y- 
diffeomorphisms) (— 126 Dynamical Systems). 
Anosov proved that if an Anosov flow has a 
smooth invariant measure, then it is ergodic 
and either it has a continuous nonconstant 
eigenfunction or it is a K-flow. Anosov diffeo- 
morphisms with smooth invariant measures 
are K-automorphisms. Sinai constructed for 
transitive Anosov diffeomorphisms (and for 
Anosov flows) a special partition of the under- 
lying manifold M called a Markov partition 
having desirable properties. The importance of 
such partitions lies in the fact that they enable 
one to represent such diffeomorphisms as 
Markov shifts. Starting with a measure invar- 
iant for such a Markov shift having the maxi- 
mal entropy (— Section H), Sinai constructed 
a Gibbs measure (— Section C), which turned 
out to be unique in this case and gave rise to 
a natural invariant measure u for the corre- 
sponding diffeomorphism. He proved further 
that the diffeomorphism ¢ considered as an 
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automorphism of the measure space (M, 2, u) 
is a K-automorphism. It was shown subse- 
quently by R. Azencott that @ is in fact Ber- 
noulli with respect to u. Sinai investigated 
further the uniqueness of the invariant mea- 
sure attaining the maximal entropy for transi- 
tive Anosov diffeomorphisms, and he was able 
to show among other things that the set of 
transitive C”-Anosov diffeomorphisms that 
do not have an invariant measure absolutely 
continuous with respect to the measure in- 
duced by the Riemannian metric on M con- 
tains an open dense subset. The methods em- 
ployed by Sinai in these investigations were 
extended further by D. Ruelle, Bowen, and 
others. Bowen, in particular, was able to con- 
struct Markov partitions for a wider class of 
diffeomorphisms, namely, those satisfying the 
so-called axiom A introduced earlier by S. 
Smale, and characterized the Gibbs measures 
for diffeomorphisms in this class by means of 
the variational principle (— Section H). For a 
more detailed account of the results of Sinai 
and Bowen — [13, 14]. 

(5) An important example of a system that is 
neither an Anosov nor an “axiom A” system 
because of nonsmoothness has been studied by 
Sinai: the simplest mechanical model due to 
Boltzmann and Gibbs of an ideal gas, which is 
described as a system generated by tiny rigid 
spherical pellets moving inside a rectangular 
box and colliding elastically. Sinai succeeded 
in proving that this is a K-system, thereby 
giving an affirmative answer to the classical 
question of the ergodicity of the basic model 
of statistical mechanics [38]. It was shown 
subsequently by M. Aizenman, S. Goldstein, 
and J. Lebowitz that this system is Bernoulli. 
Sinai also investigated another nonsmooth 
system of classical importance: the system 
describing the motion of a billiard ball on a 
square table with a finite number of convex 
obstacles. He showed that such a system, is a 
K-system. Ornstein and G. Gallavotti then 
showed that Sinai’s methods in fact show that 
the system is Bernoulli. Sinai’s methods were 
extended further by L. Bunimovich and I. 
Kubo to study properties of billiard systems in 
more complicated domains. For more detailed 
account of these matters — [24]. The results 
obtained by Sinai and Bowen for Anosov 
and “axiom A” systems were extended to 
even wider class of systems by A. Stepin, R. 
Sacksteder, M. Brin, and Ya. Pesin (— [24]). 


H. Miscellany 


(1) An arbitrary aperiodic automorphism can 
be approximated by periodic automorphisms. 
More precisely, if ọ is an aperiodic automor- 
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phism, then for any positive integer n and 

6 >0, there exists a periodic automorphism y 
of period n such that m({x| p(x) #(x)})< 
(1/n)+ ô (theorem of Halmos and Rokhlin). 
The question as to how quickly this approxi- 
mation can be carried out has been investi- 
gated in detail by Katok, Stepin, and others. 
The rate of this approximation was shown to 
have a close relationship with the entropy and 
spectral properties of the automorphism ø. By 
utilizing this relationship, various examples of 
automorphisms with specified spectral prop- 
erties have been constructed [16]. 

(2) When an arbitrary automorphism ¢ is 
given on a Lebesgue measure space (X, ¥, m), 
there exists a unique decomposition č = {A,} 
of X such that (i) each A, is invariant under 
y and (it) except for a negligible set (in a 
specified sense) of A,, each A, is turned (in a 
natural manner) into a Lebesgue measure 
space, and the restriction of g to A, is an 
ergodic automorphism. This decomposition is 
called the ergodic decomposition of X with 
respect to o. There is a corresponding decom- 
position with respect to a flow. A formula also 
exists that enables us to compute the entropy 
h(@) in terms of the entropies of ergodic com- 
ponents of ọ. 

(3) Let X be a compact metric space and 
@:X—X athomeomorphism. N. Krylov and 
N. Bogolyubov showed that there always 
exists on X a Borel probability measure u that 
is invariant under ọ. Let FY be the collection of 
all Borel probability measures on X, and let 
P, be the subset of P consisting of those invar- 
iant under g. Then P and &, are both convex 
sets compact with respect to the tweak « topol- 
ogy. If & is the set of all extreme points in 
P, then by the tKrein-Milman theorem 4, is 
not empty. A measure u in Z, belongs to &, 
if and only if ọ is ergodic with respect to u. 
When the set &, consists of a single element, o 
is called uniquely ergodic; ¢ is called minimal if 
for every point xe X, the orbit of x under p = 
orb, (x)= {@"(x)|ne Z} is dense in X. ¢ is 
called strictly ergodic if it is both minimal and 
uniquely ergodic. A theorem of J. Oxtoby 
states that ọ is strictly ergodic if and only if for 
every continuous real-valued function f on X, 
the sequence of averages (X7_} f(p*(x)))/n 
converges uniformly to a constant M(/). 
There are homeomorphisms that are minimal 
or uniquely ergodic but not strictly ergodic. 

(4) R. Jewett proved that any weakly mix- 
ing measure-preserving transformation ona 
Lebesgue space is spatially isomorphic to a 
strictly ergodic transformation. Krieger ex- 
tended the result by showing that if the en- 
tropy of an ergodic transformation ø is finite, 
then ¢ is spatially isomorphic to a strictly 
ergodic transformation. Similar results were 
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obtained for flows by K. Jacobs, M. Denker, 
and E. Eberlein (— [39]). 

(5) A topological analog of the notion of 
entropy, called topological entropy, was in- 
troduced by Adler, A. Konheim, and J. 
McAndrew. This is defined as follows: For 
every open covering . of a compact topolog- 
ical space X, let N(.o7) be the number of sets 
in the minimal subcovering of s. For open 
coverings . and &, let A v 2 be the open 
covering {AN B| Aes, BEB}. For any open 
covering .¥ and a continuous mapping p 
on X, the limit lim, „(log N(A v o7! £ v 
e VETETT A) )/n = hoplP, X) exists. Topo- 
logical entropy h,,,(@) of the continuous 
transformation ¢ is now defined by h,,,(~) = 
sup {h,.,(@, )| an open covering of X}. 

L. Goodwyn showed that hop) 2h, (@) 
for any g-invariant probability measure u, 
where h,(@) is the measure-theoretic entropy 
of g regarded as a y-preserving transforma- 
tion. T. Goodman went further and succeeded 
in proving that h,,,(~)=sup{h,(@)| ua 9- 
invariant probability measure}. In connec- 
tion with these results there is interest in the 
question of the existence and uniqueness of 
an invariant measure for g with maximal 
entropy, i.e., a g-invariant measure u for which 
h,(~)=h,,,(@). Such a measure does not 
always exist, and even if it does it may not 
be unique. The notions of topological en- 
tropy and of measures with maximal entropy 
are generalized by Ruelle in the following 
way. For an open covering ., a continu- 
ous mapping ¢, and a real-valued con- 
tinuous function g on X, let Z,,(.o7, @,g) be 
equal to inf{LocrSUPreo EXP LE-09(P(x))}, 
where the inf is taken over all subcover- 
ings I of the covering J ve 'Wv...v 
o "oJ. Then a finite limit P(x, p, g)= 
lim, |/nlog Z,,(.A, p, g) exists, and the 
quantity P(g, g)=sup{ P(A, p, g)| 2 an open 
covering of X} is called topological pressure. 
When g=0, P(@, g) reduces to h,,,(@). Ruelle 
proved for texpansive mappings ¢ that P(g, g) 
=sup{h,(y)+ fgdu]u is a g-invariant proba- 
bility measure}. This assertion is called the 
variational principle for the topological pres- 
sure. The variation principle was proved for 
general continuous mappings o by P. Walters. 
If a -invariant measure u satisfies P(@, g) 
=h,(o)+ gdu, then x is called an equilibrium 
state for g with respect to ø. It is known that 
for expansive mappings ọ every continuous 
function g on X has an equilibrium state. 

(6) Application of ergodic theory to problems 
in analytic number theory has been made by 
several authors. Ergodic or mixing properties 
of particular measure-preserving transforma- 
tions that arise in connection with various 
problems in number theory have been ex- 
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ploited to give answers to these problems. 
New and more striking applications of ideas of 
ergodic theory to different types of questions 
in number theory have been started by Yu. 
Linnik, Fürstenberg, W. Veech, T. Kamae, and 
others (— [40, 41]). 

7. Most of the results discussed in this arti- 
cle dealt with the action of a cyclic group of 
transformations or of one-parameter flow. 
There are significant extensions of many of 
these results to different types of group ac- 
tions. For recent developments — [22]. 
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When F. Klein succeeded K. G. C. von Staudt 
as professor at the Philosophical Faculty of 
Erlangen University in 1872, he gave an in- 
auguration lecture entitled “Comparative Con- 
sideration of Recent Geometric Researches,” 
which later appeared as an article [1]. In it he 
developed a penetrating idea, now called 

the Erlangen program, in which he utilized 
group-theoretic concepts to unify various 
kinds of geometries that until that time had 
been considered separately. 

The concept of transformation is not new; it 
was, however, not until the 18th century that 
the concept of transformation groups was 
recognized as useful. The theory of tinvariants 
of linear groups and the tGalois theory of 
algebraic equations attracted attention in the 
19th century. In the same century, tprojective 
geometry made remarkable progress, for 
example, when A. Cayley and E. Laguerre 
discovered that metrical properties of Eu- 
clidean and tnon-Euclidean geometries can be 
interpreted in the language of projective geom- 
etry. Cayley proclaimed, “All geometry is 
projective geometry.” After learning geometry 
under J. Pliicker, Klein made the acquaintance 
of S. Lie. Both men understood the impor- 
tance of the group concept in mathematics. Lie 
studied the theory of continuous transforma- 
tion groups, and Klein studied discontinuous 
transformation groups from a geometric stand- 
point. Klein was thus led to the idea of the 
Erlangen program, which provided a bird’s- 
eye view of geometry. 

Klein’s idea can be summarized as follows: 
A space S is a given set with some geometric 
structure. Let a transformation group G of S 
be given. A subset of S, called a figure, may 
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have various kinds of properties. The study of 
the properties that are left invariant under all 
transformations belonging to G is called the 
geometry of the space S subordinate to the 
group G. Let this geometry be denoted by 

(S, G). Two figures of S are said to be congruent 
in (S, G) if one of them is mapped to the other 
by a transformation of G. The geometry (S, G) 
is actually the theory of invariants of S under 
G, with the term invariants to be understood in 
a wider sense; it means both invariant quan- 
tities and invariant properties or relations. 

Replacing G in (S, G) by a subgroup G’ of G, 
we obtain another geometry (S, G’). A series of 
subgroups of G gives rise to a series of geom- 
etries. For instance, let A be a figure of S. The 
elements of G leaving A invariant form a sub- 
group G(A) of G that operates on S =S— A. 
We thus obtain a geometry (S’, G(A)) in which 
A is called an absolute figure. In this way, 
many geometries are obtained from projective 
geometry. Klein gave numerous examples. 

It is noteworthy that he mentioned even the 
groups of frational and thomeomorphic 
transformations. 

Klein’s idea not only synthesized the geom- 
etries known at that time, but also became a 
guiding principle for the development of new 
geometries. 

In 1854 G. F. B. Riemann published his 
epochmaking idea of Riemannian geometry. 
This geometry has a metric, but in general 
lacks congruence transformations (isometries). 
Thus Riemannian geometry is a geometry 
that is not included in the framework of the 
Erlangen program. The importance of Rie- 
mannian geometry was acknowledged when it 
was used by A. Einstein in 1916 as a founda- 
tion of his general theory of relativity. H. 
Weyl, O. Veblen, and J. A. Schouten dis- 
covered geometries that are generalizations of 
affine, projective, and tcanformal geometries 
in the same way as Riemannian geometry is 
a generalization of Euclidean geometry. It 
became necessary to establish a theory that 
reconciled the ideas of Klein and Riemann; 

E. Cartan succeeded in this by introducing the 
notion of tconnection (— 80 Connections). 
However, the Erlangen program, which gave 
an insight into the essential character of clas- 
sical geometries, still maintains its role as one 
of the guiding principles of geometry. 
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A. General Remarks 


The data obtained by observations or mea- 
surements in astronomy, geodesy, and other 
sciences do not usually give exact values of the 
quantities in question. The error is the dif- 
ference between the approximation and the 
exact value. The theory of errors originated 
from systematic work with data accompanied 
by errors, and the statistical treatment of ex- 
perimental data was the main concern in the 
beginning stages (— 397 Statistical Data Anal- 
ysis). However, due to the recent development 
of high-speed computers it has become pos- 
sible to carry out computations on a tremen- 
dously large scale, and the detailed analysis of 
errors has become an absolute necessity in 
modern numerical computation. Hence the 
analysis of errors in relation to numerical 
computation has become the center of re- 
search in error theory. 


B. Errors 


One rarely makes a mistake in counting a 
small number of things; therefore the exact 
value of the count can be determined. On the 
other hand, the exact value in decimals is 
never obtainable for a continuous quantity, 
say length, no matter how fine measurements 
are made, and a large or small stochastic error 
is thus inevitable in measuring a continuous 
quantity. A discrete finite quantity is a digital 
quantity, and a continuous quantity is an 
analog quantity. The natures of these two 
quantities are quite different. The values of 

a digital quantity are distributed on some 
discrete set, while the values of an analog 
quantity are distributed with a continuous 
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probability. Thus there is the possibility of 
error even for treatment of digital quantities, 
although checking the results for these quan- 
tities is easy. It is preferable to regard digital 
quantities as being analog quantities if the 
possible values are densely distributed. 

On the other hand, when an appropriate 
tanalog computer is not available, analog 
quantities receive treatment similar to digital 
quantities. They are expressed as x times some 
unit, and x is expanded in the decimal or 
binary systems. An approximation to such 
an expansion is obtained by rounding off a 
numeral at some place, the position depending 
on the capacity for computation by available 
methods. There are two ways of rounding off 
numbers, the fixed point method and the float- 
ing point method. The former specifies the 
place of digits where the rounding off is made, 
and the latter essentially specifies the number 
of significant digits. 

Classification of errors. (1) Errors of input 
data are the errors included in input data 
themselves. Such input-data errors include 
the errors that occur when we represent con- 
stants such as 1/3, wes n by finite decimals. 

(2) Truncation errors occur in approximate 
expressions for the computation formulas 
under consideration. (3) Roundoff errors occur 
in taking some finite number of digits from the 
earlier digits in the numerical value at each 
step. If the computation of an infinite number 
of digits were actually possible, no errors of 
this type would appear. Recently, it has been 
considered more preferable to call this “com- 
putational error.” 

The difference between fixed point and 
floating point rounding off is that the former 
is better suited for operations of addition and 
subtraction and the latter is better for multi- 
plication and division. In fixed point rounding 
off, if a number is multiplied many times by 
numbers less than 1, a so-called underflow may 
occur, and many digits may disappear; a great 
deal of information can thus be lost. In com- 
putation for scientific research that involves 
frequent multiplication and division, floating 
point rounding off is preferable. It should be 
noted that rounding off for addition and sub- 
traction may also cause a critical loss of in- 
formation. This phenomenon is called cancel- 
ing digits. For instance, in the subtraction 
7.6325071 — 7.6318425 = 0.0006646, where the 
subtrahend and minuend share several early 
significant digits, the difference loses those 
digits. Thus, relative errors may be magni- 
fied tremendously. By taking a large number 
of significant digits, such a situation may 
be avoided to some extent. So-called high- 
precision computation shows its effectiveness in 
such cases. Similarly, when a small number b is 
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added to a large number a, the result may be 
just a and the information of b could be lost 
completely. This kind of loss of information 
can often cause serious trouble. 


C. Methods of Error Analysis 


In order to analyze the propagation of errors, 
let us assume that all numbers are carried to 
infinitely many digits so that no roundoff error 
occurs. Suppose that we are to evaluate the 
function y= f(x,,.--,X,) when x,,X2,...,X, are 
assigned. Let y be the truncation error of an 
approximate expression. If an input error ô; for 
x; exists, then the corresponding error for y is 


Moreover, suppose that at the final step we 
round off to get a result with a finite number 
of digits, by which an error e is introduced. 
Then the final error 6 for y is 

n of 
a? A bj. 
This procedure is performed for each step 
needed in the computation. If y= f(x,,...,X,) 
is a specified step, then the input error 6, for 
that step involves all the errors arising before 
that step, i.e., ô; is an accumulated error. For- 
ward analysis is a method to estimate the total 
accumulated error from the initial input data. 
It is usually quite difficult to obtain precise 
estimates by means of this method. In con- 
trast, J. H. Wilkinson proposed the following 
backward analysis. Here, the computational 
value y is considered as the exact result for 
the modified initial data X,,...,X,, say y= 
fi- %,), and the estimates for |x;— x;,| 
are given. For example, in the binary floating 
point arithmetic of u bits, we always have the 
relation: 


computational value of a+b 
=a(1+6)+b(1 +6), 


with |ô], |e] <27“, even when cancellation or 
loss of information occurs. Wilkinson has 
made a deep investigation of error analysis for 
linear computation, algebraic equations, and 
eigenvalue problems by means of backward 
analysis [4, 5]. 


D. Proliferation of Errors 


The phenomena usually called “accumulation 
of errors” should more appropriately be called 
the proliferation of errors, where the algorithm 
itself includes a particular mechanism to in- 
crease a small error indefinitely. An example is 
the recursion formula for *Bessel functions 
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stated as 
Jatt (x) or. (2n/x)J,(x) —Ja-1 (x). 


It is customary to compute J,(x) by this for- 
mula, starting with the values of Jy(x) and 
J, (x), with x given. By putting J,_,(x)=y,, 
J (x) =z,, the recursion formula can be re- 
garded as a linear transformation of the 
point P,(y,,Z,) in a plane into another point 
Pari (Ynt1>Zn+1) where 


Yn+1 =2n> Za+1 = —Vnt(2n/x)zZ,. 


The teigenvalues 44, 2, of this tdifference equa- 
tion satisfy the following: As long as n<|x|, we 
have |A,|=|A,|=1, while if n>|x|, 4, is greater 
than 1 and increases rapidly as n tends to 
infinity. Consequently, even the slightest dis- 
crepancy in the position of P, gives rise to a 
greatly magnified error in the result [3]. 

Many studies have been made of the propa- 
gation of errors and of instability phenomena 
in the numerical solution of ordinary differen- 
tial equations (— 303 Numerical Solution of 
Ordinary Differential Equations; [2]). 
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A. History 


Attempts to construct axiomatically the geom- 
etry of ordinary 3-dimensional space were 
undertaken by the ancient Greeks, culminating 
in Euclid’s Elements (— 187 Greek Mathemat- 
ics). The fifth postulate of Euclid’s Elements 
requires that two straight lines in a plane that 
meet a third line, as shown in Fig. 1, in angles 
a, P whose sum ts less than 180°, have a com- 
mon point. In the Elements, two straight lines 
in a plane without a common point are said 
to be parallel. It can be proved from other 
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axioms in the Elements that if x + 8 = 180°, the 
two lines / and I’ in Fig. 2 are parallel. Hence 
given a line / and a point P not lying on J, 


Fig. 1 Fig. 2 


there exists a line I’ passing through P that is 
parallel to l. The fifth postulate ensures the 
uniqueness of the parallel l passing through 
the given point P. For this reason, the fifth 
postulate is also called the axiom of parallels. 
Utilizing this axiom, we can prove the well- 
known theorems on parallel lines, the sum of 
interior angles of triangles, etc. The axiom 
plays an important role in the proof of the 
Pythagorean theorem in the Elements. The 
axiom is also called Euclid’s axiom. 

However, Euclid states this axiom in a quite 
complicated form, and unlike his other axioms, 
it cannot be verified within a bounded region 
of the space. 

Many mathematicians tried in vain to de- 
duce it from other axioms. Finally the axiom 
was shown to be independent of other axioms 
in the Elements by the invention of non- 
Euclidean geometry in the 19th century (— 
285 Non-Euclidean Geometry). 

The term Euclidean geometry is used in 
contrast to non-Euclidean geometry to refer to 
the geometry based on Euclid’s axiom of par- 
allels as well as on other axioms explicit or 
implicit in Euclid’s Elements. It was in the 19th 
century that a complete system of Euclidean 
geometry was explicitly formulated (— 155 
Foundations of Geometry). From the stand- 
point of present-day mathematics, it would be 
natural to define first the group of motions by 
the axiom of free mobility due to H. Helm- 
holtz (— Section B) and then, following F. 
Klein, to define Euclidean geometry as the 
study of properties of spaces that are invariant 
under the groups of these motions (— 137 
Erlangen Program). 


B. Group of Motions 


Let P be an tordered field and A” the n- 
dimensional taffine space over P. Let B" be an 
r-dimensional affine subspace of A", B’"' an 
(r—1)-dimensional subspace of B", B"? an 
(r— 2)-dimensional subspace of B" +, etc. In 
the sequence of subspaces B", B’',..., B®, 
each B*— B*~' consists of two thalf-spaces (k = 
r,r—1,..., 1). Let H* be one of these half- 


139 C 


Euclidean Geometry 


spaces. Then the sequence of half-spaces H”, 
H"! ..., H! is called an r-dimensional flag, 
denoted by 8’ (nzr 1), and B" and H” are 
called the principal space and the principal 
half-space of §’, respectively. If f is a tproper 
affine transformation of A”, f(H"), f(H’ *), 
..., {(H') form an r-dimensional flag 8’. We 
write {(§")=&’. 

Let A” be the group of all proper affine 
transformations of A”. The subgroup 8” of W” 
with the following two properties is called the 
group of motions, and any element of 8” is 
called a motion (or congruent transformation). 
(1) Let r be an integer between 1 and n, and let 
$H", R" be any two r-dimensional flags. Then 
there exists an element f of B” that carries 5’ 
to K": f(H')=K". (2) Let f, g be two elements of 
B” with (H) =K", g(H)=K’, and let p be any 
point on the principal space of $”. Then f(p)= 
g(p), that is, f, g have the same “effect” on 
the principal space. In particular, when r=n, 
then f =g. That A” possesses a subgroup 8" 
with properties (1) and (2) is called the axiom 
of free mobility. 

When n= 1, it is easy to see that the ele- 
ments of B! are only those elements f of W! 
that can be expressed in the form f(x)= + 
x+a(aeP). When n>2, P must satisfy the 
following condition in order that a subgroup 
B” with properties (1) and (2) exists in W": If a, 
beP, then P contains an element x such that 
x? =a? +b?. When this condition is satisfied, 
the ordered field P is called a Pythagorean 
field. Every treal closed field (e.g., the field R of 
real numbers) is Pythagorean. If B” exists, its 
uniqueness is assured by (1) and (2). Further- 
more, if P contains a square root of every 
positive element (this condition is satisfied, for 
example, by R), then conditions (1) and (2) are 
reducible to the case r =n only, i.e., conditions 
(1) and (2) for other values of r follow from (1) 
and (2) with r=n. Hereafter, we assume the 
existence of B”. 

Suppose that we have A" > B" > B* (n> 
r>k>0), and let 9 be a flag with the prin- 
cipal space B":§’ =(H",..., H*,..., H’). Let 
K&' be another flag with the same principal 
space B": R" =(K’,..., K*,..., K"), where we 
suppose that H’= K/ for k>j>1, whereas for 
r>i>k-+1, we suppose that H’ and K' are 
different half-spaces on B’ divided by B+. The 
flag 8" is denoted by H}. An element f of B” 
with {(H") = 9; is called a symmetry (or reflec- 
tion) of B” with respect to BY. It leaves every 
point on B* invariant, and its effect on B” is 
determined only by B* independently of the 
choice of half-spaces in $” and S” (subject to 
the conditions mentioned above). In partic- 
ular, the symmetry of A” with respect to a 
point A° =p is called a central symmetry with 
respect to the center p; and the symmetry of A” 
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with respect to a hyperplane A” ' = h is called 
a hyperplanar symmetry. They are uniquely 
determined by p and h, respectively, and are 
denoted by S, and S(h), respectively. If H(A") is 
the set of all hyperplanes of A”, then B” is 
generated by {S(h)|he H(A")}. Furthermore, if 
p, q are two points of A”, the composite S,,S, is 
a parallel translation by 2: pq (Fig. 3). The 
parallel translations generate a normal sub- 
group T” of B". For p, qe A", the element of T” 
that carries p to q is denoted by t,,. The 


x'=Sp(x) 


x x" =Sq(z') 


Fig. 3 


subgroup of 8” that leaves a point p of A” 
invariant is denoted by Of. Obviously, we have 
O! = tip, On t,q\- Thus all the OF (for pe A”) are 
isomorphic. We call Of the orthogonal group 
around p and any element of O; an orthogonal 
transformation around p. More generally, any 
element of B” that leaves a subspace A* of A” 
invariant is called an orthogonal transforma- 
tion around the subspace A*. 

An element of 8” that preserves the orienta- 
tion of A”, i.e., is represented by a proper 
affinity with a positive determinant, is called a 
proper motion. Proper motions form a sub- 
group Bg of B". Rotations are, by definition, 
orthogonal transformations belonging to 85. 
Sometimes 8% is called the group of motions; 
then 8” is called the group of motions in the 
wider sense. In this article, however, we shall 
continue to use the terminology introduced 
above. 

The study of the properties of A” invariant 
under 8" is n-dimensional Euclidean geometry. 
Since W” >B”, every proposition in affine 
geometry (— 7 Affine Geometry) can be con- 
sidered a proposition in Euclidean geometry, 
but there are many propositions that are 
proper to Euclidean geometry. Sometimes the 
subgroup of QI" generated by 8” and the 
homotheties of A”, i.e., elements of QL” repre- 
sented by ‘scalar matrices, is called the group 
of motions in the wider sense, and the study of 
properties of A” invariant under this group is 
called n-dimensional Euclidean geometry in the 
wider sense. 


C. Length of Segments 


Two figures F, F' in A” are said to be congru- 
ent if there exists an fe" such that f(F)=F’. 
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Then we write F = F’. The congruence relation 
is an tequivalence relation. Let s=pq, s’=p’q’ 
be two tsegments in A”. We say that s, s’ have 
equal length when s=s’. Length is an attri- 
bute of the equivalence class of segments. The 
length of s is denoted by |s|. All segments of 
the form pp are congruent, and we define |pp| 
=0. If we are given a length and a thalf-line 
starting from a point p, we can find a unique 
point q on it such that |pq|=the given length 
(Fig. 4). Let r be a point on the extension of 
pq. The length |pr| is then uniquely deter- 
mined by |pq| and |qr|. It is defined as the 


Fig. 4 


sum of the lengths: |pr|=|pg|-+|qr|. With 
respect to the addition thus defined, lengths of 
segments in A” form a fcommutative semi- 
group with the cancellation law, which can be 
extended to an tAbelian group M with 0 as 
the identity element (— 190 Groups P). 

Let |s| 40 and |s’| be any length. On a half- 
line starting from p, we can find points q, r 
with |p| = Isl, |pr| =|s'|. Then the element 
pr/pq=/eP (— 7 Affine Geometry) is a posi- 
tive element of P uniquely determined by |s| 
and |s’]. We call A the measure of |s’|] with the 
unit |s| and denote it by |s’]:|s|. If P is tArchi- 
medean, 4 can be represented by a real num- 
ber (— 149 Fields N). We have (|s‘|+|s”|):[s| 
=(Is"|:]s]) + (s"]:]s]), (s kIs DAs IIs” h= 
|s”|:|s| (if |s| 40). Thus the mapping |s'|— 
|s’|:|s| sends the additive semigroup of 
lengths to that of the positive elements of P. 
This is actually an isomorphism, which can be 
extended to an isomorphism of M onto the 
additive group of the field P. 

Let o(X, X’, ..., X”) be a thomogeneous 
rational function of a finite number of vari- 
ables X, X’,..., X*. Ifa relation @(A, 1’, ..., 2%) 
=0 holds for A=|s|:|so|, A’ =|s'|:|Sol, ..., A” 
=|s*|:|so|, where |sp| is a length 40, then 
p(A,,45,-.-,44)=0 holds also for 4, =|s|:|s,], 


A =|s'|:]5,|,---,47 =s": Isil, where |s,] is any 
other length 40. Hence, in this case, the ex- 
pression ¢(|s|, |s], ...,|s*|)=0 is meaningful. 


D. Angles and Their Measure 


An angle / AOB is a figure constituted by two 
half-lines OA, OB starting from the same point 
O but belonging to different straight lines. The 
point O is called the vertex, and the two half- 
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lines OA, OB are called the sides of 7 AOB 
(Fig. 5). Two congruent angles are said to have 
the same measure, denoted by | Z AOB] or 
sometimes simply by «. Let H? =(H?, H!) (H' 
=half-line QR) be a given 2-dimensional flag 
and the given measure of an angle. Then we 
can find a unique half-line QP in the given 
half-plane H? such that | Z POR|=« (Fig. 6). 
The angle Z POR is said to “belong” to 9°. 
Let K? be the half-plane separated by the 
line PUQ containing the half-line QR. Then 
H?N K? is called the interior of the angle 


A 


0 B 


Fig. 5 


L PỌR. Let Z P’QR be another angle belong- 
ing to $°. If the interior of the latter angle is 
a subset of the interior of Z POR and QR + 
OP’, then | Z PQR] is said to be greater than 
| L P’OR|, and we write | Z POR|>|Z P’QR|. 
Actually, it can be shown that > is a relation 
between the measures of 7 POR and / P’QR, 
and that the set of measures of angles forms a 
‘linearly ordered set with respect to the rela- 
tion > defined in the obvious way. When 
|Z POR|>|ZP’OR|, we write | 2 POR|= 
|Z POP’|+|Z P’OR|. Actually, these are rela- 
tions between measures of angles. Further- 
more, if the measure « of an angle is given, the 
set of measures of angles <« forms a linearly 
ordered set order-isomorphic to a segment 
and satisfying: (i) If f <y then there exists a ô 
such that B+6=y; (ii) B+6=6+ B; (B,+8,)+ 
B3 =f, +(B.+8;) if all these sums exist; and 
(iti) Bi + 6=8,+6 implies £, = $2. When P is 
Archimedean, these properties imply that the 
measure of angles <| 7 POR| can be repre- 
sented by positive real numbers <k (k is any 
given positive number) such that the relations 
of ordering an addition are preserved. 

This one-to-one correspondence between 


the measures of angles and a subset S of the 


interval (O, k] of real numbers can be extended 
to a correspondence between the measures of 
general angles and the subset of R obtained 
from S. When P =R, then we have S =(0,k], 
and any real number appears as a measure of 
a general angle. We can choose 4 PQR and 
the positive number k arbitrarily, but it is 
customary to choose them as follows. Suppose 
we are given an angle / AOB. Let the exten- 
sions of the half-lines OA and OB in the oppo- 
site directions be OA’ and OB’, respectively. 
The angles  AOB and / A’OB are called 
supplementary angles of each other, and so are 
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¿L AOB and £ AOB’. The angles ¿Z AOB and 

/ A’OB are called vertical angles to each other 
(Fig. 7). Any angle is congruent to its vertical 
angle, and an angle that is congruent to its 
supplementary angle has a fixed measure. 


Fig. 7 


Such an angle (or its measure) is called a right 
angle. In the description of the measurement of 
angles, we usually consider the case where the 
special angle Z. POR is a right angle, and we 
set k= 7/2. (The existence and uniqueness of 
the right angle can be proved.) An angle that is 
greater (smaller) than a right angle is called an 
obtuse (acute) angle. A general angle whose 
measure is twice (four times) a right angle is 
called a straight angle (perigon). Sometimes we 
choose as the “unit angle” 1/90 of a right 
angle, which is called a degree (hence a right 
angle = 90 degrees, denoted by 90°); 1/60 of a 
degree is called a minute (1° = 60 minutes, 
denoted by 60’), and 1/60 of a minute ts called 
a second (1’= 60 seconds, denoted by 60”). If, 
as usual, we put the right angle equal to 2/2, 
then the unit angle is (2/z)(right angle). This is 
called a radian, and 1 radian = 180°/r = 
57°17'44,806 ...” =57.3°. 

If a straight line m intersects two straight 
lines J, I’, eight angles a, f, y, 6, a’, By’, o 
appear, as in (Fig. 8). In this figure, a and a’, B 
and f’, y and y’, and 6 and 0’ are called corre- 
sponding angles, while « and y’, $ and 6’, y and 
a’, and 6 and f’ are called alternate angles to 
each other. When / and I are parallel, each of 
these angles is congruent to its corresponding 
or alternate angle. 

The Pythagorean theorem asserts that if a 
triangle AABC is given for which Z ABC is a 
right angle (Fig. 9), then |AB|? +|BC|? =|CA|? 
(which makes sense since X? + Y*—Z? isa 
homogeneous polynomial). 


Fig. 9 


Fig. 8 


E. Rectangular Coordinates 


When two straight lines /, m intersect, two 
pairs of vertical angles appear. If one of these 
angles is a right angle, then all are. Then we 


552 


say that / and m are orthogonal (or perpendic- 
ular) to each other, and write / Lm. Let! bea 
line and A’ an r-dimensional subspace of A” 
(1<r<n-— 1) intersecting lat a point O=A’'N 1. 
If [is orthogonal to all lines on A” passing 
through O, then / is said to be orthogonal to 
A’, and we write ILA” (Fig. 10). If A"! is any 
hyperplane in A”, then there exists a unique 
line / through a given point P of A” that is 
orthogonal to A” '; this l is called the per- 
pendicular to A” ' through P, and the 


Fig. 10 


intersection IN A"~' is called the foot of the 
perpendicular through P. When A”! is given, 
the mapping from A” to A”! assigning to 
every point P of A” the foot of the perpendic- 
ular through P is called the orthogonal projec- 
tion from A" to A”?. 

Let §"=(H", H""',...,H') be an n- 
dimensional flag of A” and O the initial point 
of the half-line H'. Then we can find a point E; 
in H' (i=1,2,...,n) such that OU E; LOUE; 
(i~j,i,j=1,2,...,n). Moreover, if |e| is any 
unit of length, then E; can be chosen uniquely 
so that |OE,|=|e| (i= 1, 2, ...,2). Then O, 
E,,...,E, are tindependent points in A”, and 
we have A"=OQUE, U... U E,„. Thus we have a 
frame X =(0; E,,...,E,) of A” with O as origin 
and the E; as unit points. Such a frame is 
called an orthogonal frame. A coordinate 
system with this frame, called an orthogonal 
coordinate system adapted to $”, is uniquely 
determined by $”. A motion is characterized 
as an taffinity sending one orthogonal frame 
onto another or onto itself. 

Utilizing an orthogonal coordinate system, 
the lengths of segments and the measures of 
angles can be expressed simply. Let (x,,...,x,) 
be the coordinates of X with respect to such a 
coordinate system. Then the length of the 
segment |OX | (with |e| as unit) is equal to 
(£2; x7)”, and when Y is another point, with 
coordinates (y,,...,¥,), OAX, OF Y, then we 
have 


Ei- Xi 
(Diy xp) (Lia y)" 


In particular, we have OUX LOU Y if and 
only if X- x; y;=0. 

We may write x = OX for the tlocation 
vector of X. Then the taffinity Ax +b is a 
motion if and only if A is an forthogonal ma- 
trix. Thus the tinner product (x, y) is invariant 


cos| 2 XOY|= 
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under motions; it therefore has meaning in 
Euclidean geometry. If we put |x|=(x, x)!”, 
then the right-hand sides of the formulas for 
|OX| and cos| Z XOY| can be written as |x| 
and (x, y)/(|x|-|y|). More generally, we have 
|X Y|=|y —x|. This is the Euclidean distance 
(or simply distance) between X and Y. Then A” 
becomes a tmetric space with this distance, i.e., 
a Euclidean space. Historically, the notion of 
metric spaces was introduced in generalizing 
Euclidean spaces (— 273 Metric Spaces). 


F. Area and Volume 


The subset I” of A" consisting of points 
(x,,...,X,) with respect to an orthogonal co- 
ordinate system, with O<x,;<1,i=1,...,n, is 
called an n-dimensional unit cube. A function 
m that assigns to tpolyhedra in the wider sense 
P,Q,... in A” nonnegative real numbers m(P), 
m(Q), ... is called an n-dimensional volume if it 
satisfies the following four conditions: (1) m(@) 
=0. (2) m(PUQ)+m(PNQ)=m(P)+m(Q). (3) 
If P is sent to Q by a translation, then m(P)= 
m(Q). (4) m(J")= 1. It has been proved that 
such a function is unique and has the property 
that P=Q implies m(P)=m(Q). Thus the con- 
cept of volume can be defined in the frame- 
work of Euclidean geometry. More generally, 
if the affinity f(x)= Ax +b sends P onto Q, 
then m(Q)=cm(P), where c is the absolute 
value of the determinant | A|. If P is covered by 
a finite number of hyperplanes, then m(P)=0, 
and if P is a tparallelotope with n independent 
edges a,,...,a,, then m(P)=abs|a,,...,a,]|, 
where |a,,...,a,| is the determinant of the 

n x n matrix with a; as the ith column vector, 
and abs x is the absolute value of the real 
number x. If P is an tn-simplex whose vertices 


have location vectors Xo, X;,...,X,, then we 
have 

1 | ee eee | 
m(P)=—abs 

n! Xo X; X, 


The volume of any polyhedron can be ob- 
tained by dividing it into n-simplexes and 
summing their volumes. If P is an r- 
dimensional polyhedron in A’, then the r- 
dimensional volume of P is obtained by divid- 
ing P into r-simplexes and summing their 
r-dimensional volumes (in the respective r- 
dimensional Euclidean spaces containing 
them). In particular, when r= 1, we speak of 
length (e.g., the length of a broken line), and 
when r= 2, of area. If V is the r-dimensional 
volume of an r-dimensional parallelotope with 


r edges a,,...,a,, we have the formula 
(a;,a4) (a;,a,) 

y7= : 
(a,,a,) (a,, a,) 
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The notion of measure of point sets other than 
polyhedra is a generalization of the notion 

of volume of polyhedra (— 270 Measure 
Theory). 


G. Orthonormalization 


Let O, A,,...,A, be n+ 1 tindependent points 
in A”. Then the n vectors OA, =a,, i=1,...,n, 
are independent. The points O, A,,..., A, 
determine an n-dimensional flag §" of A" as 
follows. Let H, be the half-line OA,, H, be the 
half-plane on the plane OU A, U A, separated 
by the line OU A, in which A, lies, ..., H, be 
the half-space on A” separated by the hyper- 
plane OUA,U...UA,_, in which A, lies. Let 
b,,.-.,b, be the unit vectors of the rectangular 
coordinate system adapted to $”. Suppose 
further that we are given a rectangular coordi- 
nate system. Then b,,...,b, can be obtained 
from a,,...,a, by the following procedure, 
called orthonormalization (E. Schmidt): First 
put b, =a,/|a,|, so that |b,|=1. Then ec, = 
a,—(a,,b,)b, satisfies (b,,c,)=0, c, #0. 
Put b, =c,/|c,|. Then we have (b,,b,)=0, 
|b,|=1. When b,,...,b;_, are obtained in 
this way, so that (b;, b,)=6, for 1<j,k<i—1, 
then ¢; =a; —(a;,b,)b, — ... — (a;, b;-1)b;-; 
satisfies (b;, c;)=0, c; #0. Hence b; =¢,/Ie;| 
added to b,,...,b;_, retains the property 
(b;, b,) = ô; for 1 <j, k <i, and this procedure 
can be continued to i=n. 

Two vectors u, v are called orthogonal (de- 
noted uLvy) if (u, v)=0, and u is called normal- 
ized when |u| = 1. Thus any two of the vectors 


b,,...,b, are orthogonal, and each of them is 
normalized. Between given vectors a,,...,4, 
and b,,...,b, we have the relation {a,,...,a;} 


(=the linear space generated by a,,... 
Disses b}, i=1,...,7. 

Let M, , M, be two subspaces of the linear 
space M of the vectors of the Euclidean space 
A". If any element of M, is orthogonal to any 
element of M,, then Mt, and Mt, are called 
orthogonal and written M, |M,. For any 
proper subspace Wi, of Mi, it can be shown by 
the method of orthonormalization that there 
exists a unique proper subspace M, of IR such 
that Dt= Mt, UM, M, LM. Such a subspace 
M, is called the orthocomplement of W, (with 
respect to M). Then M, NM, = {0} follows, 
and hence i= Wt, + M,. Every element a of 
Mt is therefore written uniquely in the form 
a,+a,,a,EMt,, a EM; we call a, the M- 
component of a and a, the orthogonal compo- 
nent of a with respect to M. The mapping 
from M to M, assigning a, to a is called the 
orthogonal projection from Wi to WM; it is a 
linear and tidempotent mapping. 


a= 
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H. Distance between Subspaces 


Since the Euclidean space A” is a metric space, 
the distance is defined between any two non- 
empty subsets of A” (— 273 Metric Spaces). 
Let A’, B° be two subspaces of dimensions r, s 
of A", and let d be the distance between them. 
Then it can be shown that there exist points 
pé A’, qe B° such that d= pq, and if p'e A’, 
q'e B are any other points with d=p’q’, then 
pq =p'q'. In particular, when r=0 and s=n— 1 
(i.e., when A" =p is a point and BS=B" ' is a 
hyperplane), the distance d can be obtained as 
follows: If (a, x)=b is an equation of B"! and 
p is the location vector of p, then d=|(a, p)— 
b|/lal. If }aj|=1 in this equation of B""!, 

then d is given simply by |(a, p)— b|. An equa- 
tion (a, x)=b of a hyperplane is said to be in 
Hesse’s normal form if |a| = 1. 


I. Spheres and Subspaces 


The set of points in a Euclidean space lying at 
a fixed distance r from a given point is called 
the sphere of radius r with center at the given 
point. If p is the location vector of the center of 
this sphere with respect to a given rectangular 
coordinate system, then the equation of the 
sphere is |x — p|=r or (x, x) — 2(p, x) +(p, p) — 
r? =0. The set of points lying at equal dis- 
tances from k+ 1 points with location vectors 
Po: P1; --->P, (KÈ 1) is a linear subspace of the 
space (which may be @% or the entire space). If 
these points are independent, then the sub- 
space has dimension n—k, where n is the di- 
mension of the entire space. In particular, if 
these points are vertices of an n-dimensional 
tsimplex, then there is a unique sphere passing 
through them, called the circumscribing sphere 
of the simplex. In this case, the simplex is said 
to be inscribed in the sphere. If po, p,,...,P, 
are location vectors of the vertices of the sim- 
plex, then the equation of the circumscribing 
sphere of the simplex is given by 


1 1 1 1 
Po Pi Pa x|=0. 
Po Pt Pp, Xx? 


When n=2 or 3, there are many classical 
results concerning the circumscribing circle of 
a triangle, the circumscribing sphere of a sim- 
plex, and other figures related to a triangle or 
a simplex. 
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A space satisfying the axioms of Euclidean 
geometry is called a Euclidean space. An taffine 
space having as standard vector space an n- 
dimensional Euclidean tinner product space 
over a real number field R is an n-dimensional 
Euclidean space E”. In an n-dimensional Eu- 
clidean space E”, we fix an torthogonal frame 
Y=(0,E,,...,E,), e=OE;, (e;, e)=ô;. The 
frame X determines *rectangular coordinates 
(X,,X2,-.-,X,) of each point in E". We can thus 
establish a one-to-one correspondence be- 
tween E" and R’= {(x,,...,x,)|x;ER}. In this 
sense we identify £" and R” and usually call R” 
itself a Euclidean space. The 1-dimensional 
space R' is a straight line, and the tCartesian 
product of n copies of R! is an n-dimensional 
Euclidean space (or Cartesian space). Given 
points x =(x,,X,...,X,) and py=(y,,y2,.-.5),) 
in the Euclidean space R”, the ‘distance d(x, y) 
between them is given by 


Wi —Xy)? +... + Yn Xn). 


Thus the distance d(x, y) supplies R” with the 
structure of a tmetric space. We call x; the ith 
coordinate of the point x, the point (0, ..., 0) 
the origin of R”, and the set of points {x} —co 
<x;< 0; x,;=0, j Ai} the x-axis (or ith co- 
ordinate axis). For an integer m such that 
—1<m<n, we define m-dimensional ‘sub- 
spaces in R”; a —1-dimensional subspace is 
the empty set, a 0-dimensional subspace is 

a point, and a 1-dimensional subspace ts a 
straight line. If we take an orthogonal frame, 
an m-dimensional subspace is represented 

as an R™(— 139 Euclidean Geometry; 7 Affine 
Geometry). 

As a ttopological space, R” is flocally com- 
pact and fconnected. A bounded closed set in 
R” is fcompact (Bolzano-Weierstrass theorem). 

Given a point a=(a,,...,a,) in R” and a real 
positive number r, the subset {x |d(x,a)<r} of 
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R” is called an n-dimensional solid sphere, solid 
n-sphere, ball, n-ball, disk, or n-disk with center 
a and radius r, its tinterior {x|d(x,a)<r} an n- 
dimensional open sphere, open n-sphere, open 
ball, open n-ball, open disk, or open n-disk, 

and its tboundary {x|d(x,a)=r} an (n—1)- 
dimensional sphere or (n — 1)-sphere. In partic- 
ular, a 2-dimensional solid sphere is called a 
circular disk, its interior an open circle, and its 
boundary a circumference. A disk or a circum- 
ference is sometimes called simply a circle. 
The family of n-dimensional open spheres 
with center a gives a base for a neighborhood 
system of the point a. Suppose that we are 
given a sphere S and two points x, y on S. 

The points x, y are called antipodal points on 
the sphere S if there exists a straight line L 
passing through the center of S such that SN 
L={x, y}. The segment (or the length of the 
segment) whose endpoints are antipodal points 
is called the diameter of the solid sphere (or 

of the sphere). The notion of tdiameter (— 273 
Metric Spaces) of a solid sphere or of a sphere 
considered as a subset of the metric space 

R’ coincides with the notion of diameter of 
the corresponding set defined above. When 
n> 3, the intersection of a sphere and a 2- 
dimensional plane passing through the center 
of the sphere is called a great circle of the 
sphere. For m such that 1 <m <n, we consider 
an m-dimensional solid sphere or an (m— 1)- 
dimensional sphere in an m-dimensional 

plane R”. These spheres are also called m- 
dimensional solid spheres or (m — 1)-dimen- 
sional spheres in R”. 

In particular, the solid sphere of radius 1 
having the origin as its center is called the unit 
disk, unit ball, or unit cell, and its boundary is 
called the unit sphere. (In particular, when we 
deal with the 2-dimensional space R*, we use 
the term circle instead of sphere, as in unit 
circle.) The points (0, ...,0, 1) and (0,...,0, —1) 
are called the north pole and south pole of the 
unit sphere, respectively. The (n —2)-dimen- 
sional sphere, which is the intersection of the 
unit sphere and the hyperplane x, =0, is called 
the equator; the part of the unit sphere that is 
“above” this hyperplane (1.e., in the half-space 
x, = 0) is called the northern hemisphere, and 
the part that is “below” the hyperplane (i.e., in 
the half-space x, <0) the southern hemisphere. 

Let a;, b; be real numbers satisfying a; < b; 
(i=1,2,...,n). The subset {x|a;<x,;<b,,i= 
1,2,...,n} of R” is called an open interval of 
R”, and the subset {x| a; <x; <b;} a closed 
interval. They are sometimes called rectangles 
(when n= 2), rectangular parallelepipeds, or 
boxes. An open interval is actually an open set 
of R”, and a closed interval is a closed set. In 
particular, the closed interval {x|O<x,;<1, 
i=1,2,...,n} is called the unit cube (or unit n- 
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cube) of R”. We can take the set of open inter- 
vals as base for a neighborhood system of R”. 
All the tconvex closed sets (for example, 
closed intervals) having interior points in 
R" are homeomorphic to an n-dimensional 
solid sphere. A topological space J” that is 
homeomorphic to an n-dimensional solid 
sphere is called an n-dimensional (topological) 
solid sphere, (topological) n-cell, or n-element. A 
topological space S”! homeomorphic to an 
(n— 1)-dimensional sphere is called an (n— 1)- 
dimensional topological sphere (or simply 
(n— 1)-dimensional sphere. The spaces I” and 
S"! are torientable ttopological manifolds 
whose orientations are determined by assign- 
ing the generators of the (relative) thomology 
groups H,(I", I") and H,_,(S"~'), respectively 
(both are infinite cyclic groups). By means 
of the tboundary operator @:H,(I",I")> 
H,,_,(S" +), the orientation of 1" or S"~’ deter- 
mines that of the other. 
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Leonhard Euler (April 15, 1707—-September 18, 
1783) was born in Basel, Switzerland. In his 
mathematical development he was greatly 
influenced by the Bernoullis (— 38 Bernoulli 
Family). He was invited to the St. Petersburg 
Academy in 1726 and remained there until 
1741, when he was invited to Berlin by Fre- 
derick the Great (1712—1786). Euler was active 
at the Berlin Academy until 1766, when he 
returned to St. Petersburg. Already having 
lost the sight of his right eye in 1735, he now 
became blind in his left eye also. This, how- 
ever, did not impede his research in any way, 
and he continued to work actively until his 
death in St. Petersburg. 

Euler was the central figure in the mathe- 
matical activities of the 18th century. He was 
interested in all fields of mathematics, but 
especially in analysis in the style of tLetbniz, 
which had been passed down through the. 
Bernoullis and was developed by him into a 
form that led to the mathematics of the 19th 
century. Through his work analysis became 
more easily applicable to the fields of physics 
and dynamics. He developed calculus further 
and dealt formally with complex numbers. He 
also contributed to such fields as *partial dif- 
ferential equations, the theory of telliptic func- 
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tions, and the tcalculus of variations. He con- 
tributed much to the progress of algebra and 
theory of numbers in this period, and also did 
pioneering work in topology. He had, how- 
ever, little of the concern for rigorous founda- 
tions that characterized the 19th century. He 
was the most prolific mathematician of all time, 
and his collected works are still incomplete, 
though some seventy volumes have already 
been published. 
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A. History 


By the evaluation of a function f(x) we mean 
the application of algorithms for obtaining the 
approximate value of the function. The evalu- 
ation methods are classified roughly into two 
groups: (1) evaluation of functions using ap- 
proximation formulas, and (2) evaluation using 
microprogramming techniques. The advent 
of high-speed computers has brought about 
drastic changes in the evaluation of functions. 
Before the introduction of high-speed com- 
puters in the 1940s, mathematical tables had 
played a prominent role. The first issue (pub- 
lished in 1943) of the journal Mathematical 
tables and other aids to computation (MTAC), 
the predecessor of the journal Mathematics of 
computation, was primarily concerned with 
tables of mathematical functions. One of the 
aims of this journal was to facilitate the ex- 
change of information on errors in the tables. 
The tables, obtained in the past by tedious 
hand calculation, can now be easily prepared 
by high-speed computers, and there was a 
period when more accurate and extensive 
tables were published one after another. It 

is ironic, however, that high-speed comput- 
ers revolutionized numerical analysis and 
prompted a shift of emphasis in the field, 
beginning in the 1950s, away from the use of 
numerical tables and toward exploration of 
the most efficient methods of approximation of 
the functions, thus causing a rapid decrease in 
the need for tables. 

Recently, a significant trend in computer 
design has replaced the conventional logic 
control section with “stored logic,” or micro- 
programmed control, stored in high-speed, 
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nondestructive Read-Only Storage (ROS). For 
example, the microprogrammed control used 
for the unified COordinate Rotation DIgital 
Computer (CORDIC) algorithm is effective in 
calculating elementary functions because of its 
simplicity, accuracy, and capability of high- 
speed execution via parallel processing. It is 
not clear whether the applications of approxi- 
mation formulas heretofore in use will be 
superseded by microprogramming techniques 
in all kinds of computers. However, the value 
of the approximation formulas recognized in 
the 1950s has been declining insofar as elemen- 
tary functions are concerned. Recently, as a 
method for the evaluation of functions based 
on a new viewpoint, some “unrestricted” al- 
gorithms have been proposed by Brent [1], 
which are useful for the computation of ele- 
mentary and special functions when the re- 
quired precision is not known in advance or 
when high accuracy is necessary. It is expected 
that methods for the evaluation of functions 
using approximation formulas will be further 
developed as microprogramming techniques 
and unrestricted algorithms see wider use. 


B. Evaluation of Functions Using 
Approximation Formulas 


Suppose we approximate a function f(x) by a 
function g(x) using the following class of func- 
tions p;(x) and q,(x). Let continuous functions 
p(x), ..., Pa(x) and q,(x),...,4,(x) defined on 
a closed interval [a, b] satisfy the following 
conditions: (i) p,,...,p, and q3, ---, qm are both 
linearly independent; (ii) there exist at most a 
finite number of zeros for ©, b.q,(x) in [a,b] 
for any choice of b4, b3, ..., bm, except for the 
case b, =b,=...=b,,=0; (iii) there is a con- 
tinuous function g(x) with a nonzero denomi- 
nator in [a,b] representable as 


alx)= > apito) | È bao) (1) 


where a;,i=1,...,n, and b;, j=0,1,...,m, are 
constants. Then g(x) is called a generalized 
rational function based on a class of functions 
{p(x)},i=1,...,n, and {q,(x)}, j=1,...,m. If 
p(x)=x'~! and q,(x)=x/"+, (1) is reduced to a 
rational function. If m=1 and q,(x)= 1, (1) 

is a linear combination of p,(x) and is called 
an approximation of linear type to f(x); and 
further, if p,(x)=x‘~', (1) is reduced to a poly- 
nomial. The crux of the approximation prob- 
lem lies in the criterion to be used in choosing 
the approximate constants in (1). There are 
three methods for choosing them, which lead 
to three types of approximation of major 
importance: (1) interpolatory approximations, 
(ii) tleast-squares approximations, and (iii) 
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min-max error approximations (sometimes 
called best approximations). As a major aim of 
computer approximation of a function is to 
make the maximum error as small as possible, 
the third type of approximation has been used 
for digital computers. 

For every continuous function f(x), there 
always exist min-max error approximations of 
the form (1). In generating a min-max error 
approximation in practice, we essentially 
depend upon the following conditions and 
theorems. Let a function space F be a d- 
dimensional linear space. If for fe E which is 
not identically zero, there exist at most (d—1) 
zeros of f(x) in [a, b], then F is called the 
unisolyent space or the Haar space. 

Let one of the best approximations to 
a continuous function f(x) be g(x). If the 
linear space D of all the functions of the form 
{Z a.p(x)+ È b,g(x)q,{x)} is unisolvent, then 
the best approximation is unique. For the 
case where g(x) is an approximation of linear 
type to f(x), Haar [2] proved the following 
theorem. 

Let g(x) be the best approximation to f(x) 
and g(x)= D7", a;p,(x). A necessary and suffi- 
cient condition for the best approximation to 
be always unique is that the m-dimensional 
linear space generated by the functions p,, p>, 

.-, Pm be unisolvent. In particular, we have 
the best unique polynomial approximation 
(one variable) of f(x) defined on [a, b]. 

Let the necessary and sufficient condition 
of the previous theorem be satisfied in a d- 
dimensional linear space D. A necessary and 
sufficient condition for g(x) given by (1) to be 
the best approximation of f(x) is that there 
exist points a S Xo < X; <... <Xg-1 < Xa SÞ, 
called deviation points, such that (—1)'( f(x) — 
g(x;)) =p (i=, 1, ...,d). 

A great number of algorithms are known 
by means of which one can calculate the best 
approximation g(x) of a function f(x) for the 
given values of n and m and for p,{x)=x!! 
and g(x)=x/~' in (1). However, the following 
three kinds of algorithms are used most fre- 
quently for generating min-max approxima- 
tions on computers: Remes’s second algorithm 
[3], the differential correction algorithm, and 
Yamauti’s folding-up method. 


C. Evaluation of Elementary Functions Based 
on Microprogramming 


Chen [6] has given a general algorithm for 
calculating an elementary function z = f(x) as 
follows. Let F(x, y)= yg(x) +h(x), where y is 
some parameter for evaluating zo = f (xo) = 
F(X, Yo). We assume that (xo, Yọ) has been 
given and that z, is unknown. Suppose that a 
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new point (Xk+1, Yz+1) 18 obtained from (x;, Yy) 
according to a pair of transformations, X+; = 
(Xk Y and y+: =Y (Xk Yı), keeping the 
value of F(xọ, yo) invariant. If x, is forced to 
converge to x,, and if g(x,,)=1 and h(x„)=0, 
then zo = f(xo)= F{xo; Yo) = F(X Y=. = 
F(X, Yo)= Yo. In this procedure it is necessary 
to determine g(x), h(x), (x), and W(x) for 

the given f(x). Most of the elementary func- 
tions can be evaluated by Chen’s algorithm, 
which is essentially identical with Specker’s 
Sequential Table Look-up (STL) method 
based on addition formulas [7], e.g., logx = 
log(xa)—log« or e*=e* fe’, The iteration 
equations (2) of CORDIC given later can also 
be derived by the STL method with complex 
numbers. 

The use of coordinate rotation to evaluate 
elementary functions ts not new. In 1956 and 
1959 Volder [8] described the CORDIC for 
the calculation of trigonometric functions, 
multiplication, division, and conversion be- 
tween the binary and the decimal and r-adic 
number systems. Daggett, in 1959, discussed 
the use of CORDIC for decimal-binary con- 
version. It was recognized by Walther [9] in 
1971 that these algorithms could be merged 
into one unified algorithm. Consider coordi- 
nate systems parameterized by m in which the 
radius R and angle A of the vector P=(x, y) 
are defined as R =(x* +my?)'? and A =m"? 
arctan(m'? y/x). The basis of Walther’s algo- 
rithm is the coordinate rotation in a linear 
(m=O), circular (m= 1), or hyperbolic (m= —1) 
coordinate system, depending on which func- 
tion is calculated. Iteration equations of 
CORDIC are as follows. The point (x;.,, ¥i41, 
Z;+1) 18 obtained from the point (x,, y;, z;) by 
means of the transformation 


Xi+1 =X; + my,d;, 

Yi+ı = Yi Xi9;, (2) 
Zi = Zi 4%, 

where m is a parameter for the coordinate 
system, x; =m! arctan (m"?6,), and 6, is a 
suitable value, e.g., +2~' The angle A;,, and 
radius R;,, are A;,, =A;—a,; and R;,,=R; x 
K,;=R, x (1+m6?)'?. After n iterations we find 
A, =A j—a and R,= Ro x K, and then 


X,=K {xo cos(am"?)+ yom’? sin(am'”)}, 
Vn = K {yo cos(am'?)—xgm 12 sin(am"?)}, 
Zn — Zo + X, 


where «= 7-34, and K =J]?_} K;. These 
relations are summarized in Table 1 for m=1, 
m=O, and m= —1 in the following special 
cases. (i) The value of A is forced to converge 
to zero; y, 0. (ii) The value of z is forced to 
converge to zero; z,—0. 


142 D 558 
Evaluation of Functions 
Table 1. Input and Output Functions for CORDIC 

Input Quantity 
Function m Xo Yo Zo to be 0 Output 
sint 1 1/K 0 t Z,70 y,—sint 
cost 1 1/K 0 t 2,70 xX, Cost 
tan 't 1 1 t 0 y,-0 z,—~tan't 
XZ 0 x 0 A Z,70 Vp XZ 
y/x 0 x y 0 Yn 0 2, y/x 
sinh t —1 1/K_, 0 t Z,20 y, >sinht 
cosht —| 1/K_, 0 t Z, 70 x,—cosht 
tanh !t —1 1 t 0 y,-70 z,—tanh't 


n~-1 


K_,=[] @-6?)"” 


n-1 
K,=[] (+622, 
j=0 


D. Fast Fourier Transform (FFT) 


When a function f(x) can be taken to be 
periodic, it is advantageous to use trigono- 
metric polynomials as least-squares approxi- 
mating functions. The summations arising in 
least-squares approximations based on the 
trigonometric polynomials play an important 
role in various applications, and a quite effi- 
cient algorithm for the evaluation of such 
sums was developed by Cooley and Tukey 
[10] in 1965, known as the fast Fourier trans- 
form (FFT). Let x,,,(0<m<N-—1) bea set of 
complex numbers, and consider 


X,=(1/N) ¥. xqexp(—2nimn/N) 
0 


(0<n<N-1). (3) 


Equation (3) is often called the discrete 
Fourier transform (DFT) of the sequence xm, 
it being analogous to the continuous tFourier 
transform, 


T 


xan] Sons —2nint/T)dt 


0 
(O<n<N), (4) 


where x(t) is a periodic function of t with 
period T. X,, 1s called the nth Fourier coeffi- 
cient of a set of N equally spaced samples of 
size N for the function x(t) (t=jT/N,O<j< 
N). In the same way as for the continuous 
Fourier transform, the discrete transform can 
be inverted to yield 

N-1 


X,= È} Xmexpl2ninm/N) 


m=0 


(0<n<N-1). 


Here x, is called the coefficient of the inverse 
Fourier transform, and X,, and x, thus form a 
transform pair. 

The FFT algorithm for a finite sequence of 
length N in (3) is based on the fact that the 
calculation of (3) can be performed in stages by 
using the direct product decomposition if N = 
N, N3, with N, and N, relatively prime. For 


i=9 


example, the 2-dimensional Fourier transform 
coefficient is given by 


Xn jong = LIAN: N2)] x 

N-i N31 

$} Dd exp(—2zi(n,m,/N, +n2m/N2))Xm,.m, 
m,=0 m,=0 


(O<n, <N,-1, O<n,<N,- 1). 


If by an elementary operation we mean one 
complex multiplication and one complex ad- 
dition, we can evaluate X,,_,,, through (N, N3)? 
such operations using Horner’s scheme. By 
the direct product decomposition method, 
however, the (N, N,)” operations can be re- 
duced to only N, N,(N, + N3) operations. 
Because the matrix corresponding to the trans- 
formation mentioned above is a direct product 
of N; x N, and N, x N, matrices, we can per- 
form the calculations in two stages: first to 
obtain Čm,n, for O<m,<N,—1 and 0<n,<N, 
—1 and then to obtain X,, n, forO<n,<N,—1 
and O0O<n, <N,—1. We have 


N,-1 

Emn, =(1/N2) 2 exp(—2mingm,/N)Xm,.m> 
m,=0 
N,-1 

Xn n, =(1/N;) » exp(—2ninym,/Ni)om,n,- 
m,=0 


This direct product decomposition method is 
well known for 2- or 3-dimensional Fourier 
transforms. Even for a 1-dimensional Fourier 
transform of length N = N, N3, if N, and N, are 
relatively prime, one can use the method of 
direct product decomposition. Even when N, 
and N, are not prime, we can use a method 
of “pseudo”—direct product decomposition to 
reduce the number of operations. Namely, we 
can put m=m,+N,m, (O<m,<N,,0<m,< 
N,);n=Nj,n, +n, (O<n, <N,,0<n,<N)). 
Then, similarly as before, X, in (3) can be 
rewritten as 

N,-1 
X,=(1/N,) }, exp(—2zin,m,/N,) 


m,=0 


x exp(—2rim; n/N, N2))¢ 


Mj, ñg) 
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where 
Ny-1 
owe =(1/N.) >, exp(—2rinym/N2)Xm,+N,m 
m,=0 
(5) 
which is a DFT of length N,. If we put 
Smm, = EXP( — 2m n3 /(N, Na))Čm, n> (6) 
then 
N,-1 . 
X,=(1/N,) }, exp(—2ninymy/Ny)n, ny: 
m,=0 
(7) 


Formula (7) is nothing but a DFT of length 
N,. Thus an FFT of length N =N, N, can 

be calculated by decomposing it into three 
stages as follows: (1) obtain N, transforms 

of length N, in (5); (it) multiply čm,,n, by 
exp(—2zim,n,/N) in (6) (phase rotation); (iii) 
calculate N, transforms of length N, in (7). 

If either or both of N, and N, can be factored 
further so that, e.g., N=N,N,=N,N,,No.= 
..., an FFT of length N, can be calculated 
similarly by decomposing it, and so on, and 
in this way one can reduce the total number 
of operations. This is the principle of FFT 
pointed out by H. Takahasi. When N is a 
power of two, it can be shown that the FFT 
algorithm requires approximately N log, N/2 
operations. 


E. Padé Approximation 


Let f(x)~cg+c¢,x+0¢,x7?+4+... be a formal 
power series. For any pair of nonnegative 
integers (p,q), we define the (p, q)th Padé ap- 
proximation of f(x) as follows: The Padé ap- 
proximation is a rational function 


(dg ta,x+a,x?+...+a,x?) 
[bo tbx +b2x? +... +b, x4) 


satisfying the condition that all terms in the 
formal power series 


(bo +bix +... +0, x) (Cp +e, x +...) 
—(a4g+ayx+...+4a,x?) 


should vanish up to the term x?**. An infi- 
nite matrix whose (p, q)th entry is the (p, q)th 
Padé approximation is called the Padé table 
for f(x). The Padé approximation is uniquely 
determined, provided that every Hankel 
determinant 


Cpo+q Cptqti = 


never vanishes. 
When we expand f(x) into a continued 
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Col  %X| œx] 
Te ae 


then its 2pth and (2p+ 1)th approximate frac- 
tions are the (p, p)th and (p+ 1, p)th Padé 
approximation of f(x), respectively. 
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A. General Remarks 


The notable relation between the lengths of 
certain families of curves in a plane domain 
and the area of the domain has long been 
recognized and utilized in function theory. 

L. V. Ahlfors and A. Beurling formulated this 
relation by introducing the notion of extremal 
length for families of curves [1]. Although 
there are various definitions of extremal 
length, they are essentially the same except for 
one due to J. Hersch [3] and A. Pfluger [2]. 

The image of an interval under a continuous 
mapping is called a curve. We say that it is 
locally rectifiable if every ‘continuous arc 
of the curve is trectifiable. Let C be a finite 
or countable collection of locally rectifiable 
curves in a plane and p, 0< p < œ, be a tBaire 
function defined in the plane. Represent C in 
terms of arc length s (— 246 Length and Area), 
and set <C, p> = fc p ds. For a family T of finite 
or countable collections C, p is called admis- 
sible if <C, p> >1 for every Cer. If no CeT 
consists of a finite or countable number of 
points, then p = œ is always admissible for 
I. Call inf {f f p? dxdy}, where p runs over 
admissible Baire functions, the module of I, 
and denote it by M(T). The reciprocal A(T) = 
1/M (T) is called the extremal length of I. If 
no Baire functions are admissible for T, we 
set 4([.)=0. The extremal length is defined 
equivalently in two other ways as follows: 

Let p be a nonnegative Baire function, and 
put L(T, p)=inf{<C, p>|CeQ}, then A(T) = 
sup L(T, p)’/{ | p? dx dy, where p runs over 
nonnegative Baire functions. Next, let D be 
the collection of nonnegative Baire func- 
tions p such that f f p?dxdy<1; then A(I)= 
sup{L(T, p)?|pe®}. We obtain the same 
value for A(T) if we-require an admissible p to 
be flower semicontinuous. If p is required to 
be continuous, then the extremal length de- 
fined by Hersch and Pfluger is obtained. As is 
shown in example (1) of Section B, there is a 
case where the two definitions actually differ. 

If an admissible p yields M(T)= f f p* dx dy, 
then p|ds| is called an extremal metric. Beur- 
ling gave a necessary and sufficient condition 
for a metric to be extremal [4]. 

We list four properties of extremal length: 
(1) A) > A(T) if T, c F. (2) M(Ua T) < 
È,„M(T,). (3) Let {Tp} and F be given. Suppose 
that there are mutually disjoint measurable 
sets {£,} such that each CET, is contained 
in E,. If each element of | „T, contains at 
least one CeT, then M(T)> £, M(I,), and 
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hence 
M (u r,) = X M(I,). 


If each CeT contains at least one CeT, for 
every n, then A(T) >>, A(T). (4) Let f be an 
analytic function in a domain Q and {C} be 
given in Q. Denote by f(C) the image of C by 
f. Then 2({C}) <A({ f(C)}). The equality holds 
if f is one-to-one. This shows that A({C}) is 
conformally invariant. 


B. Extremal Distance 


Let Q be a domain in a plane, 6Q its bound- 
ary, and X,, X, sets on QU GQ. The extremal 
length of the family of curves in Q connecting 
points of X, and points of X, is called the 
extremal distance between X, and X, (relative 
to Q) and is denoted by 2o(X1, X2). 

Example (1). Let Q={z||z| <2}, X,=6Q, 
and X, be a countable set in |z|< 1 such that 
the set of accumulation points of X, coincides 
with |z|= 1. Then A,(X,, X,)= œ, but the 
extremal distance in the sense of Hersch and 
Pfluger is equal to (2x) ‘log 2. 

Example (2). In a rectangle with sides a and 
b, the extremal distance between the sides of 
length a is b/a. 

Example (3). Let Q be an annulus r; < 
|z|<r,. The extremal distance A between 
the two boundary circles of Q is equal to 
(2x)~' log(r,/r,). The extremal length of the 
family of curves in Q homotopic to the bound- 
ary circles is equal to 1/4. 

Example (4). Let Q be a domain in the ex- 
tended z-plane such that œo e Q. Let zE Q and 
{|Z—Zo| =r} <Q, and denote by A, the ex- 
tremal distance between {|z—z)|=r} and a set 
X <éQ relative to Q. Then 4,— (27) + logr 
increases with r. We call the limit the reduced 
extremal distance and denote it by Jp(X, o0). 
*Robin’s constant for tGreen’s function in Q 
with pole at z= œ is equal to 2n49(AQ, o0). 

Extremal length is also defined on Riemann 
surfaces. Some classical conformal invariants 
can be given in a generalized form in terms of 
extremal length. The notion of extremal length 
has applications in various branches of func- 
tion theory, such as tconformal and *quasi- 
conformal mappings, the tPhragmén-Lindelof 
theorem, the tcoefficient problem, the ttype 
problem of Riemann surfaces, and studies of 
the boundary properties of functions of finite 
Dirichlet integrals. It is also applied to prob- 
lems in differential geometry. Extending the 
notion of extremal length, M. Ohtsuka con- 
sidered extremal length with weight, and B. 
Fuglede introduced the notion of generalized 
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module in higher-dimensional spaces [8]. 
These notions have useful properties and 
applications. 


References 


[1] L. V. Ahlfors and A. Beurling, Conformal 
invariants and function-theoretic null-sets, 
Acta Math., 83 (1950), 101-129. 

[2] A. Pfluger, Extremallangen und Kapazitat, 
Comment. Math. Helv., 29 (1955), 120-131. 
[3] J. Hersch, Longuers extrémales et théorie 
des fonctions, Comment. Math. Helv., 29 
(1955), 301—337. 

[4] M. Ohtsuka, Dirichlet problem, extremal 
length and prime ends, Van Nostrand, 1970. 
[5] Y. Kusunoki, Some classes of Riemann 
surfaces characterized by the extremal length, 
Proc. Japan Acad., 32 (1956). 

[6] M. Ohtsuka, On limit of BLD functions 
along curves, J. Sci. Hiroshima Univ., 28 
(1964). 

[7] T. Fujrr’e, Boundary behaviors of Dirich- 
let functions, J. Math. Kyoto Univ., 10 (1970). 
[8] B. Fuglede, Extremal length and functional 
completion, Acta Math., 98 (1957), 171-219. 
[9] J. A. Jenkins, Univalent functions and 
conformal mapping, Erg. Math., Springer, 
1958. 

[10] J. Vaisala, Lectures on n-dimensional 
quasiconformal mappings, Lecture notes in 
math. 229, Springer, 1971. 


143 Ref. 
Extremal Length 


144 Ref. 
Fermat, Pierre de 


144 (XXI1.23) 
Fermat, Pierre de 


Pierre de Fermat (August 20, 1601—January 
12, 1665) was born into a family of leather 
merchants near Toulouse, France. He became 
an attorney and in 1631 a member of the 
Toulouse district assembly. When not engaged 
in such work, he did research in mathematics, 
so that he consigned his results only to his 
correspondence or to unpublished manu- 
scripts. The manuscripts were published post- 
humously by his son in 1679 and are known as 
Varia opera mathematica. His research into 
number theory, stimulated by Bachet’s (1581- 
1638) translation of the Arithmetika of Dio- 
/phantus (published in 1621), made Fermat’s 
name immortal and initiated modern num- 
ber theory. He posed the famous Fermat’s 
Problem, which has yet to be solved (— 145 
Fermat’s Problem). He began analytic geome- 
try by studying the theory of conic sections of 
Apollonius, and utilizing this theory he dealt 
with the notions of tangent lines, maximal 
(minimal) values of functions, and quadrature, 
which made him a pioneer in calculus. He also 
wrote a precursory work in the theory of prob- 
ability in the course of his correspondence 
with *Pascal. *Fermat’s principle is important 
in the field of optics, where it is known as the 
law of least action. Unlike ‘Descartes, he em- 
phasized the revival rather than the criticism 
of Greek mathematics. 
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The last theorem of Fermat (c. 1637) asserts 
that if n is a natural number greater than 2, 
then 


x" y"=2" (1) 


has no rational integral solution x, y, z with 
xyz #0. In the case n=2, equation (1) has 
integral solutions called Pythagorean numbers 
(— 118 Diophantine Equations). Fermat read 
a Latin translation of Diophantus’ Arith- 
metika, in which the problem of finding all 
Pythagorean numbers ts treated. In his per- 
sonal copy of that book, Fermat wrote his as- 
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sertion as a marginal note at the point at which 
n=2 in equation (1) is treated and added the 
famous words, “I have discovered a truly 
remarkable proof of this theorem which this 
margin is too small to contain.” It is not 
known whether Fermat actually had a proof. 
Fermat’s problem asks for a proof or disproof 
of this conjecture, which itself has not been 
solved despite centuries of efforts by many 
mathematicians; but its study has promoted 
remarkable advances in number theory. In 
particular, E. E. Kummer’s theory of ideal 
numbers and the development of the theory of 
teyclotomic fields were originally conceived in 
treating Fermat’s problem. 

In this article, we consider only those in- 
tegral solutions x, y, z of equation (1) with 
xyz #0 that are relatively prime. We also 
restrict ourselves to the cases n=] (odd prime) 
and n=4, without loss of generality. 

For smaller values of n, the nonsolvability of 
equation (1) was proved long ago, for n=3 by 
L. Euler (1770), and later again by A. M. Le- 
gendre; for n=4 by Fermat and Euler; for n=5 
by P. G. L. Dirichlet and Legendre (1825); and 
for n=7 by G. Lamé (1839). S. Germain and 
Legendre found some results on more general 
cases, but the most remarkable result was ob- 
tained by Kummer (J. Reine Angew. Math., 40 
(1850), Abh. Akad. Wiss. Berlin (1857)). 

Let I be an odd prime, ¢ a primitive Ith root 
of unity, and h the fclass number of the cyclo- 
tomic field Q(¢). Then the class number h, of 
the real subfield Q(t +671) of Q(f) divides h. 
We call h, =h/h, and h, the tfirst and fsecond 
factors of h, respectively. 

(1) If / is tregular, that is, if (h,/)=1, then 
x'+ y'=z' has no solution (Kummer, 1850). 

There are infinitely many irregular primes 
[3]; those under 100 are 37, 59, and 67. There 
are 7,128 regular primes and 4,605 irregular 
primes between 3 and 125,000. It is not yet 
known whether there are infinitely many regu- 
lar prime numbers, although the beginning 
part of the sequence of natural numbers con- 
tains a larger number of these than the num- 
ber of irregular prime numbers. The condition 
(i, h)=1 is equivalent to saying that the numer- 
ators of *Bernoulli numbers B,,, (m=1,2,..., 
(!—3)/2) are not divisible by / (Kummer, 1850). 

Kummer obtained a result on irregular 
primes (1857) which was improved later as 
follows. Note that if / is not regular then h, is 
divisible by / (Kummer, 1850) (— 14 Algebraic 
Number Fields). 

(2) If (h,,/)=1 and the numerators of Ber- 
noulli numbers B,,,; (m= 1,2, ...,(/—3)/2) are 
not divisible by 1°, then x'+ y'=2' has no 
solution (H. S. Vandiver, Trans. Amer. Math. 
Soc., 31 (1929)). By computation Vandiver 
confirmed that x'+ y'=2' has no solution for 
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1<619. At present, this procedure has been 
continued for /< 125,080 using computers by 
the method of D. H. Lehmer, E. Lehmer, and 
H. S. Vandiver, Proc. Nat. Acad. Sci US, 40 
(1954) (S. S. Wagstaff, Math. Comp., 32 (1978)). 

When the condition (xyz, /)=1 or (xyz, J)=I 
is added, we speak of Case I or Case II, respec- 
tively. The following theorems hold for Case I. 

(3) If x'+ y'=z! has a solution in Case I, 
then 


BomJSi-am (t)=0(mod!), m=1,2,...,(1—3)/2, 


(2) 


holds for —t=x/y, y/x, y/z, z/y, x/z, and z/x, 
where f (=E r" r, and B,, is the mth 
Bernoulli number. This is called Kummer’s 
criterion (D. Mirimanov, 1905). 
A simplification of the above result is 
(4a) If x'+ y'=2!' has a solution in Case I, 
then 


(2'-' —1)/1=0 (mod!) 


(A. Wieferich, J. Reine Angew. Math., 136 
(1909)). This result created a sensation at the 
time of its publication. It was first shown that 
1093 and 3511 are the only primes with /< 
3700 for which the above congruence holds; it 
is presently known that no other / with 1<6 x 
10° satisfies this congruence. The criterion 
(4a) was gradually improved by Mirimanov 
(1910, 1911), P. Furtwängler (1912), Vandiver 
(1914), G. Frobenius (1914), F. Pollaczeck 
(1917), T. Morishima (1931), and J. B. Rosser 
(1940, 1941). For example: 

(4b) If x’ + y'=z' has a solution in Case I, 
then 


(m'~! — 1)/1=0 (mod!) (3) 


holds for all m with 2 <m <43. By means 

of this result, Rosser (1941) showed for l< 
41,000,000, and D. H. Lehmer and E. Lehmer 
(Bull. Amer. Math. Soc., 47 (1941)) showed for 
1< 253,747,889 that x’ + y'=z' has no solution 
in Case I. 

We have hitherto been concerned with 
rational integral solutions of x’ + y'= z". We 
may also consider the problem of proving or 
disproving that «' + f'=y' has no solution 
a, P, y with «By 40 in the ring of talgebraic 
integers of Q(¢). Case I means the impossibil- 
ity of 


of + fi+y'=0, (afy,D=1, (4) 
and Case II means the impossibility of 
a+ fl=el™y', (ay, I)=1, (5) 


where n is a natural number, g is a tunit in 
Q(g), and A=1—(. We have the following 
results: 

(1*) If (h, /)=1, then neither equation (4) nor 
equation (5) has a solution (Kummer, 1850). 
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(2*) Under the same conditions as in state- 
ment (2), equation (4) has no solution. If we 
additionally restrict «, B, y to relatively prime 
integers of Q({ +¢7+) and replace 4 by (1 — 
¢1—¢7"), then equation (5) also has no solu- 
tion (Vandiver, 1929). 

(4b*) If equation (4) has solutions «, $, y in 
Q(¢), then congruence relation (3) holds for all 
m with 2<m<43 (Morishima, 1934). 

When / is sufficiently large, we have the 
results of M. Krasner (C. R. Acad. Sci. Paris 
(1934)) and Morishima (Proc. Japan Acad., 11 
(1935)). 

Bibliographies are given in Vandiver and 
Wahlin [1] and Vandiver [2]. 
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A. Introduction 


As the S-matrix or tGreen’s function in quan- 
tum field theory is usually prohibitively dif- 
ficult to calculate, perturbative expansions in 
terms of coupling constants have been em- 
ployed since the beginning of the theory (— 
386 S-Matrices). R. P. Feynman (Phys. Rev., 76 
(1949)) invented a way of calculating the series 
in terms of Feynman integrals. His method 
drastically simplified the preceding method 
due to S. Tomonaga and J. S. Schwinger, even 
though, as was later shown, the two methods 
are theoretically equivalent (F. J. Dyson, Phys. 
Rev., 75 (1949)). A Feynman integral is an 
integral associated with a Feynman graph 
according to the Feynman rule explained 

in Section B. Feynman integrals inherit the 
troublesome problem of divergence, and some 
recipe which systematically provides them 
with a definite meaning is needed. Such a 
recipe is given by the renormalization the- 

ory of Tomonaga, Schwinger, Feynman, and 
Dyson. A mathematically rigorous renormali- 
zation theory was given by N. N. Bogolyubov 
and O. S. Parasiuk (Acta Math., 97 (1957)), 
later supplemented by K. Hepp (Comm. Math. 
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Phys., 2 (1966)). See also W. Zimmermann 
(Comm. Math. Phys., 15 (1969)) and S. A. 
Anikin et al. (Theor. Math. Phys., 17 (1973)). 
Furthermore, E. R. Speer [1] gave a mathe- 
matically convenient recipe of renormalization 
(under the condition that massless particles 
are irrelevant). Although the series expansion 
in coupling constants is a divergent series 
even after renormalization (— 386 S-Matrices), 
the study of Feynman integrals has given 
much insight into the qualitative aspects of the 
S-matrix, and in particular, into its analytic 
structure (e.g., R. J. Eden et al. [2]). In this 
respect the discovery of the Landau-Nakanishi 
equations, which describe the location of sin- 
gularities of Feynman integrals, was crucially 
important (L. D. Landau, N. Nakanishi, and 
J. Bjorken, 1959; — Section C). Later, R. E. 
Cutkosky found a formula which gives tmpor- 
tant information concerning the ramification 
of Feynman integrals near their singularities 
(— Section C). It gave impetus to J. Leray’s 
mathematical study of Feynman integrals 
from the viewpoint of integration of multi- 
valued analytic functions (Leray, Bull. Soc. 
Math. France, 87 (1959)). Such studies were 
subsequently carried out by D. Fotiadi, M. 
Froissart, J. Lascoux, F. Pham, etc.; — [3-5] 
and references cited there for this topic. An 
extensive study by G. Ponzano, T. Regge, 
Speer, and J. M. Westwater on the mono- 
dromy structure of Feynman integrals is 
closely related to the studies by Pham and 
others (— Regge in [6] and references cited 
there). On the other hand, the progress of 
microlocal analysis has thrown new light on 
the Feynman integrals and has given a unified 
foundation to these various other studies (— 
Section C; also Pham, M. Kashiwara, and T. 
Kawai in [6], M. Sato et al. in [6] and refer- 
ences cited there). 


B. Definitions 


First, the notion of Feynman graphs is intro- 
duced. A Feynman graph is sometimes called a 
Feynman diagram. A Feynman graph G con- 
sists of finitely many points (called vertices) 
{Vj} j-1,.... finitely many 1-dimensional seg- 
ments (called internal lines) {L,};-,,...y and 
finitely many half-lines (called external lines) 
FLE}, =1.....n all of which are located in a 4- 
dimensional affine space. Each of the end- 
points W,* and W, of L, and the endpoint of 
L£ coincide with some vertex V;. A four-vector 
P, =(P;.0> Pr, 15 Pr.2» Pr,3) 18 associated with each 
external line LE and a constant m,>0 is as- 
sociated with each internal line L,. For sim- 
plicity, we usually suppose, in addition, that 
each internal line and each external line are 
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oriented, that W,* Æ W, and that G is con- 
nected. The orientation of a line is indicated by 
the symbol > . Given an orientation, we de- 
fine the incidence number [j:/] to be +1 or 
—1 according to whether L, ends or starts from 
V,. In other cases, [ j:/] is defined to be zero. 
The incidence number [j:r] is defined in the 
same way but with L? replacing L, (Fig. 1). 


Fig. 1 

In this example of a Feynman graph, the internal 
lines L, and L, do not intersect and this diagram 
should be drawn in R*, not in R?; this is indicated 
by `X. For convenience, multiple lines such as L, 
and L, are usually drawn in a curvilinear manner, 
as shown. 


The Feynman rule associates the following 
integral F(p) with each Feynman graph G: 


ra= | a 9 (Lp Cir p+ Dea LAD 


y i(k?-m +, /-1 0) 
N 
x$ dtk, (1) 
l=1 
where k? =k? o — È}? k7,. Here 1/(k? —mj + 
</—1 0) means lim, jo(1Ak? —m? + /—1 8) 
(— 125 Distributions and Hyperfunctions). 

Here we consider the case where the interac- 
tion Lagrangian density does not contain 
differential operators (i.e., direct coupling) and 
all relevant particles are spinless. In general, 
we should multiply the integrand of F,(p) by a 
matrix of polynomials of the p, and k;. 

F;(p) has the form 5(S,,Li:r]p,)fa(p); we 
often investigate f,(p) instead of F,(p). The 
function f;,(p) is studied on M=4.,{peR™| 
È; [j:r]p,=0}, and is called a Feynman 
amplitude. The integral (1) is not well defined 
as it stands because of the following prob- 
lems: (a) Is the product appearing in the inte- 
grand well defined? (b) Is the integral conver- 
gent? The first problem is not serious if m, #0 
(— 274 Microlocal Analysis E) However, the 
second problem, called the ultraviolet diver- 
gence, is serious. The renormalization proce- 
dure is intended to overcome this difficulty. 
When some m, is equal to zero, even the first 
problem, called the infrared divergence, is 
serious. See D. R. Yennie et al. (Ann. Phys., 13 
(1961)) and T. Kinoshita (J. Math. Phys., 3 
(1962)) for analyses of the infrared divergenee. 
In this article we always assume for simplicity 
that every m, is strictly positive, even though 
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such an assumption is too restrictive from the 
physical viewpoint. 

Calculations of Feynman integrals are often 
done by means of the parametric representa- 
tion of the integral (1), of the form 


(zun) shy 


a 611 — LL a) da 
0 U(a)?(V(p, a) + /_| Q) Nt 20-2? 


where U(x) and V(p, a) are determined by the 
topological structure of the graph G. See [7] 
for the derivation of this formula and the 
treatment of the integral written in this form. 
It is useful not only for the study of the sin- 
gularity structure of F,(p) but also for the 
study of spectral representations, etc. (—, e.g., 
[7,8]). Note that several different notations 
are used in the literature for the parametric 
representation of the integral. Hence one 
should be careful in referring to papers which 
use parametric representations. 


C. Analytic Properties of Feynman Integrals 


The celebrated result of Landau (Nuclear 
Phys., 13 (1959)), Nakanishi (Prog. Theor. 
Phys., 22 (1959)) and Bjorken (thesis, Stanford 
Univ., 1959) asserts that the singularities of a 
Feynman amplitude f,(p) are confined to the 
subset L.*(G) (called a positive-~ Landau- 
Nakanishi variety) of M ={peR*"|>,,[j:r]p, 
= 0}, defined by the following set of equations 
(called Landau-Nakanishi equations), where u,, 
w;, and k; are real four-vectors and o is a real 
number, all of which are to be eliminated to 
define relations among the p, (note, however, 
that a positive-« Landau-Nakanishi variety is 
not strictly a subvariety of M, because of the 
constraint (2e): 


Lam 


Uu, = 
j 


Cir, (r=1,...,n), (2a) 


il 


n N 
DL lest L Lj:I]kı=0 (j=1,...,n°, 
r=1 i= 


(2b) 
Li:]wj=a,k, (1=1,...,.N), (2c) 
j=l 
a(k?7—m7)=0 (I=1,...,N), (2d) 
ot, = 0, (2e) 
with some 
a, #0. (2f) 


Usually a Landau-Nakanishi variety (resp., 
equation) is called a Landau variety (resp., 
equation) for short. The equation (2a) is 
conventional; (2b) represents the energy- 
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momentum conservation law at the vertex V; 
(2d) corresponds to the mass-shell constraint 
(if «,=20) on the internal line L,. Since (2c) 
entails ¥),¢(C)a,k, =0 for a closed circuit 
(=loop) C of G for some set of values ¢,(C) = 
+1 or 0 with ¢,(C) being 0 if L, does not be- 
long to C, the equation (2c) is usually called 
the closed-circuit condition. By attaching w, to 
the vertex V, and associating «,k, to the inter- 
nal line L,, we get a diagram representing a 
multiple collision of classical point particles, 
since the relations (2a)—(2f) are just the class- 
ical conditions for such a collision (S. Coleman 
and R. E. Norton, Nuovo Cimento, 38 (1965)). 
The fact that the Landau-Nakanishi equations 
admit such an interpretation is neither acci- 
dental nor superficial in view of the tmacro- 
scopic causality of the S-matrix. Note also that 
there is another interpretation of the Landau- 
Nakanishi equations, which emphasizes their 
resemblance to tKirchhoff’s law (Nakanishi, 
Prog. Theor. Phys., 23 (1960)). Such a resem- 
blance can be used to study the structure of 
Feynman integrals from the viewpoint of 
tgraph theory. See [7] and the references cited 
there for this topic. 

An important observation by Pham and 
Sato (1973), which opened a way to the micro- 
local analysis of Feynman integrals and the 
S-matrix, is the following: If we consider the 
Landau-Nakanishi equations to define a sub- 
variety of S*M, the fspherical cotangent bun- 
dle of M, by eliminating only w, k, and a, 
then the resulting variety describes the tsin- 
gularity spectrum of f¢(p). More precisely, 
S.S.f¢(p) is confined to the set (p; ft u), 
where (p; u) satisfies the Landau-Nakanishi 
equations. The rigorous proof of this state- 
ment was given by Sato et al. in [6]. The 
subset of S*M or /-1S*M thus obtained 
is denoted by ¥ *(G) and is also called a 
positive- Landau-Nakanishi variety. It is 
noteworthy that the microlocalization of the 
classical result of Landau, Nakanishi, and 
Bjorken had essentially been achieved in a less 
sophisticated manner by D. Iagolnitzer and H. 
P. Stapp (Comm. Math. Phys., 14 (1969)) in the 
framework of S-matrix theory. The variety 
defined by (2a)—(2d) and (2f) is denoted by 
L(G) or Z(G) and is a Landau-Nakanishi 
variety. In a neighborhood of p° in L*(G), 
J¢(p) is the boundary value of a holomorphic 
function f,,(p) whose domain of definition is 
determined by u-vectors (— 274 Microlocal 
Analysis E). Furthermore, in simple cases one 
can verify that f¢(p) can be analytically con- 
tinued to define a holomorphic function on 
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the universal covering space U — L(G)© of U — 
L(G) for a complex neighborhood U of p°, 
where L(G)© denotes a complexification of 
L(G). Hence we can discuss the difference of 
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fo(yp) and f(D). where y denotes a loop en- 
circling L(G)°. Cutkosky (J. Math. Phys., 1 
(1960)) observed that it can be expressed 

(in simple cases) by the integral obtained by 
replacing 1/(k? —m? + /—10) in the right- 
hand side of (1) by —2x./—16* (k? —m?) = ger 
— 2r, /—1Y (k, o)ôl(k? —m?), where ¥(k,,o) 
denotes the tHeaviside function. A result of 
this type is called a discontinuity formula and 
is now obtained for the S-matrix itself in a 
suitably modified manner, i.e., the discontinu- 
ity formula holds beyond the framework of 
perturbation theory (— 386 S-Matrices). As is 
mentioned in 386 S-Matrices, the discontinuity 
formula is closely related to Sato’s conjecture 
on the tholonomic character of the S-matrix 
(T. Kawai and Stapp in [6]). Actually, M. 
Kashiwara and Kawai (in [6]) proved that 
fc(p) satisfies a tholonomic system of linear 
differential equations whose characteristic 
variety is confined to the extended Landau- 
Nakanishi variety L(G). They further proved 
that the system has regular singularities (— 
274 Microlocal Analysis G). Their result gives, 
on one hand, a precise version of Regge’s 
statement to the effect that f,(p) is a generaliza- 
tion of a thypergeometric function (in Battelle 
Rencontres, C. DeWitt and J. A. Wheeler (eds.), 
Benjamin, 1968), and, on the other hand, a 
rigorous proof of the fact that fel p) is a Nils- 
son class function. This fact is closely related 
to the works of D. Fotiadi, J. Lascoux, Pham, 
and others. Kashiwara and Kawai (Comm. 
Math. Phys., 54 (1977)) also showed that the 
holonomic character of the Feynman ampli- 
tude is an important clue for understanding 
the so-called hierarchical principle, which had 
been proposed and studied in connection 

with the tMandelstam representation by the 
Cambridge group (Eden et al. [2]). Further- 
more, Kashiwara et al. (Comm. Math. Phys., 
60 (1978)) gave a useful expression of f,(p) at 
several physically important points by analyz- 
ing the microlocal structure of the holonomic 
system that f,,(p) satisfies. Thus the use of 
*(micro-) differential equations in analyzing 
Feynman amplitudes has turned out to be 
effective in understanding their singularity 
structures in a unified manner. 
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A. General Remarks 


E. Stiefel [2] introduced certain tdiffeomor- 
phism invariants of tdifferentiable manifolds 
by considering a field of a finite number of 
linearly independent vectors attached to each 
point of a manifold; and H. Whitney [3] ob- 
tained the notion of fiber bundles as a com- 
pound idea of a manifold and such a field of 
tangent vectors. S. S. Chern [4] emphasized 
the global point of view in differential geo- 
metry by recognizing the relation between the 
notion of fconnections (due to E. Cartan) and 
the theory of fiber bundles. The theory of fiber 
bundles is also applied to various fields of 
mathematics, for example, the theory of tLie 
groups, fhomogeneous spaces, tcovering 
spaces, and general vector bundles, vector 
bundles of class C’, or analytic vector bundles. 
Homological properties of fiber bundles are 
studied by means of tspectral sequences, and 
cohomology structures of several homoge- 
neous spaces and several characteristic classes 
are determined explicitly by means of fcoho- 
mology operations. Also, the group K(X), 
formed by equivalence classes of vector bun- 
dles over a finite (CW complex X, is a tgener- 
alized cohomology group, treated in tK-theory, 
in which further development is expected 
(— 237 K-Theory). 


B. Definitions 


Let E, B, F be topological spaces, p: E—>B a 
continuous mapping, and G an teffective left 
topological ttransformation group of F. If 
there exist an topen covering {U,} (x€ A) of B 
and a homeomorphism g,: U, x Fxp ‘(U,) for 
each «e A having the following three prop- 
erties, then the system (E, p, B, F, G, U,, @,) ÍS 
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called a coordinate bundle: (1) po (b, y)=b 

(be U,, ye F). (2) Define @,,,:F = p~'(b) (be U,) 
by Pa.b Y)= Palb, y); then gp.(b)=9500,,,€G 
for be U, N Ug. (3) gga: Ua N Us > G is continu- 
ous. We say that this bundle is equivalent 

to a coordinate bundle (E, p, B, F, G, Ui Py) 

if Guslb)= Qi, O Pab EG (bE U, N U;) and 

Gua? U, U, —>G is continuous. An equivalence 
class €=(E, p, B, F, G) of coordinate bundles is 
called a fiber bundle (or G-bundle), and E is 
called the total space (or bundle space), p the 
projection, B the base space, F the fiber, and G 
the bundle group (or structure group). Also, U, 
of a coordinate bundle (E, p, B, F, G, U,, @,) 
belonging to the class & is called the coordinate 
neighborhood, o, the coordinate function, and 
Jsa the coordinate transformation (or transition 
function). 

Let €=(E, p, B, F, G) and &' =(E’, p’, B', F,G) 
be two fiber bundles with the same fiber and 
group. A continuous mapping ¥: EE’ is 
called a bundle mapping (bundle map) from £ to 
¢' if the following two conditions are satisfied: 
(1) There is a continuous mapping y :B>B' 
with p'o P= Wop. (2) ,,(b)= 91.0 Poo, ,€ 
G (bE UN Y (Va), b= hb), and Ypa: ULN 
Y~ (V;)>G is continuous, where {U,, pa} and 
{Vi Ọ„} are pairs of coordinate neighbor- 
hoods and functions of & and č’, respectively. 
Moreover, if y is a homeomorphism, then ¥ is 
also a homeomorphism and Y! is a bundle 
mapping. 

Let €=(E, p, B, F,G) and &’ =(E", p', B, F,G) 
be two fiber bundles with the same base space, 
fiber, and group. If there is a bundle map- 
ping ‘¥:E-+F’ such that w: B>B as described 
before is the identity mapping, then we say 
that ¢ is equivalent (or isomorphic) to č and 
write €¢=¢’. Take the same coordinate neigh- 
borhoods {U,}, and let g,, and gg, be the co- 
ordinate transformations of č and é’, respec- 
tively. Then =€’ if and only if there are 
continuous mappings 1,:U,—>G with g;,(b) = 
1p D)Gpa(b)A,(b)* (be U, N Up). 

For a system {g,,} of coordinate transforma- 
tions of a fiber bundle, we have g,.(b) g,,{b) = 
Jya(b) (bE U, Us N U,). Conversely, given an 
open covering {U,} and a system of gga: Us N Ug 
G satisfying, this condition, there is a unique 
G-bundle (E, p, B, F, G) with {gpa} as a system 
of coordinate transformations. Actually, E is 
the tidentification space of E={(b, y, «)| be U,} 
cB x F x A obtained by identifying two points 
(b, y, a), (b’, y’, p) with b=b’, y’=g,,(b)- y, and p 
is defined by p{(b, y, «)} =b, where the index 
set A= {a} is considered a discrete space. 


C. Principal Fiber Bundles 


A fiber bundle n =(P, q, B, G, G) is called a 
principal fiber bundle (or simply principal 
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bundle) if G operates on G by left translations. 
This ts also defined by the following con- 
ditions: G is a right ttopological transfor- 
mation group of P, and there exist an open 
covering {U} of B and homeomorphisms ø: 
U x Gq (U) with qe(b,g)=b, (b, g): g' = 
(b, gg) (be U;g,g' €G). A bundle mapping 
Y:P—>P between two principal bundles y = 
(P,q, B, G) and g =(P’, q’, B’, G) is also defined 
as a continuous mapping ¥ with ‘P(x: g)= 
Y(x)-g. 


D. Associated Fiber Bundles 


Let n =(P, q, B, G) be a principal bundle, and 
let F be a topological space having G as an 
effective left topological transformation group. 
Then G is a right topological transformation 
group of the product space P x F by (x, y): g= 
(x-g,g ‘+ y)(xeP, ye F,geG). Consider the 
‘orbit space P x, F =(P x F)/G, and define the 
continuous mapping p:P xq FB by p{(x, y)} 
= q(x). Then n xg F =(P x, F, p, B, F,G) is 

a fiber bundle, called the associated fiber 
bundle of y with fiber F. On the other hand, n 
is called an associated principal bundle of ¢ = 
(E, p, B, F, G) if €=n x, F. A principal bundle 

n having the same coordinate transformations 
as € is an associated principal bundle of €, and 
two fiber bundles are equivalent if and only if 
their associated principal bundles are equiva- 
lent. Therefore, given a fiber bundle č, there 
exists a principal bundle y such that €=7 x, F. 


E. Examples of Fiber Bundles 


(1) Product bundle. (B x F,p,, B, F, G), where 
pı» the projection of the product space, is 
called a product bundle if there is just one 
coordinate neighborhood B and the coordi- 
nate function is the identity mapping of B x F. 
A bundle that is equivalent to a product bun- 
dle is called a trivial bundle. 

(2) A tcovering (F, p, Y) is a fiber bundle 
whose fiber is the discrete space p~! (yo) 

(yo€ Y), and the structure group is a factor 
group of the tfundamental group z,(Y, yo). In 
particular, a tregular covering is a principal 
bundle. . 

(3) Hopf bundle. Let A be the real number 
field R, the complex number field C, or the 
quaternion field H, 2=dimg A, and A"*! the 
(n+ 1)-dimensional linear space over A. Iden- 
tify two points (Zo, ...,Z,), (Zo, --- , Z4) of the 
subspace A”*? — {0} (0 is the origin) if there is a 
ZEA such that z;=z;z (i=0,...,n). Then we 
obtain the identification space P"(A), called the 
n-dimensional projective space over A. Let S7 
(=S**D"1 the (A(n+ 1)— 1)-sphere) be the 
unit sphere in A"*?. Then SQ is the topological 
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transformation group of Sk by the product of 
A, and the orbit space S% /SÌ is P"(A). Fur- 
thermore, (S%, q, P”(A), SÌ) (q is the projection) 
is a principal bundle called the Hopf bundle (or 
Hopf fibering). These comments are valid also 
for n= œ. When n=1, P'(A) is homeomorphic 
to S, and the Hopf bundle is (S?*7', q, S4, S*~') 
(A=1,2,4). A Hopf bundle is defined similarly 
for A4=8 using the tCayley algebra, and the 
projection q:S?*~! +S? (A=2, 4, 8) is the Hopf 
mapping (Hopf map). 

(4) Let G be a topological group, H its 
closed subgroup, and r:G—G/H, r(g)=gH the 
natural projection. If there exist a neighbor- 
hood U of r(H)eG/H and a continuous map- 
ping f: U >G such that ro f is the identity 
mapping, then we say that H has a local cross 
section f in G, and (G/K, p, G/H, H/K, H/Ko) is 
a fiber bundle for any closed subgroup K of H 
(where p is the natural projection gK >gH and 
Ka» is the largest norma! subgroup of H con- 
tained in K). The associated principal bundle 
of the latter fiber bundle is (G/Ko, p, G/H, 
H/K,). Any closed subgroup H of a *Lie 
group G has a local cross section in G; hence 
the above bundles can be obtained. 


F. Vector Bundles 


A system €=(E, p, B) of topological spaces E, B 
and a continuous mapping p: EB is called an 
n-dimensional real vector bundle if the follow- 
ing two conditions are satisfied: (1) p~! (b) is a 
real vector space for each be B. (2) There exist 
an open covering {U,} (xe A) of B and a co- 
ordinate function g,:U, x F % P~'(U,) for each 
xe A, where F =R"; furthermore, the g, ,: 

R" =p /(b) are isomorphisms of vector spaces. 
In this case, gp,(b) = Pg. O Pap: R" = R" ( be 
U,N Up) is an element of the general linear 
group GL(n, R). Hence a vector bundle ts a 
fiber bundle with fiber R” and group GL(n, R), 
and the converse is also true. A 1-dimensional 
vector bundle is called a line bundle. A vector 
bundle é’ =(E’, p’, B) is called a subbundle of a 
vector bundle ¢=(E, p, B) if E’ c E, p E’=p’, 
and p’'(b) is a vector subspace of p~'(b) for 
each be B. 

Let €, and č, be two vector bundles of 
dimension n; and n, with the same base space 
B. Let E be the union of the direct sum p,'(b) 
+ p3! (b) for be B, and define p: E>B by 
p(p;1(b)+ pz '(b))=b. Take the same coordi- 
nate neighborhoods U, for č, and €,, and 
define p,: U, x R"*”"2—>p~'(U,) by 9,(b, y) = 
(olr +92,)(y) (yER":*"2 =R" +R"), where 
the gi: U, x R" ~p; '(U,) are the coordi- 
nate functions of €,. Then E is topologized by 
taking the family {~,(O)} (O is open in U, x 
R"™*"2) as the topen base, and we obtain 
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an (n, +n,)-dimensional real vector bundle 

(E, p, B), denoted by é, ® č, and called the 
Whitney sum of č, and €,. Similarly, we can 
define the tensor product č; © €,, the p-fold 
exterior power ^? č (or bundle € of p-vectors), 
and the bundle of homomorphisms 
Hom(é,,€,), of dimension n,n, (5), and 

nn, respectively, using the tensor product 
R Q R” = R™”2, the p-fold texterior power 
APR"= Ro! (the space (RP of p-vectors 

in R"), and the space of homomorphisms 
Hom(R", R”2)= R”":”2, (For the last one, we use 
Hom((9},) ', 92.) instead of Hom(!.,, 92,).) 
Hom(é, e')=é* is called the dual (vector) 
bundle of é, where e! is the trivial line bundle. 
If we use coordinate transformations, ®, ®, 
AP, and č* are obtained by the direct sum, 
*Kronecker product, matrix of *p-minors, and 
transpose of matrices, respectively. If č, is 

a subbundle of €,, the quotient bundle ¢, /¢, 

of dimension n, —n, is defined by using the 
quotient vector space R": /R"™2 = R" "2, and 

č @(é,/é,) is equivalent to ¢,. These oper- 
ations preserve the equivalence relation of 
bundles. Also, ® and ® are commutative up 
to equivalence and satisfy the associative and 
distributive laws. For each € having a finite- 
dimensional fCW complex as base space, there 
is a č such that € @ ¢' is trivial. 

Using the complex number field C or the 
quaternion field H instead of the real number 
field R, we can define similarly the complex 
vector bundle or the quaternion vector bundle 
and the operations ®, Q, etc. For a complex 
vector bundle €, the complex conjugate bun- 
dle is defined by the complex conjugate of 
matrices. 

(5) Tangent bundles, tensor bundles. Let M 
be an n-dimensional tdifferentiable manifold of 
class C”. Consider the ttangent vector space 
T,(M) at pe M, set T(M)=|)pem T,(M), and 
define x: T(M)>M by n(7,(M))=p. Fora 
‘coordinate neighborhood U, of p with local 
coordinate system (x,,...,X,), each point of 
x ‘(U,) is represented by L7., f,6/6x;, and 
x *(U,) has a coordinate system (x4, ..-,Xy; 
fis- fa). Hence T(M) is a C’”'-manifold, and 
I(M)=(T(M), n, M, R", GL(n, R)) ts an n- 
dimensional real vector bundle. T(M) is called 
the tangent (vector) bundle, its dual bundle 
T*(M) the cotangent (vector) bundle, and the 
tensor product T(M)@ ...@ T*(M)@...a 
tensor bundle of M. The line bundle A" T*(M) 
is called the canonical bundle of M. 

For a tcomplex manifold M, T(M) is a 
complex manifold and T(M) is a complex 
vector bundle. Therefore these bundles are 
defined as complex bundles. 

(6) Tangent r-frame bundle. In the preceding 
example, the space of all ftangent r-frames of 
M is a bundle space with base space M and 
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group GL(n,R). It is called the tangent r-frame 
bundle (or the bundle of tangent r-frames) of 
M. 


G. The Classification Problem 


For a fiber bundle €=(E, p, B, F, G) and a 
continuous mapping W :B'— B, consider the 
subspace E’ = {(x,b')e E x B’| p(x) =(b')} of 
E x B' and the projections p': E’> B’ and Y: 
E'>E. Then w*é=(E’, p’, B', F,G) is a fiber 
bundle, and ¥ is a bundle mapping from w* é 
to č; w*é is called the induced bundle of č by 
y. Let {U,} and {gga} be the systems of coordi- 
nate neighborhoods and transformations of é. 
Then {y*(U,)} and {g,,0W} are correspond- 
ing systems of w* č. If ¥:E’>E is a bundle 
mapping from č’ to č having w: B’> B as the 
mapping of base spaces, then &’ =” €. Also, 
we have p*E,=*E, if č =<); (yoy) ¿= 
w’*(W? €). If € is a principal bundle, then y” ë 
is also principal, and y*(y xg F)=(W*n) x¢F. 
For a tparacompact space B’, p*# f= WH č if 
Yı, Y2: B'—B are thomotopic. 

For a topological group G, a principal 
bundle é(n, G)=(E(n, G), p, B(n, G), G) is called 
an n-universal bundle if E(n, G) is tn-connected 
(n< œ); its base space B(n, G) is called an n- 
classifying space of G. In particular, č(%0, G) 
=¢€,=(E,, P, Bg, G) is called simply a universal 
bundle and B; a classifying space of G. Then 
we have the classification theorem: Let B be a 
CW complex with dim B <n; then the set of 
equivalence classes of principal G-bundles with 
base space B is in one-to-one correspondence 
with the thomotopy set 2(B; B(n, G)) of con- 
tinuous mappings of B into B(n, G). Such a 
correspondence is given by associating with 
the induced bundle w* E(n, G) a continuous 
mapping yw: B— B(n, G), called the character- 
istic mapping or classifying mapping (charac- 
teristic map or classifying map) of y* €(n, G). 
Furthermore, if G is an effective left topo- 
logical transformation group of F, the set of 
equivalence classes of G-bundles with base 
space B and fiber F is in one-to-one corre- 
spondence with 2(B; B(n, G)). The correspon- 
dence is given by associating y*(é(n, G) xF) 
to y. 


H. Construction of Universal] Bundles 


For an arbitrary topological group G, J. W. 
Milnor [6] constructed a universal bundle (Eg, 
p, Bg, G) in the following manner. The join 
Eg=Go...oGo... of countably infinite 

copies of G is defined as follows: A point e of 
Eg is the symbol t; 9, ®... @ tingin <i, < 
in<...<i,,,m=1,2,3,...), where bises bjn are 


> im 


real numbers satisfying t, > 0, t; +. +t, = L, 


1471 
Fiber Bundles 


and g; is an element of the i, th copy of G, 
k=1,...,m. Here we can omit t; g; if t; =0. 
Regard E, as a topological space with a weak 
topology such that the coordinate functions 
emt, eg; are continuous. Define the right 
action of G on Eg by (t; gi, ®© -Ð tinin) 9 = 
ti (g 9) -O tingin 9). Let Bg and p: Eg> 
Bg be the identification space of Eç by this 
action of G and the identification mapping. 
Then (Eg, p, Bg, G) is a universal bundle for G, 
and B, is called the classifying space for G. Eg 
and B, are sometimes written as EG and BG. 
In particular, a classifying space B, is a count- 
able CW complex for any countable CW 
group G (i.e., a topological group that is a 
countable CW complex such that the mapping 
gg ' of G into G and the product mapping 
G x G>G are both cellular). The following 
examples of classifying spaces for Lie groups 
are also useful. Note that every CW complex 
B, of a given G has the same thomotopy type. 


I. Examples of Universal Bundles 


(1) G is either O(n), U(n), or Sp(n): Let A and å 
be as in (3) of Section E. According as A is R, 
C, or H, we let U(n, A) be the torthogonal 
group O(n), the tunitary group U(n), or the 
‘symplectic group Sp(n). Then the *Stiefel 
manifold Vain, m(A)= U(m+n, A)/T,, x U(n, A) 
(Inis the unit element of U(m, A)) is A(n+ 1) 
—2)-connected. Hence the principal bun- 
dle č({å(n + 1)—2, U(m +n, A))=(Vpin. mA), 
M m+n, m^), U(m, A)) from (4) of Section E is a 
(A(n+ 1)—2)-universal bundle of U (m, A), 
where the base space Mmin m(A)= U(m--n, A)/ 
U(m, A) x U(n, A) is the tGrassmann manifold. 
(2) G is either O(c), U(o0), or Sp(oo). The 
examples in (1) are valid for m, n= œ. Con- 
sider the tinductive lim#t group U(co, A)= 
(J, U(n, A) under the natural inclusion 
U(n, A)c U(n +1, A), and supply the infinite 
classical group U (20, A) with the weak topol- 
ogy (this means that a set O of U (œ, A) is 
open if and only if each ON U (n, A) is open in 
U(n, A)). Then the infinite Stiefel manifold 
Vn+n,m(A) and the infinite Grassmann manifold 
Mrin.m(A) (m= œ or n= œ) are defined as 
before, and we have 


Mo m(A) == U M min ml), 


and so on. Furthermore, these manifolds are 
CW complexes, and V,, m(A) (m< œ)is œ- 
connected. Although U (co, A) is not a Lie 
group, U (m, A) x U (n, A) has a local cross 
section in U(m +n, A) for m, n< œ [7]. There- 
fore, setting n= æ in (1), (co, U(m, A)) isa 
universal bundle of U(m, A), and the infinite 
Grassmann manifold M,,, ,,(A) is a classifying 
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space Bum, a) Also, ¢(A(n + 1)—2, U (œ, A)) 
in (1) is a (A(n + 1)—2)-universal bundle of 
U(w, A) (n< œ). 

(3) G is either SO(m) or a general! Lie group. 
For the frotation group SO(m), we have €(n— 
1, SO(n) = (Van, m(R), Ps My n,ms SO(n) and 
Bso = Mo, m, Where Min +n.m=SO(m+n)/ 
SO(m) x SO(n) is the oriented Grassmann 
manifold. For any compact Lie group G, we 
have €(n— 1, G) =(Vn+n,m(R), p, O(m + n)/G x 
O(n), G), where G < O(m). For any connected 
Lie group G, we have (n, G)=€(n, G,) x@,G, 
where G, is the maximum compact subgroup 
of G (since G/G, is homeomorphic to a Eu- 
clidean space, €(n, G) reduces to &(n, G,)). 


J. Reduction of Fiber Bundles 


Let G be a topological group and H its closed 
subgroup. We say that the structure group of a 
G-bundle é is reduced to H if € is equivalent to 
a G-bundle whose coordinate transformations 
take values in H. For a principal H-bundle 

Yo =(P, q, B, H), the associated H-bundle no xy 
G=(P x 4G, p, B, G) with fiber G is defined, 
where H operates on G by the product of G; 
it is also a principal G-bundle if we define 

an operation of G on P xG by {(x,g9)} g = 
{(x,gg')}. For a principal G-bundle y, we say 
that 7 is reducible to an H-bundle if there is a 
principal H-bundle no with y=, X pG, and 
we call no a reduced bundle of y. It is easy to see 
that the group of a G-bundle č is reducible to 
H if and only if the associated principal G- 
bundle of & is reducible to H. Also, if yo is a 
reduced bundle of y, then w*n, is a reduced 
bundle of y*n. 

Now, assume that H has a local cross sec- 
tion in G and G/H is co-connected. Then 
for an n-universal bundle č(n, H) of H, 

é(n, H) x G is an n-universal bundle of G (n- 
connectedness of E(n, H) xy G is shown by the 
thomotopy exact sequence of tfiber spaces). 
Therefore, by the classification theorem, the 
group of any G-bundle is reducible to H, and 
the equivalence classes of G-bundles are in 
one-to-one correspondence with those of H- 
bundles. 

(1) A G-bundle is trivial if and only if its 
group is reducible to e (identity element). A 2n- 
dimensional differentiable manifold M of class 
C* has an talmost complex structure if and 
only if the group of the tangent bundle T(M) is 
reducible to GL(n, C), i.e., T(M) is considered 
as an n-dimensional complex vector bundle. 

(2) Since GL(n, R) O(n) x R"" +2 and 
GL(n,C)x U (n) x R”, n-dimensional real (com- 
plex) vector bundles can be considered as O(n) 
(U(n))-bundles with fiber R”(C”). 
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K. Homotopy and Homology Theory of 
Bundles 


Since a fiber bundle is a tlocally trivial fiber 
space, the exact sequence and the spectral 
sequence of fiber spaces (— 148 Fiber Spaces) 
are applicable to fiber bundles. For example, 
the cohomology structures of homogeneous 
spaces of classical groups have been deter- 
mined by A. Borel, J.-P. Serre, and others. 

(1) Characteristic class. For a classifying 
space B, of a topological group G, we have an 
isomorphism z,(B,)=7,-,(G) of homotopy 
groups and the following classification theo- 
rem of fiber bundles over the n-sphere S”. The 
set of the equivalence classes of principal G- 
bundles or G-bundles with fiber F over the 
base space S” is in one-to-one correspondence 
with the set z,,_,(G)/29(G) of equivalence 
classes under the operation of G on z,,_,(G) 
given by the inner automorphisms of G; such a 
correspondence is given by associating with 
each principal G-bundle y =(P, q, S", G) the 
class (called the characteristic class of n) con- 
taining the image A(z,) of a generator 1,€7,,(S") 
by the homomorphism A:7z,(S")=7,(P, G) 
—>T,-1(G). Take U, and U, (the open sets of 
S” such that the last coordinates t,,,; are > 
—1/2 and < 1/2, respectively) as coordinate 
neighborhoods of y. Then the restriction T = 
gı2| S"! represents the characteristic class 
of n, where g,,:U,U,-—-G is the coordinate 
transformation and S”! is the equator of S”. 

(2) For the principal bundle n =(SO(n+ 
1), q, S", SO(n)), the mapping T:S""'—+SO(n) 
is given by 


lp- 0 
Pitt) == 2040p{ 0 4) 


(/, is the unit matrix of degree n). Hence the 
tmapping degree of the composite q'o T:S” ' 
>S"! (of T and the natural projection q’: 
SO(n)—S"~') is equal to 0 if n is odd and 2 if 
nis even. From this fact and the homotopy 
exact sequence, we have 


Ten ( Vin+n.m(R)) 


ifm=1 or nis even 


Z 
Tai if m>1 and nis odd 


for the real Stiefel manifold V,,,,,,,(R), which is 
(n—1)-connected. 

(3) Sphere bundles. An O(n + 1)-bundle with 
fiber S” is called an n-sphere bundle. The set of 
equivalence classes of n-sphere bundles with 
base space S” is in one-to-one correspondence 
with z,,-;(O(n+ 1))/z9(O(n+ 1)). For example, 
any l-sphere bundle over S” (m> 3) and any 
n-sphere bundle over $° is trivial. Every 3- 
sphere bundle over S* is equivalent to one of 
fém nlm an integer, n a positive integer}, where 
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č m.n is defined as follows: Let p, a: S° —0O(4) be 
defined by p(q)q' =4q'q"', o(q)q' =qq' (4,q' are 
tquaternions of norm 1). Then these mappings 
represent generators of 23(O(4))=73(S° x 
S?)= Z +Z, and €,, ,, is the 3-sphere bundle 
over S* corresponding to the element m{p} + 
n{o}en;(O(4)) (i.e., to the mapping f,,,,:S°> 
O(A) defined by f,,, .(9)(q)=4"""a'q_"). (Here 
we use the fact that the operation of the ele- 
ment re O(4) (r(q)=q7') is given by rpr `! =p, 
ror '=poa }.) 


L. Cross Sections 


For a fiber bundle €=(EF, p, B, F, G}, a cross 
section f: By E over a subspace B, (< B)is a 
continuous mapping such that po f is the 
identity mapping of By; a cross section over B 
is called a cross section of €. A bundle č ts 
trivial if and only if the associated principal 
bundle of € has a cross section. More gener- 
ally, given a principal bundle y =(P, q, B, G) 
and a closed subgroup H having a local cross 
section in G, y ts reducible to H if and only if 
the associated bundle ( x ¢(G/H) = (P/H, q’, B), 
G/H) with fiber G/H has a cross section. 

Suppose that the base space B of the fiber 
bundle €=(E, p, B, F, G) is a tpolyhedron. We 
denote the tr-skeleton of B by B” and consider 
the problem of extending cross sections f,: B" 
— E successively for r=0, 1, .... Clearly, there 
is a cross section fo. For each r-simplex o of B, 
we have (p~! (0), p,o, F)=(o x F, p,,o, F) since 
c is ‘contractible. Hence there is a bundle 
mapping g,:0x Fxp ‘(c) with pog,=P,. 
Assume the existence of a cross section f,_: 
B"? >E, and consider the mapping h; = p, 0 
0, o(f,_-,|6):¢>F (p,:0 x F—>F is the pro- 
jection and a is the boundary of o). Then if 
h, 1s extensible to h,:o-—F, an extended cross 
section f,:0-—E of f,_,|¢ 1s defined by f,(b)= 
,(b, h,(b)) (beo), and the extension f,: B’> 
E of f,_, is defined by f.jo=f,, f,| B = 
f.. If 2,_,(F)=9, for example, there is an 
extension h, of h; since (a, 6) ~(V", S’~'), and 
f,-1 is extensible to a cross section f,. 

Now assume that the base space B of a G- 
bundle č =(E, p, B, F, G) is an tarcwise con- 
nected polyhedron and F is t(n — 1)}-connected. 
Then there is a cross section f: B"> E con- 
structed by the stepwise method of the previ- 
ous paragraph. But if n, (F)#0, we have an 
obstruction to extending f over B"*!. Now we 
explain how to measure this obstruction. Sup- 
pose that F is tn-simple. Then for each (n+ 1)- 
simplex o of B, the mapping h,:¢- F, defined 
by f as in the previous paragraph, determines 
a unique element c(/)(o) of the homotopy 
group 7,,(F). Hence we have a tcochain c(f)e 
C"*'(B;12,(F)), and f is extensible to a cross 
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section over B"*! if and only if c({)=0. Thus 
there is a cross section over B”** if and only 

if the set of c( f) for every cross section f over 
B” contains the cochain 0; {c(f)} is considered 
as a measure of the obstruction. 

Let w:I-B be a tpath. We consider the 
space I x F and the canonical projection p, : 
Ix FI. Then there is a bundle mapping Q: 
Ix F>E with poQ=wopy,, since w* f= 
(I x F,p,,1,F); and a homeomorphism wy: F œ% 
F is defined by wy = Go 4,02)0Q;'09,,,,, 
where b, = w(e) (c=0, 1), p.: U, x Fp *(U,) is 
a coordinate function of a coordinate neigh- 
borhood U,3b,, and Q,(=Q|ex F):F xp +(b,). 
The homeomorphism w, induces an isomor- 
phism w,:7,(F)=n,(F), and 72,(F) forms a 
tlocal coefficient on B. Then the cochain c{ f) 
is a tcocycle with the local coefficient 2,(F), 
called the obstruction cocycle of f. Further- 
more, the set {c( f)} for every cross section 
f:B">E is a cohomology class c”**(é)e 
H”"*! (B; 2,(F)) (local coefficient), and c”*'(é) 
is called the primary obstruction to the con- 
struction of a cross section. There is a cross 
section over the (n+ 1)-skeleton B"*' if and 
only if c"*!(é)=0. The local coefficient z,(F) 
is trivial if B is fsimply connected or, for 
example, if the structure group G of ¢ is con- 
nected (in which case č is called an orientable 
fiber bundle); when this is true, c”*!(¢) is an 
element of H"*!(B;7,,(F)), where z,(F) is not a 
local coefficient. Furthermore, if c"*!(¢)=0 
and z,(F)=0 (n<i<m), then the secondary 
obstruction O"'!(é)< H™*!(B; 7,,(F)) is de- 
fined similarly (— 305 Obstructions). 


M. Stiefel-Whitney Classes 


Let €=(E, p, B, F, O(n)) be an O(n)-bundle over 
an arcwise connected polyhedron B and let 

€° be its associated principal bundle. Con- 
sider the Stiefel manifold V, ,,=Vp,,-,(R) = 
O(n)/I,-, X O(k), which is (k — 1)-connected, 
and the associated bundle & = €° x guyVp nx 
with fiber V,,,,-,. The primary obstruction 
Welch (Ee HB: (Vin) (k= 
0,1,...,n—1) is called the Stiefel-Whitney 
class of €. We have 2W,..,(€)=0 unless k=n— 
1 and k is odd. Hence we usually consider 
W., (č)e H**' (B; Z). č is orientable, i.e., the 
group of € is reducible to SO(n), if and only if 
W,(¢)=0. The Stiefel-Whitney classes of an 
n-dimensional tdifferentiable manifold M 

are defined to be those of the tangent bundle 
T(M). Since the orientability of M coincides 
with that of T(M), M is orientable if and only 
if W,(M)=0. The condition W,,,(M)=0 is 
necessary for the existence of a continuous 
field of orthonormal tangent (n—k)-frames 
over M (if k=n—1 this condition is also suffi- 
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cient). Also, when M is closed, W,(M) is equal 
to 7(M) yu, where n is the fundamental coho- 
mology class of M and y(M) is the tEuler 
characteristic of M (— 56 Characteristic 
Classes B). 


N. Chern Classes 


For a U(n)—bundle €=(E, p, B, F, U(n)), the 
primary obstruction C,,,(€)=c7**7(E%e 
H?**?(B; Z) (k=0, 1,...,n—1) of the associ- 
ated bundle €* = €° x uin Vn. n-x(C) is called the 
Chern class of č. If we consider & as an O(2n)- 
bundle by U(n) < O(2n), then W,,,,(€)=0 and 
W,,(¢) = C,(€) (mod 2). The Chern classes of a 
real 2n-dimensional almost complex manifold 
are defined to be those of the tangent bundle 
T(M)(— 56 Characteristic Classes C). 


O. Bundles of Class C’, Analytic Bundles 


A fiber bundle € =(E, p, B, F, G) is called a fiber 
bundle of class C (r=0, 1, ... 00, œ) if E, B, F 
are differentiable manifolds of class C", G is a 
Lie group and a transformation group of F of 
class C’, and p and the coordinate functions 
are differentiable mappings of class C”. Bun- 
dles of class C? are usual G-bundles, and those 
of class C® are real analytic fiber bundles. 
Similarly, complex analytic fiber bundles are 
defined by the notions of complex manifolds, 
*complex Lie groups, and tholomorphic map- 
pings. For example, the universal bundles 
€(n— 1, O(m)) and n(2n, U(m)) are real and com- 
plex analytic principal bundles, respectively, 
and the tangent bundle T(M) of a C’*! (or 
complex) manifold is a C” (or complex ana- 
lytic) vector bundle. The operations of the 
Whitney sum, etc., are defined analogously for 
these vector bundles. 

The equivalence of C” (complex analytic) 
bundles is defined by means of bundle map- 
pings that are C’-differentiable (holomorphic). 
Bundles of class C” (r < œ) are classified by C” 
mappings into a classifying space in the same 
manner as for bundles of class C°. Also, the 
connection of class C” (— 80 Connections) in 
C” bundles is an important notion. 

For complex analytic bundles, a similar 
classification has been obtained for restricted 
spaces by K. Kodaira, Serre, S. Nakano [8], 
and others. The classification of complex ana- 
lytic bundles over a tStein manifold is reduced 
to that of bundles of class C? (Oka’s principle 
[9]), and similar results are valid for C®- 
manifolds. The complex analytic (or holo- 
morphic) connection does not necessarily exist, 
and M. F. Atiyah [10] found the condition 
for its existence and its relation with Chern 
classes. 
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P. Microbundles 


A system x: B-> EB of topological spaces E, 
B and continuous mappings i, j is called an n- 
dimensional microbundle over B if for each 

be B, there exist a neighborhood U of b, a 
neighborhood V of i(U), and a homeomor- 
phism kh: V% U x R” with hoi|U=i,, j|V=p,o 
h(i;:UxUx0cU xR’, and p,:U x R">U is 
the projection). Let Ho(n) be the topological 
group of all homeomorphisms of R” onto itself 
fixing the origin with compact-open topology. 
Then the equivalence classes of n-dimensional 
microbundles over B are naturally in one-to- 
one correspondence with the equivalence 
classes of H)(n)-bundes with base space B and 
fiber R” [13]. 

In the category of polyhedra and PL (tpiece- 
wise linear) mappings, the notion of a PL 
microbundle can be defined in the same man- 
ner. The structural group of n-dimensional PL 
microbundles is defined, in a generalized sense, 
as a complete ‘semi-simplicial complex [11]. 
The tangent PL microbundle is defined for any 
ttopological (PL) manifold. J. Milnor classified 
tsmoothings of PL manifolds by means of PL 
microbundles [11] and then showed that the 
ttangent vector bundles and its tPontryagin 
classes of smooth manifolds are not topologi- 
cally invariant [12]. 

For a tPL embedding f: MN between PL 
manifolds, if there is a neighborhood E of 
{(M) in N and a PL mapping p: E-M so that 
a diagram viMSE4SM isa PL microbundle, 
then v is called a normal PL microbundle of f. 
In this case, f is tlocally flat. 

There is a locally flat PL embedding be- 
tween PL manifolds which admits no normal 
PL microbundle [14]. 


Q. Block Bundles 


As the normal bundle theory for locally flat 
PL embeddings of PL manifolds, the concept 
of block bundle was introduced independently 
by C. P. Rourke and B. J. Sanderson [15], M. 
Kato [16], and C. Morlet [17]. Let E bea 
tpolyhedron, and let K be a cell complex. A 
set {E,|¢€K} of PL balls in E indexed on K is 
called a q-block structure of E if the following 
three conditions are satisfied: (1) | Jsex Eo = E; 
(2) for each ce K, there ts a tPL homeomor- 
phism h,:o x I4E, such that h,(t x I4)=E, 
for each face t of o, where J =[—1, 1]; and (3) if 
E,N E, # Ø, then EN E, =E,, where t=0 Np. 
For o€ K, E, is called the block over o, and 
h, in (2) is called a trivialization of E,. Then a 
triple (E, K, {E,|oe K} is referred to as a q- 
block bundle over K and is denoted by ¢/K. 
Another block bundle č'/K =(E’, K, {E,|oe 
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K}) over the same complex K is said to be 
isomorphic with ¢/K if there is a PL homeo- 
morphism g: EE’, called an isomorphism, 
such that g(E,)= E; (oe K). A PL embedding 
i:|K|>E is a zero-section of ¢/K if for each 
o€K, there is a trivialization h,:0 x >E, 
such that h,(x, 0)=i(x) (xeo). In this case 

we say that ¢/K is a block bundle with a zero- 
section i:|K|—E. There is a unique zero- 
section of €/K up to isomorphism of €/K onto 
itself. For every flocally flat PL embedding 
between PL manifolds M and W of codimen- 
sion q and for any cell division K of M,a 
derived neighborhood N of f(M) in W admits 
a unique g-block bundle v/K =(N, K, {N,|ae 
K}) with f:M>N as a zero-section up to 
isomorphism respecting the zero-section [15]. 
The block bundle v/K is called a normal block 
bundle of f: MW. 
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A. General Remarks 


J.-P. Serre [1] generalized the concept of fiber 
bundles to that of fiber spaces by utilizing the 
covering homotopy property (— Section B). 
He applied the theory of tspectral sequences, 
due to J. Leray, to the (cubic) tsingular (co)- 
homology groups of fiber spaces. These are 
quite useful for determining (co)homology 
structures and homotopy groups of topo- 
logical spaces, and are now of fundamental 
importance in algebraic topology. 


B. Definitions 


Let p: E—>B be a continuous mapping between 
topological spaces, and let X be a topological 
space. Then we say that p has the covering 
homotopy property with respect to X if for any 
mapping f: XE and thomotopy g,: X >B 
(O0<t<1) with po f=@p, there is a homotopy 
fi XE (O<t<1) with fp=f and po f,=g,. 
We call (E, p, B) a fiber space (or fibration) if p 
has the covering homotopy property with 
respect to every cube [”={(x,,...,x,)|O<x;< 
1},n=0,1,... (then p has the covering homo- 
topy property with respect to every (CW 
complex). Then E is called the total space, 
p the projection, B the base space, and F, = 
p +(b) the fiber over be B. 

Let E, B, F be topological spaces and p: E> 
B a continuous mapping. We call (E, p, B, F) 
a locally trivial fiber space if for each be B, 
there exist an open neighborhood U of b and 
a homeomorphism @:U x F—p~!(U) with 
pog(b’, y)=b' (b'eU,yeF). In this case, p has 
the covering homotopy property with respect 
to each tparacompact space; hence (E, p, B) is a 
fiber space. A tfiber bundle is clearly a locally 
trivial fiber space. 
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C. Path Spaces 


Another important example of a fiber space is 
a path space. A path in a topological space 

X is a continuous mapping w:I1> X (I= 

[0, 1]). Given subsets Ao and A, of X, the 
path space Q(X; Ao, A;) is the space of all 
paths w:!—X satisfying w(e)e A, (e=0, 1) 
topologized by tcompact-open topology. De- 
fine p,:Q(X; Ao, A,)> 4, by p,(w) = w(e) (e= 

0, 1). Then (Q(X; Ap, A1), Pa, A,) is a fiber space; 
in fact, p, has the covering homotopy property 
with respect to every topological space. In 
particular, the total space of the fiber space 
(Q(X; X, *), Po, X) (+e X) is tcontractible, and 
the fiber po '(*)=Q(X; *, *)=QX over « is the 
tloop space of X with base point *. For a 
continuous mapping f: Y->X, consider the 
space E,={(y, we Y x Q(X; X, X)| f(y) =w(0)} 
and the continuous mapping p:E,—X de- 
fined by p(y, w)=w(1). Then YCE,, and Y is 
a tdeformation retract of E,; furthermore, 
(E,,p, X) is a fiber space with f=p| Y. 


D. Homotopy Groups of Fiber Spaces 


For thomotopy groups of fiber spaces, the 
Hurewicz-Steenrod isomorphism theorem 
holds: Let (E, p, B) be a fiber space and F= 

p '(«*) the fiber over the base point +e B. 

Then p,.:2,(E, F)>72,(B) is an isomorphism for 
n>2 and a byection for n=1. By this theorem, 
we have the homotopy exact sequence of a fiber 
space: 


1 Tg (B) 40, (F)“511,(E) S 0,,(B) >... 
Furthermore, the more general exact sequence 
2 1(Z; QB), n(Z; F)y-S2(Z; E) Šn(Z; Bh 


is valid for each CW complex Z, where 
m(Z; X) denotes the thomotopy set of map- 
pings from Z to X relative to the base point. 

Example (1). A cross section of a fiber space 
(E, p, B) is a continuous mapping f:B>E with 
pof=1.If(E, p, B) has a tcross section or the 
fiber F is a tretract of E, then z,(E)22,(B)+ 
7,(F). If F is contractible in E, then z,(B)= 
n,(E) + n,- (F) (n> 2). 

Example (2). (E, p, B) is called an n- 
connective fiber space if B is tarcwise connec- 
ted, E is tn-connected, and p, :7;(E)—>n;(B} is 
an isomorphism, for i>n. For each arcwise 
connected space B and integer n, there is such 
a fiber space. 

Example (3). For a CW complex X, there are 
topological spaces X, and continuous map- 
pings f,:X +X, Qn+1:Xn+1 >X, (n=0,1,...) 
with the following four properties: (1) X, (0< 
n<m) is a point if X is m-connected; (ii) fa»: 
r;(X) = r; (X,) (<n); üi) (Xp. dys Xn-1) is a fiber 
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space, and its fiber is an tEilenberg-MacLane 
space K(z,(X), n); (iv) qno fn is thomotopic to 
Ja-1- Such a system {X,,, fas qa} is called the 
Postnikov system of X and is in a sense con- 
sidered a decomposition of X into Eilenberg- 
MacLane spaces. 


E. Spectral Sequences of Fiber Spaces 


The cohomological properties of fiber spaces 
are obtained mainly from the following results 
(which are valid similarly for homology except 
for properties of products). Assume that the 
base space B of a given fiber space (E, p, B) is 
tsimply connected and the fiber F = p~!(«) is 
arcwise connected, and let R be a commuta- 
tive ring with unit. Then the spectral sequence 
(of singular cohomology) of the fiber space 

(E, p, B) (with coefficients in R) is defined to be 
a sequence {E,,d,} satisfying the following 
properties: 

(i) E=}, E1 are bigraded R-modules, and 
d= Lp,q4?"" are R-linear differentials such that 
d AEP EEPE, 

(ii) E?;4 =Kerd?4/Imd? "4*"~!, which means 
that E,,, = H(E,). 

(iii) E?-4=0 for p<0 or q<0, EP = EMA =... 
= E’:4 for r>max(p,g + 1). 

(iv) E, has a product for which E?4- EPT c 
EP*P. 444 and d (u: v)=(d,u) v +(—1} u: 

d,v (ue E!1). Furthermore, the induced prod- 
uct in H(E,) coincides with the product in 
E41 

(v) E» is the tbigraded module associated with 
some filtration of the cohomology module 
H*(E; R); that is, H"(E;R)=D°">DI" 13... 
> D"° > D"*! 1 = 0 and ERI = DP 4/pPtha}, 
Furthermore, the fcup product ~ in H*(E; R} 
satisfies D”4 ~~ DPT c DP*?'-4*4 and coincides 
with the given product in E,. 

(vi) E34 = H?(B; H4(F; R)), and the product 

in E, coincides with the cup product in 

H*(B; H*(F; R)). 

(vii) The composition of H"(B; R)= E}? > EF? 
>... > E}? = Ev = D™? H"(E; R) is equal to 
p*, and the composition of H"(E; R) = D>"—> 
BS E?” c... c ED” c ED” = H”(F; R) is 
equal to i* (i: F c E), where each —> is the 
projection onto the quotient module. In the 
sequence 


H"! (F; R) H"(E, F; R)2-H"(B; R), 


we have 0* "(Im p*)= E2""!, Coim p* = E® °, 
and d,,: E®” 1! —>E™°? is equal to the trans- 
gression t* = p* } o 0*:0* '(Im p*)+Coim 
p*. Each element of 6*~!(Im p*) is called 
transgressive. 

In the following examples, we assume that R 
is a principal ideal ring. 

Example (4). Let k be a commutative 
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field, and assume that dim, H,(B; k) and 

dim, H,(F; k) are finite. Then for the tPoincaré 
polynomial P,(t)=%,,5,t", 6, = dim, H,,(X; k), 
we have P,(t)= PO P(t) —(1 +t)o(t), where 
(t) is a polynomial with nonnegative coeffi- 
cients (Leray). In particular, for the tEuler 
characteristic y(X)= Px(—1), we have y(E) 

= 7(B)x(F). Also, if i, :H,(F; k) > H,(E; k) is 
monomorphic for each n>0, then P,(t)= 
P(t) Pe (0). 

Example (5). Isomorphism theorem: If 
H,(B; R)=0 (0<n<r) and H,(F; R)=0(0<n 
<s), then p,:H,(E, F; R)-> H,(B; R) is isomor- 
phic for 0<n<r-+s and epimorphic for n=r + 
s, and we have the following homology exact 
sequence: 


...> H,(F; R) H,(E; R)"3 H,(B; R) 
3H, (F; R)>..., n<r+s 


(similarly for the cohomology). 

Example (6). Assume that H"(F; R) = 
H"(S"; R) (S" is the r-sphere, r > 1). Let E=M, 
be the tmapping cylinder of p: E—>B and P: 
E-B be the continuous mapping defined 
by p. Then the Thom-Gysin isomorphism ĝ: 
H" 1(B; R)= H"(E, E; R) (n> 0) with g9(a)= 
p*(a)~ g(1) holds (— 114 Differential Top- 
ology G). Also, we have the Gysin exact 
sequence: 


... > H"(B; R)2> H"(E; R)> H" (B; R) 
4,H"(B:R)>..., 


where g satisfies g(a) =a ~ Q=Q~ a (Q= 
g(1)e H"*'(B; R)). Here Q is equal to the 
image of a generator of H’(F; R)=R by the 
transgression t*, and 2Q=0 if r is even. (These 
results hold also for r=0 and R=Z,.) 
Example (7). Assume that H"(B; R)= 
H"(S"; R) (r>2). Then we have H" "(F; RS 
H"(E, F; R) (n20) and the Wang exact 
sequence: 


.. > H"(E: R)Š H”(F; R) 
£, H" (F; RH UES R)>..., 


where 0 satisfies O(a — 8)=0(x)— B+ 
(—1)" Ya~ 0(p) (x, pe H”(F; R)). 

Example (8). For a field k of odd character- 
istic, if H"(E;k)=0 (n >0) and the algebra 
H*(F;k)is generated by a finite number of 
elements of odd degree, then H*(F; k) = 
Ak(Xis -<-> Xi) (the texterior algebra) and 
A*(B;k)=k[y,,..-,y], where y,=t*(x,) (A. 
Borel). 
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A. Definition 


A set K having at least two elements is called a 
field if two operations, called taddition (+) 
and fmultiplication (-), are defined in K and 
satisfy the following three axioms. 

(1) For any two elements a, b of K, the sum 
a+b is defined; the associative law (a + b)+ 
c=a+(b+c) and the commutative law a +b = 
b +a hold; and there exists for arbitrary a, b 
a unique element x such that a+ x =b; that is, 
K is an tAbelian group with respect to the ad- 
dition (the tidentity element of this group is 
denoted by 0 and is called the zero element of 
K). 

(2) For any two elements a, b of K, the prod- 
uct ab (=a; b) is defined; the associative law 
(ab)c = a(bc) and the commutative law ab = ba 
hold; and there exists for arbitrary a, b with 
a#0 a unique element x such that ax =b, that 
is, the set K* of all nonzero elements of K is 
an Abelian group with respect to multiplica- 
tion. K* is called the multiplicative group of 
K, while the identity element of K* is denoted 
by 1 and is called the unity element (unit ele- 
ment or identity element) of K. 

(3) The distributive law a(b+c)=ab+ac 
holds. In other words, a field is a tcummuta- 
tive ring whose nonzero elements form a 
group with respect to multiplication. 

A noncommutative ring whose nonzero 
elements form a group is called a noncom- 
mutative field (skew field or s-field). It should 
be noted that sometimes a field is defined as a 
ring whose nonzero elements form a group 
without assuming the commutativity of that 
group, and in this case our “field” defined 
before is called a commutative field. (The term 
“skew field” is sometimes used to mean either 
a commutative or a noncommutative field.) In 
this article we limit ourselves to commutative 
fields (for noncommutative fields — 29 As- 
sociative Algebras). 
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B. General Properties 


Since a field K is a commutative ring, we have 
properties such as a0=0a=0, (— a)b=a(—b) 
= —ab for elements a, b in K. If a tsubring k of 
K is a field, we say that k is a subfield of K or 
K is an overfield (extension field or simply 
extension) of k. If a field K has no subfield 
other than K, K is called a prime field. 

A map f ofa field K into another field K’ is 
called a (field) homomorphism if it is a ring 
homomorphism, i.e., if it satisfies f(a + b)= 
f(a)+ f(b), f(ab) = f(a) f(b). Since a field is 
tsimple as a ring, every (field) homomorphism 
is an injection unless it maps everything to 
zero. A homomorphism of K into K’ is called 
an isomorphism if it is a bijection, and K and 
K' are called isomorphic if there exists an 
isomorphism of K onto K’. An isomorphism of 
K onto itself is called an automorphism of K. 

If there is a natural number n such that the 

n 
sum ni=1+...+1 of the unity element 1 is 0, 
then the minimum of such n is a prime number 
p, called the characteristic of K. On the other 
hand, if there is no natural number n such that 
nl =0, we say that the characteristic of K is 0. 


C. Examples of Fields 


The rational number field Q consisting of all 
rational numbers, the real number field R 
consisting of all real numbers, and the complex 
number field C consisting of all complex num- 
bers, are all fields of characteristic 0. A subfield 
of the complex number field C is called a num- 
ber field. The rational number field is a prime 
field, and every prime field of characteristic 0 is 
isomorphic to the rational number field. For 
the ring Z of all rational integers, the residue 
class ring modulo a prime number p is a field 
Z/pZ={0,1,2,...,p—1 (mod p)} of character- 
istic p, called the residue class field for p. Thus 
Z/pZ is a prime field, and every prime field of 
characteristic p is isomorphic to Z/pZ. If the 
number of the elements of a field K is finite, K 
is called a finite field. Z/pZ is an example of a 
finite field. 


D. Extensions of a Field 


In order to express that K is an extension field 
of k, we often use the notation K/k. Subfields 
of K containing k are called intermediate fields 
of K/k. Consider two extensions K ,/k, and 
K,/k,, and let gp: K,—>K, be an isomorphism 
which induces an isomorphism w:k,—>k}. 
Then we call ọ an extension of Ųų. Suppose that 
k,, K, are given fields and K, contains a 
subfield k, isomorphic to k,. Then there exist 
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an extension field K, of k; and an tsomor- 
phism g@:K,—>K, which is an extension of the 
given isomorphism w:k,—>k,; construction of 
the field K, is often called the embedding of 
k, into K,. When K, and K, are extensions 
of k, an isomorphism: K; >K, is called a k- 
isomorphism if it leaves every element of k 
invariant. 

In an extension K/k, let S be a subset of K. 
The smallest intermediate field of K/k contain- 
ing S is called the field obtained by adjoining S 
to k or the field generated by S over k, denoted 
by k(S). The field k(S) consists of those ele- 
ments in K each of which is a rational ex- 
pression in a finite number of elements of S 
with coefficients in k. An extension field k(t) 
obtained by adjoining a single element t to k is 
called a simple extension of k, and in this case t 
is called a primitive element of the extension. 
The trational function field k(X) with coeffi- 
cient field k is a simple extension of k with a 
primitive element X. 

When subfields k, (A€ A) of a field K are 
given, the smallest subfield of K containing all 
these subfields exists and is called the com- 
posite field of the k,. 


E. Algebraic and Transcendental Extensions 


An element « of an extension field K of a field 
k is called an algebraic element over k if « is a 
tzero point of a nonzero polynomial, say, f(X) 
=day+a,X+...+a,X" with coefficients in k. 
If x is not algebraic over k, then « is called a 
transcendental element over k. An algebraic 
element « is always a root of an irreducible 
polynomial over k which is uniquely deter- 
mined up to a constant factor (ek*) and ts 
called the minimal polynomial of « over k. K is 
called an algebraic extension of k if all elements 
of K are algebraic over k; otherwise we call K 
a transcendental extension of k. If K, is an 
algebraic extension of K and K is an algebraic 
extension of k, then K, is also an algebraic 
extension of k. In an arbitrary extension field 
K of k, the set of all algebraic elements over 

k forms an algebraic extension field of k. A 
simple extension k(t) with a transcendental 
element t is isomorphic to the rational func- 
tion field of one variable with coefficient field 
k. If tis an algebraic element over k, k(t) is 
isomorphic to the tresidue class field of the 
polynomial ring kL X] modulo the minimal 
polynomial f(X) of t over k. 


F. Finite Extensions 


An extension field K of a field k is called a 
finite extension if K has no infinite set of ele- 
ments that are tlinearly independent over k, 
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i.e., if K is a finite-dimensional linear space 
over k. The dimension of the linear space over 
k is called the degree of K over k and is de- 
noted by (K :k) (or [K:k]). If K is a finite 
extension of k and L is a finite extension of K, 
then L is also a finite extension of k and (L: K) 
(K:k)=(L:k). Every finite extension field of k 
is an algebraic extension of k and is obtained 
by adjoining a finite number of algebraic 
elements to k. Conversely, every field obtained 
by adjoining a finite number of algebraic 
elements to k is a finite extension of k. If K = 
k(a) with an algebraic element «, then (K : k) is 
equal to the degree of the minimal polynomial 
of « over k, also called the degree of « over k. 
Every element of k(x) is expressed as a poly- 
nomial in & with coefficients in k. On the other 
hand, for any nonconstant polynomial f(X) of 
kX] there exists a simple extension k(«) such 
that « is a root of f(X). 


G. Normal Extensions 


An algebraic extension field K ofa field k is 
called a normal extension of k if every irreduc- 
ible polynomial of k[X] which has a root in 
K can always be decomposed into a product 
of linear factors in K[X]. An extension field K 
of k is called a splitting field of a (nonconstant) 
polynomial f(X)ek[X] if f(X) can be decom- 
posed as a product of linear polynomials, i.e., 
f(X)=c(X —x (X —a,)...(X —4,), cek, ae K. 
A splitting field K of f(X) (f(X)ek[X]) is 
called a minimal splitting field of f(X) if any 
proper subfield L of K (K > L2>k)is nota 
splitting field of f(X). A minimal splitting field 
of f(X) is obtained by adjoining ail the zero 
points of f(X). A finite extension field of k is a 
normal extension if and only if it is a minimal 
splitting field of a polynomial of k[ X]. For 
any given (nonconstant) polynomial f(X)e 
kX], there exists a minimal splitting field of 
f(X), and all minimal splitting fields of f(X) 
are k-isomorphic. 


H. Separable and Inseparable Extensions 


An algebraic element g over k is called a sepa- 
rable element or an inseparable element over k 
according as the minimal polynomial of x over 
k is tseparable or tinseparable. An algebraic 
extension K of k is called a separable extension 
of k if all the elements of K are separable over 
k; otherwise, K is called an inseparable exten- 
sion. An element « is separable with respect 

to k if and only if the minimal polynomial of 

æ over k has no double root in its splitting 
field. If x is inseparable, then k has nonzero 
characteristic p, and the minimal polyno- 

mial f(X) of « can be decomposed as f(X)= 
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(X —0,)?(X — 0)” .. (X — am)”, r> 1, where 
X1, A2, -> , Xp are distinct roots of f(X) in its 
splitting field; between the degree n of f(X) 
and the number m of distinct roots of f(X), 
the relation n=mp’ holds. In particular, if 
a?” ek for some r, we call « a purely inseparable 
element over k. An algebraic extension of 

k is called purely inseparable if all elements 

of the field are purely inseparable over k. In 
an algebraic extension K of k the set of all 
separable elements forms an intermediate 
field Ky of K/k. The field Kg is called the 
maximal separable extension of k in K. If K is 
inseparable over k. i.e., if K # Ko, then the 
characteristic of k is p #0, and K is purely in- 
separable over Ko. The degrees d=[K,:k] 
and p’=[K:K,] are denoted by [K:k], and 
[K :k];, respectively. A separable extension 
of a separable extension of k is also separable 
over k, and every finite separable extension 
of k is a simple extension. 

If no inseparable irreducible polynomial in 
k[X] exists, we call k a perfect field; otherwise, 
an imperfect field. Every field of characteristic 
0 is a perfect field. A field of characteristic p 
(#0) is perfect if and only if for each aek the 
polynomial X?” ~a has a root in k. Every alge- 
braic extension of a perfect field is a sepa- 
rable extension and a perfect field. Any imper- 
fect field has an inseparable, in fact purely 
inseparable, proper extension. 


I. Algebraically Closed Fields 


If every nonconstant polynomial of k[X] can 
be decomposed into a product of linear poly- 
nomials of k[X], or equivalently, if every 
irreducible polynomial of k[X] is linear, k is 
called an algebraically closed field; k is alge- 
braically closed if and only if k has no alge- 
braic extension field other than k, and hence 
every algebraically closed field is perfect. 

For any given field k there exists an algebrai- 
cally closed algebraic extension field of k 
unique up to k-isomorphisms (E. Steinitz); 
hence we call such a field the algebraic closure 
of k. To proceed further, suppose that we are 
given a field k and its extension K. If there is 
no algebraic element of K over k outside of k, 
i.e., if k is the intersection of K and the alge- 
braic closure of k, then we say that k is alge- 
braically closed in K. The complex number 
field is an algebraically closed field (C. F. 
Gauss’s fundamental theorem of algebra; 

— 10 Algebraic Equations). 


J. Conjugates 


Let k be a field and K an algebraic extension 
of k. Two elements a, f of K are called conju- 
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gate over k if they are roots of the same irre- 
ducible polynomial of kL X] (or equivalently, 
if the minimal polynomials of « and $ with 
respect to k coincide); in this case we call the 
subfields k(a), k($) conjugate fields over k. The 
conjugate fields k(a) and k() are k-isomorphic 
under an isomorphism ø such that o(a) =f. In 
particular, if K is a normal extension of k, the 
number of conjugate elements of an element « 
of K is the number of distinct roots of the 
minimal polynomial f(X) of x, which is inde- 
pendent of the choice of a normal extension K 
containing k. The element « is separable if and 
only if the number of conjugate elements in K 
is the same as the degree of f(X). On the other 
hand, k(«) is normal over k if and only if k(a) 
coincides with all its conjugate fields. 

Let « be a separable algebraic element over 
k, and let x, =a, %>,...,%, be conjugate ele- 
ments of æ over k. The product A=a,4,...4, 
and sum B=a,+a,+...+«a, are elements of k. 
Indeed, if f(X)=X"+c,X" '4+...+c, is the 
minimal polynomial of « with respect to k, we 
have A=(—1)"c,, B= —c,, and A and B are 
called the norm and the trace of «, respectively, 
denoted by A= N (a), B= Tr(a). Let K bea 
finite separable extension of degree n over k, 
and let « be an element of K. Then the degree 
m of the minimal polynomial of « is a divisor 
of n; that is, n= mr with a positive integer r. 
We define the norm and the trace of « with 
respect to K/k by Ngala) =N (ay, Trela) = 
rTr(a), respectively. Then these quantities 
satisfy Nka (ap) = Ngala) Ngah) Treal + B)= 
Treal) + Treal P) for x, Be K. (For the Galois 
theory of algebraic extensions — 172 Galois 
Theory.) 


K. Transcendental Extensions 


Let K be an extension of k and u,,...,u, be 
elements of K. An element v of K is said to 

be algebraically dependent on the elements 
U,,U>,...,uU, if v is algebraic over the field 
k(u,,uz,...,u,). A subset S of K is called 
algebraically independent over k if no ues is 
algebraically dependent on a finite number 

of elements of S different from u itself; S is 
called a transcendence basis of K over k if S is 
algebraically independent and K is algebraic 
over k(S). Furthermore, if K is separable over 
k(S), then S is called a separating transcendence 
basis of K over k. There always exists an alge- 
braically independent basis of K over k, and 
the tcardinal number of S depends only on K 
and k; this cardinal number is called the tran- 
scendence degree (or degree of transcendency) 
of K over k. (When S is an infinite set, we 
sometimes say that the transcendence degree is 
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infinite.) In particular, if K =k(S) with an 
algebraically independent S, K is called a 
purely transcendental extension of k. 

An extension K of k is called a separably 
generated extension, or simply a separable 
extension, if every finitely generated intermedi- 
ate field of K/k has a separating transcendence 
basis over k. If K itself has a separating tran- 
scendence basis over k, then K is separably 
generated, but not conversely. 

A purely transcendental extension field of k 
having a finite transcendence degree n is also 
called a rational function field in n variables 
over k, and a finite extension of such a rational] 
function field is called an algebraic function 
field in n variables over k. 

Let K and L be extension fields of k, both 
contained in a common extension field. We 
say that K and L are linearly disjoint over k if 
every subset of K linearly independent over k 
is also linearly independent over L, or equiva- 
lently, if every subset of L linearly independent 
over k is also linearly independent over K. An 
algebraic function field K =k(x,,X3,...,X,) 
over k (whose transcendence degree is <n) is 
called a regular extension of k if K and the 
algebraic closure k of k are linearly disjoint. In 
order that K be regular over k it is necessary 
and sufficient that k be algebraically closed in 
K and that K be separably generated over k. 


L. Derivations 


A map D ofa field K into itself is called a 
derivation of K if it satisfies D(a +b) = D(a)+ 
D(b) and D(ab)=aD(b) + bD(a) for all a, be K. 
The set of elements c of K for which D({c)= 

0 is a subfield. If the characteristic of K is 

p( #0), then D(x’)=0 for all xe K. Let k bea 
subfield of K. A derivation D of K is called a 
derivation over k if D(c)=0 for all cek; the 
totality of derivations over k is a tk-module. If 
K=Kk(x,,X,...,X,) Is an algebraic function 
field over k, then the k-module of derivations 
over k has finite dimension s( <n) and we 

can choose s suitable elements u,,u,,...,u, of 
K such that K is separably algebraic over 
k(u,,u,...,u,). Generally, the transcendence 
degree r of K over k does not exceed s, and K 
is separably generated over k if and only if 
r=s. 


M. Finite Fields 


Finite fields were first considered by E. Galois 
(1830), so they are also called Galois fields. 
There is no finite noncommutative field (Wed- 
derburn’s theorem, J. H. M. Wedderburn, 
Trans. Amer. Math. Soc., 6 (1905)). A simple 
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proof for this was given by E. Witt (Abh. Math. 


Sem. Univ. Hamburg, 8 (1931)). The character- 
istic of a finite field is a prime p, and the num- 
ber of elements of the field is a power of p. 
Conversely, for any given prime number p and 
natural number «a, there exists a finite field 
with p* elements. Such a field is unique up to 
isomorphism, which we denote by GF(p*) or 
F,(q =p’). For any positive integer m, GF(p™*) 
is an extension field of GF(p*) of degree m and 
a tcyclic extension. Every element a of GF(p*) 
satisfies a” =a; hence a has its pth root in 
GF(p*). Therefore every finite field is perfect. 
The multiplicative group of GF (p°) is a cyclic 
group of order p*—1. 


N. Ordered Fields and Real Fields 


A field K is called an ordered field if there is 
given a ‘total order in K such that a>b im- 
plies a+c>b+c for all c and a>b, c>0 im- 
plies ac > bc. The characteristic of an ordered 
field is always 0. An element a of K is called a 
positive element or a negative element accord- 
ing as a>0 or a<0. For an element a of K the 
absolute value of a, denoted by |a], is aor —a 
according as a>0 or a <0. If we define neigh- 
borhoods of a by the sets {x|a—e<x<a+e} 
with positive elements ¢, K becomes a tHaus- 
dorff space. If, for any two positive elements 

a, b of K, there exists a natural number n such 
that na> b, then we call K an Archimedean 
ordered field. Two ordered fields are called 
similarly isomorphic if there exists an isomor- 
phism between them under which positive 
elements are always mapped to positive ele- 
ments. The rational number field and the 

real number field are examples of Archime- 
dean ordered fields, while every Archimedean 
ordered field is similarly isomorphic to a sub- 
field of the real number field. (For the struc- 
ture of non-Archimedean ordered fields, see 
[8].) A field k is called a formally real field (or 
simply real field) if —1 (1 is the unity element 
of k) cannot be expressed as a finite sum of 
squares of elements of k. The real number field 
is a model of formally real fields. More gener- 
ally, every ordered field is a formally real field. 
A formally real field is called a real closed field 
if no proper algebraic extension of it is a for- 
mally real field. The real number field is a real 
closed field. The algebraic closure of a real 
closed field is obtained by adjoining a root of 
the polynomial X? + 1. If a is a nonzero ele- 
ment of a real closed field, then either a or —a 
can be a square of an element of the field. 
Every real closed field can be made an ordered 
field in a unique way, namely, by defining 
squares of nonzero elements to be positive 
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elements. Since it is known that every formally 
real field is a subfield of a real closed field, it 
follows that every formally real field is an 
ordered field and therefore is of characteristic 
0. The problem of constructing an ordered 
field out of a formally real field is closely re- 
lated to the existence of valuations of a certain 
type [8]. The notion of formally real fields 
was introduced by E. Artin (Abh. Math. Sem. 
Univ. Hamburg, 5 (1927)). By making use of the 
theory of formally real fields, Artin succeeded 
in solving affirmatively fHilbert’s 17th prob- 
lem, which asked whether every positive 
definite rational expression (i.e., a rational 
expression with real coefficients that takes 
positive values for all real variables) can be 
expressed as a sum of squares of rational ex- 
pressions. More precisely, it was shown by A. 
Pfister that every positive definite function in 
R(X,,...,X,) is a sum of at most 2” squares. 
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A. History 


When a quantity w(x), such as velocity, is 
defined at every point x in a certain region of 
a space, we say that a field of the quantity y 
is given. This general concept is used in many 
branches of science. Here we confine ourselves 
to some branches of physics, in particular to 
the quantum theory of fields, which describes 
telementary particles. 

The ‘theories of elasticity and thydro- 
dynamics (in particular, concerning fEuler’s 
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equation of motion) deal with displacement 
and velocity fields, respectively. However, a 
field in a vacuum (ether), which is quite differ- 
ent from a field in a space filled with matter, 
first became a subject of physics in telectro- 
magnetism. M. Faraday (1837) introduced the 
electromagnetic field and discovered its funda- 
mental laws, and J. C. Maxwell (1837) com- 
pleted the mathematical formulation. On the 
basis of this formalism, A. Einstein (1905) 
established the theory of trelativity and later 
developed the general theories of relativity and 
of gravity. 

Although quantum theory originated from 
the problem of blackbody radiation, the quan- 
tum theory of the electromagnetic field was 
first developed by P. A. M. Dirac (1927) after 
the development of fquantum mechanics. 
Along similar lines P. Jordan and E. P. Wigner 
(1927) quantized the matter wave (electron 
field), and W. Heisenberg and W. Pauli (1929) 
developed the quantum theory of wave fields 
in general. Subsequently, Jordan and Pauli 
(1927), Pauli (1939), S. Tomonaga (1943), J. 
Schwinger (1948), and others reformulated the 
theory in a relativistically covariant manner. 
Quantum electrodynamics, dealing with an 
electromagnetic field interacting with electrons 
(and positrons), has given excellent agreement 
with experimental measurements when for- 
mulated in this way. Divergence difficulties 
inherent to quantized field theory were by- 
passed by the renormalization procedure of 
Tomonaga, Schwinger, R. P. Feynman, and F. 
J. Dyson (1947). 

On the other hand, H. Yukawa (1934) ap- 
plied the concept of the quantized field to the 
interpretation of nuclear force and predicted 
the existence of 7-mesons. Many kinds of new 
particles have since been found, including z- 
mesons and muons. The field theory of z- 
mesons can explain the qualitative features of 
the meson-nucleon system. Similar theories 
can be formulated for other types of unstable 
particles that have been found in cosmic rays 
since 1949. 

During the progress of meson theory, vari- 
ous types of fields were investigated, and a 
general theory of elementary particles was 
developed. Dirac (1936) proposed a general 
wave equation for elementary particles, and 
Pauli and M. Fierz (1939) proved the con- 
nection of spin and statistics (— the end of 
Section D). Schwinger (1951) derived quantum- 
mechanical equations of motion and commu- 
tation relations from a unified variational 
principle. For the cases where no interaction is 
present, a general theory of elementary par- 
ticles consistent with the requirements of rela- 
tivity and of quantum theory was established 
as the theory of free fields. 
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B. Relativistically Covariant Classical Fields 


Relativistically covariant fields f(x) on the 
level of classical theory are functions of the 
space-time point x with either real or com- 
plex values depending on the index a (which 
distinguishes different fields) [1]. A finite- 
dimensional representation D* of SL(2, C) (— 
258 Lorentz Group) on either a real or com- 
plex vector space is assigned to each «, and 
the index r of (x) refers to components in 
this representation space. Each model is 
specified in terms of a Lagrangian density 
L(x) that is a function (typically a polynomial) 
of F(x), ĉa pr (x) (ô, denoting é/éx*, p =0, 
...,3) and their complex conjugates. Z is 
taken to be invariant under the replacement 
of p(x) and ô p(x) by X, D%(A),,e3(x) and 
&s,,D*(A),, ACA), 6, ps (x) for all AE SL(2, C) 
(called the Lorentz invariance of Z) and pos- 
sibly under the replacement of f(x) and 
8,03(x) by Ep U pP(x) and Ep UP) 
(called an internal symmetry). 

The fields are supposed to satisfy the follow- 
ing partial differential equation, called the field 
equation, and are obtained as the tEuler equa- 
tion of the variational problem for the action 
integral I = f (x)dx: 


3 
6L(x)/0Q(x)— 3 (0/6x") (6.L(x)/6(0* pF (x) 
=0, 


3 
BL(xOGAX)— DY) (6/0x") (.L(x)/0(0" GF (x))) 


u=0 
=0 


(the bar indicates the complex conjugate), 
where the second equation is identical to the 
first for a real field. For complex fields, A(x) is 
chosen such that I =T (possibly except for a 
surface term) in order to ensure that the above 
two equations are complex conjugates of each 
other. 

The invariance of J under a Lie group of 
(local and/or volume-preserving point) trans- 
formations of fields implies a conservation law 
for a certain quantity (Noether’s theorem; — 
e.g., E. L. Hill, Rev. Mod. Phys., 23 (1951)). For 
translational invariance (x*— x" +a"), the 
energy-momentum (stress) tensor 


T* (x) 
= — by L(x) + Y [60L(x)/0(0, 97 (x) 0, Gr x) 


+ 6L(x)/O(6, OF (X))O, Pr (X)} 


(the complex conjugate terms in all equations 
to be suppressed for real fields) satisfies the 
differential conservation law 03-9 0,T",(x)=0 
as a consequence of the field equations, which 
in turn implies the time-independence of the 
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following quantity provided that T*,(x) 
vanishes sufficiently fast at spatial infinity: 


= [Tess n= | Tdk (x? =t). 


These are the total field Hamiltonian (energy) 
and momentum. 

In a general situation, T> (=E, gT”; 
where g*” is the Minkowski metric tensor with 
the signature 1, —1, —1, —{) is not necessarily 
symmetric in 4 and v. F. J. Belinfante (Physica, 
6 (1939); 7 (1940)) has given a formula (by a 
change of (x) via so-called 4-divergence) for 
obtaining a symmetrized energy-momentum 
tensor 0”, for which H and P, are equal to 
those defined from the above T”, and for 
which the differential conservation law is 
satisfied. 

Lorentz invariance implies the conservation 
law &,,0,M*’*(x)=0 for the angular momen- 
tum density 


MHH x) =x T(x) — x4 T(x) 


+ È. {10 L/l prx prx) 


+ 0L/8(0, 6) (x)) Se" E) 
a x"O#4(x) —x*@**(x) 


and the time-independence of the total angular 
momentum 


wri | mdy (x° =t), 


e Sr?" is antisymmetric in 4, v and D*(A),s 
Ors + Ep, v Spe Euy/2 for A(A),, = Juy + Euy With 

an morima a E,y- (We have (A(4)x} = 

E, A(AMEX’ and A(A)yy = Zp Gap ACA?) 

A continuous one-parameter group U(p) 
of internal symmetries implies the conserva- 
tion law &,,6,,J"(x)=0 for the 4-current den- 
sity (charge (4:=0) and current (u= 1, 2,3) 
densities) 


J”(x)= 2 {0L/0(6,,02(x))A7? oF (x) 


+ OL /0(8, @2(x))A GPx) 


and the time-independence of the charge 
[ J°(x)a?x (x° =t), where U'(0)% =A". An 
example of U(p) is the multiplication by 
expiQ*p (called a gauge transformation of the 
first kind), where Q° is an integer with Q7= 
for real fields o‘. 

Examples of Lagrangian densities for non- 
interacting fields (called free Lagrangian den- 
sities) are: 


real scalar field: 


(Ly g” O,9(x) 0, p(x) -m p(x)?)/2, 
complex scalar field: 


(Èg 6, P(x) 6, p(x) —m? P(x) p(x), 
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Dirac field: 
DE prH CN OuWs(X) — OW (x) yrs Ws (2) 
~m}, px) yx) 
N = Es) 2” = 06) are tDirac’s y- 
matrices), 


massive real vector field: 


— (1/4) 3 g% g (0x A(x) — 6, A;lx)) 


KAuy 


x (6, A, (x) ~ 6, A, (x)) Hey g" A, (xX) A (x), 
uv 

where D“ is trivial for scalar fields, [1,0] ® 
(0, 1] for the Dirac field, and [1, 1] for vector 
fields. 

For interacting fields, some interaction parts 
are added to the sum of such free Lagrangian 
densities of relevant fields. Some examples are: 


P(ọ)(or g@*) interaction: 

P(g(x)) (or ge(x)*), 
Yukawa-type interaction: 

g Bours WAX) Pes We(X) A(X), 
Fermi interactions: 

g Lin Gu Er WONO) 

x (Lys Why (FOE we (x)) 

(O* = 1 (scalar, no index k), y” (vector, Gy = Guys 
y*y” (tensor, Gyw = Su guy), y?y" (pseudovector, 
Grr = gau V3 = iy? Yyy’), y3 (pseudoscalar, no 
index k). 


C. Heuristic Theory of Quantized Fields 


For free fields, a quantization procedure 
similar to the usual quantum mechanics (— 
351 Quantum Mechanics; 377 Second Quan- 
tization) leads to the following type of canon- 
ical commutation or anticommutation rela- 
tions among fields (and their first time deriva- 
tives) at time 0; for example, 


real scalar field: 

[p(0, x), $0, y)]- =id*(x —y), 

[(0, x), (0, y)] - =[6(0, x), o(0, y)] - =0, 
Dirac field: 

[y,(0, x), Y0, y)], =5,,0°(x —y), 

[y,(0, x), ¥,(0, y)]. =LW,(0, x), #0, y)] + =0. 
([A, B], = AB + BA, 6,,=0 for r#s and 6,,= 1, 
ô? (x —y) = I1}; ô(x* — y*).) The free-field equa- 
tions then lead to the following 4-dimensional 
commutation relations: 


real scalar field: 

Lolx) e(y)J- = —iA,,(x— y), 
Dirac field: 

w(x), Wy yJ = (È, Ysp 
massive real vector field: 

[A,(x), A] - = if guy +m 70,0, Anl — Y). 


Here A,, is the tinvariant distribution. There is 
a unique representation of such relations for a 


—im By) An (X za y), 
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field (called the Fock representation) with a 
vector Q (called the free vacuum vector) which 
is annihilated by f p(x) f(x)dx, by f(x) f(x) dx 
and f ,(x)f(x)dx, or by f A, (x) f(x)dx when- 
ever fe *f(x)dx=0 for p? >0 (here, p: x= 

È Guvp"x"), and which is cyclic (i.e., polynomials 
of (smeared-out) fields generate a dense subset 
of Q). The free fields in the Fock represen- 
tatign satisfy the Wightman axioms (— Sec- 
tion D). 

In the Fock representation of canonical 
fields a translationally invariant vector must 
be a free vacuum vector, up to multiplication 
by a complex number. In order to construct a 
model of a translationally invariant interaction 
among canonical fields with a unique vector of 
minimal energy (a vector which is called the 
true or interacting vacuum and which must 
be translationally invariant if unique), one 
must look for some other suitable representa- 
tion. Such a no-go theorem is called Haag’s 
theorem. 

The earliest formulation of interacting quan- 
tized fields was developed heuristically by 
formal manipulation in the Fock representa- 
tion, described in textbooks of quantum field 
theory [2-6]. It can be mathematically justi- 
fied if the so-called cutoff is introduced by 
limiting the space to a finite volume, possibly 
changing the space into a lattice and smooth- 
ing out fields (effectively cutting off the high- 
energy part of the interaction) (A. M. Jaffe, O. 
E. Lanford III, and A. S. Wightman, Comm. 
Math. Phys., 15 (1969)). The full theory is then 
expected to be obtained by taking a limit of 
the true vacuum expectation values as various 
cutoff parameters are removed. This is the aim 
of constructive field theory (— Section F), 
which has been achieved for some space-time 
models of dimension 2 and 3. 

In quantum field theory, the S-matrix is 
given in terms of (the mass-shell restriction of 
the Fourier transform of) the vacuum expec- 
tation value of the time-ordered product of 
fields, called t-functions (— Section E). In the 
heuristic approach, it is given in terms of the 
following Gell-Mann-—Low formula (its imagi- 
nary time version being mathematically used 
in constructive field theory): If the field g(x) = 
et 9((0,x))e ~= with H =H, +H, (Hp is 
the free Hamiltonian as the generator of the 
time translation for the free field g(x) and H, 
is the interaction Hamiltonian, such as P(@)) 
and x?>...>x°, then 


(Q, P(x) ed p(x,)Q) 


= lim (eTO, p(x)... p(x,)e FTO) 


AQ e77"), 


where Q, is the free vacuum. The numerator 
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can be written as 
(Qo, U(T, XD) Gal DU (x9, Ke axe 
Po(X,) U(x. T)Qo) 


in terms of U (t, s) =e" "oe ~it 9Ħ o ~isHo and the 
canonical field po at time 0. The covariant 
perturbation series due to Tomonaga, Schwin- 
ger, Feynman, and Dyson is obtained by sub- 
stituting the following expansion, which is the 
iteration of the Duhamel formula: 


a5 (=o at, al ANEI 


Here, H,(t)=e""eH,e "0, Each term can be 
represented by connected tFeynman diagrams 
(the denominator canceling out all discon- 
nected graphs) and computed according to the 
*Feynman rule, yielding tFeynman integrals. 
Each expression so obtained (formally) may 
be a divergent integral (in the absence of the 
cutoff), in which case one tries to cancel it out 
by modifying the original Hamiltonian with 
the alteration of parameters such as mass 

and coupling constant (by an amount called 
the renormalization constant, which is diver- 
gent in the absence of the cutoff) or possibly 
by the addition of terms again involving 
renormalization constants. If this can be 
achieved in terms of a finite number of renor- 
malization constants, the model or the Hamil- 
tonian ts called renormalizable [7]. If the 
divergent integral appears only in a finite 
number of graphs (counting the same sub- 
graph of an infinite number of different dia- 
grams as one graph), the model is called super- 
renormalizable. 


D. Axiomatic Quantum Field Theory 


Relativistically covariant fields ọ,(x) on a Hil- 
bert space # are called Wightman fields if the 
following four Wightman axioms are fulfilled 
[8-10]: 

(1) The fields ~,(x) are operator-valued distri- 
butions: For each C”-function f of rapid de- 
crease on the Minkowski space (f € F(R), 


_@,(f) is an operator defined on a common 


domain D dense in # and satisfying o,(f)D< 
D (the domain of g,(f)* contains D), g, (A*I D 
= p;( f) for another index & and (¥, g,( f)®) 
for any Y, and Ẹ in D is linear and continuous 
in f relative to the topology of the Schwartz 
space Y(R*). 

(2) Relativistic covariance: There exists a 
continuous unitary representation U(a, A) 
(ae R* and AeSL(2, C)) of the universal cover- 
ing group Ž! of the trestricted inhomogeneous 
Lorentz group Y! on # such that U(a, A)Dc 
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D and 


U (a, A)Pa( f) U (a, A)* = 2 D(A)qg Pp fa, A)» 


where fa, 4(x) = f(A(A)7"(x —a)) and (D(A),,) is 
a finite-dimensional representation of SL(2, ©). 
The index x usually consists of tundotted and 
dotted indices, interchanged in x, and of other 
indices, interchanged among them in g.) 

(3) Locality: If the supports of f and g are 
mutually spacelike, then 


Pal I) Pp(9) = €(a; B) Op GOS) 


on D, where e(a, B)= +1. If e(a, 8) = —1 exactly 
when D(A)qq=D(A)gg= —1 for A= —1 (i.e., 
when both p, and @, are Fermi fields), they 
are said to satisfy the normal commutation 
relations. 

(4) Spectrum conditions: Let U(a, 1)=e'*’?. 
The joint spectrum of P” (1=0, 1, 2, 3) is in the 
forward cone V, ={peR*|p- p>0, p°>0}, 
with a point spectrum of multiplicity 1 at p=0. 
A vector © belonging to the point spectrum 0 
of P* is called the true (or interacting) vacuum 
and is required to be in D. 

Usually D is taken to be minimal, namely, D 
is the linear hull of Q and 9, (fi)... Pa (fQ 
with all possible n, «1, ...,%,5 fis -+ Ja. By means 
of the nuclear theorem, it is possible to define 
a linear operator gy, (f) for f(x,,...,x,) in 
S(R*), linear and continuous in f on D such 
that it coincides with @, (J1) --- Pa, (Jn) if f(x) 
=I], f(x). If such operators are introduced, 
then the linear hull of Q and g,,__, (f)Q is 
taken to be D. 

Under the foregoing axiom, the vacuum 
expectation values of the products of field 
operators define tempered distributions W,(x) 
= W,,...2,(%15+++,Xp), Called Wightman functions, 
such that 


WATT fi) = (Q, Pa, Si) Pau Sn) Q). 

The notion of a connected graph in pertur- 
bation theory corresponds to the following 
notion of truncated Wightman functions W7: 


WID=5 -1 (m-1)! F T] Wildy, 


Hr} k=1 


WAD= > dT] W), 

m {Ty} k=1 
where J indicates a set of variables x;e M, jel, 
{I,} is a partition of I into m subsets, with the 
order of the x’s in each I, remaining the same 
as in I, and the sum over {/,} extending over 
all possible partitions. If the spectrum of {P*} 
in Q+ is contained in V"={peM|p-p>m’, 
p° >m} for some m>0 (the mass gap), then 
W7 has an exponential clustering property at 
spatial infinity. For example W,"(x,+4,,..., 
x, ta,e™® 0 as a distribution in x if m <m, 
a? =0 for all j and R=max|a,—a,|> 00. 
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If the mass operator (P- P)"? has an iso- 
lated point spectrum at m with the eigenspace 
HA (m), then there are sufficiently many n, «= 
(a,,...,%,), and f(x,,...,x,) such that p, f)QE 
H,(m), and the linear hull of such vectors are 
dense in #,(m). In terms of the f; satisfying 
Papl f)) Qe H (m) (some of the ms may coin- 
cide) and the solutions g(x) = | g(p)exp i(px — 
co,(p)x°)d* p, w,(p)=(p* + m7)", of the Klein- 
Gordon equation, then the following limits, 
called out and in states, exist: 


out 


yee (hy se h,)= lim Q(t, gı) Q(t, Gn) Q, 


Q(t, gj) = | eat + (t, x), B Xn; + (t, x)) 


x fixi +... + Xn) Gilt, x)dx,... dx,,,dx, 
h,=(22)° J (P), (HRE X (m). 
It defines the S-matrix elements as follows: 
S(hi -hp hi hh) 
=(P*(h; ...h,), (h... h) 


This is called Haag-Ruelle scattering theory, 
and the existence proof is based on the cluster- 
ing property of W’ and an asymptotic esti- 
mate for the behavior of the g’s for large t 
and all x (R. Haag, Phys. Rev., 112 (1958); D. 
Ruelle, Helv. Phys. Acta, 35 (1962)). The out 
and in states can be interpreted in the limit of 
infinite future and past as the state where n 
particles are moving at velocities v; related to 
the spectrum p” of P“ through the relation p° = 
m;/(1 —v?)"?, p=m,v,/(1 —v;)'? (with probabil- 
ity amplitude proportional to h,(p)) (H. Araki 
and R. Haag, Comm. Math. Phys., 4 (1967)). 
Since 


(Ph, hy), Ph... hy) 


= Onn’ 3 II (h;, hpo) 
P j=1 


where the sum is over all permutations P, Y™ 
can be viewed as a unitary mapping from the 
*Fock space #° over LE æ (m) to the closed 
subspace #°™ spanned by ‘P"'(h, ...h,), n= 
0, 1,2,... (Q for n=0, h, itself for n=1). The 
free fields on #7 and #"" are called (out and 
in) asymptotic fields. Likewise, ‘¥'” is a unitary 
mapping from #° to H1. If Hi" =H, then 
the matrix element S can be viewed as a ma- 
trix element of a unitary operator on #°, 
called an S-matrix. Its unitary transform by 
Yt and by Y™ coincide and define a unitary 
operator, sometimes called an S-operator, on 
Hi = f™. The equality H = H = H™ is 
called the completeness of the scattering states 
or asymptotic completeness. 

If the scattering states are complete, then 
the following LSZ asymptotic condition, due 
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to Lehmann, Symanzik, and Zimmermann 
(Nuovo Cimento, 1 (1955)), holds: 


weak- lim Q(t, g) = azu(h;), 


in 


where a*(h) is the tcreation operator (a*(h) - 
Wh, ...h,) = (hh, ...h,) for either out or 

in states), Papl QEA (m)) is not required, 
and (Q, Papl f)Q)=0 is assumed instead. This 
leads to an explicit expression for the S-matrix 
elements in terms of t-functions, which takes 
the following simple form if all particles have 
spin 0, all fields ọ, are scalar (D(A), =6,,), and 
all n, can be taken to be 1. First define the 
connected part S.(h, ... h„; hi... hh) from S by 
exactly the same equation as truncated Wight- 
man functions with S to be set to 0 if n=0, 1 
or n' =0, 1 except when n=n' =1 (S(h, h’)= 

(h, h’)). Then 


S.(hy exe hy; his nae h,) 


=|] (i (P; Pj— ni) ty (Disses 


Pitts sees ra Il (—iZ;"?h(p,) dQ,(p,)), 


j=l 


where p? =(p; +m?)'? (the set of such p is 
called the positive mass shell; there the vanish- 
ing factors (p;' p;— m°) cancel the poles of #7), 
dQ, „(p)i is the Lorentz invariant measure 
(21p?Nd*p (=8(p p- m?)d*p) on the posi- 
tive mass shell, he #,(m) is represented by a 
measurable function h(p) with the inner prod- 
uct (h, h) = f h(p)h'(p) dQ,,(p), Z; is defined by 
(h, pa (HA) = Z}°?{(h, f) for all helm) and 
f(p)= OT) -32 f e'P'* f(x)dx, and 7,’ is the 
Fourier transform of the truncated time- 
ordered function (t-function) defined by 


t. (pi, p= | Son. nxen È px) 
j=l 


xT] (2n)-3?d*x,, 


jal 


Ty (x) a A(x°; Cp) We. a(x) (x = (x, Ee Xn))s 


We (x) =(Q, Papp (XP pe Papin XP(ny) 2)" » 


in which the sum is over all permutations P, T 
indicates the truncated Wightman functions, 
0(x°; Cp) is the characteristic function of the 
CONE xpa) > X pra) > --- > Xp (the time ordering) 
possibly smeared out by convolution with a 
C-function of compact support (so that it can 
multiply a distribution), where the formula 
does not actually depend on the smearing 
functions. (Usually fields p, are normalized so 
that Z;=1.) 

The r-functions (O. Steinmann, Helv. Phys. 
Acta, 33 (1960); D. Ruelle, Nuovo Cimento, 19 
(1961); H. Araki, J. Math. Phys., 2 (1961)) are 
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defined by 
r(x)= > O(x°; Cp/C;) We a(x) 
P 


(- 2, 40?; C/C) Waat) 


(often with an additional factor (—i)"~!), where 
the Fourier-Laplace transform 6(q°; C p) of 
0(x°; C,) is a rational function and 0(x°; C,/C) 
is the inverse Fourier transform of its bound- 
ary value as Imq° eC, tending to 0, and the 
cone C; and hence r; is specified by a consistent 
choice of signs of ©, Imq? for all nonempty 
proper subsets 7 of (1,...,n). The Fourier 
transform of r,(x) is a boundary value of an 
analytic function common to all r; and coin- 
cides with ¿7 for p°eC,. Making use of this 
relation, some analyticity properties of the S- 
matrix (including TCP symmetry) have been 
proved ([11, 12]; J. Bros, H. Epstein, and 

V. Glaser, Comm. Math. Phys., 1 (1965); 
Epstein, Glaser, and A. Martin, Comm. Math. 
Phys., 13 (1969); Epstein, J. Math. Phys., 8 
(1967)). 

The two-point function for scalar fields has 
the following simple expression in terms of 
tinvariant distributions, sometimes called 
the Kallen-Lehmann representation with the 
Kallen-Lehmann weight p: 


(Q, e(x)e(y)Q) = | A(x — y)dp(k7). 


Under the Wightman axiom (with the 
normal commutation relations), there exists 
an antiunitary operator ©, called the TCP 
operator, which satisfies the relations 


@Q=2, Ula, A\@~!=U(—a, A), 
= U(0, —1), 
Oo,(x)O™* =(—1)"(— i)” pal =X); 


where y is the number of undotted indices in «, 
N=0Oif D(A),,=1 for A= —1 (Bose fields), and 
N =1 if D(A),,= —1 for A= —1 (Fermi fields). 
The choice of +1 for &(«, x) in the locality 
axiom cannot be opposite to the normal com- 
mutation relations for any a except for the 
trivial case ~,(x)=0 ([8-10]; G. Liiders and B. 
Zumino, Phys. Rev., 110 (1958); N. Burgoyne, 
Nuovo Cimento, 8 (1958)). This is called the 
connection of spin and statistics. For a general 
choice of +1 for e(a, £), the Wightman axioms 
imply existence of a certain number of even- 
oddness conservation laws (i.e., the existence of 
a unitary representation u of a group (Z,)' for 
some l such that u(g)Q=Q, u(g)g,(f)ulg)* = 
X.(9)9,( f) for some characters y, on the group) 
so that the Klein transforms @,(x)= u(g,),(x) 
(for some choice of g,) of the original fields 
satisfy the Wightman axioms with the normal 
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commutation relations ([8-10]; H. Araki, 
J. Math. Phys., 2 (1961)). 


E. Theory of Local Observables 


Except for the technical assumption of 
operator-valued distributions, physical con- 
tents of the Wightman axioms have been for- 
mulated in terms of the von Neumann algebra 
(0) for bounded open sets ©, generated by 
those observables which can be measured in 
the space-time region 0, as follows: 

(1) Isotony: If ©, > ©,, then A(0,)> A(0,). 

(2) Covariance: U (a, A).o(@)U (a, A)* = 
A(A(A)O +a). 

(3) Locality: If Q;e./(@,) and ©, is spacelike to 
0, then [Q,,Q0,]=0. (No signal can propa- 
gate faster than the speed of light.) 

In addition, the spectrum condition is as 
before (the stability of the vacuum) and, since 
we restrict our attention to the closed span of 
A(0)Q for all ©, Q is assumed to be cyclic for 
(Jo (©). (Then the latter is irreducible.) By 
treating Q(x)= U(x, 1)QU (x, 1)* for Qe.A(0) 
as a (noncovariant but localized) field, Haag- 
Ruelle scattering theory and the analyticity 
properties of the S-matrix described above for 
Wightman fields hold in exactly the same way 
in the theory of local observables. The notion 
of an algebra of local observables has been 
a concern of R. Haag since the late 1950s, 
with the consequent analogy to Wightman 
fields being demonstrated by Araki in his 
Zurich lectures of 1961-1962. Hence the 
above axioms are sometimes called Haag- 
Araki axioms. 

With the help of the additional axiom 
(A(O,)UA(C))" = S(O, U 02), called the addi- 
tivity, the vacuum vector Q is cyclic and sep- 
arating for (0) for any bounded open 0 
(Reeh-Schlieder theorem) and (0) = (Ô), 
where @ is the double cone {x||x°}+|x|<L} if 
O={x||x°|+|x|<L, |x|<e} for any e>0, for 
example (Borchers theorem). 

A merit of the Haag-Araki axioms is that 
these axioms have direct physical interpre- 
tation. In particular they always imply the 
commutativity at spacelike separation of sup- 
ports, in contrast to the anticommutativity for 
Fermi fields. This then necessitates the con- 
sideration of the representations associated 
with some other states, such as states with an 
odd number of fermions and representations of 
the C*-algebra .° of quasilocal observables 
(generated by all ./(()), which are nonequiva- 
lent to the vacuum representation and are 
called superselection sectors. This viewpoint 
was introduced by Haag and D. Kastler (J. 
Math. Phys., 5 (1964)) and the C*-algebra 
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version of the Haag-Araki axioms ts called the 
Haag-Kastler axioms. 

If ©’ denotes the causal complement of © 
(i.e., the set of all points spacelike to ©), then 
the assumption ./(@) =. (C) is called duality 
and is proved for a certain type of region, 
which includes the double cone in the case of 
free fields. With the assumption of the duality 
for double cones in the vacuum sector, S. 
Doplicher, Haag, and J. Roberts (Comm. Math. 
Phys., 13 and 15 (1969); 24 (1971); 35 (1974)) 
succeeded in the analysis of superselection 
sectors and clarified the connection of spin and 
statistics in a much more satisfactory fashion, 
as well as the anticommutativity of intertwin- 
ing operators for superselection sectors for 
Fermi statistics. 


F. Constructive Field Theory 


An effort to make mathematical sense out of 
the heuristic theory of quantized fields and to 
produce examples of Wightman fields and the 
associated system of local observables has 
been pushed forward by J. Glimm and A. Jaffe 
since the mid 1960s and is known as construc- 
tive field theory. Since 1972, the Euclidean 
methods, already known in some sense, have 
become extremely powerful central tools, are 
collectively known as Euclidean field theory 
[13-16]. 

The Wightman function W(z, ...z,) is ana- 
lytic at the Schwinger points z;=(ix), x,) (xe R*) 
if x; AX, for j #k, and its value S(x; ...x,)= 
W(z, ...Z,) is called the Schwinger function. 
The axioms for Schwinger functions equiv- 
alent to Wightman axioms are known as 
Osterwalder-Schrader axioms (Comm. Math. 
Phys., 31 (1973); 42 (1975)). The positivity 
axiom reflecting the positive definite metric 
of the Hilbert space for Wightman fields is 
known as O-S positivity (or T -positivity or 
reflection positivity). 

Since the Schwinger function is symmetric 
in its variables, it can be viewed as the expec- 
tation value of the product of (commuting) 
random fields, called Euclidean fields, if an 
additional positivity holds. This idea was put 
forward by K. Symanzik in the 1960s. E. Nel- 
son then realized that Euclidean fields for free 
fields have the Markov property, and he devel- 
oped Euclidean Markov field theory. A work 
of Guerra in 1972 utilizing Nelson symmetry 
revealed the extreme power of this approach, 
and the whole of constructive field theory has 
been studied in Euclidean formulation with 
remarkable results for super-renormalizable 
models in space-time of dimension 2 and 3. 
The key point is the Feynman-Kac-Nelson 
formula, which expresses Schwinger functions 
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as functional integrals and reveals a mathe- 
matical connection between Euclidean field 
theory and classical statistical mechanics. 


G. Gauge Theory 


*Electrodynamics in terms of the 4-vector 
potential A,,(x) is invariant under the local 
gauge transformations A,(x)>A,(x)+ ô A(x) 
(and the associated transformation of the 
charged fields) for A satisfying the wave equa- 
tion (JA=0. This leads on one hand to com- 
plication in the canonical quantization of the 
fields A,,(x) and, on the other hand, to the 
necessity of an indefinite inner-product space 
as exemplified by the Gupta-Bleuler formalism 
of quantum electrodynamics. In such a formal- 
ism, the physical Hilbert space (with positive 
definite metric) is introduced by considering 

a specific subspace (physical subspace) of a 
semidefinite metric and taking the quotient by 
its null subspace. 

The corresponding theory with a non- 
commutative gauge group is known as the 
theory of Yang-Mills fields. In order to restore 
the formal unitarity of the S-matrix in the 
perturbation series in terms of Feynman dia- 
grams, L. D. Faddeev and V. N. Popov (Phys. 
Lett., 25B (1967)) introduced fictitious particles, 
called Faddeev-Popov ghosts. T. Kugo and I. 
Ojima (Prog. Theoret. Phys., Suppl., 66 (1979)) 
developed a canonical quantization scheme for 
the Yang-Mills field which naturally intro- 
duces additional fields corresponding to the 
Faddeev-Popov ghosts and specifies the phy- 
sical subspace by means of the simple condi- 
tion that it be the kernel of the generator of 
BRS transformations, earlier introduced by C. 
Becchi, A. Rouet, and R. Stora. 

Gauge theory can be formulated on a lattice 
of finite volume as a kind of classical statistical 
mechanics. This is known as lattice gauge 
theory, and the important issue currently being 
investigated is whether or not its limit, as the 
lattice interval tends to 0 (the continuum limit) 
and the volume tends to infinity, produces a 
nontrivial quantum theory of gauge fields. 
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A. The Number of Finite Groups of a Given 
Order 


A group is called a finite group if its order is 
finite (— 190 Groups). Since the early years of 
the theory of finite groups, a major problem 
has been to find the number of distinct isomor- 
phism classes of groups having a given order. 
It is almost impossible, however, to find a 
general solution to the problem unless the 
values of n are restricted to a (small) subset of 
the natural numbers. Let f(n} denote the num- 
ber of isomorphism classes of finite groups of 
order n. If p is a prime number, then f(p)=1 
and any group of prime order is a cyclic group. 
If p is prime, then any group of order p? is an 
tAbelian group and f(p°)=2. If p and q are 
distinct primes and p> gq, then f(pq)=2 or 1 
according as p is congruent to 1 modulo q or 
not. If p=1 (mod qg), there is a non-Abelian 
group of order pq as well as a cyclic group of 
order pq. For small n, the value of f (n) is as 
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follows: 


n 8 12 16 18 20 24 27 28 30 32 60 
fm) 5 514 5 515 5 4 45113 


For any n, f(n)>1. When p is prime, f(p”) is 
known for m<6: f(p°*)=5, f(p*)=15 if p>2. 
For f(p*) see O. Schreier, Abh. Math. Sem. 
Univ. Hamburg, 4 (1926). For f(2°) see [11]. 
Set f(p™)=p' and !=Am?>. Then A—2/27 as 
m— oo (G. Higman, Proc. London Math. Soc., 
10 (1960); C. C. Sims, Symposium on Group 
Theory, Harvard, 1963). 


B. Fundamental Theorems on Finite Groups 


The following are some fundamental theorems 
useful in studying finite groups. 

(1) The order of any subgroup of a finite 
group G divides the order of G (J. L. La- 
grange). The converse is not necessarily true. If 
a finite group G contains a subgroup of order 
n for any divisor n of the order of G, then G is 
a tsolvable group. Furthermore, if G contains 
a unique subgroup of order n for each divisor 
n of the order of G, then G is a cyclic group. 

Let p be a prime number. Let the order of a 
finite group G be p"m, where m is not divisible 
by p. A subgroup of order p” of G is called a p- 
Sylow subgroup (or simply a Sylow subgroup) 
of G. The importance of this concept may be 
seen from the next theorem. 

(2) A finite group contains a p-Sylow sub- 
group for any prime divisor p of the order of 
the group. Furthermore, p-Sylow subgroups 
are conjugate to each other. The number of 
distinct p-Sylow subgroups is congruent to 1 
modulo p. In general, the number of distinct p- 
Sylow subgroups of G that contain a given 
subgroup whose order is a power of p is con- 
gruent to 1 modulo p (Sylow’s theorems). 

(3) A p-group is a fnilpotent group (a finite 
group is called a p-group if its order is a power 
of p). Thus any finite p-group G of order >1 
contains a nonidentity element in the tcenter 
of G. Furthermore, any proper subgroup of G 
is different from its tnormalizer. A paper by P. 
Hall (Proc. London Math. Soc., (2) 36 (1933)) is 
a classic and fundamental work on p-groups. 

A group G of order 8 with 2 generators a, t 
and relations of =1, tot™! 1 o*=17 is 
called the quaternion group. This group is 
isomorphic to the multiplicative group consist- 
ing of {+1, +i, +j, +k} in the tquaternion 
field. A generalized quaternion group is a group 
of order 2” with 2 generators a, t and rela- 
tions o?" =1, tot! 1 72=¢2"”. A non- 
Abelian group all of whose subgroups are 
normal subgroups ts called a Hamilton group. 
A Hamilton group is the direct product of a 
quaternion group, an Abelian group of odd 


=o 


=o 
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order, and an Abelian group of exponent 2 
(i.e., p? =1 for each element p). 

Let G be a finite group, and let G= G>G;, 
>...>G,={1} be a tcomposition series of G. 
The set of isomorphism classes of the simple 
groups G;-1/G, i= 1, 2, ...,r, is uniquely deter- 
mined (up to arrangement) by the tJordan- 
Holder theorem. Thus the two most funda- 
mental problems of finite groups are (i) the 
study of the simple groups and (ii) the study of 
a group with a given set of composition fac- 
tors. The first is one of the leading problems of 
the theory, although it has been in a state of 
stagnation until rather recently (— Section J). 
As to the second problem, initial works by H. 
Wielandt and others are under way (partic- 
ularly in the direction of various generaliza- 
tions of Sylow’s theorems). For the class of 
finite solvable groups, the first problem has a 
rather trivial solution; only the second prob- 
lem is important, and even in this case the 
theory seems to leave something to be desired. 


C. Finite Nilpotent Groups 


A finite group is nilpotent if and only if it is 
the fdirect product of its p-Sylow subgroups, 
where p ranges over all the prime divisors of 
the order. Any maximal subgroup of a nilpo- 
tent group is normal. The converse holds for 
finite groups; that is, a finite group is nilpotent 
if and only if all its maximal subgroups are 
normal. 


D. Finite Solvable Groups 


One of the most profound results on finite 
groups, an affirmative answer to the long- 
standing Burnside conjecture, is the Feit- 
Thompson theorem (Pacific J. Math., 13 
(1963)): A finite group of odd order is solvable. 
The index of a maximal subgroup of a finite 
solvable group is a power of a prime number 
(E. Galois). But the converse is not true. The 
unique simple group of order 168 has the 
property that all maximal subgroups are of 
prime power index. A finite solvable group 
contains a self-normalizing nilpotent subgroup 
(i.e., a nilpotent subgroup H such that Ng(H) 
=H), and any two such subgroups are conju- 
gate (R. W. Carter, Math. Z., 75 (1960); cf. W. 
Gaschitz, Math. Z., 80 (1963), for a generali- 
zation). Such a subgroup is called a Carter 
subgroup and is an analog of a Cartan subal- 
gebra of a Lie algebra. But unlike Cartan 
subalgebras, most simple groups do not con- 
tain any self-normalizing nilpotent subgroups. 
A finite solvable group of order mn (m, n are 
relatively prime) contains a subgroup of order 
m; two subgroups of order m are conjugate; if l 
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is a divisor of m, then any subgroup of order / 


is contained in a subgroup of order m (P. Hall). 


The converse of the first part of this theorem is 
also true: A finite group is solvable if it con- 
tains a subgroup of order m for any decompo- 
sition of the order in the form mn, (m,n)=1 (P. 
Hall’s solvability criterion). This generalizes 
the famous Burnside theorem asserting the 
solvability of a group of order p%q’, where 
both p and q are prime numbers. If the se- 
quence of the quotient groups of a tprincipal 
series of a finite group G consists of cyclic 
groups, then the group G is called supersoly- 
able. A finite group is supersolvable if and 
only if the index of any maximal subgroup is a 
prime number (B. Huppert, Math. Z, 60 (1954)). 
If p is the largest prime divisor of the order of 
a finite supersolvable group G, then a p-Sylow 
subgroup of G is a normal subgroup. 


E. Hall Subgroups 


A subgroup is called a Hall subgroup if its 
order is relatively prime to its index (see the 
theorems of P. Hall on finite solvable groups). 
There is no general theorem known on the 
existence of a Hall subgroup. If a finite group 
G has a normal Hall subgroup N, then G con- 
tains a Hall subgroup H that is a complement 
of N in the sense that G=NH and NN H=1; 
furthermore, any two complements are conju- 
gate (Schur-Zassenhaus theorem). The analog 
of Hall’s theorem on finite solvable groups 
fails for nonsolvable groups. But if a finite 
group, solvable or not, contains a nilpotent 
Hall subgroup H of order n, then any sub- 
group of an order dividing n is conjugate to a 
subgroup of H (Wielandt, Math. Z., 60 (1954); 
cf. P. Hall, Proc. London Math. Soc., 4 (1954), 
for a generalization). There are some gener- 
alizations of these results for maximal 7- 
subgroups which may not be Hall subgroups. 


F. 2-Solvable Groups 


Let z be a set of prime numbers. Denote the 
set of prime numbers not in z by 7’. A finite 
group is called a z-group if all the prime divi- 
sors of the order belong to z. A finite group is 
called z-solvable if any composition factor is 
either a z’-group or a solvable z-group. If z= 
{p} consists of a single prime number p, we 
use terms such as p-group or p-solvable (in- 
stead of { p}-solvable). Let G be a n-solvable 
group. A series of subgroups Pyp=1CNy¢ 
P,SN,&... EP, N,=G defined by the prop- 
erties that P,/N;_, is the maximal normal z- 
subgroup of G/N,_, and N,/P, is the maximal 
normal z’-subgroup of G/P, is called the n- 
series of G, and the integer / is called the z- 
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length of G. A solvable group is z-solvable for 
any set x of prime numbers. A z-solvable 
group contains a Hall subgroup which is a z- 
group and also a Hall subgroup which ts a z’- 
group; an analog of Hall’s theorem on finite 
solvable groups holds. Hall and Higman (Proc. 
London Math. Soc., 7 (1956)) discovered deep 
relations between the p-length of a p-solvable 
group and invariants of its p-Sylow subgroup. 
For example, the p-length is 1 if a p-Sylow 
subgroup is Abelian. 


G. Permutation Groups 


The set of all permutations on a set Q of n 
elements forms a group of order n! whose 
structure depends only on n. This group is 
called the symmetric group of degree n, de- 
noted by S,. Any subgroup of S, is a permuta- 
tion group of degree n. When it is necessary to 
mention the set Q on which permutations 
operate, $, may be denoted as S(Q), and a 
subgroup of S(Q) is called a permutation 
group on Q. The set of n elements on which S, 
operates is usually assumed to be {1, 2, ..., n}, 
and an element o of S, is written as 


bk 2. nc ht 
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where i’ is the image of i by a:i’ = a(i). The 
element o may be written as (...)(abc...z)(...), 
which means that o cyclically maps a into b, b 
into c, and so on, and finally z back into a. In 
the example, o =(1 2 3)(45)(6). It is customary 
to omit the cycle with only one letter in it, 
such as (6) in the example. With this conven- 
tion, o =(1 2 3)(45) may be an element of S, for 
any n È 5, leaving all the letters i>6 invariant. 
A cycle of length / is an element o of S, which 
moves / letters a,,...,a, cyclically and fixes 

all the rest; i.e., o=(a,,...,4,). Then an ex- 
pression such as o =(1 2 3)(4 5) is the same as 
the product of two cycles (1 2 3) and (45). In 
general, any permutation can be expressed as 
the product of mutually disjoint cycles (two 
cycles (a,,...,a,) and (b,,...,5,,) are said to be 
disjoint if a; # b; for all i and j). Furthermore, 
the expression of the permutation as the prod- 
uct of mutually disjoint cycles is unique up to 
the order in which these cycles are written. A 
cycle of length 2 is called a transposition. Any 
permutation may be written as a product of 
transpositions. This expression is not unique, 
but the parity of the number of transpositions 
in the expression is determined by the permu- 
tation. A permutation is called even if it is the 
product of an even number of transpositions 
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and odd otherwise. The symmetric group S, 
contains the same number of even and odd 
permutations. 

The totality of even permutations forms a 
normal subgroup of order (n!)/2(n > 2), called 
the alternating group of degree n and usually 
denoted by A,. An even permutation is the 
product of cycles of length 3. The alternating 
group A, of degree 5 is the nonsolvable group 
of minimal order. This fact was known to 
Galois. If n4, then the alternating group A, 
is a simple group and a unique proper normal 
subgroup of S,,. If n=4, A, contains a normal 
noncyclic subgroup V of order 4. In this case 
A, and V are the only proper normal sub- 
groups of S4. A noncyclic group of order 4 is 
called a (Klein) four-group. If n>5, the sym- 
metric group S, is not a solvable group. This 
is the group-theoretic ground for the famous 
theorem, proved by Ruffini, Abel, and Galois, 
which asserts the impossibility of an algebraic 
solution of a general algebraic equation of 
degree more than four. If n <4, S, is solvable. 
The group S, has a composition series with 
composition factors of orders 2, 3, 2, 2, and 
S, is realized as the group of motions in 3- 
dimensional space which preserve an octahe- 
dron. Hence S, is called the octahedral group. 
Similarly, A,(A.) is realized as the group of 
motions in space which preserve a tetrahedron 
(icosahedron); thus A, is called the tetrahedral 
group and A, the icosahedral group. 

These groups have been extensively studied 
in view of their geometric aspect. The group of 
motions of a plane which preserve a regular 
polygon is called a dihedral group. If a regular 
polygon has n sides, then the group has order 
2n. Sometimes a Klein four-group is included 
in the class of dihedral groups (for n= 2). The 
dihedral groups, octahedral group, etc., are 
called regular polyhedral groups. A finite 
subgroup of the group of motions in 3- 
dimensional space is either cyclic or one of 
the regular polyhedral groups. A dihedral 
group is generated by two elements of order 2. 
Conversely, a finite group generated by two 
elements of order 2 is a dihedral group. This 
simple fact has surprisingly many conse- 
quences in the theory of finite groups of even 
order [15, ch. 9]. A dihedral group of order 
2n contains a cyclic normal subgroup of order 
n, and hence is solvable. 

If n #6, every automorphism of S, is inner. 
The order of the group of automorphisms of 
Ss is twice the order of Sẹ. The index (S,: H} of 
a subgroup H of S, is at Jeast equal to n unless 
H=A,,. H (S,:H)=n, then H is isomorphic to 
S,- lfn #6, S, contains a unique conjugate 
class of subgroups of index n. But S$, contains 
two such classes, which are exchanged by an 
automorphism of Sẹ (— Section I). 
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H. Transitive Permutation Groups 


A permutation group G ona set Q is called a 
transitive permutation group if for any pair 

(a, b) of elements of Q, there exists a permuta- 
tion of G which sends a into b. Otherwise G is 
said to be intransitive. Let G be a transitive 
permutation group on a set Q, and let a be an 
element of Q. The totality of elements of G 
which leave a invariant forms a subgroup of G 
called the stabilizer of a (in G). The index of 
the stabilizer is equal to the number of ele- 
ments of Q, the degree of G. Thus the degree of 
a transitive permutation group G divides the 
order of G (a fundamental theorem). 

The concept of orbits is important. Let G be 
a permutation group on a set Q. A subset I of 
Q is called an orbit of G if it is G-invariant and 
G acts transitively on T. In other words, a 
subset T of Q is an orbit of G if the following 
two conditions are satisfied: (i) If ae land 
geéG, the image g(a) also lies in T; and (ii) if 
a and b are two elements of T, there exists 
an element x of G such that b= x(a). Thus 
each element x of G induces a permutation 
(x) on T. The set of all these permutations 
,(x)(xEG) forms a permutation group on T, 
which may be denoted by p (G). Then (G) is 
transitive on T, and ¢, is a homomorphism of 
G onto ¢,(G). Thus the number of elements in 
an orbit T is a divisor of the order of G. It is 
clear that the set Q on which G acts is the 
union of mutually disjoint orbits T}, ..., I, of 
G. This implies that the degree of G is the sum 
of the numbers of elements in the orbits I. 
The resulting equation often contains non- 
trivial relations. If œ; denotes the homomor- 
phism ¢,, defined before, then G is isomorphic 
to a subgroup of the direct product of the 
groups 9,(G), i=1, 2,...,7. 

A transitive permutation group is called 
regular if the stabilizer of any letter is the 
identity subgroup {1}. A transitive permuta- 
tion group is regular if and only if its order 
equals its degree. Any group can be realized 
as a regular permutation group (Cayley’s 
theorem). A transitive permutation group 
which is Abelian is always regular. 

Let G be a transitive permutation group on 
a set Q. If the stabilizer of an element a of Q is 
a maximal subgroup, G is called primitive, and 
otherwise imprimitive. A normal subgroup, 
which is #{1}, of a primitive permutation 
group is transitive. An imprimitive permuta- 
tion group induces a decomposition of the set 
Q into the union of mutually disjoint subsets 
A,,...,A, (s> 1) such that each A; contains at 
least two elements, and if xe G maps an ele- 
ment a of A; onto an element b of A,, then x 
maps every element of A; into A,: x(A;)=A,. 
The set {A,,...,A,} is called a system of im- 
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primitivity. A subset A of Q is called a block if 
x(A)MA equals A or the empty set for all x in 
G. A block is called nontrivial if A#Q and A 
contains at least two elements. Each member 
of a system of imprimitivity is a nontrivial 
block. A transitive permutation group is 
primitive if and only if there is no nontrivial 
block. 

A permutation group G on a set Q is called 
k-transitive (or k-ply transitive, where k is a 
natural number) if for two arbitrary k-tuples 
(a,,...,4,) and (b,,...,5,) of distinct elements of 
Q, there is an element of G which maps a; into 
b; for all i=1, 2,...,k. If k>2, G is called multi- 
ply transitive. A doubly transitive permutation 
group is always primitive. The symmetric 
group S, of degree n is n-transitive, while the 
alternating group A, is (n— 2)-transitive for 
n> 3. Conversely, an (n — 2)-transitive permu- 
tation group on {1,2,...,n} is either S, or A,- 
For multiply transitive permutation groups 
which are simple, see the list in Section I. If 
k26, no k-transitive groups are known at 
present except S, and A,,. If the Schreier con- 
jecture (— Section I) is true, then there are no 
6-transitive permutation groups except S, and 
A,, (Wielandt, Math. Z., 74 (1960); H. Nagao, 
Nagoya J. Math., 27 (1964); O’Nan, Amer. 
Math. Soc. Notices, 20 (1973)). 

Two 5-transitive permutation groups other 
than S, and A, are known: the groups M,, and 
M,, of degrees 12 and 24, respectively, dis- 
covered by E. L. Mathieu in 1864 and 1871. 
The stabilizer of a letter in M,3(M>,) is a 4- 
transitive permutation group of degree 11 (23), 
denoted by M,,(M,,). No 4-transitive permu- 
tation groups other than S,, A,, M; (i= 11, 12, 
23, and 24) are known. The groups M,,, Mirs 
M34, M33 and the stabilizer M3, of a letter 
in M,, are called Mathieu groups. They are 
simple groups which have quite exceptional 
properties. For Mathieu groups, see E. Witt, 
Abh. Math. Sem. Univ. Hamburg, 12 (1938). A 
k-transitive permutation group G on Q of 
degree n and order n(n—1)...(n—k+1) has 
the property that no nonidentity element of G 
leaves k distinct letters of Q invariant. If k>4, 
such a group is one of the following: S, A,+2, 
M2, and M,, (C. Jordan). For k=2 and 3, 
see H. Zassenhaus, Abh. Math. Sem. Univ. 
Hamburg, 11 (1936). 

A multiply transitive permutation group G 
contains a normal subgroup S such that S is a 
non-Abelian simple group and G is isomorphic 
to a subgroup of the group Aut S of the auto- 
morphisms of S, except when the degree n of G 
is a power of a prime number and G contains 
a regular normal subgroup of order n which is 
an telementary Abelian group (W. S. Burn- 
side). Furthermore, in these exceptional cases, 
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G is at most 2-transitive if n is odd, while G is 
at most 3-transitive if n is even but more than 
4. The symmetric group S, of degree 4 is the 
only 4-transitive group that contains a proper 
solvable normal subgroup. 

A transitive extension of a permutation 
group H on Q is defined as follows. Let oo be a 
new element not contained in Q. A transitive 
extension G of H is a transitive permutation 
group on the set {Q, oo} in which the stabilizer 
of œ is the given permutation group H on Q. 
Transitive extensions do not exist for some H. 
Suppose that a permutation group H admits a 
transitive extension G that is primitive. If H is 
simple, then G is also simple unless the degree 
of G is a power of a prime number. Construct- 
ing transitive extensions has been an effec- 
tive method for constructing sporadic simple 
groups. 

Permutation groups of prime degree have 
been studied extensively since the last century, 
partly because of their connection with al- 
gebraic equations of prime degree. Let p be a 
prime number. A transitive permutation group 
of degree p is either multiply transitive and 
nonsolvable or has a normal subgroup of 
order p with factor group isomorphic to a 
cyclic group of order dividing p— 1 (Burnside). 
Choose two cycles x and y of length p in S,. If 
y is not a power of x, then the subgroup <x, yò 
generated by x and y is a multiply transitive 
permutation group which is simple. The struc- 
ture of <x, yò is not known despite its simple 
definition. More attention has been paid to 
groups of degree p, where p is a prime number 
such that (p—1)/2 =q is another prime number. 
The problem is to decide if such nonsolvable 
groups contain the alternating group A,. The 
Mathieu groups M,, and M,; are the only 
known exceptions for p> 7. The search for 
additional exceptions has been aided by the 
development of high-speed computers. It is 
known that there ts no exceptional group 
of degree p=2q+ 1 for 23<p<4079 (P. J. 
Nikolai and E. T. Parker, Math. Tables Aids 
Comput., 12 (1958); see N. Ito, Bull. Amer. 
Math. Soc., 69 (1963), for further results in this 
direction). 

A Frobenius group is a nonregular transitive 
permutation group in which the identity is the 
only element leaving more than one letter 
invariant. A Frobenius group of degree n 
contains exactly n—1 elements which displace 
all the letters. These n— 1 elements together 
with the identity form a regular normal sub- 
group of order n. This is a theorem of Frobe- 
nius; all the existing proofs depend on the 
theory of characters. The regular normal 
subgroup of a Frobenius group is nilpotent 
(J. G. Thompson, Proc. Nat. Acad. Sci. US, 45 
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(1959); [15, ch. 10; 16, ch. 3]). A Zassenhaus 
group is a transitive extension of a Frobenius 
group. 


I. Finife Simple Groups 


All simple groups of finite order were com- 
pletely classified in February 1982 (— Section 
J; [23]). These are divided into the following 
four classes: (1) cyclic groups of prime order, 
(2) alternating groups of degree >5, (3) sim- 
ple groups of Lie type, and (4) other simple 
groups. 

The subclass (1) consists of cyclic groups of 
prime order p for any prime number p. Abelian 
simple groups belong to this subclass. The sub- 
class (4) consists of twenty-six sporadic simple 
groups including five Mathieu groups. All 
sporadic groups, other than the five Mathieu 
groups, are of recent discovery. 

Simple groups of Lie type are analogs of 
simple Lie groups, and include the classical 
groups as well as the exceptional groups and 
the groups of twisted type. 

Classical groups are divided into four types: 
tlinear, tunitary, tsymplectic, and torthogonal 
(— 60 Classical Groups). Let q =p” be a power 
of a prime number p. Consider a vector space 
V of dimension n> 2 over the field F, of q 
elements, except in the unitary case where V 
is a vector space of dimension n>2 over the 
field F, of q? elements. Let f be a nondegener- 
ate form on V which ts Hermitian in the uni- 
tary case (with respect to the automorphism 
of order 2 of F, over F,), tskew symmetric 
bilinear in the symplectic case, and tquadratic 
in the orthogonal case. In the orthogonal case, 
the dimension of V is assumed >3. Consider 
the group of all linear transformations of V 
(linear case) of determinant 1, or the group of 
all linear transformations of determinant 1 
which leave the form f invariant (in other 
cases). In the orthogonal case, take the com- 
mutator subgroup. With each of these groups, 
the factor group of it by its center is a simple 
group with a few exceptions. 

There are several notations to denote these 
groups. E. Artin’s notation for simple groups, 
which is reasonably descriptive and simple, 
follows the name of the simple group: n and q 
are as described in the preceding paragraph, g 
is the order of the simple group, and (a, b) 
denotes the greatest common divisor of two 
natural numbers a and b. 

Linear simple group, L,,(q): 


g=gh 7 TV (q'—1)/d,  d=(n,q—1). 
i=2 
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Unitary group, U,(q): 
g=gr? TT (q'—(-1d, d=(nq+1). 
i=2 


The structure of unitary groups does not de- 
pend on the form. 

Symplectic group, S,(q), n =2m: 
g=q™ [](q7%—-l/d, d=(2,q-1). 
In the symplectic case, the dimension n of the 
space V must be even, so n= 2m, and the struc- 
ture does not depend on the form. 


Orthogonal group in odd dimension n= 
2m+ 1, Orm+1(Q): 


g=q™ [| (q@”— td, d=(2,q-1). 
i=1 


The structure does not depend on the form in 
odd dimension. l 

Orthogonal groups in even dimension n = 
2m: There are two inequivalent forms, one 
with tindex m (which is maximal) and the other 
with index m— 1. The two orthogonal groups 
are denoted by O,,,(é,q), c= + 1, where e= 1 if 
the form is of maximal index and —1 other- 
wise. Then 


m-1 
g=q™ Yq" —s) | | (q™— 1/4, 
i=1 


d=(4,q4"— £). 


The value of ¢ is determined by the form f: 
e=1 if f is equivalent to X74, x2;-,;X2;, and 
e= =l if f ~x] + yxy x2 +x3 + Lie Xzi-1 X2 
where the polynomial t? + yt + 1 is irreducible 
over F,- 

There are other ways to denote these 
groups. Let X = X (x, *) be a group of nonsin- 
gular linear transformations of a vector space 
V. Two asterisks indicate two invariants, such 
as the dimension of V and the number of ele- 
ments in the ground field. The notation SX 
stands for the subgroup of X consisting of 
linear transformations with determinant 1, and 
the notation PX stands for the factor group of 
the linear group X by its center. Thus PX is 
a subgroup of the group of all projectivities 
of the projective space formed by the linear 
subspaces of V. The following list is self- 
explanatory, except the last term in each row, 
which is the notation of L. E. Dickson [1]: 


L,(q)= PSL(n, q)= LF (n, q) 
U,(q) = PSU (n, q)= HO(n, q’) 
S2n(4) = PSp(n, q) = A(2n, q). 


(LF: linear fractional group; HO: hyperortho- 
gonal group; A: Abelian linear group.) If f is a 
nondegenerate quadratic form, then the sub- 
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group of GL(n, q) consisting of all the elements 
leaving the form f invariant may be denoted 
by O(n, q, f). Let Q(n, g, f) denote the commu- 
tator subgroup of O(n, q, f). Set e=1 if f is of 
maximal index, and s= —1 otherwise. Then 


O,(€, q) = PQ(n, q, f). 


Dickson’s notation for orthogonal groups is 
complicated and seldom used. 

Finite simple groups corresponding to 
Lie groups of some exceptional type were 
studied by Dickson early in this century, but 
C. Chevalley (Tohoku Math. J., (2) 7 (1955)) 
proved the existence, simplicity, and other 
properties of groups of any (exceptional) type 
over any field by a unified method. Simple Lie 
algebras over the field C of complex numbers 
are completely classified, and according to the 
classification theory they are in one-to-one 
correspondence with the tDynkin diagrams. 
Let L be a simple Lie algebra (over C) corre- 
sponding to the tDynkin diagram of type 
X (— 248 Lie Algebras). Let L=Ly>+ ZL, 
be a Cartan decomposition of L, where a 
ranges over the troot system A of L. It is pos- 
sible to choose a basis B of L with the following 
properties (tChevalley’s canonical basis): 
B consists of e,¢ L,(ae A) and a basis of Lo; 
the structure constants of L with respect to B 
are all rational integers; the automorphism 
x,(€) in the fadjoint group defined by 


X4(S)=exp(Gade,) (CEC) 


maps each element of B into a linear combi- 
nation of elements of B with coefficients which 
are polynomials in č with integer coefficients. 
Thus the matrix A,(&) representing the trans- 
formation x,(€) with respect to B has coeffi- 
cients which are polynomials in & with integer 
coefficients. 

The elements of B span a Lie algebra L, 
over the ring Z of integers. Let F be a field and 
form L;=F @,L,. Then L, is a Lie algebra 
over F, and the set B may be identified with a 
basis of Lp over F. Let t be any element of F, 
A,(t) be the matrix obtained from A,(€) by 
replacing the complex variable č by the ele- 
ment ¢, and finally x,(t) be the linear trans- 
formation of Lp represented by the matrix 
A,(t) with respect to B. The group generated 
by the x,(t) for each root « and each element t 
of F is called the Chevalley group of type X 
over F. The commutator subgroups of the 
Chevalley groups are simple, with a few excep- 
tions which will be stated after the complete 
list of simple groups of Lie type. Suppose that 
X = Anm Dn, or Eg (— 248 Lie Algebras S). 
Then the Dynkin diagram of type X has a 
nontrivial symmetry; let it be > f. Suppose 
that the field F has an automorphism o of the 
same order as the order of the symmetry of the 
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Dynkin diagram. Let @ be the automorphism 
of the Chevalley group which sends x,(t) to 
X(t’). Let U (resp. V) be the subgroup of the 
Chevalley group generated by x,(t) with «>0, 
te F (x(t), B <0), and let U'(V') be the sub- 
group consisting of all the elements of U(V) 
which are left invariant by 0. The group gen- 
erated by U! and V’ is called the group of 
twisted type. If the order of ø is i, this group is 
said to be of twisted type 'X. In all but one 
case, the group of twisted type is simple (see 
R. Steinberg, Pacific J. Math., 9 (1959)). The 
value iis 2 except when X = D,. Since D, 
admits symmetries of orders 2 and 3, there are 
two twisted types. If X = B,, G,, or F,, then 
the diagram has a symmetry. If the character- 
istic p of the ground field F is 2, 3, or 2 accord- 
ing as X = B,, G, or F, and the field F has an 
automorphism o such that (t°) =t? for any 
te F, then a procedure similar to the one de- 
scribed before is applicable, and the group of 
twisted type X’ is obtained (R. H. Ree, Amer. 
J. Math., 83 (1961)). The group of twisted type 
is simple if the field F has more than three 
elements. 

The following list contains all the simple 
groups of Lie type. For each classical group, 
we list the type followed by identification: 


Ay = Lat (Q) (n>1) 
7An=Un+1(4) (n>1) 
B, = Oon+1(9) (n> 1) 
Cn = Son(Q) (n> 2) 
D, = O2,(1, 9) (n> 3) 
*Dy=Or(—1,q)  (n>3) 


For other groups the type of the group is 
followed by the customary name or notation, 
if any, and the order g: 


B, Suzuki group, Sz(q), g=22"*! 
g=4°(q—1)(q? +1) 
`D, g=4'?(q* +q* + I)(q°—1)(q?-1) 
G) g=4°(q°—1)(q?—1) 
G, Ree group, Re(q), g=32"*? 
g=q°(q° + 1)(q-1) 
F, g=4"*(q'? —1)(q?—1)(q°—1) 
x(q? -1) 
F, q=?" g=q'?(q°+1)(q*—1) 
x(q? + 1)(q—1) 
Es dg=q**(q'*—1)(q? —1)(q°—1) 
x (g® — 1)(45 —1)(q? — 1) 
d=(3,q—1) 
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Es dg=q°°(q** — 1)(q’ + 1)(q? —1) 
x (qf ~ 1)(4° + 1)(q? — 1) 
d=(3,q+1) 
E, dg=q°*(q’*—1)(q*—1)(g?—-) 
x(q? — 1)(q® — 1)(q° — 1)(q? ~ 1) 
d=(2,q—1) 
Ex, g=q'?*(g??—1)(q**~1)(q?°—-1) 
x(q'®—D(q'*—Yiaq?—-Y 
x (q° — 1)(q? —1). 


B3: M. Suzuki, Proc. Nat. Acad. Sci. US, 46 
(1960); G, and F3: Ree, Amer. J. Math., 83 
(1961): G,: Dickson, Trans. Amer. Math. Soc., 2 
(1901), Math. Ann., 60 (1905); other Chevalley 
groups: Chevalley, Tohoku Math. J., (2) 7 
(1955); twisted types: Steinberg, Pacific J. 
Math., 9 (1959), J. Tits, Séminaire Bourbaki 
(1958), Publ. Math, Inst. HES (1959), D. Hert- 
zig, Amer. J. Math., 83 (1961), Proc. Amer. 
Math. Soc., 12 (1961). 

Nonsimple cases: L(2), L,(3), U;(2), and 
Sz(2) are solvable groups of orders 6, 12, 72, 
and 20, respectively. The groups 0,(2), G,(2), 
G;(3), and F4(2) contain normal subgroups 
of indices 2, 2, 3, and 2, respectively. These 
normal subgroups are simple and identified as 
L4(9), U3(3), £,(8) in the first three cases. The 
normal subgroup of F,(2) is not in the list of 
simple groups of Lie type and is quite excep- 
tional (Tits’s simple group, Ann. Math., (2) 80 
(1964)). 

Isomorphisms between various simple 
groups: L(q)= U2(q)=S,(q) = 03(q); Osla) = 
Sala); O4(t, g) = Lala) x Lla); O4(-1,q) = 
L2(q’); O6(1, g) = La(q); Oo( -1, 4) = Uld); 
Oon+1(Q) = S2,(q) if q is a power of 2; L,(2)=S,; 
L,(3)= Ay; L2(4)=L2(5)= As; La(7)= L3(2); 
L,(9)= Ag; L4(2)= Ag; U4(2)= S4(3). If q is odd 
and 2n>6, then S,,(q) and O,,..,(q) have the 
same order but are not isomorphic. L,(4) and 
L4(2) have the same order but are not isomor- 
phic. There is no other isomorphism or coin- 
cidence of orders among the known simple 
groups (Artin, Comm. Pure Appl. Math., 8 
(1955)). 

The groups of the automorphisms of simple 
groups belonging to subclasses (1), (2), and 
(3) are known. For the simple groups of Lie 
type, see Steinberg, Canad. J. Math., 10 (1960). 

The following list of twenty-six groups con- 
sists of all the simple groups that belong to 
class (4): 

Five Mathieu groups whose orders are 


M,31:9=7,920=2*-3?-5- 11 
M, 9:9 =95,040 = 26-33-5-11 
M5919 =443,520=27-3?-5-7-11 
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M3:g = 10,200,960 = 27 -32-5-7-11-23 
My 41g =244,823,040 = 2!0-33-5-7- 11-23. 


The other twenty-one groups have been 
discovered since 1964. Each group is identified 
by the symbol (x);, indicating that it is the ith 
group discovered in the year 19x. The list 
continues with the name or names of dis- 
coverers, the order of the group, and a brief 
description. 

(64),: Z. Janko, g= 175,560 = 2°-3-5-7- 
11-19. A subgroup of the Chevalley group 
G,(11). See J. Algebra, 3 (1966). 

(67): M. Hall and Z. Janko, g = 604,800 = 
27-33-5?-7, a transitive extension of U;(3) 
of degree 100. 

(67),: D. G. Higman and Sims, g= 
44,352,000 = 29 : 37-53-7- 11, a transitive 
extension of M,, of degree 100. It is a normal 
subgroup of index 2 in the group of automor- 
phisms of a certain graph with 100 vertices. 

(67),: Suzuki, g = 448,345,497,600 = 2'3- 37. 
5*- 7-11-13, a transitive extension of G,(4); 
defined from the automorphism group of a 
graph of 1782 vertices. 

(67),: J. McLaughlin, g = 898,128,000 = 
27-3°-53-7-11, a transitive extension of 
U,(3); defined from a graph of 275 vertices. 

(68),: G. Higman, Z. Janko, and J. McKay, 
g = 50,232,960 = 27 -3°- 5-17-19, a transitive 
extension of the group which is obtained from 
L,(16) by adjoining the field automorphism of 
order 2. The existence was verified by using a 
computer. 

(68),, (68)3, (68)4: J. H. Conway, 


g=27)-39- 54-72-14 - 13-23 
= 4,157,776,806,543,360,000, 

g=2'8-36-53-7-11-23, 

g= 29-37-5367 14-23, 


The big group is obtained from the automor- 
phism group of a lattice in 24-dimensional 
space, and the two smaller ones are subgroups 
of it. The lattice was defined by J. Leech in 
connection with a problem of close packing of 
spheres in 24 dimensions (Canad. J. Math., 19 
(1967)). 

(68);: B. Fischer, g=2'?-3°-5?°7-11-13= 
70,321,751,654,400, a transitive extension of 
U,(2) derived by means of a certain graph. 

(69),: D. Held and others, g=2!°:3°-5?-7?: 
17 = 4,030,387,200. 

(69),: B. Fischer, g =2*®-3'3-5?-7-11-13- 
17-23 =4,089,470,473,293,004,800. 

(69),: B. Fischer, g=27'-3'©-5?-73-11-13- 
17-23-29 =1,255,205,709, 190,661,72 1,292,800. 

(71),: R. N. Lyons and C. C. Sims, g = 
28-37-56-7- 11-3137-67. 

The existence of (69), and (71), was verified by 
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using computers; (69), and (69), were derived 
by means of certain graphs. For a more de- 
tailed account of these simple groups, see J. 
Tits, Séminaire Bourbaki (1970), No. 375, and 
the references [18, 19, 20.] 

(72),: A. Rudvalis, g =2!*-33-53-7-13-19, 
a transitive extention of Tits’s simple group, 
i.e., a normal subgroup of F4,(2) of index 2. 
Concerning this group (72),, see the article 
of J. H. Conway and D. B. Wales, J. Algebra, 
27 (1973). 

(73),: M. O’Nan, g=2°:3*-5-73- 11-19-31. 
This group (73), was discovered by O’Nan and 
the existence was verified by C. Sims, using a 
computer. 

(73)2: B. Fischer, g =2*!-313-5°-77-11-13- 
19- 23-31-47. 

(73)3: B. Fischer and R. Griess, g =246 -37° - 
57-78 117-139-17-19-23-29-31-41-47- 
59-71. 

(74),: J. G. Thompson, g=2!5-3!10. 53.72- 
13:19-31. 

(74): K. Harada, g=2!4-36-56.7-11-19. 

The existence of (73), was suggested by B. 
Fischer, and then that of (73), by B. Fischer 
and R. Griess. Shortly after this, the exis- 
tence of (74), and (74), was suggested by J. G. 
Thompson, and then Thompson and Harada 
proved the existence of these groups with the 
aid of P. Smith and S. Norton, using com- 
puters. The existence of (73), was established 
in 1976 by Leon and Sims, using a computer. 
Very recently (July 1980), R. Griess has an- 
nounced that the group (73), is realized as 
a group of automorphisms of a 196,883- 
dimensional commutative nonassociative 
algebra over the rational numbers, 

(75),: Z. Janko, g=27!-33-5-7-113-23-29. 
31-37-43. This group (75), was discovered 
by Z. Janko. The existence was verified by 
using a computer. 

For a more detailed account of the twenty- 
six sporadic simple groups, see [21]. 

Simple groups of order <1000 are A, 

(g = 60), L(7) (168), A6(360), L(8) (504), 
and L,(11) (660). All simple groups of order 
<20,000 are known. 

Among the known simple groups, the fol- 
lowing multiply transitive permutation repre- 
sentations are known: Alternating groups A, 
(degree n), Ag and A, (degree 15), A, (degree 
10), A, (degree 6), Mathieu groups M; (degree 
i), My; (degree 12), L,(q) (degree (q”—1)/ 
(q—1)), L(p) (degree p for p=5,7, 11), U3(q) 
(degree 1 +q°), Sz(q) (degree 1 + 2), Re (3”) 
(degree 1 + 35"), S,,(2) (degrees 2”~1(2"+ 1)), 
and the Higman-Sims group (67), (degree 176) 
and, the Conway group (68), (degree 276). 
Among them, A, (degree n, n> 5), M; (degree i), 
M,, (degree 12), L,(2”) (degree 1 +2”), and 
L,(5) (degree 5) are triply transitive. 
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There are several remarkable properties of 
known finite simple groups which have been 
conjectured to hold for arbitrary finite simple 
groups. One of the most famous is the Schreier 
conjecture, which asserts that the group of 
outer automorphisms of a simple group is 
solvable. This has been verified for all known 
cases. Another conjecture says that a finite 
simple group is generated by two elements. 
This has also been verified for almost all 
known groups. In many cases, there is a gen- 
erating set of two elements, one of which has 
order 2. There is no counterexample known to 
disprove the universal validity of this property. 
Except for Sz(q), the orders of known simple 
groups are divisible by 12. 


J. Classification of Finite Simple Groups 


The objective of classification theory is to find 
the complete list of finite simple groups; this 
was accomplished in February 1982, following 
the series of important works mentioned 
below. 

The order of a finite non-Abelian simple 
group is divisible by at least three distinct 
prime numbers (W. S. Burnside; — Section D). 
The order of a finite non-Abelian simple group 
is even (W. Feit and J. G. Thompson; — Sec- 
tion D). These theorems are special cases of 
the following theorem: If G is a finite non- 
Abelian simple group in which the normalizer 
of any solvable subgroup # {1} is solvable, 
then G=L,(q)(q > 3), Sz(2?"*')(n> 1), Az, 
L,(3), U;(3), M,,, or Tits’s simple group. In 
particular, a minimal simple group is tsomor- 
phic to L(p)(p=2 or 3 (mod 5), p> 3), L(2”), 
L,(3”), $z(2?), or L3(3), where p is a prime 
number (a finite non-Abelian simple group is 
called a minimal simple group if all proper 
subgroups are solvable). This theorem is 
proved in a series of papers by J. G. Thomp- 
son (Bull. Amer. Math. Soc., 74 (1968), Pacific 
J. Math., 33 (1970), 39 (1971), 48 (1973), 50 
(1974), and 51 (1974)). The method which 
Thompson used in these papers has since been 
generalized in various ways by many authors 
to establish a number of important theorems. 
There are also some interesting consequences 
of this theorem concerning solvable groups. 
For example, a finite group is solvable if and 
only if every pair of elements generates a sol- 
vable subgroup. 

Let G be a non-Abelian simple group of 
even order and S one of its 2-Sylow subgroups. 
Then S is neither cyclic nor a generalized 
quaternion group (W. S. Burnside, R. Brauer, 
and M. Suzuki). If S is a dihedral group, then 
G=A, or L,(q) (q odd 25) (D. Gorenstein and 
J. H. Walter). These theorems deal with the 
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cases where S is “small.” The study in this 
direction has culminated in the classification of 
finite simple groups all of whose 2-subgroups 
are generated by at most four elements (D. 
Gorenstein and K. Harada, Mem. Amer. Math. 
Soc., 147 (1974)). 

If a 2-Sylow subgroup of a finite non- 
Abelian simple group G is an Abelian group, 
then G=L,(r) (r=0, 3 or 5 (mod 8), r> 3), or 
else G possesses an element of order 2 whose 
centralizer is isomorphic to Z, x L3(q) (q=3 or 
5 (mod 8), 4> 3) (J. H. Walter). In the latter 
case, G is called a group of Janko-Ree type 
(J-R type for short), and if g 45, it is called a 
group of Ree type. If g=5, then G =(64),, the 
Janko’s simple group of order 175,560. The 
Ree groups Re(q) are groups of Ree type. Since 
the discovery of Re(q), it has long been an 
open problem to show that there are no other 
groups of Ree type. Very recently, the com- 
bined work of E. Bombiere, J. G. Thompson 
and others settled the problem (Inventiones 
Math., 58 (1980)). 

A surprisingly short proof of the Walter’s 
theorem above is given by H. Bender (Math. 
Z., 117 (1970)). Bender’s method applies to a 
much larger class of groups. A subgroup A of 
a finite group G is said to be strongly closed if 
AN Nl A) c A for each geG. D. M. Gold- 
schmidt proved (Ann. Math., 99 (1974)) that if 
A is a strongly closed Abelian 2-subgroup, 
then the subgroup Go generated by the con- 
jugates of A possesses a normal series Gy > 
G, >G, with the properties: G, is of odd order, 
G,/G, is a 2-group and is contained in the 
center of Go/G,, and either Go =G, or Gp/G, 
is the direct product of simple groups on the 
following list: L, (q4) (q=0, 3 or 5 (mod 8), 

q > 3), Sz(27" 1), U3(2") (n> 1), and the groups 
of J-R type. Furthermore, AG,/G,>G,/G, 
and AG,/G, is the center of a 2-Sylow sub- 
group of G)/G,. This theorem generalizes 

an earlier result of G. Glauberman (the so- 
called Z*-theorem) which states that if A is a 
strongly closed subgroup of order 2, then the 
image of A in the quotient group G/K by the 
maximal normal subgroup of odd order is a 
normal subgroup and hence is contained in 
the center of G/K. These theorems of Glauber- 
man and Goldschmidt are of fundamental 
importance in the study of finite simple groups 
since they provide an effective tool for showing 
that a given group has a normal Abelian 2- 
subgroup. 

Glauberman obtained a criterion for the 
existence of a strongly closed Abelian p- 
subgroup # {1} for some prime p [15,22]. For 
any prime p, the quadratic group Qd( p) is 
defined to be the semidirect product of the 2- 
dimensional vector space V(2, p) over the field 
of p elements by the special linear group 


151 J 
Finite Groups 


SL(2, p) where the action of SL(2, p) on V(2, p) 
is taken to be the natural one. A finite group G 
contains a strongly closed Abelian p-subgroup 
A# {1} if no section of G is isomorphic to 

Qd( p) (a section is a quotient group of a sub- 
group). Furthermore, if p is odd, then we can 
choose as A a characteristic subgroup of a p- 
Sylow subgroup S of G which is determined 
only by the structure of S. Therefore a finite 
non-Abelian simple group has a section iso- 
morphic to Qd(2)=S, except when it is one of 
the simple groups mentioned in Goldschmidt’s 
theorem. This theorem generalizes an un- 
published result of J. G. Thompson to the 
effect that 3 divides the order of finite non- 
Abelian simple groups except $z(2?"*?). 

Let G be a 2-transitive permutation group 
on n+ 1 letters, and assume that the stabilizer 
H ofa letter contains a normal subgroup K 
which is regular on the remaining n letters. 
Then G contains a normal subgroup N such 
that G is isomorphic to a subgroup of the 
automorphism group of N and either N = 
L,(q), U3(q), Sz(q) or a group of Ree type, 
or else N is 2-transitive on the n+ 1 letters and 
no nonidentity element of N leaves two dis- 
tinct letters invariant (the structure of N in the 
latter case is also known (H. Zassenhaus; — 
Section H). This theorem is proved by E. 
Shult for n even (Illinois J. Math., 16 (1972)) 
and by C. Hering, W. M. Kantor, and G. M. 
Seitz for n odd (J. Algebra, 20 (1972)). Its proof 
depends on the work of many authors who 
considered various special cases, especially the 
work of H. Zassenhaus, W. Feit, N. Ito, and 
M. Suzuki on the classification of Zassenhaus 
groups, and the work of M. Suzuki on the case 
where n is even and H/K is of odd order (Ann. 
Math., 79 (1964)). In this special case which 
Suzuki handles, the stabilizer H is of even 
order and H N H” is of odd order for any ge 
G—H. If a proper subgroup H of an arbitrary 
finite group G has this property, then H is 
called a strongly embedded subgroup. Extend- 
ing the work of Suzuki, H. Bender proved the 
following theorem (J. Algebra, 17 (1971)): If a 
finite group G possesses a strongly embedded 
subgroup, then either (1) a 2-Sylow subgroup 
of G is cyclic or a generalized quaternion 
group, or (ii) G possesses a normal series G = 
Go > G, >G, such that G,/G, and G, are of 
odd order and G,/G, = L,(2”), U3(2"), or 
Sz(2*""1) (n> 1). This theorem generalizes 
another theorem of Suzuki who reached the 
same conclusion under the assumption that 
two distinct 2-Sylow subgroups have only the 
identity element in common. Bender’s theorem 
is of fundamental importance in the classifica- 
tion theory of finite simple groups, since a 
strongly embedded subgroup often appears as 
an obstacle to the proofs of classification 
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theorems. For a generalization of Bender’s 
theorem, see a paper by M. Aschbacher (Proc. 
Amer. Math. Soc., 38 (1973)), which also con- 
tains an alternative proof of Shult’s theorem. 

The theorem of Shult, Hering, Kantor, and 
Seitz may be interpreted as a classification of 
finite groups having a split (B, N )-pair of rank 
!. Let G be a finite group and let B and N be 
subgroups of G such that (1) B and N generate 
G, (ii) T= BNN is a normal subgroup of N, 
and (iii) W= N/T is generated by a set S of ele- 
ments of order 2 such that sBs # B and sBwc 
BwBU BswB for each seS and each we W. 
The subgroups B and N are called a (B, N )- 
pair of G (the quadruplet (G, B, N, S) is called 
a Tits system), and the cardinality of the set 
S is called the rank of the (B, N )-pair. The 
(B, N)-pair is said to be split if B has a normal 
subgroup U such that B= TU and TAU ={1}, 
and is said to be saturated if T= (\„en B". Ifa 
finite group G has a split saturated (B, N )-pair 
of rank 1, and if Z = (\,.¢ BY, then G/Z is a 2- 
transitive permutation group satisfying the 
assumption of the theorem of Shult, Hering, 
Kantor, and Seitz, and information is obtained 
on the structure of G. In general, the simple 
groups of Lie type are characterized as simple 
groups with certain (B, N)-pairs. For (B, N )- 
pairs of rank 2, see papers by P. Fong and G. 
M. Seitz (Inventiones Math., 21 (1973), 24 
(1974)). J. Tits has developed a satisfactory 
theory on finite groups having a (B, N)-pair of 
rank at least 3 (Lecture notes in math. 386, 
Springer). 

Let G be a finite group generated by a 
conjugate class D of elements of order 2, and 
let z be the set of positive integers consisting of 
the orders of the products of two distinct 
elements of D. Furthermore, assume that G 
has no nontrivial solvable normal subgroup. 
B. Fischer proved (Inventiones Math., 13 
(1971)) that if z= {2,3}, then G contains a 
normal subgroup isomorphic to one of the 
following groups: A,, S>,(2), O2,(+1, 2), 
O,(+1, 3), U,(2), and the three Fischer’s simple 
groups (68),, (69),, (69),. For a generalization 
of this theorem, see papers by M. Aschbacher 
(Math. Z., 127 (1972), J. Algebra, 26 (1973)). 
The most powerful result in this direction is 
given by F. Timmesfeld (J. Algebra, 33 (1975), 
35 (1975)): Suppose a consists of 2, 4, and odd 
positive integers. Furthermore, assume that if 
d and e are in D and de is of order 4, then 
(de)? €D. Then G = Ag, U;(3), the Hall-Janko 
group (67),, Lala) (n > 3), Oon+1(q)(n 2 3), 
O2,(+1,q)(n>4), Go(q), *Dalq), Fala), °E6(Q), 
E6(q), E>(q), or Eg(q), where q=2”. 

Let G be a simple group of even order and 
let H be the centralizer of an element of order 
2. Then the order of G is bounded by a func- 
tion f of the order h of H (for example, we can 
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choose f(h)= {h(h+ 1)\!; R. Brauer and K. A. 
Fowler, Ann. Math., 62 (1955)). In particular, 
there exist only finitely many isomorphism 
classes of finite simple groups which contain 
an element of order 2 with a given centralizer 
H. This fact is a ground for Brauer’s program 
of studying simple groups of even order in 
terms of the structure of the centralizers of 
elements of order 2. There are a number of 
important results concerning Brauer’s pro- 
gram [20]. For example, nine sporadic simple 
groups were discovered in related works. Since 
1973, Brauer’s program has been improved 
greatly by M. Aschbacher, D. Gorenstein, and 
others, and the classification of finite simple 
groups was finally completed in February 1982 
[21,23]. 
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A. Definitions 


Let T(M) be the ttangent vector bundle of an 
n-dimensional tdifferentiable manifold M. An 
element of T(M) is denoted by (x, y), where x is 
a point of M and y is a tangent vector of M at 
x. Given a tlocal coordinate system (x!,..., x") 
of M, we can obtain a local coordinate system 
of T(M) by regarding (x!,...,x",y',...,y")= 
(xi, y/) as coordinates of the pair (x, y)e T(M), 
where (xt, ..., x”) are coordinates of a point x 
of M and y= y/0/éx/. A continuous real- 
valued function L(x, y) defined on T(M) is 
called a Finsler metric if the following condi- 
tions are satisfied: (i) L(x, y) is differentiable at 
y £0; (ii) L(x, Ay) =|A|L(x, y) for any element 
(x, y) of T(M) and any real number å; and 

Gii) if we put g(x, y) =(1/2)07 L(x, y)2/dy'dy4, 
the symmetric matrix (g,;(x, y)) is positive de- 
finite. A differentiable manifold with a Finsler 
metric is called a Finsler space. There exists 

a Finsler metric on a manifold M if and only 
if M is tparacompact. We call F(x, y)= L(x, y)? 
the fundamental form of the Finsler space. 
When F(x, y) is a quadratic form of (y’,..., y”), 
L(x, y) is a Riemannian metric, and F(x, y)= 
Xi; 9j(x)y'y!. Therefore a Finsler space is a 
Riemannian space if and only if g;; does not 
depend on y. The matrix g; is also called the 
fundamental tensor of the Finsler space (i, j = 
1,... 7). 

Thus the notion of a Finsler metric is an 
extension of that of Riemannian metric. The 
study of differentiable manifolds utilizing such 
generalized metrics was considered by B. 
Riemann, but he stated that a Riemannian 
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metric is more convenient for the purpose 
since “only nongeometrical results can be 
obtained” by using Finsler metrics [7]. P. 
Finsler initiated the systematic study of Finsler 
metrics and extended to a Finsler space many 
concepts and theorems valid in the classical 
theory of curves and surfaces [5]. 


B. The Finsler Metric 


In a Finsler space, the arc length of a curve 
x=x(t) (a<t<b) is given by f? L(x, dx/dt)dt. 
Therefore a tgeodesic in a Finsler space is 
defined as a tstationary curve for the problem 
of tvariation 6 f? L(x, dx/dt)dt =0, and the 
differential equation of the geodesic is given by 


ax dxi dx* 
arte 4 the See 


where y;,(x, y) is the Christoffel symbol of g,;, 
1.€., 


i 1 ; O9ja OGax OD jx 
a(x, y)=- ja + - ; 
Jad% y) 729 & oxi ax? 


where (g(x, y)) is the inverse matrix of 
(g(x, y)). 

The distance between two points in a Fin- 
sler space is defined, as in a Riemannian space, 
as the infimum of the lengths of curves joining 
the two points. Many properties of Riemann- 
ian spaces as metric spaces can be extended 
to Finsler spaces. The topology defined by the 
Finsler metric coincides with the original 
topology of the manifold. A Finsler space M is 
said to be tcomplete if every Cauchy sequence 
of M as a metric space is convergent. The 
following three conditions are equivalent: (i) M 
is complete; (11) each bounded closed subset 
of M is compact; (iit) each geodesic in M is 
infinitely extendable. In a complete Finsler 
space, any two points can be joined by the 
shortest geodesic. 

A diffeomorphism ¢ of a Finsler space M 
preserves the distance between an arbitrary 
pair of points if and only if the transforma- 
tion on T(M) induced by o preserves the 
Finsler metric L(x, y). Such a transformation is 
called an tisometry of the Finsler space. In the 
tcompact-open topology the set of all isome- 
tries of a Finsler space is a tLie transformation 
group of dimension at most n(n + 1)/2. Ifa 
Finsler space admits the isometry group of 
dimension greater than (n(n — 1)+ 2)/2, itis a 
Riemannian space of constant curvature [8]. 


C. The Theory of Connections 


An important difference between a Finsler 
space and a Riemannian space relates to their 
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properties with respect to the theory of fcon- 
nections. In the case of a Riemannian space, 
the Christoffel symbols constructed from the 
fundamental tensor are exactly the coefficients 
of a connection, whereas in the case of a Fin- 
sler space, the Christoffel symbols yi (x, y) do 
not define a connection, for the fundamental 
tensor g;; depends not only on the points of the 
space but also on the directions of tangent 
vectors at these points. 

When we consider notions such as tensors, 
etc., in a Finsler space M, it is generally more 
convenient to take the whole tangent vector 
bundle T(M) into consideration rather than 
restricting ourselves to the space M. For ex- 
ample, let P be the ttangent n-frame bundle 
over a Finsler space M and Q = p~! (P) be the 
tprincipal fiber bundle over T(M) induced 
from P by the projection p of T(M) onto M. 
We call the elements of fiber bundles asso- 
ciated with Q tensors. In this sense, the fun- 
damental tensor g; in a Finsler space is the 
covariant tensor field of order 2. Therefore it is 
natural to consider a connection in a Finsler 
space as a connection in the principal fiber 
bundle Q. The connection in a Finsler space 
defined by E. Cartan ts exactly of this type [3]. 
Namely, he showed that by assigning to a 
connection in Q certain conditions related to 
the Finsler metric, we can determine uniquely 
a connection from the fundamental tensor so 
that the covariant differential of the funda- 
mental tensor vanishes. 

Cartan’s introduction of the notion of con- 
nection produced a development in the theory 
of Finsler spaces that parallels the develop- 
ment in the theory of Riemannian spaces, and 
many important results have been obtained. 
O. Varga (1941) succeeded in obtaining a 
Cartan connection in a simpler way by using 
the notion of osculating Riemannian space. 

S. S. Chern (1943) studied general Euclidean 
connections that contain Cartan connections 
as a special case. Noticing that the tangent 
space of a Finsler space ts a tnormed linear 
space, H. Rund (1950) obtained many notions 
different from those of Cartan. However, as far 
as the theory of connections is concerned, the 
two theories do not seem to be essentially 
different. The theory of curvature in a Finsler 
space is more complicated than that in a 
Riemannian space because we have three 
curvature tensors in the Cartan connection. 
Using the fact that, in a local cross section of 
the tangent vector bundle of a Finsler space, a 
Riemannian metric can be introduced by the 
Finsler metric, L. Auslander (1955) [2] ex- 
tended to Finsler spaces the results of J. L. 
Synge and S. B. Myers on the curvature and 
topology of Riemannian spaces. A. Lichnéro- 
wicz extended the tGauss-Bonnet formula to 
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Finsler spaces by considering an integral on 
the subbundle of the tangent vector bundle 
satisfying L(x, y)=1 [6]. M. H. Akbar-Zadeh 
studied fholonomy groups and transformation 
groups of Finsler spaces by using the theory of 
fiber bundles. 

Connections of Finsler spaces have been 
investigated by many geometers, but most of 
them used methods considerably different from 
those of the modern theory of connections in 
principal fiber bundles. J. H. Taylor and Synge 
(1925) defined the covariant differential of a 
vector field along a curve. L. Berwald (1926) 
defined a connection from the point of view of 
the general geometry of paths. A curve on a 
manifold satisfying the differential equation 


d?x! Gi dx 0 

a? (x dt ) F 

is called a path. The theory was originated by 
O. Veblen and T. Y. Thomas and generalized 
as above by J. Douglas. Characteristically, 
with respect to a Berwald connection, the 
covariant differential of the fundamental ten- 
sor does not vanish. 

A Finsler space is a space endowed with a 
metric for line elements. As a dual concept, we 
have a Cartan space, which is endowed with a 
metric for areal elements [4]. A. Kawaguchi 
(1937) extended these notions further and 
studied a space of line elements of higher order 
(or Kawaguchi space). 
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153 (IX.7) 
Fixed-Point Theorems 


A. General Remarks 


Given a mapping f of a space X into itself, a 
point x of X is called a fixed point of f if f(x) 
=x. When X is a topological space and f 
is a continuous mapping, we have various 
theorems concerning the fixed points of f. 


B. Fixed-Point Theorems for Polyhedra 


(1) Brouwer Fixed-Point Theorem. Let X be a 
tsimplex and f:X —X a continuous mapping. 
Then f has a fixed point in X (Math. Ann., 69 

(1910), 71 (1912)). 


(2) Lefschetz Fixed-Point Theorem. Let H,(X) 
be the p-dimensional thomology group of a 
‘finite polyhedron X (with integral coeffi- 
cients), T,(X) the ttorsion subgroup of H,(X), 
and B,(X)=H,(X)/T,(X). The continuous 
mapping f: XX naturally induces a homo- 
morphism f, of the free Z-module B,(X) into 
itself. Let x, be the ftrace of f, and A; = 

p-0(—1)?a, (n=dim X). We call this integer 
A, the Lefschetz number f. 

We have the Lefschetz fixed-point theorem: 
(i) Let f, g be continuous mapping sending X 
into itself. If f, g are thomotopic (fœ g), then 
A,=A,. (it) If A, #0, then f has at least one 
fixed point in X (Trans. Amer. Math. Soc., 28 
(1926)). 

The condition A, #0 ts, however, not neces- 
sary for the existence of a fixed point of f. The 
Brouwer fixed-point theorem ts obtained im- 
mediately from (i) and (ii). in particular, if the 
mapping f is homotopic to the identity map- 
ping 1x, then «, is the pth tBetti number of X, 
and A, is equal to the tEuler characteristic 
y(X) of X. Hence, in this case, if y(X) 40, then 
f has a fixed point. 

When X is a compact oriented manifold 
without boundary the Lefschetz number A, of 
f can be interpreted as the fintersection num- 
ber of the graph of f and the diagonal of X. 

More generally, let X and Y be compact 
oriented n-dimensional manifolds without 
boundary. If f and g: X > Y are continuous 
mappings, a point x of X such that f(x)= 
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g(x) is called a coincidence point of f and g. 
The intersection number A,_, of the graph 

of f and that of g is called the coincidence 
number of f and g. If A; „#0, then f and g 
have at least one coincidence point. The coin- 
cidence number Aș „is also expressed as 
Zp=0(—1)"tr(f* o g| H"(X)), where g: 
H?(X)—>H?(Y) is the tGysin homomorphism 
of g. 

Suppose that a finite group G acts on the 
manifolds X and Y. If f: X > Y is a mapping, a 
point x of X such that tf(x)= ft(x) for all te G 
is called an equivariant point of f. When G is a 
group of order 2 and acts on X nontrivially, 
the equivariant point index A sis employed. 
This index was introduced by Nakaoka 
(Japan. J. Math. 4 (1978)), using the tequivar- 
iant cohomology. It has the property that 
A 7 #0, implying that f admits an equivariant 
point. The prototype of this theorem is the 
Borsuk-Ulam theorem (Fund. Math. 20 (1933)), 
which states that a continuous mapping f: 
S"— R" always admits a point xe S” such that 


Sx) = fx). 


(3) Lefschetz Number and Fixed-Point Indices. 
Suppose that |K| is an n-dimensional homo- 
geneous polyhedron (i.e., any simplex of K 
that is not a face of another simplex of K is of 
dimension n), and f:|K|-—|K| is a continuous 
mapping. Then there exists a continuous 
mapping g:|K|—|K| homotopic to f and 
admitting only isolated fixed points {q,,..., 
q,}, each of which is an inner point of an n- 
dimensional simplex of K. The tlocal degree 
2, of a mapping g at q; is called the fixed- 
point index of g at q;. Then J; = L/_, 4; does 
not depend on the choice of g and is equal to 
(—1)"A,. 


(4) Singularities of a Continuous Vector Field. 
Let X be an n-dimensional tdifferentiable 
manifold and F a fcontinuous vector field on 
X that assigns a tangent vector x, to each 
point p of X. A point p is called a singular 
point of F if x, is the zero vector. The vector 
field F induces in a natural manner a continu- 
ous mapping f:X—X that is homotopic to 
the identity mapping 1y. Then a fixed point of 
fis a singular point of F, and vice versa. When 
such a singular point p is isolated, there exists 
a tcoordinate neighborhood N of p that is 
homeomorphic to an n-dimensional open ball 
such that x, is nonzero for every point q in N 
except for g=p. Let N’ be the boundary of N. 
Then we may consider N’=S"~! and a map- 
ping F |N’ from S”! to R"~{0} ~S""?. The 
tdegree of a mapping S"? 4 R"~ {0} = 
is called the index of the singular point p. This 
index is equal to the fixed-point index å, of f 
at p. Hence, when X is compact and has no 
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boundary, the sum of indices of (isolated) 
singular points of F is equal to (—1)"y(X). In 
particular, a compact manifold X without 
boundary admits a continuous vector field 
with no singular point if and only if 7(X)=0 
(Hopf’s theorem, Math. Ann., 96 (1927)). 


(5) Poincaré-Birkhoff Fixed-Point Theorem. In 
certain cases, a continuous mapping f: X >X 
of a finite polyhedron X into itself has fixed 
points even if A, =0. For example, let X be the 
annular space {(r, #)|~<r<f} ((r, 0) are the 
polar coordinates of points in a Euclidean 
plane) and let f: X +X be a homeomorphism 
satisfying the following conditions: (i) there 
exist continuous functions g(9), h(0) such that 
g(P) <8, h(0)> 6, f(a, 0) =(a, g(0)), f(B, 0) = 

(£, h(@)); (ii) there exists a continuous positive 
function p(r, 0) defined for a<r<f such that 


[f pir hdrdo= | | (f(r, 0))drdd 
D D 


for all measurable sets D. 

Then f has at least two fixed points. This 
theorem was conjectured in 1912 by Poincaré, 
who hoped to apply it to solve the frestricted 
three-body problem. The theorem was later 
proved by G. D. Birkhoff (Trans. Amer. Math. 
Soc., 14 (1913); see also M. Brown and W. D. 
Neumann, Michigan Math. J. 24 (1977)) and 
is called the Poincaré-Birkhoff fixed-point 
theorem or the last theorem of Poincaré. 


C. Atiyah-Bott and Atiyah-Singer Fixed-Point 
Theorems 


There is a far-reaching generalization of the 
Lefschetz formula given by Atiyah and Bott 
(Ann. Math., (2) 86 (1967), 88 (1968)). Let M be 
a compact differentiable manifold without 
boundary and f: M—M a differentiable map- 
ping with only simple fixed points; that is, it is 
assumed that det(1 —df,) #0 for each fixed 
point peM of f, where df, is the differential of 
f at the point p. The fixed points of f are finite 
in number. Suppose that an ‘elliptic complex 
over M 


6:01 (Ey) ST (E,) >... <SP(E) 0 


and a sequence of smooth vector bundle 
homomorphisms 9;: f* E;> E; (i=0,..., 1) 

are given such that d; T,= T;,,d; for each i, 
where T;:T(E;)—>T(E;) is defined by T,s(x)= 
0;S(f(x)) for ser (E). The sequence T =(T,) 
induces endomorphisms H'T of the homology 
groups H'(6é) of the elliptic complex £. We 
define the Lefschetz number L(T) by L(T)= 
Di-o(—1)'tr H'T. On the other hand, for a 
fixed point p of f, let y;,,:E;,, 7 Ei, p denote 
the restriction of o; on the fiber E; , of E; over 
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p. Under the circumstances mentioned above 
the Lefschetz number L(T) is given by the 
formula 


M- 


(Itr Pip 


ll 


Alo 


L(T)=y = 
a z |det(1 —df,)| ` 


where the summation is over the fixed points 
of f. 

Here are some examples of the above for- 
mula. First, take as & the tde Rham complex 
and as ọ; the obvious one, i.e., the ith exterior 
power of the transpose of df. In this case, the 
formula reduces to the classical Lefschetz 
formula. As a second example consider the 
*Dolbeault complex 


0A%9(M)SA%!(M)-S ... SA%™(M) 0 


of a compact complex manifold M and a 
holomorphic mapping f: M—M with only 
simple fixed points. In this case the formula 
above reduces to one giving the Lefschetz 
number of the induced endomorphisms 
f'*:H®*(M)—H°-*(M) of the tDolbeault 
cohomology: 


1 
*) — 
Ls”) » aetc(l —df,)’ 
where df, is regarded as a holomorphic 
differential. 

If the assumption that the mapping f has 
only simple fixed points is replaced by the one 
that f is a diffeomorphism of M contained in 
a compact ttransformation group G, then 
there is also a generalized Lefschetz formula, 
given by Atiyah and Singer (Ann. Math., (2) 87 
(1968)). The fixed-point set of such a diffeo- 
morphism is a closed submanifold of M (con- 
sisting of several components). Suppose that 
we are given an elliptic complex & over M and 
a lift of the G-action on M to &. The latter 
implies that, if we define 7;:(E,)>T(E;) by 
T,s(x)= f~'s(f(x)) for seT(E)), then d; T,= 
T;,,4; holds for each i. Under these circum- 
stances, the Lefschetz number L(T) can be 
expressed in the form 
L(T)= 2 v(F;), 
where the summation is over the components 
{ F;\ of the fixed-point set M” of f and where 
the number v(F;) is written explicitly in terms 
of the tsymbol of the elliptic complex & with 
G-action, the characteristic classes of the mani- 
fold F;, the characteristic classes of the normal 
bundle of F, in M, and the action of g=f~' on 
the normal vectors. The formula is essentially 
a reformulation of the tAtiyah-Singer index 
theorem. In fact, L(T) is the tanalytic index of 
& evaluated on g and the number v(F;) is de- 
duced from the ttopological index of & using 
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the localization theorem. The most useful 
elliptic complexes are de Rham complexes, 
Dolbeault complexes, signature operators and 
Dirac operators. Jn the case of Dolbeault 
complexes, f is assumed to be an analytic 
automorphism, and the number v(F;) takes the 
form 


_ [LZE 


Eien 


[F;). 

Here, the normal bundle N of F; has a decom- 
position N =, N (0) into the sum of complex 
vector bundles such that g acts on N (8) as e”, 
and %? is the characteristic class defined by 


1 —e i? 


oi e xi —i8? 
where the *Chern class of N(@) is written as 
c(N(@))=IT (1 +x,); moreover 7 (F;) denotes 
the tTodd class of the complex manifold F,, 
and N* denotes the dual bundle of N (- 237 
K-Theory H). 


D. Fixed-Point Theorems for Infinite- 
Dimensional Spaces 


Birkhoff and O. D. Kellogg generalized 
Brouwer’s fixed-point theorem to the case of 
function spaces (Trans. Amer. Math. Soc., 23 
(1922)), Their result was utilized to show the 
existence of solutions of certain differential 
equations, and has led to a new method in the 
theory of functional equations. 

J. P. Schauder obtained the following theo- 
rem: Let A be a closed convex subset of a 
Banach space, and assume that there exists a 
continuous mapping T sending A to a tcount- 
ably compact subset T(A) of A. Then T has 
fixed points (Studia Math., 2 (1930)). This 
theorem is called the Schauder fixed-point 
theorem. 

A. Tikhonov generalized Brouwer’s result 
and obtained the following Tikhonov fixed- 
point theorem (Math. Ann., 111 (1935)): Let R 
be a locally convex ttopological linear space, 
A a compact convex subset of R, and T a con- 
tinuous mapping sending A into itself. Then 
T has fixed points. 

This theorem may be applied to the case 
where R is the space of continuous mappings 
sending an m-dimensional Euclidean space £" 
into a k-dimensional Euclidean space E* to 
show the existence of solutions of certain 
differential equations. For example, when m= 
k= 1, consider the differential equation 


dy/dx = f(x, y), 


We set T(y)=yo+ fi, f(t, y(t))dt to determine a 
continuous mapping T: R->R. Then the fixed 
points of T are the solutions of the differential 


¥(Xo) = Yo. 
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equation. Now we can apply the theorem of 
Tikhonov to show the existence of solutions. 

On the other hand, when we are given prob- 
Jems of functional analysis, Schauder’s fixed- 
point theorem is usually more convenient to 
apply than Tikhonov’s theorem. 

The following theorem, written in terms of 
functional analysis, is useful for applications: 
Let D be a subset of an n-dimensional Eucli- 
dean space, F the family of continuous func- 
tions defined on D, and T: FF a mapping. 
Suppose that the following three conditions 
are satisfied: (i) For f,, f,eF, 0<A<1 implies 
Afi +1 —A) fre F. (ii) If a series { fp} of func- 
tions in F converges uniformly in the wider 
sense to a function f, then fe F; and further- 
more, the series { Tf,} converges uniformly in 
the wider sense to Tf. (iii) The family T(F) is a 
tnormal family of functions on D. Then there 
exists a function fe F such that Tf= f. 

Let R be a topological linear space and T a 
mapping assigning a closed convex subset T(x) 
of R to each point x of R. A point x of R is 
called a fixed point of T if xe T(x). The map- 
ping T is called semicontinuous if the condition 
X >a, Vy, (y € T(x,)) implies that be T(a). In 
particular, if K is a bounded closed convex 
subset of a finite-dimensional Euclidean space 
R and T a semicontinuous mapping sending 
points of K into convex subsets of K, then 
T admits fixed points (Kakutani fixed-point 
theorem, Duke Math. J., 8 (1941)). This result 
was further generalized to the case of locally 
convex topological linear spaces by Ky Fan 
(Proc. Nat. Acad. Sci. US, 38 (1952)). 
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A. Introduction 


A foliation is a kind of geometric structure on 
manifolds, such as a differentiable or complex 
structure. The study of foliations evolved from 
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investigation of the behavior of torbits of a 
vector field and also of the solutions of ttotal 
differential equations. Through the early 
works of C. Ehresmann, G. Reeb, and A. Hae- 
fliger, together with the development of mani- 
fold theory in 1960s, it became an established 
field of mathematics. Since then, great progress 
has been made in this field, especially in its 
topological aspects. At the same time it turns 
out that many problems in foliation theory are 
deeply related not only to the geometry of 
manifolds but also to various other branches 
of mathematics, such as the theory of differen- 
tial equations, functional analysis, and group 
theory. 


B. Definitions and General Remarks 


A foliation on a manifold can be defined 
within various categories: topological, C’- 
differentiable (1 <r < œ), real analytic, and 
holomorphic. For definiteness, however, we 
restrict ourselves to the C’-differentiable cate- 
gory in what follows. Furthermore, all mani- 
folds are assumed to be paracompact. 

Let M be an n-dimensional C”-manifold, 
possibly with boundary. A codimension q, C’- 
foliation of M (O<q<n,0<r<o)isa family 
F ={L,|xeA} of arcwise connected subsets, 
called leaves, of M with the following prop- 
erties: (i) LL, = Ø if x # x'; (ii) (Jaca La= 
M; (iti) Every point in M has a local coordi- 
nate system (U, y) of class C” such that for 
each leaf L, the arcwise connected components 
of UN L, are described by x” 4*! = constant, 
...,x" = constant, where x!, x?,...,x” denote 
the local coordinates in the system (U, y). 

In particular, every leaf of F is an (n—4q)- 
dimensional fsubmanifold of M. The totality 
of integral submanifolds of a tcompletely 
integrable nonsingular system of * Pfaffian 
equations on R”, œ; = 4;,(x)dx, +a; (x)dx, + 
+ 4;,(x)dx, =0 (i= 1, 2,...,q) forms a co- 
dimension q foliation, and the totality of inte- 
gral curves of a nonsingular vector field of 
class C” on M (r>1) constitutes a codimension 
n—1 C’-foliation. 

Let F be a codimension q, C’-foliation of M 
(r>1). Then M admits a C”! tp-plane field 
consisting of all vectors tangent to the leaves 
of F, and, dually, a C"! q-plane field (p +q = 
n). Denote the former by t(¥) and the latter 
by v( F), and call them the tangent bundle and 
the normal bundle of F, respectively. v(F) is 
isomorphic to the quotient bundle T(M)/t(F). 
F is called transversely orientable if v(F) is an 
orientable vector bundle. A C’-mapping f: 

N- M is said to be transverse to the foliation 
F if the composite mapping T(N)T(M)—> 
T(M)/t(F) is epimorphic at each point of N. 
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In this case, f induces a codimension q, C’- 
foliation f~'(¥) of N whose leaves are the 
arcwise connected components of f~!(L) 
(LEF). 

In particular, if Q is a q-dimensional C’- 
manifold and f: N-@Q is a C’-submersion, f 
induces a codimension q, C’-foliation of N 
whose leaves are the arcwise connected com- 
ponents of f7! (x) (xeQ). 

A C p-plane field X on M is called involu- 
tive if, for any C” vector fields X, Y on M such 
that X., Y.E Z, (xe M), the tLie bracket LX, Y] 
satisfies [X, Y] e.. This condition is known 
as the Frobenius integrability condition for 
X. If X is defined locally by q Pfaffian equa- 
tions œ; =...=w,=0, the above condition is 
equivalent to the condition that there are 
C! 1-forms 0; (i,j=1,...,q) such that dw;= 
X4_, 0,,\@;. A C p-plane field % is said to be 
completely integrable if it is a tangent bundle 
of some foliation. When r>2, Z is completely 
integrable if and only if it is involutive (Frobe- 
nius theorem) (— 428 Total Differential Equa- 
tions). There is a topological obstruction to 
the complete integrability of Z (— Section F). 

A closed C”-manifold M admits a codi- 
mension 1, C’—plane field if and only if the 
Euler number of M vanishes. In 1944, Reeb 
constructed a codimension t, C®-foliation of 
the 3-sphere S° as follows [1]. Let f(x) be a 
C*-even function defined on the open interval 
(—1, 1), such that 


o d® l 
ea aaro. (k=0,1,2,...). 
The graphs of the equations y= f(x)+c (—1 < 
x< 1,ceR) together with the lines x= +1 
constitute a codimension 1, C”-foliation of 
[ —1,1] xR. Then by rotating it around the y- 
axis in R?, we obtain a codimension 1, C®- 
foliation of D? x R, where D? denotes the 
closed 2-disk. The foliation is invariant under 
vertical translations and therefore defines a 
codimension 1, C®-foliation of D? x S'. This 
foliation is called the Reeb component of D? x 
S'. Since S? is a union of two solid tori inter- 
secting in the common toroidal surface, the 
Reeb components of each solid torus con- 
structed above define the so-called Reeb foli- 
ation of S° (Fig. 1). 


Fig. 1 
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Generalizing the above construction into a 
differential topological method, one obtains 
the following results: Every closed 3-dimen- 
sional manifold admits a codimension 1, C1- 
foliation (S. P. Novikov [3], W. Lickorish, J. 
Wood); every odd-dimensional sphere admits 
a codimension 1, C®-foliation (I. Tamura, A. 
Durfee, B. Lawson). On the other hand, every 
open manifold has a codimension 1, C®- 
foliation induced by a submersion over R (— 
Section F). 

Let M be a total space of a C’-bundle over 
a C“-manifold B with fiber F. If F is a C”- 
manifold and the structure group reduces to a 
totally disconnected subgroup of Diff’ F, the 
group of all C’-diffeomorphisms of F, then the 
local sections, which are defined in an obvious 
manner using the local triviality of this bundle, 
fit together to give leaves of a codimension q, 
C’-foliation (q = dim F). In this case, M is 
called a foliated bundle or a flat F-bundle over 
B. Each leaf of this foliation is diffeomorphic 
to a covering manifold of B and transverse to 
the fibers of the bundle M >B. Foliated bun- 
dies exhibit a class of foliations; this is espe- 
cially important in connection with the char- 
acteristic classes of foliations (— Section G). 


C. Holonomy 


The notion of the holonomy of a leaf, given by 
Ehresmann, is a generalization of the tPoin- 
caré mapping in tdynamical systems. Let F be 
a codimension q, C’-foliation of M and L bea 
leaf of F. Let N(L) denote the total space of 
the normal disk bundle of L in M. Choose a 
C’-immersion i: N(L)—M such that i restricted 
to the zero section of N(L) is the natural inclu- 
sion and i maps the fibers of N(L) transversely 
to the foliation F. Then the induced foliation 
i-'(F) of N(L) has the properties that the 
leaves are transverse to the fibers of N(L) and 
the zero section of N(L) is a leaf. If y is an 
oriented loop in L based at a point x€ L, then 
there is a neighborhood U of 0 in the fiber 
over Xv Satisfying the following: for each point 
xeéU there is a curve y,.:[0, 1 ]—N(L) having 
the properties: (i) y,,(0)= x, (11) Im{y,) lies on a 
leaf of i '(F), and (iii) zoy,(t)=)(t) for any 
telO, 1], where z: N(L)>L is the bundle pro- 
jection. The family of curves {y| xe U} gives a 
C’-diffeomorphism H, from U to another open 
set of 1 '(x9), which assigns y,(1) to x. Let G; 
denote the group of tgerms at 0 of all local 
C’-diffeomorphisms of R? which fix 0. The 
germ at 0 of the mapping H, depends only on 
the thomotopy class of y, and, by identifying 

m '(x) with R4, we obtain a homomorphism 
h,:1,(L, Xo) > Gz. h, is determined by L up to 
conjugacy and is called the holonomy homo- 
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morphism, or simply the holonomy, of the leaf 
L. The image of h, is called the holonomy 
group of L. For r>1, by differentiating each 
element of Gj, one has a homomorphism 

dh, : n, (L, Xo) > GL(q; R), called the linear 
holonomy of L. The holonomy of a proper leaf 
(— Section D) completely characterizes the 
foliation of a neighborhood of it (Haefliger). 


D. Topology of Leaves 


Let F be a codimension q foliation of M. The 
leaf topology is a topology of M defined by 
requiring each connected component of the set 
of the form U N L to be open, where U is an 
open set in M and Le F. Leaves are nothing 
but the connected components of M with 
respect to this topology. A leaf Le F is called 
a compact leaf if L is compact in the leaf top- 
ology. In general, L is called proper if two 
topologies on L induced from the original and 
the leaf topologies on M coincide. Any com- 
pact leaf is proper. A leaf L is called locally 
dense if Int L4 @. If a leaf is neither proper 
nor locally dense, it is called exceptional. Since 
we are assuming that M is paracompact, a 
leaf that is a closed subset of M is proper 
(Haefliger). There exists a codimension 1, Ct- 
foliation of the 2-torus T? that contains 
exceptional leaves. But in the C” category 
(r > 2), such a foliation does not exist on T? (A. 
Denjoy, C. Siegel). In higher dimensions, there 
are examples of C”-foliations with exceptional 
leaves (R. Sacksteder). The following result is 
called the Novikov closed leaf theorem [3]: 
Any codimension 1, C’-foliation (r > 2) of a 
closed 3-dimensional manifold M contains a 
Reeb component if either z; (M) is finite or 
1>(M)#0(M 4S! x S?,S' x RP’). In partic- 
ular, every C?-foliation of S° contains a com- 
pact leaf homeomorphic to T?. The question 
of whether every codimension 2, C’-foliation of 
S? has a compact leaf is known as the Seifert 
conjecture. There is a counterexample in the 
C! case (P. Schweitzer [7]), but it remains 
unsolved for r>2(— 126 Dynamical Systems 
N). 

A compact leaf L is said to be stable if it has 
an arbitrarily small open neighborhood that 
is a union of compact leaves. The following 
results are called the Reeb stability theorems: 
(1) Let L be a compact leaf of a C’-foliation 
(r>0). If the holonomy group of L is finite, 
then it is stable (Reeb [1]). (2) Let F bea 
transversely orientable codimension 1, C’- 
foliation (r > 1) of a compact manifold M 
(tangent to the boundary). If there exists a 
compact leaf L with H'(L;R)=0, then M isa 
fiber bundle over S* or [0,1], and the leaves of 
F are the fibers of this bundle. In particular, 
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every leaf is compact and stable (Reeb [1], 
Thurston [8]). 

The generalization of the stability theorem 
for proper leaves has been investigated by T. 
Inaba and P. Dippolito. 


E. Haefliger Structures 


Let Y be the tpseudogroup consisting of all 
C’-diffeomorphisms g of an open subset of R! 
to another open subset of R?. We write I} for 
the set of all germs [g], of g at x, xedomain of 
g,g€9,. The sheaf topology of F} is the to- 
pology whose open base is the family of sub- 
sets of the form |), caomainor g4 L9]x}- With this 
topology and the multiplication induced from 
the composition of 4, T} is a topological 
groupoid, i.e., a ‘groupoid whose multiplication 
and inverse mappings are continuous. If one 
identifies a point x in R? with [idg], r; con- 
tains R’ as a subspace. 

A codimension q, C’—Haefliger structure or 
a [(-Structure # on a topological space X is a 
maximal covering of X by open sets {U,|ieJ}, 
such that for each pair i, jeJ, there is a con- 
tinuous mapping 9,;:U; U,-T", satisfying 


Vilx) = yl) OY, (X) for xe UN UN U,. (*) 


Since y(x) is the germ of the identity mapping 
for xe U,, the correspondence x—y;(x) defines 
a continuous mapping f;:U;>R‘cI%. A co- 
dimension q, C’-foliation of a C”-manifold M 
is the same as a I'7-structure on M such 

that each f; is a C’-submersion (Haefliger [5]). 

If f: YX is continuous and # is a 
Ij-structure on X, there is an induced ry- 
structure f7! W on Y which is defined by 
{S U) yyoflijeJ}. Two I%-structures % 
and .#, on X are said to be homotopic if there 
exists a Ij-structure # on X x [0,1] such that 
H |x xin =H, (t=O, 1). 

Let U(X) be the set of homotopy classes of 
I7-structures on X. There exists a space BT}, 
called the classifying space for l% -structures, 
such that there is a natural one-to-one corre- 
spondence between I7(X) and LX, BIG] for 
any paracompact space X, where [A, B] de- 
notes the set of homotopy classes of continu- 
ous mappings from A to B. By condition (*) 
above, ifr >0, the differentials {dy,(x)|xe 
U;NU,,i,jeJ} define a q-dimensional vector 
bundle v(#) over X, which is called the nor- 
mal bundle of #. The correspondence # — 
v(#) gives a continuous mapping v: BT > 
BGL(q; R) among classifying spaces. If r=0, 
there is also a similar mapping v: BT} > 
BTop,. Let BI’, be the homotopy fiber of the 
mapping v. BI”, is a classifying space for the 
7-structures with trivialized normal bundles. 

There is a tight connection between the 
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topology of BI”, and the group structure 

of Diff’(R*), which is stated below. Let 
Diff,(R*) be the topological group of all C’- 
diffeomorphisms of R‘ that are identities 
outside some compact sets with the C” to- 
pology. Let Diffg ;(R*) denote Diffy(R%) with 
the discrete topology and B Diff?(R%) denote 
the homotopy fiber of the natural mapping 

B Diffk 5(R“) B Diff, (R%. Then there exists a 
continuous mapping B Diff, (R> QBI? 
that induces an isomorphism in the ho- 
mology group with integer coefficient for 
O<r<oo and qg21, where Q! denotes the qth 
‘loop space functor (J. N. Mather [12], Thur- 
ston [10]). Further, it has been proved that 
Diffk.¿(R°) is a tsimple group (if r#q+1) and 
that BI” is t(q + 1)-connected for r#q+1 
(Mather and Thurston, Haefliger [5]). The 
group Homeo, ;R? is facyclic (Mather), and 
hence BI? is contractible. 


F. Existence and Classification of Foliations 


Not every plane field on a manifold is isomor- 
phic to a completely integrable field (R. Bott). 
Thus, in general, the existence of a plane field 
does not guarantee the existence of a foliation 
of a manifold. Haefliger and Thurston solved 
the existence and classification problems in 
foliation in terms of Haefliger structures as 
follows. Two codimension q foliations Ay and 
F, of a C”-manifold M are said to be concor- 
dant if there is a codimension q foliation ¥ on 
M x [0,1], that is transverse to M x {t} (t= 

0, 1) and induces there the given foliation F, 
(t=0, 1). They are said to be integrably homo- 
topic if one further requires that the foliation 
F be transverse to M x {t} for all te[0,1] in 
the definition above. Similarly, two subbundles 
čo» 6; of T(M) are said to be concordant if 
there is a subbundle č of T(M x [0,1]) such 
that ¢| m x1 =¢, for t =0, 1, and they are said 
to be homotopic if one further assumes that 
|m xin is a subbundle of T(M x {t}) for all 
te[0, 1]. The following theorem is of funda- 
mental importance. 

Theorem: Let M be an open (resp. closed) 
C*-manifold. Then for each r=0, 1,..., 00, the 
integrable homotopy classes (resp. concor- 
dance classes) of codimension q, C’-foliations 
of M are in a natural one-to-one correspon- 
dence with homotopy classes of Ij-structures 
H on M together with homotopy classes (resp. 
concordance classes) of subbundles of T(M) 
isomorphic to v(.#). (M. L. Gromov, A. 
Phillips, Haefliger [5], Thurston [9]). 

The following are consequences of the 
theorem: (1) A closed manifold M admits a 
codimension 1, C”-foliation if and only if the 
Euler number of M vanishes. (ii) If a manifold 
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admits a tq-frame field, then the associated q- 
plane field is homotopic to the normal bundle 
of a codimension q foliation of M. (iii) Every 
dimension q plane field on a C*-manifold is 
homotopic to the normal bundle of a C°- 
foliation with C” leaves. 


G. Characteristic Classes of Foliations 


Let BY, be the classifying space of T%- 
structures. An element of the cohomology 
group H*(BT};R) is called a (real) character- 
istic class of codimension q, C’-foliations. If F 
is a codimension q, C’-foliation of M and f: 

M > BT}; is the classifying mapping for F, 
then an element a(¥)= f*x¢ H*(M;R), we 
H*(BY%; R), is called the characteristic class 
of ¥ corresponding to «. The first nontrivial 
characteristic classes of foliations are known as 
the Godbillon-Vey classes (C. Godbillon and 
J. Vey [13]) and can be defined as follows. Let 
F be a transversely oriented, codimension q, 
C®-foliation of M. Then there exists a tg-form 
Q on M such that on a neighborhood U of 
each point of M, Q is written as wf A...A@, 
where œf, ...,œ¥ are linearly independent 1- 
forms that vanish on leaves of F. By the inte 
grability condition, there is a 1-form y such 
that dQ =y AQ. Then the Godbillon-Vey class 
I, of F is the de Rham cohomology class in 
H?4*1(M;R) represented by the closed 2q + 1 
form 4 A (dn). 

The following construction provides a wide 
class of characteristic classes of foliations (Bott 
and Haefliger [14], I. Bernstein and B. Rosen- 
feld [15]). Let J, be the set of tk-jets at 0 of 
local C”-diffeomorphisms of R! keeping 0 
fixed. The set {J,}2o forms an tinverse system 
of tLie groups with respect to the natural 
homomorphism p,:J,,;—J,, and each J, 

(k> 1) contains O(q) as a tmaximal compact 
subgroup. Let P, be the differentiable fiber 
bundle of k-jets at 0 of local diffeomorphisms 
of R? whose domains contain 0. It is a tprin- 
cipal J,-bundle over R. Denote by A(P,,) the 
‘direct limit of the de Rham complexes of 
{P,}~9, and let A be the subcomplex of A(P,,) 
consisting of invariant forms with respect to 
the natural action of %7. A is canonically 
isomorphic to the tcochain complex A(A,) of 
continuous alternating forms on A,, where A, 
is the topological Lie algebra of fformal vector 
fields on R? (— 105 Differentiable Manifolds 
AA). 

Now let F be a codimension q, C®- 
foliation of M. Let P,(F) denote the differ- 
entiable fiber bundle over M whose fiber 
over xe M is the space of k-jets at x of the C”- 
submersion f: UR? from an open neighbor- 
hood U of x to R4, satisfying (i) f(x) =9, (ii) 


154G 
Foliations 


Kerdf =t(F)| y. This is a principal J,-bundle 
over M, and its restriction to U is isomorphic 
to the pullback by f of the bundle P,. Hence 
there are homomorphisms from the set of in- 
variant forms on P, to A(P,(¥)) that induce 
a homomorphism A = A(A,)>lim A(P,(F)). 
This homomorphism is compatible with the 
action of O(q) and hence induces a homomor- 
phism of O(q)-basic subcomplexes. Thus one 
obtains a homomorphism 97: H*(A,;O0(q))> 
lim H*(A(P,(F )); O(g)) = H*(M; R). In fact, 
P; depends only on the 2-jet of the foliation 
¥,and one can think of it as a homomor- 
phism @: H*(A,; O(q)) > H*(BT}; R) (r > 2). 
The elements in Im ọ are called the smooth 
characteristic classes of foliations. 

Let WO, be the differential graded algebra: 


WO, = E(u, uz, sae > Uanig+1y/2]-1) ® R[c,, t. TAF 


where du;=c;, dc; =0, deg(u;)=2i— 1, deg(c;) = 
2i, and E denotes the texterior algebra over 

R generated by the u;s, and R denotes the 
*polynomial algebra of the c;s truncated by 
the tideal generated by elements of degree 

> 2q. There exists a homomorphism of dif- 
ferential graded algebras WO, A(A,; O(q)) 
which induces an isomorphism in cohomology 
(I. M. GePfand and D. B. Fuks). For a codi- 
mension q foliation Z of M, the cohomology 
class determined by c,; corresponds to the ith 
Pontryagin class of the normal bundle v( F) 
of F, and the cohomology class u, c} corre- 
sponds to the Godbillon-Vey class of F. In 
particular, the subring of the cohomology 

ring H*(M;R) generated by the tPontryagin 
classes of v( F) is trivial for degree > 2g (Bott’s 
vanishing theorem). 

Let N1*! be a closed (q + 1)-dimensional 
Riemannian manifold of tconstant negative 
curvature. The total space T, N of the unit 
tangent sphere bundle of N admits a codi- 
mension q, C”-foliation associated with the 
*geodesic flow of T, N (tAnosov foliation). It 
has been shown that the Godbillon-Vey class 
of this foliation is nontrivial (R. Roussarie, 

F. Kamber and P. Tondeur, K. Yamato). It is 

known that many of the smooth characteristic 
classes are also nontrivial (Bott and Haefliger, 
Thurston, J. Heitsch). 

A smooth characteristic class xe H*(BI%; R) 
is called rigid if for any smooth one-parameter 
family {.¥,} of codimension q foliations on 
a C”-manifold M, d(a(¥,))/dt =0 holds. 

The elements in the image of the natural 
homomorphism 


H*(WO,,,)> H*(WO,)> H*(BT%; R) 


are rigid (Heitsch). On the other hand, the 
Godbillon-Vey class is not rigid. In fact, Thur- 
ston constructed a one-parameter family 

of codimension q foliations of a certain | 
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(2q + 1)-dimensional manifold for which the 
Godbillon-Vey class varies continuously. The 
characteristic classes of a simple foliation are 
often trivial. For example, the Godbillon-Vey 
class of a codimension 1 foliation of a closed 
manifold vanishes if it is almost without holo- 
nomy (i.e., no noncompact leaves have non- 
trivial holonomy) (M. Herman [16]; T. Mizu- 
tani, S. Morita, and T. Tsuboi [17]). 


H. Further Topics 


(1) Transverse structures. Let B be a q- 
dimensional manifold and Z a geometric 
structure on it, and let 4, denote the tpseudo- 
group generated by the local diffeomorphisms 
that preserve the structure S. Replacing 9 

by Y, in the definition of a C’-Haefliger struc- 
ture, one obtains definitions of a I -structure 
and a I,-foliation. A T -foliation is called a 
Riemannian foliation, a transversely real ana- 
lytic foliation, or a transversely holomorphic 
foliation if Y is a Riemannian, real analytic, or 
complex structure on R? (C?”), respectively. 
The theories for many such foliations are 
analogous to those for C’-foliations. For 
example, many results are known about the 
characteristic classes of Riemannian or holo- 
morphic foliations. Haefliger showed that there 
is no codimension 1 real analytic foliation on a 
simply connected closed manifold and that the 
classifying space BIT’? for codimension 1 trans- 
versely oriented transversely real analytic 
foliations has the homotopy type of a tK (n, 1)- 
space for some uncountable tperfect group z 
[5]. 

(2) Foliated cobordism. Two closed oriented 
n-dimensional C”-manifolds My and M, with 
codimension q, C’-foliations are said to be 
foliated cobordant if there exist a compact 
oriented (n+ 1)-dimensional C%-manifold W 
with boundary ôW = M, U(— Mo) and a codi- 
mension q, C’-foliation of W which is trans- 
verse to OW and induces the given foliations of 
Mpo and M,. The resulting foliated cobordism 
classes {F } form a group F OQ; , with respect 
to the disjoint union. It is known that FQF , 
= {0} and that the Reeb foliation of S° is 
cobordant to zero. The characteristic classes 
of foliations provide invariants of foliated 
cobordisms. In particular the Godbillon-Vey 
number I [M] is an invariant of F¥Q}5,41., 
(r>2,q21), and a result of Thurston men- 
tioned in Section G implies that the homo- 
morphism ¥Q5,.,.,—7R defined by {F }—> 
[z[M] is surjective. 

(3) Growth of leaves, transverse invariant 
measure. Let Z be a C”-foliation of a com- 
pact manifold. Fix a fRiemannian metric 
on M. Then each leaf L of ¥ has the in- 
duced metric, and one has a function f, (r)= 


608 


vol(D(x,r)), where D(x,r) is the set of points 
yeL whose distance along L from a fixed 
point xe L is not greater than r. The growth 
type of the function f; is determined only by L. 
Many papers have been published that deal 
with the relation between the behavior of 
leaves and their growth types in codimension 1 
foliations (J. Cantwell and L. Conlon, G. Hec- 
tor, T. Nishtmori, N. Tsuchiya). On the other 
hand, as a generalization of the notion of 
asymptotic cycles in dynamical systems, the 
notion of foliation cycles, or equivalently a 
transverse invariant measure, has been defined 
(J. Plante [18], D. Ruelle, D. Sullivan [19)). 
The existence of a transverse invariant mea- 
sure for a foliation is closely connected to the 
growth types of the leaves. The example of 
Denjoy in Section D leads to the study of 
*minimal sets of foliations and the structures 
of foliations (Hector, Cantwell and Conlon). 
The structure of a codimension 1 foliations 
which are almost without holonomy has been 
fairly well investigated (Sacksteder, Hector, 
H. Imanishi, R. Moussou, Roussaire). 

(4) Compact foliation. A foliation whose 
leaves are all compact is called a compact 
foliation. D. Epstein proved that if F isa 
codimension 2, C?~compact foliation of a 
closed 3-manifold, then the leaves of ¥ are the 
fibers of a tSeifert fibration of M. In higher 
dimensions, the situation is more complicated 
(Sullivan, R. Edwards and K. Millett). 

(5) Foliated bundles. There are many results 
on foliated bundles. In particular J. W. Milnor 
[20] and J. Wood obtained a condition for a 
circle bundle & over a closed surface X to have 
a foliation transverse to fibers. More precisely, 
if € and Z are orientable, then € admits such a 
foliation if and only if |X (¢)| < —min{0, 7(2)}, 
where X denotes the Euler number and y the 
Euler-Poincaré characteristic. Kamber and 
Tondeur made an extensive study of charac- 
teristic classes of foliated bundles [21]. 

(6) Transverse foliations. Two foliations F 
and & of M are said to be transverse to each 
other if any two leaves K and L of F and Y 
are transverse to each other. A foliated bundle 
has such foliations. D. Hardorp proved that 
on every orientable closed 3-manifold, there 
exists a triple of codimension 1 foliations 
that are pairwise transverse. Tamura and A. 
Sato classified the codimension | foliations 
that are transverse to the Reeb component 
of D? x S$}. 
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A. Introduction 


Geometry deals with figures. It depends, there- 
fore, on our spatial intuition, but our intuition 
lacks objectivity. The Greeks originated the 
idea of developing geometry logically, based 
on explicitly formulated axioms, without re- 
sorting to intuition. From this intention re- 
sulted Euclid’s Elements, which was long con- 
sidered the perfect model of a logical system. 
As time passed, however, mathematicians came 
to notice its imperfections. Since the 19th cen- 
tury especially, with the awakening of a more 
rigorous critical spirit in science and philoso- 
phy, more systematic criticism of the Elements 
began to appear. Non-Euclidean geometry was 
formulated after reexamination of Euclid’s 
axiom of parallels; but it was also discovered 
that even as a foundation of Euclidean geom- 
etry, Euclid’s system of axioms was far from 
perfect. Various systems of axioms for Eucli- 
dean geometry were proposed by mathemati- 
cians in the latter half of the 19th century, 
among them one by D. Hilbert [1], which 
became the basis of far-reaching studies. 


B. Hilbert’s System of Axioms 


Hilbert took as undefined elements points 
(denoted by A, B,C,...), straight lines (or sim- 
ply lines, denoted by a,b,c,...), and planes 
(denoted by x, f,y,...). Between these ob- 
jects there exist incidence relations (expressed 
in phrases such as “A lies on a,” “a passes 
through A,” etc.); order relations (“B is be- 
tween A and C”); congruence relations; and 
parallel relations. The relations are subject to 
the following five groups of axioms: 

(I) Incidence axioms: (1) For two points A, 
B, there exists a line a through A and B. (2) 
If A +B, the line a through A, B is uniquely 
determined. We write a= AUB and call a the 
join of A, B. (3) Every line contains at least 
two different points. There exist at least three 
points that do not lie on a line. (4) If A, B, C 
are points not on a line, there exists a plane « 
through A, B, C. (We also say that A, B, C lie 
on «.) For every plane a, there exists at least 
one point A on «. (5) If A, B, C are points 
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not on a line, the plane « through A, B, C is 
uniquely determined. We write x= AU BUC 
and call « the join of A, B, C. (6) If A, B are 
two different points on a line a and if A, B lie 
on a plane q, then every point on a lies on «. 
(We say that a lies on « or a passes through 

a.) (7) Ifa point A lies on two planes «, B, 
there exists at least one other point B on « and 
P. (8) There exist at least four points not lying 
on a plane. 

(II) Ordering axioms: (1) If B is between A 
and C, then A, B, C are three different points 
lying on a line; also, B is between C and A. (2) 
If A, C are two different points, then there 
exists a point B such that C is between A and 
B. (3) If B is between A and C, then A is not 
between B and C. 

We define a segment as a set of two different 
points A, B, denoted as AB or BA, and we call 
A and B ends of this segment. The set of points 
between A, B is called the interior of AB, and 
the set of points of A UB that are neither ends 
nor interior points of AB is called the exterior 
of AB. 

(4) Let A, B, C be three points not lying on a 
line. If a line a on the plane AU BUC does not 
pass through A, B, or C, but passes through a 
point of the interior of AB, then it also passes 
through a point of the interior of BC or CA 
(Pasch’s axiom). 

The following propositions are proved 
from the above axioms. Given n points A,, 
A,,...,A, on a line (n> 2), we can rearrange 
them, if necessary, so that the point A, is be- 
tween A; and A, whenever we have | <i<j< 
k <n. There are exactly two ways of arrang- 
ing the points in this manner (theorem of linear 
ordering). Let O be a point ona line a, and let 
A, B be two points on the line different from O. 
Write A ~B when A=B or O is not between A 
and B; write A~ B otherwise. Then ~ is an 
equivalence relation between points on the line 
different from O; from A~ B, A~C it follows 
that B~C. We say that A, B are on the same 
side or on different sides of O on a depending 
on whether A~ B or A ~ B. Two subsets a’ 
and a” of a defined by a’ ={A'|A~ Ah, a" = 
{A"|A~ A”} are called half-lines or rays on 
a with O as the extremity (or starting from 0). 
Denoting by a, for simplicity, the set of points 
on a, we have a=a’U{O} Ua" (disjoint union). 

Using axiom II.4, we can also prove the 
following: Let a be a line on x, and let A, B be 
two points on « not lying on a. If A=B or if 
the interior of the segment AB has no point in 
common with a, we say that A, B are on the 
same side of a on a, and write A ~ B. Other- 
wise, we say that A, B are on different sides of 
aon «and write A+B. Then ~ is an equiva- 
lence relation between points on « not lying on 
a, and from A ~ B, A ~C follows B~ C. The 


610 


subsets «'={A'| A~ A'}, a" ={A"| A~ A} of 

x are called half-planes on x bounded by a. 
Again, denoting by « and a the set of points on 
x and the set on a, respectively, we obtain «= 
a UaUa" (disjoint union). 

(III) Congruence axioms: Two segments AB, 
A'B' can be in a relation of congruence, ex- 
pressed symbolically as AB = A’B’. (Segments 
AB and A’B’ are then said to be congruent. 
Since the segment AB is defined as the set 
{A, B}, the four relations AB = A’B’, BA = 
A'B', AB=B'A', BA=B’'A’ are equivalent.) 
This relation is subject to the following three 
axioms: (1) Let A, B be two different points on 
a line a, and A, a point on a line a, (a, may or 
may not be equal to a). Let a, be a ray on a, 
starting from A,. Then there exists a unique 
point B, on a, such that AB = A, B}. (2) From 
A, B, = AB and A,B, = AB follows A, B, = 
A, B. (Hence it follows that = is an equiv- 
alence relation between segments.) (3) Let 
A, B, C be three points such that B is between 
A and C, and let A,, B,, C, be three points 
such that B, is between A, and C,. Then from 
AB=A,B,, BC=B,C, follows AC=4A,C,. 

Now let h, k be two different lines in a plane 
x and through a point O, and let h’, k’ be the 
rays on h, k starting from O. The set of two 
such rays h’, k’ is called an angle in «, denoted 
by Z(h’,k’) or Z(k’,h’). This angle is also de- 
noted by 4 AOB, where A, B are points of h’, 
k’, respectively. The rays h’, k’ are called the 
sides and the point O is called the vertex of 
this angle. Then h’ is a subset of a half-plane 
on a bounded by k, and k’ is a subset of a half- 
plane on « bounded by h. The intersection of 
these two half-planes is called the interior of 
this angle, and the subset of «—O consisting of 
points belonging to neither the inside nor the 
sides of the angle is called the exterior of the 
angle. Between two angles Z (h’,k’), (hi, ki) 
there may exist the relation of congruence, 
again expressed by the symbol =, as in the 
case of segments, and subject to the following 
two axioms: (4) Let / (h’,k’) be an angle ona 
plane « and h, be a line on a, (xı may or may 
not be equal to x). Let O, be a point on h,, hi 
a ray on q; starting from O,, and g) a half- 
plane on x, bounded by h,. Then there exists a 
unique ray kį starting from O, and lying in 
a’, such that Z'(h’,k’)= 4 (h, ki). Moreover, 
L(K, k)= L(h,k’) always holds. (Hence it 
follows that = is an equivalence relation be- 
tween angles.) (5) Let both A, B, C and A,, B,, 
C, be triples of points not lying on a line. Then 
from AB=A,B,, AC=A,C,, and ¿Z BAC= 
L B A, G, it follows that Z ABC= 7 A,B,C,. 

(IV) Axiom of parallels: Suppose that a, b 
are two different lines. Then it follows from 
axiom I.2 that if a and b share a point P, such 


` a point is the unique point lying on both a and 
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b. In this case we say that a, b intersect at P 
and write aM b= P. On the other hand, if a and 
b have no point in common and if a, b are on 
the same plane, we say that a, b are parallel 
and write a//b. If A, a are on a plane « and A 
is not on a, we can prove (utilizing axioms I, 
II, and IH) that there exists a line b passing 
through A in « such that a//b. The axiom of 
parallels postulates the uniqueness of such a b. 
(V) Axioms of continuity: (1) Let AB, CD be 
two segments. Then there exist a finite number 
of points A,, A,,...,A, on AUB such that 
CD = AA, =A,A,=...=A,_,A, and B is be- 
tween A and A, (Archimedes’ axiom). (2) The 
set of points on a line a (again denoted for 
simplicity by a) is “maximal” in the following 
sense: It should satisfy axioms IJ.1—II.3, HI.1, 
V.1, and the theorem of linear ordering. If 
da is a set of points satisfying these axioms 
such that @>a, then @ should be =a (axiom 
of linear completeness). Hence follows the 
theorem of completeness: the set of points, 
lines, and planes is maximal in the sense that it 
is not possible to add further points, lines, or 
planes to this set with the resulting set still 
satisfying axioms I-IV and V.1. 


C. Consistency 


In formulating the above axioms and proving 
their consistency, Hilbert assumed the consis- 
tency of the theory of real numbers (— 156 
Foundations of Mathematics). To prove con- 
sistency, Hilbert constructed a model for the 
above axioms using the method of analytic 
geometry. He defined points as triples of real 
numbers (x4, X2, X3), lines and planes as sets of 
points satisfying suitable systems of linear 
equations, and relations of ordering, con- 
gruence, and parallelism in the usual way. It is 
easy to verify that such a system satisfies all 
the axioms I-V. Thus the consistency of these 
axioms is reduced to the consistency of the 
theory of real numbers (— 35 Axiom Systems). 
A model for I-IV and V.1 can be obtained 
in the countable field R, of all real talgebraic 
numbers instead of R. Then Rg can be further 
restricted to its subfield P, defined as follows: 
Let F be an arbitrary field. An textension of F 


of the form F(,/1 +4?) with Ae F is called a 
Pythagorean extension of F, and F is said to be 
a Pythagorean field if any Pythagorean exten- 
sion of F coincides with F (e.g., Ry and R are 
Pythagorean). It is easily verified that I-IV are 
satisfied in the “analytic geometry over any 
Pythagorean field.” On the other hand, we can 
construct a minimal Pythagorean field con- 
taining a given field (the Pythagorean closure 
of the field) in the same way as we construct 
the talgebraic closure of a field. The field P, is 
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defined as the Pythagorean closure of the field 
Q of rational numbers. 


D. Independence of Axioms 


In Hilbert’s system, the axioms I and II are 
used to formulate further axioms. On the other 
hand, it can be shown that each of the groups 
III, IV, and V is independent from other 
axioms. 

The independence of IV is shown by the 
consistency of non-Euclidean geometry (— 285 
Non-Euclidean Geometry). The following 
model shows the independence of II.5: In the 
analytic model for I-V, we replace the defi- 
nition of distance between two points 


(X1,%25X3), (V1, Y2; Y3) by 
((x,;—y, +X =y} +(X_—y2)? + (x3 — y3 V”. 


Then II.5 does not hold, while all other 
axioms remain satisfied. The independence of 
V.2 is shown by the geometry over Ry or P}. 
The independence of V.1 follows from the 
existence of the non-Archimedean Pythag- 
orean field: the Pythagorean closure of any 
tnon-Archimedean field (e.g., the field of ra- 
tional functions of one variable over Q with a 
tnon-Archimedean valuation) is such a field. A 
geometry in which V.1 does not hold is called 
a non-Archimedean geometry. 


E. Completeness of the System of Axioms and 
Relations between Axioms 


The tcompleteness of the system of axioms I- 
V can be shown by introducing coordinates in 
the geometry with these axioms and represent- 
ing it as tEuclidean geometry of three dimen- 
sions. Axiom group V is essential for the intro- 
duction of coordinates over R. Moreover, we 
have the following results: 

(i) The geometry with the axioms I-IV can 
be represented as “Euclidean geometry” of 
three dimensions over a Pythagorean field, 
and the geometry with axioms I.1-I.3, I-IV 
can be represented as “Euclidean geometry” of 
two dimensions over a Pythagorean field. 

(ii) The geometry with II, and a stronger 
axiom of parallels IV* (given a line a and a 
point A outside a, there exists one and only 
one line a’ passing through A that is parallel to 
a) can be represented as an taffine geometry 
over a field K that is not necessarily 
commutative. 

(iii) The field K is commutative if and only if 
the following holds: Suppose that in Fig. 1 
A'UB//AUB’, BUC// BUC’. Then it follows 
that A’ UC// AUC’ (Pascal’s theorem). 
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Fig. 1 


(iv) The “two-dimensional geometry” with 
1.1-J.3, I, and 1V* can be embedded in the 
“three-dimensional geometry” with axioms I, 
II, and IV* if and only if the following holds: 
Suppose that in Fig. 2 we have AUB// A'UB’, 
BUC/ BUC’ Then it follows that CU A// 

C’ UA’ (Desargues’s theorem). 


Fig. 2 


(v) From 1.1-1.3, II, IV*, and Pascal’s theo- 
rem follows Desargues’s theorem. 

(vi) Desargues’s theorem is independent of 
T.1-1.3, I, U1.1-111.4, IV*, and V; that is, we 
can construct a non-Desarguesian geometry (a 
geometry in which Desargues’s theorem does 
not hold) in which these axioms are satisfied. 

Axioms I, I, and IV*, as well as the theo- 
rems of Pascal and Desargues, are propositions 
in affine geometry. Each has a corresponding 
proposition in projective geometry, and the 
results concerning them can be transferred to 
the case of projective geometry (— 343 Projec- 
tive Geometry). 


F. Polygons and Their Areas 


Suppose that we are given a finite number of 
points A; (i=0,1,...,r)in the geometry with 
axioms I and II. Then the set of segments (or, 
more precisely, the union of segments together 
with their interiors) A;A;,, (i=0,1,...,r—1) ts 
called a broken line joining A, with A,. In 
particular, if A, = A,, then this set is called a 
polygon with vertices 4; and sides 4;4;,,.A 
polygon with r vertices is called an r-gon. (For 
r= 3,4, 5,6, r-gons are called triangles, quad- 
rangles, pentagons, and hexagons, respectively.) 
A plane polygon is a polygon whose vertices all 
lie on a plane. A polygon is called simple if any 
three consecutive vertices do not lie on a line, 
and two sides 4,A,;4, and A,A;,, (i#j) meet 
only when j=i+1 or i=j+1.In this article, 
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we consider only simple plane polygons, and 
refer to them simply as polygons. 

‘Jordan’s theorem implies that a polygon in 
the sense just defined divides the plane into 
two parts, its interior and its exterior. This 
special case of Jordan’s theorem can be proved 
by I.1-1.3 and II only. A polygon P is divided 
into two polygons P,, P, by a broken line 
joining two points on sides of P and lying in 
the interior of P (Fig. 3). In this case, we say 
that P is decomposed into P,, P, and write P 
=P,+P,. We may again decompose P,, P» 
and thereby arrive at a decomposition of the 
form P=P,+...+ P,. Axiom III is used to 
introduce the congruence relation = between 
polygons. Two polygons P, Q are called de- 
composition-equal if there exist decomposi- 
tions P=P,+...+P,,Q=Q,+...+Q, such 
that P, =Q,,...,P,=Q,. This is expressed by 
PzQ. We call P, Q supplementation-equal if 
there exist two polygons P’, Q’ such that (P+ 
P’\z(Q + Q’), P’zQ’. This will be expressed by 
PeQ. If we assume IV, we can use result (i) of 
Section E. Let K be the ground field of the 
geometry (K is Pythagorean, hence tordered). 
The area of polygon P is defined as the posi- 
tive element m(P) of K assigned to P such that 
m(P+Q)=m(P)+ m(Q), and m(P)=m(P’) if 
P = P'. From PzQ or PeQ, it follows that m(P) 
=m(Q). Under these axioms, it is proved that 
m(P)=m(Q) implies PeQ. If we also assume 
V.1, then m(P)=m(Q) implies PzQ. Thus 
the theory of area of polygons can be con- 
structed without assuming axiom V.2, though 
this result cannot be generalized to higher- 
dimensional cases. For the case of three di- 
mensions, we can construct two solids of the 
same volume that are not supplementation- 
equal [2, 7]. 


Fig. 3 


G. Geometric Construction by Ruler and a 
Transferrer of Constant Lengths 


The geometry with I-IV can be represented 
as 3-dimensional Euclidean geometry over 

a Pythagorean field. Conversely, all these 
axioms are valid in 3-dimensional Euclidean 
geometry over any Pythagorean field. Thus 
the minimal system of “quantities” whose 
existence is assured in geometry with these 
axioms is the field P), the Pythagorean closure 
of Q. Hilbert noticed that the existence of a 
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geometric object under axioms I-IV can be 
expressed as its constructibility by ruler (i.e., an 
instrument to draw a straight line joining two 
points) and a transferrer of constant lengths. 
The latter, for a constant length x, is an instru- 
ment that permits finding the point X on the 
given ray AB such that AX =x. It is not pos- 
sible to construct by ruler and transferrer all 
the points that can be constructed by means 
of ruler and compass (— 179 Geometric Con- 
struction). However, it is possible to construct 
all the lengths Ax, where / is any element of P}. 
Hilbert conjectured that an element of P) can 
be characterized as a ‘totally positive algebraic 
number of degree 2”, ve N. This conjecture was 
proved by Artin [3]. 


H. Related Topics 


While Hilbert’s foundations are concerned 
with 3-dimensional Euclidean geometry, it is 
easy to generalize these results to the case of 
n-dimensional Euclidean geometry (— 139 
Euclidean Geometry). Also, for affine and pro- 
jective geometries, there are well-organized 
systems of axioms (at least for the case of di- 
mensions > 3). Hilbert [1, Appendix HI] 
showed that plane thyperbolic geometry can 
be constructed on a modified system of 
axioms, but for other non-Euclidean geome- 
tries (in particular, ‘elliptic geometries) there 
are no known systems of axioms as good as 
Hilbert’s for the Euclidean case. On the other 
hand, Hilbert [1, Appendix IV] gives another 
method of constructing Euclidean geometry 
in characterizing the group of motions as the 
topological group with certain properties. 

G. Thomsen [4] rewrote Hilbert’s system of 
axioms in group-theoretical language utiliz- 
ing the fact that the group of motions is gen- 
erated by symmetries with respect to points, 
lines, and planes. Finally, Hilbert’s study of 
the foundations of geometry led him to re- 
search in the ffoundations of mathematics. 
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A. General Remarks 


The notion of tset, introduced toward the end 
of the 19th century, has proved to be one of 
the most fundamental and useful ideas in 
mathematics. Nonetheless, it has given rise to 
well-known tpargdoxes. Based on this notion, 
R. Dedekind developed the theories of natural 
numbers [2] and real numbers [3], defining 
the latter as “cuts” of the set of rational num- 
bers. Thus set theory served as a unifying 
principle of mathematics. 

It has been noted, however, that some of the 
most commonly utilized arguments in set 
theory, which are at the same time the most 
useful in mathematics and belong almost to 
the basic framework of formal logic itself, 
resemble very much those which give rise to 
paradoxes. This fact has caused many critical 
mathematicians to question the very nature of 
mathematical reasoning. Thus a new field, 
foundations of mathematics, came into being at 
the beginning of this century. This field was 
divided at its inception into different doctrines 
according to the views of its initiators: logicism 
by B. Russell, intuitionism by L. E. J. Brouwer, 
and formalism by D. Hilbert. In set theory, 
which was the origin of this controversy, it was 
pointed out that the “definition” of set as given 
by G. Cantor was too naive, and axiomatic 
treatments of this theory were proposed (— 33 
Axiomatic Set Theory). 


B. Logicism 


Russell asserted that mathematics is a branch 
of logic and that paradoxes come from ne- 
glecting the “types” of concepts. According to 
his opinion, mathematics deals formally with 
structures independently of their concrete 
meanings. Science of this character has been 
called logic from antiquity. According to him, 
logic is the youth of mathematics, and math- 
ematics is the manhood of logic. To construct 
mathematics from this standpoint, asserted 
Russell, ordinary language is lengthy and 
inaccurate, and some proper system of sym- 
bols should be used instead. Thus he tried to 
reconstruct mathematics using tsymbolic logic. 
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Attempts to reorganize mathematics using 
logical symbols had formerly been made by G. 
Leibniz, who wrote Dissertatio de arte com- 
binatoria in 1666, as well as by A. de Morgan, 
G. Boole, C. S. Peirce, E. Schröder, G. Frege, 
G. Peano, and others. Symbols used by the 
last two authors resemble those of today. 
Russell studied these works and published his 
own theory in a monumental joint work with 
A. N. Whitehead: Principia mathematica (3 
vols., 1st ed. 1910-1913, 2nd ed. 1925-1927), 
in which the theories of natural numbers and 
real numbers as well as analytic geometry are 
developed from the fundamental laws of logic. 

If this work had been completely successful, 
it could have eliminated any possibility of the 
intrusion of paradoxes into mathematics. 
However, the authors were forced to postulate 
an “unsatisfactory” axiom in order to con- 
struct mathematics. They introduced the notion 
of type as follows: An object M defined as the 
set of all objects of a certain type belongs to a 
higher type than the types of the elements of 
M. This serves to eliminate certain paradoxes 
but brings about inconveniences such as the 
following. Suppose that we are trying to con- 
struct the theory of real numbers from that of 
rational numbers. Each real number can then 
be considered a tpredicate about rational 
numbers. If this predicate contains only tquan- 
tifiers relating to variables running over all 
rational numbers, then the corresponding real 
number is said to be predicative, otherwise 
impredicative. According to Russell, the latter 
should have a higher type than the former, 
which makes the theory of real numbers 
exceedingly complicated. To avoid this dif- 
ficulty, Russell proposed the axiom of reduci- 
bility, which says that every predicate can be 
replaced by a predicative one. With this rather 
artificial axiom Russell himself expressed 
dissatisfaction. Russell also postulated the 
taxiom of infinity and the taxiom of choice, 
which are also problematic. After examining 
the philosophical background of the book, H. 
Weyl wrote about Principia mathematica, 
“Mathematics is no more based on logic than 
the utopia built by the logician.” Nevertheless, 
logic as formulated in this book, as well as the 
theory of types as developed by F. P. Ramsay 
in the school of Russell and Whitehead, is still 
an important subject of mathematical logic. 


C. Intuitionism 


The intuitionist claims that mathematical 
objects or truths do not exist independently 
from mathematically thinking spirit or intui- 
tion, and that these objects or truths should be 
directly seized by mental or intuitional activ- 
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ity. The philosophical standpoints of mathe- 
maticians such as L. Kronecker and H. Poin- 
caré in the 19th century or E. Borel, H. 
Lebesgue, and N. N. Luzin at the turn of this 
century can be assimilated to intuitionism, but 
those of the latter three are often said to be- 
long to semi-intuitionism or to French empiri- 
cism. Brouwer took a narrower standpoint, 
strongly antagonistic to Hilbert’s formalism. 
Today the word “intuitionism” is generally 
interpreted in Brouwer’s sense. 

Brouwer sharply criticized the usual way of 
reasoning in mathematics and claimed that 
indiscriminate use of the law of excluded mid- 
dle (or tertium non datur) P v 7P cannot be 
permitted. According to him, the proposi- 
tion “Either there exists a natural number 
with a given property P, or else no such num- 
ber exists” is to be regarded as proved only 
when an actual construction of a natural num- 
ber with the property P is given or when the 
absurdity of the existence of such a natural 
number can be constructively proved. When 
neither of these two results can be shown, then 
one can say nothing about the truth of the 
above proposition. Thus the usual method of 
proof, known as the method of reductio ad 
absurdum, i.e., of proving a proposition P by 
proving its double negation 171P, is not 
generally considered valid. It is a difficult but 
important problem of mathematical logic to 
determine which parts of usual mathematics 
can be reconstructed intuitionistically, though 
it does not seem easy to reconstruct any part 
of mathematics elegantly from this standpoint. 


D. Formalism 


To eliminate paradoxes, Hilbert tried to apply 
his axiomatic method. From Hilbert’s stand- 
point, any part of mathematics is a deductive 
system based on its axioms. In the deductive 
development, however, “logic,” including set 
theory and elementary number theory, is used. 
Paradoxes appear already in such logic. Hil- 
bert’s idea was to axiomatize such logic and 
to prove its consistency. Thus one must first 
formalize the most elementary part of mathe- 
matics, including logic proper. 

Hilbert proved the consistency of Euclidean 
geometry by assuming the consistency of the 
theory of real numbers. This is an example of a 
relative consistency proof, which reduces the 
consistency proof of one system to that of 
another. Such a proof can be meaningful only 
when the latter system can somehow be re- 
garded as based on sounder ground than the 
former. To carry out the consistency proof of 
logic proper and set theory, one must reduce 
it to that of another system with sounder 
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ground. For this purpose, Hilbert initiated 
metamathematics and the finitary standpoint. 

The finitary standpoint recognizes as its 
foundation only those facts that can be ex- 
pressed in a finite number of symbols and only 
those operations that can be actually executed 
in a finite number of steps. Essentially, it does 
not differ from the standpoint of intuitionism. 
The methods based on this standpoint are also 
called constructive methods. 

Metamathematics is also called proof theory. 
Its subject of research is mathematical proof 
itself. Hilbert was the first to insist on its im- 
portance. The theory is indispensable for con- 
sistency proofs of mathematical systems, but it 
may also be used for other purposes. In fact, 
the same idea can be seen in the tduality prin- 
ciple of projective geometry, which dates from 
long before Hilbert’s proclamation of formal- 
ism. This is not a theorem of projective geome- 
try deduced from its axioms; rather, it is a 
proposition about the theorems in projective 
geometry, based on the type of axioms and 
proofs in this subject. 

According to Hilbert’s method, one must 
develop proof theory from the finitary stand- 
point with the aim of proving the consistency 
of axiomatized mathematics. For this purpose, 
one must formalize the mathematical theory in 
question by means of symbolic logic. A theory 
thus formalized is called a formal system. 


E. Some Results of Formalist Theory 


One of the most remarkable results hitherto 
obtained with Hilbert’s method is the con- 
sistency proof of pure number theory by G. 
Gentzen [7]. This consistency proof covers 
the largest domain for which an explicit 
consistency proof has so far been obtained. 
However, the methods of formalist proof 
theory have proved to be most effective in 
studying the logical structure of mathematical 
theories and have led to various results on the 
consistency of formalized mathematical sys- 
tems, on symbolic logic, and on axiomatic set 
theory. We give some examples. 


(1) Gédel’s Incompleteness Theorem. K. Gödel 
[6] showed that if a system obtained by for- 
malizing the theory of natural numbers is 
consistent, then this system contains a tclosed 
formula A such that neither A nor its negation 
“1A can be proved within the system. He 
originally proved this under the assumption 
that the system is @-consistent. This is a stron- 
ger condition for the system than simple con- 
sistency, but J. B. Rosser [13] succeeded in 
replacing this by the latter. This result shows 
the incompleteness not only of the usual 
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theory of natural numbers but of any consist- 
ent theory (from the finitary standpoint) con- 
taining the theory of these numbers. 

At the same time, Gödel also obtained the 
following important result: Let S be any con- 
sistent formal system containing the theory of 
natural numbers. Then it is impossible to 
prove the consistency of S by utilizing only 
arguments that can be formalized in S. This 
means that a consistency proof from the fini- 
tary standpoint of a formal system S inevitably 
necessitates some argument that cannot be 
formalized in S. 


(2) Consistency Proofs for Pure Number 
Theory. Gentzen [7] called pure number theory 
the theory of natural numbers not depend- 
ing on the free use of set theory (differing con- 
sequently from the usual theory of natural 
numbers based on tPeano axioms; — 294 
Numbers) and proved its consistency. W. 
Ackermann [14] proved the consistency of a 
similar theory admitting the use of Hilbert’s 
te-symbol. G. Takeuti [15] showed that Gen- 
tzen’s result can be obtained as a corollary 

to his theorem extending tGentzen’s funda- 
mental theorem on tpredicate logic of the first 
order to a ftheory of types of a certain kind. 

According to the result of Gödel mentioned 
in (1) above, some reasoning outside pure num- 
ber theory must be used to prove its consis- 
tency. In all consistency proofs of pure number 
theory mentioned above, ttransfinite induction 
up to the first te-number £x is used, but all the 
other reasoning used in these proofs can be 
presented in pure number theory. This shows 
that the legitimacy of transfinite induction up 
to £o cannot be proved in this latter theory. A 
direct proof of this fact was given by Gentzen 
[13]. On the other hand, the legitimacy of 
transfinite induction up to an ordinal num- 
ber < £ can be proved within pure number 
theory. 

Again, transfinite induction is not the only 
method by which to prove the consistency of 
pure number theory. Actually, Gödel [17] 
carried out the proof utilizing what he called 
computable functions of finite type on natural 
numbers and what we call primitive recursive 
functionals of finite type. 

By restricting pure number theory further, 
one obtains weaker theories of natural num- 
bers whose consistency can be proved with 
finitary methods without recourse to such 
methods as transfinite induction up to £g. M. 
Presburger [18] proved the consistency of a 
theory in which only the addition of numbers 
is considered an operation. Ackermann [19], 
J. von Neumann [20], J. Herbrand [21], and 
K. Ono [22] proved the consistency of theo- 
ries in which some restrictions are placed 
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on the use of the axiom of tmathematical 
induction. 

On the other hand, K. Schiitte [23] gave a 
consistency proof for number theory including 
what he called “infinite induction” from a 
stronger standpoint than Hilbert’s finitary one; 
he attempted to find a basis that makes such a 
proof possible. 


(3) The Consistency of Analysis. No definitive 
result has yet been obtained from the stand- 
point of formalism, though many attempts are 
being made, among which a recent one by C. 
Spector [24] should be mentioned. 


(4) Axiomatic Set Theory. There are different 
kinds of axiom systems (— 33 Axiomatic Set 
Theory). To give a consistency proof for any of 
these systems is considered a very difficult 
problem today, but many interesting results 
are known concerning the relative consistency 
or independence of these axioms. 


(5) The Skolem-Léwenheim Theorem. The 
. metamathematical Skolem-Lowenheim 
theorem states: Given a consistent system of 
axioms stated in the first-order predicate logic 
whose cardinality is at most countable, there 
always exists an tobject domain consisting of 
countable objects satisfying all these axioms. 
For example, axiomatic set theory is stated 
in predicate logic of the first order, and the 
cardinality of its axioms is countable. Thus 
there exists an object domain consisting of 
countable objects satisfying all these axioms, 
provided that they are consistent. Such a do- 
main is called a countable model of axiomatic 
set theory. On the other hand, from the axioms 
of this theory one can prove that there exists 
a family of sets that is more than countable. 
This should also hold in a model of the theory, 
in which each object represents a set. This 
situation is known as the Skolem paradox. 
This does not imply, however, the inconsis- 
tency of axiomatic set theory. In fact, the term 
“countable” is to be interpreted in its math- 
ematical sense when one says “there exists a 
family of sets that is more than countable,” 
while it should be interpreted in its metamath- 
ematical sense when one speaks of a countable 
model of the theory. It is the confusion of these 
two different interpretations that leads to the 
“paradox.” 


(6) Skolem’s Theorem on the Impossibility of 
Characterizing the System of Natural Numbers 
by Axioms. T. Skolem [27] proved that it is 
not possible to characterize the system of 
natural numbers by a countable system of 
axioms stated in the predicate logic of the first 
order. More precisely, given any consistent 


616 


countable system of axioms satisfied by the 
system of natural numbers, there always exists 
another ‘linearly ordered system satisfying all 
these axioms and yet not isomorphic to the 
system of natural numbers as an ordered 
system. 

Godel’s incompleteness theorem and the 
Skolem paradox, as well as this result, seem 
to indicate that there is a certain limit to 
the effectiveness of the formalist method. On 
the other hand, nonstandard analysis has 
originated in this result. 
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A. Brief History 


It is obvious that four colors are necessary to 
color some geographical maps on a sphere (or 
plane), but are four colors sufficient to color 
every map? This is the so-called four-color 
problem. The precise formulation of this prob- 
lem will be given in Section B. The conjecture 
was made by Francis Guthrie and communi- 
cated through his brother Frederick Guthrie 
to A. De Morgan in 1852 [4]. A. Cayley called 
attention to the problem in 1878. It became 
famous after J. P. Heawood pointed out a 
mistake in the proof by A. B. Kempe (1879). 
Heawood also studied the problem of coloring 
maps on arbitrary surfaces (— Section E). 
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Though there have been various approaches 
to the four-color problem, the main stream of 
investigation has concentrated on obtaining an 
unavoidable set consisting of reducible ar- 
rangements (— Section D) in order to correct 
the mistake made by Kempe. G. D. Birkhoff 
[7] first discovered the simplest nontrivial 
reducible arrangement, nowadays called 
Birkhoff’s diamond (Fig. 1). Using all reduc- 
ible arrangements known up to that time, P. 
Franklin showed that every map with up to 25 
countries must contain a reducible arrange- 
ment, so that such a map is four-colorable [8]. 
This limit has been gradually increased; in 
1975 a reducible arrangement for 25 countries 
was found. 


Fig. 1 
Birkhoff’s diamond. 


H. Heesch invented the method of discharg- 
ing (— Section D), found criteria for reduci- 
bility, and finally conjectured the existence of 
an unavoidable set of reducible arrangements 
with several thousand elements, but this was 
too large to construct by hand. W. Haken and 
K. Appel with J. Koch worked with high- 
speed computers for a total of 1,200 CPU 
hours over a period of four and a half years 
and finally succeeded in constructing and 
checking an avoidable set of reducible ar- 
rangements with 1,834 elements, which proved 
the four-color problem affirmatively (1976; 

[9, 10]). For early investigations of the four- 
color problem — [5]; for results up to the 
1930s — [3]. 


B. Precise Formulation of the Problem 


To formulate the problem precisely, we must 
state the following two conditions: (i) Every 
country on a map is a tconnected domain; a 
connected part of the sea is considered to be a 
country. (it) Two countries sharing boundary 
lines must be colored differently. On the other 
hand, if two countries share only a finite num- 
ber of points, then they may share the same 
color. 

Actually we can modify the map so that no 
more than three countries meet at the same 
point. A map with this condition is called a 
trivalent or cubic map. In the study of the four- 
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color problem, we can restrict ourselves to 
trivalent maps without two-sided countries. In 
the remainder of this article, we shall assume 
that the maps considered have these 
properties. 

In recent investigations, it has become 
customary to convert maps into their fdual 
graphs, where each country is replaced by its 
capital lying inside, and for each adjacent pair 
A, B (Fig. 2), the boundary is replaced by a 
line connecting the representative points of A 
and B in such a way that it meets the bound- 
ary only once. 


Fig. 2 
Dual graph. 


The formal extension of the four-color prob- 
lem to higher dimensions is trivial, since we 
can easily construct arrangements needing 
arbitrary many colors for coloring. By con- 
verting to a dual graph, one can see that this is 
not surprising, since the planarity of a graph 
is a strong restriction, while the condition 
of representability of a linear graph in 3- 
dimensional space imposes no restriction. 


C. Tait’s Algorithm 


Suppose that a planar trivalent map M is four- 
colored. Denote the four colors by 0, 1, 2, and 
3. In Fig. 3, we use A, B, C, and D instead of 0, 
1, 2, and 3, respectively). Here we take the 
operation a ® bc. Represent the integers 

a, b=0, 1, 2, or 3 in the binary (2-adic) number 
system as 2-digit numbers a, ao and b, bo, 
where a; and b; are 0 or 1. For each digit we 
take the operation a; ® b;=c; to be binary 


Fig. 3 

An example of Tait’s algorithm. At the vertices, @ 
stands for + and © for —. Within the domains, A, 
B, C, and D stand for four colors corresponding to 
0, 1, 2, and 3, respectively. 
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addition without carrying, i.e., the operation is 
that of the logical (exclusive) “or.” The number 
C=C,Cy in the binary system represents the 
result of the operation. To each boundary line, 
we give the number corresponding to a@ b, 
where a and b are the colors for the two coun- 
tries meeting at the boundary. Then we have 
the numbering 1, 2, and 3 for each edge, where 
1, 2, and 3 are labeled once and only once for 
each triple of edges emanating from each 
vertex. Such numbering for edges is called Tait 
coloring for edges. Then we give the signature 
+ or — to each vertex, according as the order 
of the edge numbering is counterclockwise or 
clockwise. Then the algebraic sum of the signa- 
tures along the boundary of each country in 
the map is always a multiple of 3. Conversely, 
if we give the signature + or — to each vertex 
in such a way that the algebraic sum of the 
signatures is always a multiple of 3 along the 
boundary of each country, then we get a four- 
coloring of the map by reversing the above 
procedure. This is called Tait’s algorithm, 
which shows that the four-coloring of a planar 
map is an *NP problem. As for five-coloring, it 
is known that an algorithm of polynomial 
complexity exists. 

Tait’s algorithm also shows that the four- 
color problem is equivalent to the following 
apparently elementary geometric proposition: 
“For any convex polyhedron, we can always 
cut near some of its vertices in such a way that 
the resulting polyhedron has only faces such 
that the number of sides is a multiple of 3.” 
Many other equivalent statements of the four- 
color problem are known [1,2]. 


D. Solution of the Four-Color Problem 


For a given planar trivalent map, denote by 
V, the number of the countries with n sides 
(n> 3). From Euler’s theorem on polyhedra, 
we have immediately the relation 

> (6—n)h,= 12. (1) 
n23 

We easily see from this that every planar map 
must contain 3-, 4-, or 5-sided countries. A 
family F of the arrangements of countries with 
the property that every planar map must 
contain at least one arrangement belonging to 
F is called an unavoidable set. The family con- 
sisting of 3-, 4-, and 5-sided countries is the 
simplest example. In order to obtain an un- 
avoidable set, Heesch invented the method of 
discharging. Let us assume the existence of a 
map M that contains no arrangement of a 
family F. We give a signed charge of (6—n) to 
every n-gon in M. Next we divide and move 
the charges so that the pluses and the minuses 
cancel out. If all positive charges disappear, 
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according to the assumption of the existence of 
the map M, then we have a contradiction of 
(1), so we can conclude that no such map M 
exists. 

A 3-sided country can be ignored in four- 
coloring. As for a 4-sided country A, Kempe 
proved (by means of a definite modification 
procedure) that after four-coloring outside A, 
we also have a total four-coloring including A. 
An arrangement of a country or countries A is 
called reducible if, as in the above case, after 
four-coloring outside A, we get a total four- 
coloring including A by suitable modifications. 
If we have an unavoidable set consisting only 
of reducible arrangements, then by definition, 
every planar map will be four-colorable. 

Kempe believed that he had proved the 
reducibility of a pentagon (5-sided country), 
but unfortunately, he missed a particular case. 
Even though the pentagon is not reducible, 
much effort has been given to finding other 
reducible arrangements. Many useful criteria 
for reducibility have also been studied. If we 
have enough reducible arrangements, then 
we can either eventually obtain an unavoid- 
able set, which proves the four-color problem 
affirmatively, or find a “minimal counter- 
example” without reducible arrangements, 
which disproves it. 

Haken obtained an unavoidable set consist- 
ing of arrangements without certain kinds of 
reduction obstructions. (He called these “geo- 
graphically good arrangements.” ) He firmly 
believed that, applying certain “probabilistic 
conjectures” to such arrangements, he would 
conquer the problem by considering arrange- 
ments of up to 14 countries. With repeated 
improvements on the discharging process and 
on the criteria for reducibility, he finally was 
able to conclude that his speculation was 
correct, as recounted in Section A. His inves- 
tigation is not only an example of the use of 
high-speed computation in pure mathematics, 
but also one inviting reassessment of the 
meaning of mathematical proof. 


E. Coloring Maps on Arbitrary Surfaces 


On a ttorus, seven colors are sufficient to color 
any map and that many colors are necessary 
to color some maps. Heawood (1890) inves- 
tigated the problem of coloring maps on 
closed surfaces with tEuler characteristic 7 <2, 
orientable or not. The least number of colors 
sufficient to color any map on a surface is 
called the chromatic number of the surface. 
Heawood proved that the chromatic number 
is less than or equal to 


p=[(7+./49 — 24y)/2] 


(x <2), (2) 


158 
Fourier, Jean Baptiste Joseph 


where [x] means the largest integer a < «. After 
this was proved in various special cases, J. W. 
T. Youngs and G. Ringel finally proved in 
1968 that the number p equals the chromatic 
number except for a Klein bottle (a non- 
orientable surface with y= 0) [6]. Franklin 
proved in 1934 that the chromatic number for 
a Klein bottle is 6. 
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Jean Baptiste Joseph Fourier (March 21, 
1768—May 16, 1830) was born in Auxerre, 
France, the son of a tailor, and was orphaned 
at the age of eight. In 1790, he was appointed 
professor at the Ecole Polytechnique. In 1798, 
Napoleon took him on his Egyptian campaign 
together with G. Monge. On his return to 
France, he was made governor of the depart- 
ment of Isére. With the downfall of Napoleon, 
he lost his position; however, he was later 
appointed to the French Academy of Science 
as a result of his research on the transmission 
of heat. In 1826 he was elected a member of 
the Académie Francaise. 

His research on heat transmission was 
begun in 1800. In 1811, he presented a prize- 
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winning solution to a problem put forth by the 
Academy of Science. He solved the equation 
for heat transmission under various tboundary 
conditions. Fourier also stated (without rigor- 
ous proof) that an arbitrary function could be 
represented by ftrigonometric series (— 159 
Fourier Series), a statement that gave rise to 
subsequent developments in analysis. 
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Throughout this article, we assume that f(x), 
g(x), ... are real-valued functions and that 
integrals are always Lebesgue integrals. 


A. Introduction 


The set of functions 


1/./2n, cos x/,/n, sinx//n,..., 
coskx/,/n, sinkx/./n, ..., 


which is called the trigonometric system, is 
an torthonormal system in (—z, z) (— 317 
Orthogonal Functions). Let f(x) be an ele- 
ment of L,(—7z, n) (i.e., (Lebesgue integrable 
in({—7,7)). We put 


1 nu 
ay, -5| f{(t)}cos kt dt, 


1 n 
| f(t)sinktdt, k=0,1,..., (1) 
T = 


and call a,, b, the Fourier coefficients of f. The 
formal series 


1 œ 
540+ Y (a,cos kx + b, sin kx) (2) 
k=1 


is called the Fourier series of f and is often 
denoted by S(/). To indicate that a formal 
series S(f), as above, is the Fourier series of a 
function f, we write 


1 oO 
f(x)~ 540+ Y (a, cos kx + b, sin kx). 
k=1 


The sign ~ means that the numbers a,, b, are 
connected with f by the formula (1); it does 
not imply that the series is convergent, still less 
that it converges to f. Generally, trigonometric 
series are those of form (2), where the a,, b, are 
arbitrary real numbers. Since the trigono- 
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metric series have period 27, we assume that 
the functions considered are extended for 

all real x by the condition of periodicity 
J(x+2nx) = f(x). The study of the properties 
of the series S( f) and the representation of f 
by S(f) are major objects of the theory of 
Fourier series. Since e’* =cos x +isinx, if we 
set 2c, =a, —ib,, c_,=C, (k=0, 1,2, ...), we 
have 


1 (7 ; 
amz fedt, k=0, +1,.... 


Then ©(f) is represented by the complex form 
E-o Ce, and fe**} (k=0, +1,...) is an 
orthogonal system in (— z, z). In this complex 
form, we take symmetric partial sums such as 
Dr _,c,e'™ (n= 1,2, ...). 

Consider the power series 3a) + Eg (a, — 
ib,)z* on the unit circle z=e!* in the complex 
plane. Its real part is the trigonometric series 
(2), and the imaginary part (with vanishing 
constant term) is 


Ms 


(a, sin kx — b, cos kx), (3) 
k 


H 


which is called the conjugate series (or al- 
lied series) of f and is denoted by S( f). In 
complex form, the conjugate series is 
—id~._.(sgnk)c, ei. 

If f and g belong to L,(—2, 2) and 


flo~ Yo ae, 


k=— œ 


ga~ È de", 
then 

1 2n 00 i 
x| f(x-—dg(jdt~ VY cde. 
2n 0 k=—-@ 


The function 
1 2n 

Jra) | fœ- Daddi 
R Jo 


is called the convolution of f and g. 
If f is tabsolutely continuous, then the de- 
rivative f'(x) satisfies 


= y k(b, cos kx — a, sin kx). 


k=1 
If F(x) is an indefinite integral of f, then 
=, Ck ikx 
F(x)—cox~ C+ 5 —e 
k=-œ ik 


x a,sinkx— b, coskx 
=C+) k k : 
k=1 k 


where C is a constant of integration and the 
symbol ’ indicates that the term k =0 is 
omitted from the sum. 

If fe L,(—7z, n), then the Fourier coefficients 
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c, converge to 0 as n—> œ (Riemann-Lebesgue 
theorem). If f satisfies the tLipschitz condition 
of order « (0<a< 1), then c,=O(n*), and if f 
is of bounded variation, then c, =O(n~'). 
When feL,(—7, 2), 


1 (27 oo 
| Iœ dx=2 $ lel? 
T k=- %0 


0 


1 fo] 
= 503+ $ (a? +02) 
k=1 


which is called the Parseval identity. If È |c,|* 
< œ, then there exists a function fe L,(—7, n) 
which has the c, as its Fourier coefficients. — 
This converse is implied by the tRiesz-Fischer 
theorem (— Appendix A, Table 11.1). 


B. Convergence Tests 


The nth partial sums s,(x)=s,(x; f) of the 
Fourier series S(/) can be written in the form 


TE f(x +D, (dt, 


where 


D,(t)= {sin(n + 1/2)t}/2 sin(t/2). 


The function D,(t) is called the Dirichlet kernel. 


For a fixed point x we set 9,()=f(x+0+ 
f(x—t)—2f(x); then 


so-so] Px(t)D,(t) dt. 
0 
Hence if the integral on the right-hand side 
tends to zero as n> ©, lim,,..,, (x)= f(x). If f 
vanishes in an interval I= (a,b), then S(f) 
converges uniformly in any interval F =(a + 
£, b — £) interior to J, and the sum of G(/) is 
0. This is called the principle of localization. 
Here we give four convergence tests. (1) If f 
is of bounded variation, S( f) converges at 
every point x to the value { f(x+0)+ f(x — 
0)}/2. In addition, if f is continuous at every 
point of a closed interval J, S(/) is uniformly 
convergent in J (Jordan’s test). As a special 
case of this test, bounded functions having a 
finite number of maxima and minima and no 
more than a finite number of points of dis- 
continuity have convergent Fourier series 
(Dirichlet’s test). (2) If the integral [5 | py(t)|/t dt 
is finite, then S( f) converges at x to f(x) 
(Dini’s test). (3) If 


k 
| |p,(t)|dt = o(h) 


0 


and 


ig [O eo 


0, 
0), t 
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then S(f) converges at x to f(x) (Lebesgue’s 
test). Jordan’s and Dini’s tests are mutually 
independent, and both are included in Le- 
besgue’s test, which, although not as conve- 
nient in certain cases, is quite powerful. (4) If 
f(x) is continuous in (a, b) and its modulus of 
continuity satisfies the condition w(d):log(1/d) 
—0 as 6-0 in this interval, then S( f) con- 
verges uniformly in (a+ £, b — €) (Dini-Lipschitz 
test). 


C. Summability 


Let s,(x) be the nth partial sum of the Fourier 
series S( f), and o,(x)=o,(x; f) be the first 
arithmetic mean ((C, 1)-mean) of s(x) (Le., ¢,(%) 
= (So(x) +5, (x) +... +5,(x))/(n+ 1)). Then we 
have 


m 


1 
ad= Px(t)K,(t)dt, 


0 


where 


1 (ane 1 ay 


KA= sna 


The expression K,(t) is called the Fejér kernel, 
and the a,(x) are often called Fejér means. If 
the right and left limits f(x +0) exist, S( f) is 
*(C, 1)-summable at the point x to the value 
(f(x +0)+ f(x —0))/2. If f is continuous at 
every point of a closed interval I, S(f) is 
uniformly (C, 1)-summable in J (Fejér theorem, 
1904). As we explain in Section H, there exist 
continuous functions whose Fourier series 
are divergent at some points. Thus the sum- 
mability of S( f) is more important than its 
convergence. Fejér’s theorem remains true 

if we replace (C, 1)-summability by (C, «)- 
summability (x >0). More generally, if fe 
L,(—72, x), then G(f) is (C, «)-summable for 
a >0 to the value f(x) almost everywhere (H. 
Lebesgue). Since (C, «)-summability (x > 0) 
implies tsummability by Abel’s method, the 
result of Fejér’s theorem is valid for tA- 
summability. However, the direct study of A- 
summability is also important. Let f(r, x) be 
Abel’s mean of S( f); that is, 


1 a 
f(r,x) = 540 + È (a,coskx + b,sinkx)r* 
k=1 


-1f f(x+OP(r, Odt, 
where P(r, t)=(1 —r’)/2(1 — 2rcost+r7), 0< 
r<1. We call P(r, t) the Poisson kernel. The 
function f(r, x) is harmonic inside the unit 
circle and tends to f(x) as r>1 almost every- 
where. Hence f(r, x) gives the solution of the 
*Dirichlet problem for the case of the unit 
circle. 
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D. The Gibbs Phenomenon 


Let f(x) be of bounded variation and not 
continuous at 0: f(0)=0, f(+0)=/>0, f(—0)= 
—1, Then the partial sum s,(x) converges to 
f(x) in the neighborhood of 0, but not uni- 
formly. Moreover, 


lim s, (z) 216: 
n> n 


2 ("sint 


T Jo 


Hence as x tends to 0 from above and n tends 
to oo, the values y=s,(x) accumulate in the 
interval [0,/G], while s,(—x)= —s,(x) in the 
neighborhood of 0. This phenomenon is called 
the Gibbs phenomenon. If f is of bounded 
variation, then S(f) exhibits the Gibbs phe- 
nomenon at every point of simple discontinuity 
of f. However, the (C, 1)-means of S( f) do not 
exhibit this phenomenon. 


E. Conjugate Functions 


For any integrable fe L,(—z, n), the integral 


fx= 


imt | (e+= tang) at 
h0 Tja 2 


exists almost everywhere. The function f(x) 

is called the conjugate function of f(x). The 
conjugate series S(f) is (C, «)-summable («> 
0) to the value f(x) at almost every point, 

and a fortiori summable by Abel’s method. 
Even if feL,(—7z, 2), f does not always be- 
long to the class L,(—7, z). For example, 
D2 cosnx/logn is the Fourier series of a 
function fe L,(—7, n), but its conjugate series 
=~, sinnx/logn is not the Fourier series of a 
function in L,(—2, 2). However, if both f and 
fare integrable, S(f)=S(/f). If fe L, (p> 1), 
then fe L, and |i flp <4pl flip; also, S(f) 

= S(f). If |f|log*|f]| is integrable (such a 
function is said to belong to the Zygmund 
class), then f is integrable and S(f)=G(/f). 
Moreover, in this case there exist constants A 
and B such that 


2n 2n 
| Jaxa | [fllog*|fldx + B. 


0 0 


If f is merely integrable, so is |f|” for any 0< 
p<1, and If p< Bp ISI (0<p<1). If feLipa 
(0<a<1), then feLipa, but the theorem fails 
for «=0 and a=1. The conjugate function is 

important for convergence of partial sums of 
Fourier series. 
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F. Mean Convergence 


The theorems on conjugate functions enable 
us to obtain some results for the tmean con- 
vergence of the partial sums s, of S(/). If fe 
L, (p> 1), then || f—s,||,0; if fe L,, then 
If~Sllp>0, If —S,ll, 70 for every 0<p< 

1. Also, if | f|log*|fleL,, then || f—s,|| 30, 

| f—§,l —0. As a corollary of this result, we 
obtain the following theorem, which is a gen- 
eralization of the Parseval identity: If the 
Fourier coefficients of functions fe L, and ge 
L, (1/p +1/q= 1) are a,, b, and ap, bp, respec- 
tively, we have the Parseval formula 


Ly 1 a , 
-| fgdx= agayt > (a,a,+5,b,), 
T 2 n=1 


0 


where this series is convergent. 


G. Analytic Functions of the Class H, 


Let p>0. A complex function (z) holomor- 
phic for |z|< 1 is said to belong to the class 
H, (Hardy class) if there exists a constant M 
such that 


2n 
fae | |p(re'®)|PdO0< M. 
rl 2n 0 

When g(z)eH,, the nontangential limit (e) 
=lim,..,io@{z) exists for almost all 0. We write 
this as pe) = f(@) + if(0), where f(0), f(0) 
belong to the class L,,. Also, f(0) coincides 
with the conjugate function of f(8@) for p>1. 
For 1 <p< œ, H, is isomorphic to L,, but 

for p=1 and p= œ, H, and L, are different 
classes. Using the theory of functions of H,, we 
can discuss some properties of Fourier series. 
If p(e*) = f(0) + if(0) is of bounded variation, 
then (e) is absolutely continuous and its 
Fourier series converges absolutely. We set 


1 1/2 
g(0)= (| (a notre ar) , 


g*(0)= 


1 2r 1/2 
(| (1—r)dr | lo'(re!® )|? P(r, pat) $ 
o 0 


where P(r, t) is the Poisson kernel. Then g(@)< 
2g*(0), and there exist constants A,, B, C, 
and A, such that 


22 2n 
| wordo, | lọ(e®)|P? d0, = p>d, 


0 0 


0 0 


2n 2n 
| arora, | lp(e")|’d0, p>l1, 


2n 2n 
| an 1d0<B | |p|log*|g|dé+C, 
0 0 
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2n 2n H 
| Ig*(0)|“d0< A, (f ote" d0 
re) 0 


0<u<l. 


Denote by s,(8) and o,(6), respectively, the 
partial sums and arithmetic means of the 
Fourier series of (e), and set 


7 0 — o (0)2\ 12 
o-($ 80 onl i 


n 

Then 04 A, <g*(0)/y(0) < A, # æ. From these 
relations, we can prove that if the indices 

n, satisfy the conditions B>n,,,/n,>«> 

1, s,,(0) converges almost everywhere to 

ole”) for o(e*)eH, (1 <p). If we set A,(0)= 

> ae itl ce”, 5(0)=(Leo |A,(0)|7)", where 
ole”) ~E Zoc,e™, then ||5(A)||,< A, loll, 
(p> 1). If ø(z)e H, (0<p <1), then Yc,e'” is 
(C, p-! —1)-summable to g(e'*) almost every- 
where. These functions and relations were 
introduced mainly by J. E. Littlewood and 

R. E. A. C. Paley and were later generalized by 
A. Zygmund. There are more precise results 
by E. Stein [7], G. Sunouchi [8], S. Yano [9], 
and others. 


H. Almost Everywhere Convergence and 
Divergence 


P. du Bois Reymond (1876) first showed that 
there exits a continuous function whose 
Fourier series diverges at a point, but the 
problem of whether Fourier series of continu- 
ous functions converge almost everywhere 
(the so-called du Bois Reymond problem) re- 
mained unsolved for many years. At last in 
1966, L. Carleson [10] proved that the Fourier 
series of a function belonging to L, converges 
almost everywhere; hence the du Bois Rey- 
mond problem was solved affirmatively. Using 
Carleson’s method, R. A. Hunt [12] proved 
that 


2n Pp Qn 
| (spise) ix<A,| Iœ)? dx, 
0 n 0 


1<p<o, 


which implies that the Fourier series of fe L, 
(1 <p< œ) converges almost everywhere. Hunt 
also proved that 


27n 
| (sup ts dx 
(0) n 


2n 
<A | Lf) log* [f(o)))2dx-+.4. 


0 


Moreover, P. Sjolin proved that if 


Qn 
Í \f\-log* |f|-log* log" |f|dx< œ, 


0 
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then the Fourier series of f(x) converges 
almost everywhere. 

On the other hand, A. N. Kolmogorov 
gave an integrable function with Fourier 
series diverging everywhere (more precisely, 
lim sup,,..,, |San(x)}| = 00 almost everywhere). 
Using this example, J. Marcinkiewicz showed 
that there is an fe L, such that s,(x; f) oscil- 
lates boundedly almost everywhere. Moreover, 
there exists an integrable function with inte- 
grable conjugate and almost everywhere diver- 
ging Fourter series [3]. For any given tnull set 
E, we can construct a continuous function 
whose Fourier series diverges on every xE E. 


I. Absolute Convergence 


The convergence of the series (1) }(|a,|+|5,|) 
implies the absolute convergence of the trig- 
onmometric series (2) a,/2 + X (a, cos nx + 

b, sinnx). Conversely, if the series (2) con- 
verges absolutely in a set of positive mea- 
sure, the series (1) converges (Denjoy-Luzin 
theorem). For the absolute convergence of 
Fourier series, we have the following tests: If 
feLipa («> 1/2), then S(f) converges ab- 
solutely, but for «= 1/2, this is no longer true. 
If f(x) is of bounded variation and belongs to 
Lip « («>0), S(f) converges absolutely. 

Suppose that the Fourier series of a function 
f(x) is absolutely convergent and the value of 
f(x) belongs to an interval (a, b). If g(z) is a 
function of a complex variable holomorphic 
at every point of the interval (a, b), the Fou- 
rier series of ọ { f(x)} converges absolutely 
(Wiener-Lévy theorem). As a corollary we 
obtain that if S( f) converges absolutely and 
f(x) 40, then S(1/f) converges absolutely. The 
converse of the Wiener-Lévy theorem was 
proved by Y. Katznelson [6]. For a given (x) 
defined in (—1, 1], if the Fourier series of 
p{ f(x)} converges absolutely for every f(x) 
with absolutely convergent Fourier series 
(| f(x)| <1), then (z) is holomorphic at every 
point of the interval [ —1, 1). 

Many problems concerning this topic still 
remain unsolved. In particular, the determi- 
nation of the structure of the functions with 
absolutely convergent Fourier series has not 
been completed. 


J. Sets of Uniqueness 


If a,cosnx +b,sinnx converges to 0 on a set 
of positive measure, then a,, b, 0 (Cantor- 
Lebesgue theorem). A point set E c (0, 27) is 
called a set of uniqueness (or U-set) if every 
trigonometric series converging to 0 outside E 
vanishes identically. A set that is not a U-set is 
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called a set of multiplicity (or M-set). G. Can- 
tor showed that every finite set is a U-set, and 
W. H. Young showed that every denumerable 
set is a U-set. It is clear that any set E of posi- 
tive measure is an M-set, but D. E. Men’shov 
showed that there are tperfect M-sets of mea- 
sure 0. Moreover, N. K. Bari showed that 
there exist perfect sets of type U. However, the 
structure problem of sets of uniqueness has not 
yet been solved completely. 

A set E is said to be of type H (or an H-set) 
if there exists a sequence of positive integers 
n; <n, <... and an interval I such that for each 
xeE, no point of {nx}, is in J (mod 27). H- 
sets are sets of uniqueness, a fact given by A. 
Rajchman. I. I. Pyatetskii-Shapiro generalized 
H-sets to H™-sets [3]. 

Lacunary trigonometric series are series in 
which very few terms differ from zero. Such 
series can be written in the form 


(a,cosn,x +b, sinn x)= }, A,,(x). 


ms 


k 


li 
= 
x 

ll 
= 


S. Sidon established some of the characteristic 
properties of such series; he generalized them 
further and obtained the notion of Sidon sets 
(— 192 Harmonic Analysis). We often define a 
lacunary series more specifically as a series for 
which the n, satisfy Hadamard’s gaps; that is, 
Ny +1/M%>q> 1. Then if E£ (a2 + b2) is finite, 
the series Èg; A,,(x) converges almost every- 
where. Conversely, if X>, A,,(x) is convergent 
in a set of positive measure, then E£ (a? + b?) 
converges. This theorem is related to the 
Rademacher series and random Fourier series 


[4]. 


K. Multiple Fourier Series 


Routine extensions to multiple Fourier series 
from the case of a single variable are easy, but 
significant results are difficult to obtain. Re- 
cently, however, there have been several im- 
portant contributions in this field. 
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A. Fourier Integrals 


In this article we assume that f(x) is a 
complex-valued function defined on R = 
{—00, œ) and t(Lebesgue) integrable on any 
finite interval. If the integral 


a B 
| f(xje"™ dx= lim | f(xje ™ dx 
E Bow a 


exists, it is called the trigonometric integral 

or Fourier integral. We have a general result: 
If f(x)e L,(—, œ), K(x) ts bounded on 

(—o0, œ), and f K(x)dx =o0(T) (T> +0), then 
f2 f(x) K (xt) dx exists and 


lim p f(x)K(xt)dx=0. 


t>~+0 
= 


In particular, it follows that if f(x)eL,(— œ, 
oo), then f2 f(x)e"* dx exists and 


lim | fixe dx =0 


(Riemann-Lebesgue theorem). 


625 


B. Fourier’s Integral Theorems 


Suppose that f(x) is of fbounded variation in 
an interval including x, or more generally 
satisfies the assumption for any one of the 
convergence tests for Fourier series (— 159 
Fourier Series B). Then Fourier’s single inte- 
gral theorem 


1 
sO +9 +f —0) 


am fe eee 4) 


= lim — 


AO TE —xX 


holds if one of the following three condi- 
tions is satisfied: (1) {(x)/(1 +|x]) belongs to 
L,(—o, œ); (2) f(x)/x tends to zero mono- 
tonically as x— +00; (3) f(x)/x =g(x)sin(px + 
q), where g(x) tends to zero monotonically 

as x— +00 (S. Izumi, 1934). The right-hand 
side of (1) is called Dirichlet’s integral. 

Let f(x) be of bounded variation in an inter- 
val including x (or satisfy some other conver- 
gence test for Fourier series). Then Fourier’s 
double integral theorem 


1 
zA +0) +f(x—0)) 


T oe] 
-iim | a | f(u)cost(u—x)du (2) 


holds if one of the following three conditions 
is satisfied: (4) f(x)e L; (—, 00); (5) f(x)/(1 

+ |x|)eL,(—oo, 00), and f(x) tends to zero 
monotonically as x +; (6) f(x)/(1+|x|)e 
L,{—, 00) and f(x)=g(x)sin(px + q), where 
g(x) tends to zero monotonically as x > +o. 
If f(x)e L,(—00, œ), the formula 


sin? A(t — x) 


d 
A(t —x)? : 


1 © 
soim f(t) 
holds almost everywhere, and in particular at 
any x where f(x) is continuous. More gener- 
ally, the formula 


(x)= lim af f)K(A(t—x))dt 


= lim | 1(x-4) Koa (3) 


holds at any point x where 4 (x+0) and f(x— 
0) exist, if K(t)e L4(—o0, 00), [*,, K(t)dt= 

1, |K(t)|<M, K(t)=o(t~') as |t| 00, and 
Aiea (—o, œ), or if K(t)e L,(—o, œ), 
{2., K(t)dt=1, and f(x) is bounded. Similarly, 
the formula 


tim f sŠ) axe eok K(x)dx 


holds if M{ f} =lim,.,, T! f f(t)dt exists, 
K(x) is differentiable, |x? K(x)|< I C (1 <x), and 
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x '($|f(O)|dt<D, where C, D are constants 
(Wiener’s formula). 


C. Fourier Transforms (— Appendix A, 
Table 11.11 


Let f(x)eL, (0, 


rom fè |” surcosueae 
0 


is called the (Fourier) cosine transform of f(x). 
Under the same condition as for the validity of 
(2), the inversion formula of the cosine transform 
f(x) =,/2/n|@ F(t)cos xt dt holds where we 
suppose that f(x)=3( f(x +0)+ f(x —0)). If we 
define f(— x)= f(x), then this is equivalent to 
the formula (2). Analogously, 


G(t) AN f(u)sin ut du 


is called the (Fourier) sine transform of f(x). 
Under the same condition as for the validity 
of (2), we get the inversion formula f(x)= 
Janse G(t)sin xtdt. More generally, for any 
f(x)eL,(—, 00), 


eel f(x)e dx 


is called the Fourier transform of f(x). Under 
the same condition as for the validity of (2), the 
Fourier inversion formula 


oo). Then 


f(x)= 


gi lim Foe dt 
holds. The cosine transform and the sine 
transform coincide with the Fourier trans- 
form when f(x)=f(—x) and — f{x)= f(-x), 
respectively. 

If for any f(x), F(t)e L(— œ, 0) (1 <q <œ) 
exists, for which 


© F 
B gzj Oar Pde 


(T=) 
is valid (i.e., (1/,/2n) fr fe -ixt dt converges 
to F(x) as T- œ in the mean of order q), then 
we say that f(x) has the Fourier transform F(t) 
in L,(—, 00). If f(x)eL,(—00, œ) (1 <p <2), 
then f(x) has the Fourier transform F(t) in 
L, (1/p + 1/q= 1), and F(t) has the Fourier 
transform f(—x) in L, (E. C. Titchmarsh). 
Moreover, 


he [F(t)\?dx 


= 00 


1 oo 1/(p—-1) 
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If f(x), G(x)eL,(—, œ) (t <p <2) and their 
Fourier transforms in L, are F(t), g(t), respec- 
tively, then the Parseval identity 


j F()G())dt= | E TT 


holds. The Fourier inversion formula holds, in 
the sense that 


x 1 æ et] 
| rous |" fe ar dt, 


x 1 © ei4_ 1 
|, f()at= sa) TO _ du 


The theory of Fourier transforms is valid for 
cosine and sine transforms. Specifically for p= 
2, if fe L,(0, 00), then \/2/n [2 f(x): cos xt dt 
tconverges in the mean in L,(0, œ) to the 
cosine transform F(t) as T— œ, and con- 
versely, the cosine transform of F(t) in L, is 

J (x). The transforms f(x), F(x) are connected 
by the formulas 


I. Padus i | rT aise | 
0 T Jo t 
i fid f | Mee sear 
0 T Jo u 


| reas | | F(t)|? dt. 


0 0 


The theory of Fourier transforms was gen- 
eralized as follows by G. N. Watson (Proc. 
London Math. Soc., 35 (1933)). We suppose 
that y(x)/xeL,(0, œ) and 
| OO ay =min(x, y). (4) 
If f(x)e L,(0, œ), then there exists an F(t)e 
L,(0, œ) such that 


[Foarm |" ay 


o 0 u 


and the inversion formula 


[rdus [OPO 4 
0 


0 t 


and the Parseval identity 


| ka= | |F(t)|? dt 


0 0 


hold. F(t) is called the Watson transform of 
f(x). For any feL,, equality (4) is necessary 
for the existence of the Watson transform F(t) 
for which the inversion formula holds. S. 
Bochner (1934) generalized this theory further 
to funitary transformations in L, (— 192 
Harmonic Analysis). 
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D. Conjugate Functions 


Corresponding to Fourier’s double integral 
theorem (2), the integral 


1 À ioe) 
im = | a| sin t(u— x) f (u) du 
>on 0 Say 


is called the conjugate Fourier integral of the 
integral in the right-hand side of (2). Formally, 
this is written lim, t" fo (1 — cos åt)t (f(x 
+t)— f(x—t)}dt. If f(x) is a sufficiently regular 
function, the part involving cos At tends to 0 as 
A>. Now let 
oa lim [a 
T Ano ” t 
For any fe L (0, œ), the integral exists almost 
everywhere, and g(x) is called the conjugate 
function or Hilbert transform of f(x). If feL, 
(p> 1), then g(x)e L, also and we have 
jini fm ("Hate 


T Ar w 
670 


dt 


and f2, lgx) dx <M, |”, |f(x)|’dx, where 
M, is a constant depending only on p. In 
particular, 


| Pa- | \g(x)|? dx for p=2. 


~ © — 0 


E. Boundary Functions of Analytic Functions 


Suppose that a complex-valued function f(z) 
(z=x+iy) is tholomorphic for y>0, f(x +iy) 
converges as y—0 for almost all x to f(x) 
(which is called the boundary function), and 
f(x)eEL,(—, œ) (p> 1). Moreover, suppose 
that f(z) is represented by tCauchy’s integral 
formula or *Poisson’s integral of f(x) on the 
real line. If either f(x)e L, (p> 1) and has F(t) 
as its Fourier transform or f(x) is an L,- 
Fourier transform of F (t)e L, (q> 1), then a 
necessary and sufficient condition for the 
function f(x) to be the boundary function of 
an analytic function is that F(t) be 0 almost 
everywhere for t>0(N. Wiener, R. E. A. C. 
Paley, E. Hille, J. D. Tamarkin). 


F. Generalized Fourier Integrals 


Let | f(x)|/(1 +|x|)eL,(—o0, %0) for a positive 
integer k and 


L, = L(t, x)= x y! 
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The function E,(t) or one that differs from 
E,(t) by a polynomial of degree at most k is 
called the kth transform of f(x). We write 
formally f(x)= f|? „ ed‘ E,(t). Actually, if we 
give an appropriate meaning to the integral, 
this formula itself is valid (H. Hahn, Wiener 
Berichte, 134 (1925); S. Izumi, Téhoku Science 
Rep., 23 (1935)). (For the theory and applica- 
tions of kth transforms — [2,ch. 6].) 


G. Applications of Fourier Transforms 


Suppose that f(x)eL,(—0, œ), F(t) is its 
Fourier transform, and f(x)=o(e~%), where 
(x) is positive and increasing. If f7 0(x)x~? dx 
= œ% and F(t) vanishes identically in an 
interval, then F(t)=0 over (—oo, œ). If 

J? 8(x)x 7? dx < oo, then there exists a function 
f(x) such that F(t) vanishes identically in an 
interval, but F(t) does not vanish identically in 
(—20, œ) (Wiener, Paley, N. Levinson). These 
results are applicable to the theory of fquasi- 
analytic functions. 

Let f(x)e L; (—œ, %0). Then a necessary 
and sufficient condition for any function in 
L,(—, œ) to be approximated as closely as 
we wish by linear combinations of the trans- 
lations EX, a, f(x +h,) of f(x) with respect 
to the L,-norm is that the Fourier transform 
of f(x) does not vanish at any real number. 
When f(x)eL,(—00, œ), a necessary and suffi- 
cient condition that an arbitrary function in 
L?(—oo, œ) can be approximated as closely 
as we wish by BX, a, f(x +h,) with respect to 
the L,-norm is that the zeros of the Fou- 
rier transform of f(x) have measure zero 
(Wiener). This result was used by Wiener to 
prove the generalized Tauberian theorem: 
Suppose that g,(x)eEL,(—, œ) and its Fou- 
rier transform never vanishes. Moreover, let 
g2(x)EL,(—~—00, œ) and p(x) be bounded over 
(—œ, 0). Then lim,..,, |2 gı (x —Hplt)dt = 
Aj®,,g,(t)dt implies that lim,..,, J? 921x 
t)p(t)dt =A f€ „ go(t)dt. Another type of 
Wiener theorem is concerned with tStieltjes 
integrals. Suppose that 


W 


$ sup {gi(x)|< a0 


n=— oo ngx<nt+1 


(hence g,(x)€L,) and that the Fourier trans- 
form of g,(x) never vanishes. Moreover, let 


oe 


sup |g2(x)|< 0 


n=- nSx<ntl 
and let [¥*? |da(z)| be bounded. Then 


(0a) 


im | ainda) A | g,(t)dt 


~— oe 
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implies 


lim | g2(x—t)da(t)=A | g(t)dt 

From these general theorems, we can prove 
various tTauberian theorems about the sum- 
mation of series. Also, these results were applied 
to the proof of the tprime number theorem by 
S. Ikehara and E. Landau (Wiener [1]). In the 
general Tauberian theorem, the boundedness 
of p(t) can be replaced by one-sided bounded- 
ness (H. R. Pitt, 1938). In fact, the first form of 
the theorem still holds if we replace the con- 
dition by the following one: g, and g, are 
continuous, g,(x) 20, the Fourier transform of 
g,(x) does not vanish, g,(x) satisfies the con- 
ditions of the second theorem, and p(x)>C. 
(Concerning Fourier transforms on the topo- 
logical groups — 192 Harmonic Analysis; 

and concerning Fourier transforms of the 
distributions — 125 Distributions and 
Hyperfunctions). 


H. Fourier Transforms of Distributions 


The Fourier transform is defined in higher- 
dimension R” by 


Ff) =(,/2n)" | e 5f(x)dx 


XE =X Oy + X62 +.. + XnGn 


fel, 


One denotes it by f(¢). The inverse transform 
is defined by 


D- “le zgd 


Differentiation under the integral sign gives 


DFE =(/2r)" | e 8(— ix)*f(x)dx 


D* =(6/6x 4)" ...(6/0x,,)”, 


under the assumption (1 +|x|)"f(xJeL, (l= 
X +...+4,). Roughly speaking, the decreas- 
ing order of f(x) when |x| 00 is reflected in 
the differentiability of f(¢). In the same way, 


GAE =(,/20)" [emro dx, 


which shows that the differentiability of f(x) 
is reflected in the decreasing order of f(€). 
The same statements can evidently be made 
for the relation between g(é) and its inverse 
Fourier transform. 

Let ¥ be the space of rapidly decreasing 
functions (x), i.e., such that for all positive 
integers k and «>0, (1 + |x|) D* p(x) remains 
bounded (— 125 Distributions and Hyperfunc- 
tions). From the facts above, it follows that the 
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linear mapping f(x)—= ¥(f) is a topological 
isomorphism from Z, onto K. Let S’ be the 
dual space of Z. Usual function spaces in 
classical analysis are contained in ¥’, for 
instance, the L,-space and that of functions 
increasing at most of polynomial order at 
infinity. 

Let Te“. Then the mapping pe. Kr 
T(F p) is continuous, and ¥ Te% is defined 
by ¥ T(~)=T(F p). F is also a topological 
isomorphism from “ onto Z. Take as an 
example the distribution p.v.1/x. This ts no 
longer a function; however, its Fourier trans- 
form can be calculated as follows: 


d 2i 
lim (, / 27)! | PAE = ! 
Phe e<|x|<A x ./2n 


1 
LS, zi — ¢>0, 

a [eta 1 
T. r 
a 7” cs > 


where the limit is taken with respect to the 
topology of F”. 

Using the definition above, classical results 
can be extended to S’ almost automatically. 
For example, ¥(D*T)=(ié)* F(T), F((—ix)*T) 
= D* ¥(T). Moreover, if Teg’, then F T= 
Jn T(e~**). In particular, F ô =(27) "2. 

The Plancherel theorem says that if for 
J(x)eL, one defines its Fourier transform by 


f(Q=l.im. Ja 


xis A 


e f(x)dx, 


then the correspondence f(x) f(E) is a uni- 
tary mapping from L, onto L,, i.e., 


| If)? dx = | IFE dé. 


This result can be extended. For any non- 
negative integer m, the element of H” (— 

168 Function Spaces) is characterized by its 
Fourier transform: f(x)e H” if and only if 
(ESHI OE L2. Furthermore, for arbitrary 
real s, the space H* can be defined as the set of 
all elements of “’ whose Fourier transform 
f(©) satisfies (1+ |é|)f()eL,. H5 and H~S are 
dual to each other. 

For feL,, ge L, or f, ge La, the convo- 
lution makes sense as a function, and it holds 
that F(f *g)=F(f)F (g). This relation can be 
extended to distributions, to state 


F(S*T)=(F SAFT). 


This holds for (S, T)e 8 x F’, 21, xX Dy (Gz, 
is the dual of Pa) etc. 
Fourier transforms are also often defined by 


fley= efas or = ( eafnj 
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In the former case, the inversion formula takes 
the form 


f= fed 


and the Parseval identity becomes 


[irorax=en fiora 
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A. General Remarks 


A group F is called a free group if it is the tfree 
product (— 190 Groups M) of tinfinite cyclic 
groups G,,...,G, generated by a,,...,a,, TE- 
spectively. Then n is called the rank of F. A 
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free product of tsemigroups is defined similarly 
to that of groups, and the free product of 
infinite cyclic semigroups G,;={1,a;,a?,... } 
(i=1,...,n) is called a free semigroup gener- 
ated by n elements a; (i= 1, ..., n). 

Ifa group G is generated by subgroups 
H; (i=1,...,n) isomorphic to G;, then G is a 
homomorphic image of the free product of 
the groups G;. A subgroup # {e} of the free 
product F of groups G; is itself the free prod- 
uct of a free group and several subgroups, 
each of which is conjugate in F to a subgroup 
of some G; (A. G. Kurosh, 1934). Notably, a 
subgroup # {e} of a free group is itself a free 
group (O. Schreier, Abh. Math. Sem. Univ. 
Hamburg, 5 (1927)). A subgroup of index j of a 
free group of rank n is a free group of rank 
1+j(n— 1) (Schreier). 

Let F be the free group generated by n ele- 
ments a,,...,a,, and let G be a group gen- 
erated by n elements b,,...,5,. Then there is a 
homomorphism of F onto G. Let N be its 
kernel. If the class of a tword w(a,,...,a,) 
belongs to N, then we have w(b,,...,5,)=1. 
We call w(b,,...,5,)=1 a relation among 
the generators b,,...,5,. If N is the minimal 
normal subgroup of F containing the classes 
of words w4 (a4, ..., ap) --+5 Wm (Q1,---5@,), then 
the relations w,(b,,...,5,)=1,..-5Wnlby, -5 
b,)=1 are called defining relations (or funda- 
mental relations). If generators a,,...,a, and 
words W (a1, ..., an), ---, Wn(@y5---5 ap) are 
given, then there is a group generated by 
d,,..-,a, with defining relations w,(a,,..., 
a,)= 1, ...,Wm(4,,..-,4,)= 1. In fact, let F be the 
free group generated by a,,...,a, and N the 
minimal normal subgroup containing the 
classes of words w, (a4, ..., anh -<s Win(@q5 -5 
d,). Then the factor group F/N is such a group. 
A free group is a group with an empty set of 
defining relations. In the preceding discussion, 
n and m are not necessarily finite. If both n 
and m are finite, then G is called finitely 
presented. 


B. The Word Problem 


If a finitely presented group G is given, then a 
general procedure has to be determined by 
which it can be decided, in a finite number of 
computational steps, whether a given word 
equals the identity element as an element of G. 
This is called the word problem (— 190 Groups 
M). A solution to the word problem does not 
always exist (P. S. Novikov [8], 1955); in fact, 
there is a group with two generators and 32 
defining relations for which the word problem 
cannot be solved (W. Boone [7]). However, it 
was shown by V. A. Tartakovskii that the 
problem can be solved for a large class of 
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groups. W. Magnus (1931) showed that it is 
solvable for any group with a single defining 
relation. The word problem is an example of 
decision problems (— 97 Decision Problem). 
The word problem for groups is closely related 
to that for semigroups (A. M. Turing, 1937; E. 
L. Post, 1947; A. A. Markov, 1947). Similar 
problems for other algebraic systems can also 
be considered. The problem of determining a 
general procedure by which it can be decided, 
in a finite number of steps, whether two given 
words interpreted as elements of G can be 
transformed into each other by an (inner) 
automorphism of G is called the transforma- 
tion problem. 

Let F be a free group of rank n and F=F, 
>... DF,>F,,,2>... be the tlower central 
series of F. Then F,/F,,, is a tfree Abelian 
group of rank u,(r)=(1/r) D4), u(r/d)n’, where u 
is the tMobius function (E. Witt). The intersec- 
tion of all subgroups of F of finite index is the 
identity element. 


C. The Burnside Problem 


The original problem of Burnside is: If every 
element of a group G is of finite order (but not 
necessarily of bounded order) and G is finitely 
generated, is G a finite group? E. S. Golod [6] 
(1964) showed that this problem for p-groups 
has a negative solution. The following is the 
more usual form of the Burnside problem: If a 
group G is finitely generated and the orders of 
elements of G divide a given integer r, is G 
finite? Let F be a free group of rank n, N be 
the normal subgroup of F generated by all the 
rth powers x” of elements of F, and B(r,n)= 
F/N. Then the problem is the same as the 
question of whether B(r, n) is finite. For r= 

2, 3, 4, 6 the group is certainly finite (I. N. 
Sanov, M. Hall). The restricted Burnside prob- 
lem is the question whether the orders of 
finite factor groups of B(r,n) are bounded. It 
was solved affirmatively for r a prime (A. I. 
Kostrikin [5], 1959). 

A group generated by two generators x, y 
and satisfying the relations x"= y”=(xy)"=1 
(where u, v, w are integers) is infinite if 1/u + 1/v 
+1/w—1<0, and is of order g if 0<1/u+1/v 
+1/w—1=2/g. 

There is also a finitely presented group 
which is isomorphic to its proper factor group 
(B. H. Neumann). 
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The origin of functional analysis can be traced 
to the 1887 work of V. Volterra. He stressed 
the important notion of operations or opera- 
tors, that is, generalized functions of which 
the domains as well as the ranges are sets of 
functions. A typical example is the operator 
assigning to a function f its derivative f’. An 
operator is called a functional if its values are 
numbers, as in the case of the operator assign- 
ing to a function f the value f'(a) or the value 
fe f(t)dt. In 1896, Volterra considered the 
operator mapping a continuous function f to 
a continuous solution ¢ of the integral equa- 
tion f(x)= p(x) — Jz K(x, y)e(y) dy, where 
K(x, y) is a continuous function. Defining 
Io =@ and (K@)(x)=|7 K(x, y) (y)dy, he 
showed that ¢ is given by p=(I—K) 'f= 
f+Kf+K?f4+..., where K"f=K(K"'f). 
Following this lead, I. Fredholm studied in 
1900 the integral equation f(x)= @(x)— 

A{? K(x, y)o(y) dy containing a parameter å. 
He proved the so-called talternative theorem: 
For a given åo, the operator equation (J — 
AoK)p=f, (Ke)(x) =J2 K(x, y)p(y)dy, either 
admits a uniquely determined continuous 
solution ọ for every continuous function f or 
else (I — åo K)o =0 admits a nontrivial con- 
tinuous solution œo #0. D. Hilbert discussed 
(1904-1910) a tcontinuous linear operator K 
defined on the tHilbert space L, with values 
in L3, and he called a complex number A, a 
*spectrum of K if (I — ĉo K) does not have a 
continuous linear inverse. He proved that if 
K is Hermitian, then K admits a *spectral 
resolution with real spectra only. One of his 
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outstanding contributions was the discovery of 
the ‘continuous spectrum. In 1918, F. Riesz 
proved that Fredholm’s alternative theorem 
holds for a tcompletely continuous linear 
operator in the space of continuous functions, 
and later the result was extended to Banach 
spaces. 

In 1932, three important books by S. Banach 
[1], J. von Neumann [2], and M. H. Stone [3] 
were published. These books treated tclosed 
linear operators that are not necessarily con- 
tinuous. The notion of Banach space was 
introduced: a tnormed linear space complete 
with respect to the distance dis(x, y)= ||x — 
y||. By making use of the tBaire-Hausdorff 
theorem and the fHahn-Banach theorem, 
Banach proved, for closed linear operators 
in Banach spaces, the fundamental principle 
consisting of the fopen mapping theorem, the 
‘closed graph theorem, the tuniform bounded- 
ness theorem, the tresonance theorem, and the 
tclosed range theorem. These theorems were 
modified to be applicable in locally convex 
topological linear spaces by the Bourbaki 
group beginning in the late 1940s. 

In 1929, von Neumann proved, as a mathe- 
matical foundation of quantum mechanics, 
that a closed linear operator T in a Hilbert 
space admits spectral resolution with real 
spectra if and only if T is a tself-adjoint opera- 
tor (— also Stone [3]). The condition that a 
closed linear operator is a tfunction of a self- 
adjoint operator was given by von Neumann, 
F. Riesz, and Y. Mimura (1934—1936). K. Frie- 
drichs (1934) proved that a tsemibounded 
linear operator admits a self-adjoint extension. 
T. Kato [4] (1950) proved that a Schrödinger- 
type Hermitian operator is tessentially self- 
adjoint. 

Von Neumann’s tmean ergodic theorem 
in the Hilbert space (1932) was extended to 
Banach spaces by K. Yosida, S. Kakutani, and 
Riesz in 1936. G. D. Birkhoff ’s tpointwise 
ergodic theorem (1931) was extended by N. 
Wiener (1939), Yosida (1940), E. Hopf (1954), 
N. Dunford (1955), R. V. Chacon and D. S. 
Ornstein (1960), and others in various ways. 
The fAbelian ergodic theorems were discussed 
by E. Hille and R. S. Phillips [5] and Yosida 
[6]. 

The notion of *Banach algebra was intro- 
duced by M. Nagumo in 1936. I. M. Gel’fand 
proved that a commutative Banach algebra 
with multiplicative unit e satisfying |/e|| = 1 
(=the normed ring) admits a representation 
by an algebra of complex-valued continuous 
functions (1941). 

The treflexivity as well as the tduality of 
Banach spaces were studied by S. Kakutani 
(1939), V. L. Shmul’yan (1940), and W. T. 
Eberlein (1941). 


631 


The notion of vector lattice (= tRiesz space) 
was introduced in analysis by Riesz (1930). This 
was followed by the work of L. V. Kantrovitch 
(1935) and H. Freudenthal (1936). Kakutani 
gave two standard types of tBanach lattice: 
the tabstract (M) space (1940) and the tab- 
stract (L) space (1941). M. Krein and S. Krein 
(1940) and Yosida and M. Fukamiya (1940) 
discussed the (M)-type vector lattices, and 
Yosida (1941), the (L)-type vector lattices. H. 
Nakano (1940-1941) and T. Ogasawara and 
F. Maeda (1942) studied the spectral resolu- 
tion of the Banach lattice. 

The connection between ‘Brownian motion 
and tpotential theory was clarified by Kaku- 
tani (1942), and extended by J. L. Doob (1956) 
and G. A. Hunt (1957-1958). 

The tone-parameter semigroup of continu- 
ous linear operators in Banach spaces was 
studied by Hille and Yosida, and they gave in 
1948 a characterization of the tinfinitesimal 
generator of such semigroups. Its dual was 
given by Phillips (1955). The one-parameter 
semigroup of nonlinear tcontractive operators 
in Hilbert spaces was studied by Y. KOmura 
(1967), who obtained a nonlinear version of 
the Hille-Yosida theorem. This has been ex- 
tended considerably in Banach spaces by 
many scholars, e.g., Kato (1967), M. G. Cran- 
dall and A. Pazy (1969), Crandall and T. Lig- 
gett (1971), I. Miyadera and S. Oharu (1970), 
H. Brezis (1973), P. Bénilan (1973), and others. 

In 1936, S. Sobolev gave a generalization of 
the notion of functions and their derivatives 
through integration by parts. This general- 
ization has been extended by L. Schwartz 
(1945-) [9] to the notion of tdistributions, 
which are continuous linear functionals defined 
on the function spaces Z(R") and (R"), and 
this extension gives, e.g., a reasonable inter- 
pretation of Dirac’s 6-function. Since 1959, 
Gel’fand [10] has been publishing, with his 
collaborators, books on the distribution the- 
ory pertaining to function spaces other than 
Y(R") and A(R"). M. Sato introduced (1959- 
1960) [11] the theory of thyperfunctions, as a 
generalization of distribution theory. In the 
case of one independent variable, a hyper- 
function f may be defined as a generalized 
boundary value on the real axis R of a holo- 
morphic function F defined in C!—R!. Hyper- 
function theory has been refined to tmicro- 
local analysis and studied extensively by Sato, 
A. Martineau, H. Komatsu, P. Schapira, T. 
Kawai, M. Kashiwara, M. Morimoto, A. Ka- 
neko, and others (— [12]). 
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163 (XIII.16) 
Functional-Differential 
Equations 


A. General Remarks 


In many models, it is assumed that the future 
behavior of a system under consideration is 
governed only by its present state and not by 
past states. However, for various systems 
arising in practical problems we cannot ignore 
the effect of the past on the future. Such a 
phenomenon is often observed in population 
problems, epidemiology, chemical reactions, 
system engineering, and so on. 

The description of such phenomena may 
involve difference-differential equations 


X(t)= f(t, x(t), x(t — h), ..., x(t — hm)), (1) 


or integrodifferential equations 


t 


s=) | S(t, 8, x(t), x(s))ds. (2) 


0 


An enormous variety of equations is discussed 
in the literature, but most of them can be 
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expressed in the form 
X(t) = f(t, x(s)), (3) 


which is usually called the functional differen- 
tial equation; here s varies in an interval I, and 
f depends on the function x of /,. If t is the 
right or left endpoint of the interval J,, then 

(3) is said to be of retarded or advanced type, 
respectively. Most of the results obtained have 
been from work on equations of retarded type. 
In this case, the maximal length of the interval 
I, is called the retardation (delay, lag, deviation, 
etc.), and (3) is often called the retarded dif- 
ferential equation, delay differential equation, 
or differential equation with lag (retardation, 
deviating argument). 


B. Historical Remarks 


Functional differential equations of a sort were 
studied by Johann Bernoulli in the 18th cen- 
tury in connection with the string problem. 
Since then, a good deal of work has been done 
in this field by many people; some of this work 
was done before the beginning of this century. 
Among the investigators, Volterra [1, 2] is 
noteworthy for his systematic study on rather 
general equations related to problems of 
predator-prey populations and viscoelasticity, 
though his results were ignored by his con- 
temporaries. In the early 1940s, Minorsky 
[3], in his famous study of ship stabilization, 
pointed out the importance of delay effects in 
control theory, with many of the modern 
issues first appearing in his work [4]. Mishkis 
[5] studied linear systems extensively, and 
Driver [6] gave a unified representation for 
functional differential equations. Important 
achievements were given by Krasovskii [7] 
and Bellman and Cooke [8]. They laid the 
foundation for the qualitative theory of func- 
tional differential equations. Inspired by these 
works, many books (such as [9-14]) were 
published. Owing to these books together with 
many articles on this field, the theory of func- 
tional differential equations has become an 
important branch in the theory of differential 
equations. 

Equations of advanced type are treated in, 
e.g., [5,8], but qualitative theory for them has 
hardly been established. The equation 


X(t)=ax()+bx(At) on OQO<t<oo (4) 


is of advanced type if 2> 1, and its analytic 
solution has been studied in detail. Equation 
(3), whose right-hand side involves differential 
operators, e.g., X(t) = f(t, x(t), x(t — h), X(t — h)), 
is said to be of neutral type [5, 8, 14], and is 
generally considered to be of neither retarded 
nor advanced type because of its distinctive 
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features. However, Hale [14] originated the 
study of a class of equations of neutral type for 
which the general qualitative theory can be 
developed in the same manner as for equations 
of retarded type. 

For the case of infinite retardation, proper 
choice of the space of functions x contained 
in the right-hand side of (3) is of great signifi- 
cance. A general treatment of the phase space 
for equations with infinite retardation in an 
axiomatic setting is given in [15]. 


C. Phase Space 


For simplicity, let J, always be the interval 
[t—h,t] with a finite retardation h>0.A 
solution of (3) starting at t=7 is a function 
defined on [t — h, a) for a>r such that the 
solution coincides with a preassigned function 
(the initial function) on [t —h, t] and is con- 
tinuously differentiable and satisfies (3) on 

[t, a). Since a solution starting at t =t must be 
continuous for t >t, it is quite natural, as is 
done in much of the literature in order to 
develop a qualitative theory, to choose the 
space C([t—h, t], R”) of continuous R”-valued 
functions on [t—h, t] as a space of functions x 
involved in the right-hand side of (3). Intro- 
ducing a symbol x, which represents an ele- 
ment of C=C([—hA, 0], R”) defined by x,(s)= 
x(t+s) (se[—h,0]), we can rewrite (3) ina 
more convenient form: 


X()=f(t,X,), (E) 


where the function f(t, o) is defined on Rx C 
(or on its subspace). Here C is considered 

to be a Banach space with the norm |/g|| = 
MAX,er—n,0}1(S)|, |: | being a norm in R”. The 
space C is said to be the phase space of (E), 
and the initial condition can be written as 


x=% (1, 0)ERXC. (5) 


D. Initial Value Problem 


Let f(t, o) be a continuous functions defined 
on an open domain DCR x C. The initial value 
problem is to find a solution of (E)—(5) for a 
given (t, č)e D. Many fundamental results hold 
as for ordinary differential equations: (a) There 
always exists a solution of (E)-(5) under the 
continuity of f on D. Here, the solution means 
the one to the right. No general existence 
theorem is given for the solution to the left. (b) 
The solution of (E)—(5) is unique, if f satisfies 
the Lipschitz condition: | f(t, p)— f(,W)|<L\l@e 
—w/|| in a neighborhood of each point of D, 
where L is a constant depending on the neigh- 
borhood. (c) If f varies in the space C(D, R”) 
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equipped with the compact open topology and 
if the solution of (E)—(5) is unique for every 
(t, č)e D, then the solution is continuous as a 
function of (t, é, f). (d) If f is completely con- 
tinuous on D, that is, f is a continuous map- 
ping of bounded subsets of D into bounded 
sets in R”, then every solution can be con- 
tinuable to the right as long as it remains 
bounded or stays away from the boundary of 
D. This assertion is no longer true in the 
absence of complete continuity. 

To prove the existence theorem, Schauder’s 
‘fixed point theorem is utilized; the Picard 
successive method is also effective under the 
Lipschitz condition. Consider the case when 
f(t, p) can be written as f(t, p)=g(t, (0), o), 
where g(t, x, o) is defined for (t, x, p)eR x R” x 
C, and g(t, x, p)=g(t, x, W) if p(s)=w(s) on 
[—h, —6], 6>0. Equation (1) is one such case, 
where h=maxh,, 6=minh,. Then, g(t, x, x,) is 
a function of (t, x) alone on [t,t +ô] under the 
given condition (5), that is, (E) is reduced to 
an ordinary differential equation. This makes 
it possible to find a solution of (E)—(5) by 
matching successively the solutions of ordi- 
nary differential equations on [t,t + ô], [t+ 
6,7+26],.... This is the step-by-step method, 
which is effective even for equations of neutral 
type with the same property. 

Under uniqueness, the solution x(t) of (E) 
induces a mapping T(t, t): G(t, 7) >C, t>t, 
which maps x,€ G(t, t) to xe C, where G(t,t)c 
C is the set of € for which the solution of (E)— 
(5) is continuable up to t. Then 


T(t,s)T(s,t)=T(t,t) for tsr, (6) 


and T(t, t) is strongly continuous in t. If (E) is 
autonomous, that is, f(t, o) is independent of t, 
then there exists a mapping T(t), t>0, satisfy- 
ing T(t—t)= T(t, t), and {T(t)} >o becomes a 
one-parameter semigroup. 


E. Linear System 


When f(t, o) is continuous on J x C for an 
interval J and is linear in ọ, equation (E) is 
said to be a linear system (denoted by (L)). In 
this case, f(t, o) satisfies the Lipschitz con- 
dition with L= L(t) on I x C for a continuous 
function L(t) of J, which also implies that f is 
completely continuous. Thus the solution of 
(L)—(5) uniquely exists over [t, 00)N J for any 
(t, ¢)el x C, and the mapping (the fundamental 
or solution operator) T(t, t) is a bounded 
linear operator on C for any t, tel, t>t, and 
‘compact if t>1+h. T(t,t) corresponds to the 
fundamental matrix for ordinary linear dif- 
ferential equations, but it is not invertible in 
general. 

A continuous function f(t, g) linear in o 
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may be expressed in the form 


ft, o= | Ld.n(t, s)] (s), (7) 
-h 


where n(t, s) is an n x n matrix defined on I x 
[ —h,0], which is measurable in (t, s) and of 
tbounded variation in s with a total variation 
less than L(t), and the integration is that of 
Stieltjes with respect to s. From this fact it 
follows that the range of the initial functions 
can be extended to the space of fpiecewise- 
continuous functions, and the solution x(t) of 
the nonhomogeneous linear system X(t) = 
S(t, x,)+ p(t) through (t, čje I x C can be rep- 
resented by the constant variational formula 


x(t)=[T(t,7)¢] (0) 
F | [T(t s) ](O)p(s)ds, tèrt, 


where I (s)= E (the unit matrix) for s=0, and 
I'(s)=0 (the zero matrix) for s<0. 


F. Autonomous Linear System 


Let (L) be an autonomous linear system 
X(t) = f(x). 


In this case, (7) is not different from the Riesz 
representation theorem f(~)= {°,[dn(s)] 9(s). 
For (AL) the fundamental operator T(t) plays 
an extremely important role. As was seen, 
{T(t)},50 is a one-parameter semigroup of 
bounded linear operators T(t) which is strong- 
ly continuous in t>0 and compact for t>h. 
Thus the asymptotic behavior of the solutions 
of (AL) are determined by the distribution of 
the spectra o(A) of the infinitesimal generator 
A of T(t), which is given by Ag=@ with the 
domain D(A)= {geC| eC and @(0)= f(g)}. 
The properties of T(t) assert that: (a) ¢(A) 
consists of point spectra alone. (b) The number 
of A,={Aea(A)|ReA>«} is at most finite for 
any xe R. (c) For every A4€0(A) the dimension 
of the generalized eigenspace of 4 is finite. (d) 
The spectra of A coincide with the roots (char- 
acteristic roots) of the characteristic equation of 
(AL), 


0 
det E — | e*s ants) =0, 
—h 


together with their multiplicities. (e) e% e 
o(T(t)), t>0, if and only if Aea(A). 

Let P,, xe R, be the linear space spanned by 
the generalized eigenfunctions corresponding 
to a AEA,. Then, (a) P, is invariant under T(t), 
that is, T(t)P, < P, for t 20; (b) the restriction 
of T(t) to P, is invertible and hence extend- 
able over te R; (c) if če P,, then [T(E] (0)= 
Lier, Palt, Ee“, where p,(t, č) are polynomials 


(AL) 
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in t and linear in €; (d) there is a direct-sum 
decomposition C = P,+Q, such that Q, is in- 
variant under T(t), the projection along Q,7z,: 
C— P, is bounded, and there exist positive 
constants K, e for which || T(t)é|| < Ke® * || El, 
t20 if €€Q,. Hence for any čeC the solution 
x(t) of (AL) through € at t =0 satisfies 


SD palt, Taje” |< K(1 + |la,le* "NE, 


t>0, (8) 


where ||z,|| denotes the operator norm of z,. 
The set Q = (\ser Q, may contain an element ë 
other than the zero element, but T(t)ë must be 
identically zero for t 2 hn. 

For a linear system (L) with f(t, g) œ- 
periodic in t, similar conclusions result, with 
T(%œ,0) in place of A; and relation (8) holds, 
where A, is defined similarly by replacing o(A) 
by {(1/o)log z| we o(T(@, 0))} and p,(t, č) are 
polynomials with w-periodic coefficients. This 
corresponds to *Floquet’s theorem for ordi- 
nary periodic linear systems. E0(T(w, 0)) and 
(1/)log u are said to be the characteristic 
multiplier and the characteristic exponent, 
respectively. 


G. Stability Problem 


The concept of stability can be defined and 
studied in the same spirit as for ordinary dif- 
ferential equations, and the Lyapunov second 
method also turns out to be effective. For 
instance, the zero solution of (E) with f defined 
on Dy =[0, ©) x {gEC|\\p|| <H} for H>0 is 
said to be uniformly asymptotically stable if 
there are a constant «>0 and positive func- 
tions 6(¢) and o(¢) of e >0 such that any solu- 
tion x(t) of (E) satisfies 


|x(t)|<« as long as x(t) exists, (9) 


whenever ||x,|| <ô(e) and tèt or ||x,|| <a and 
t>t+o(e) for t>0. In the above definition, if f 
is completely continuous on Dy and if0<ée< 
H, then the phrase “as long as x(t) exists” is 
redundant, and (9) is equivalent to the claim 
that x(t) exists for all t >t and |x(t)| <£. Under 
the Lipschitz condition and the complete 
continuity on f in (E), the zero solution of (E) 
is uniformly asymptotically stable if and only if 
there exists a continuous R-valued function 
(Lyapunov function) V(t, p) defined on Dy, for 
0< HH, <H such that 


(i) = a(le(O)|)< Vit, ~) < bille, 
(ii) D* V(t, x, <—cVit,x,) 


as long as (t, x,)€ Dy, for a solution x(t) of (E), 
where a(r), b(r) are continuous functions with 
a(r)>0 for r>0, b(0)=0, c>0 is a constant, 
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and D* denotes the upper-right Dini deriva- 
tive. Furthermore, the Lyapunov function can 
be endowed with a Lipschitz condition | V(t, o) 
—V(t,w)|<L, ||\g—w|| for some constant L, if 
f in (E) is uniformly Lipschitzian, that is, the 
coefficient L is constant over the domain. The 
Lipschitz condition together with (ii) makes it 
possible for a solution y(t) of the perturbed 
equation 


V(t)= f(t, y) + P(t, ya) (10) 
to satisfy 
DË V(t, y) < —cVit,y)+L,|P(t yl. 


By using this fact, the stability of (10) can be 
discussed. 

However, for functional differential equa- 
tions it is quite a difficult problem to con- 
struct a suitable Lyapunov function for a 
given (E). 

Many of the attempts to improve the suffi- 
ciency condition that have been made up to 
now are such that a stability property can be 
verified by means of a simple Lyapunov func- 
tion. The main effort has been devoted to 
replacing condition (ii) by another type of 
condition. One of them is 


(ii*) D V(t, x) < —e(|x(0)|) 


under the uniform boundedness of f(t, o), 
where c(r) is a continous function with c(r)>0 
for r>0. Another one is for the case when 

V(t, gp) = W(t, p(0)), where W(t, x) is defined for 
(t, x)eR x R”. In this case, (ii) can be replaced 
by 
(ii**) D*V(t,x,)<—cV(t, x,) 

whenever V(t+5,x,,,)<F (V(t, x,)) for se 

[ —h,0], where c>0 is a constant and Fir) is a 
continuous function satisfying F(r)>r for r>0. 
The condition (11**) was given by B. S. Razumi- 
khin and provides an easier way to construct 

a Lyapunov function. 

For a linear system, uniform asymptotic 
stability implies || T(t, t)|| <1/d(1) for t>+7 and 
|| T(t, t)|| < 1/2 for t> t+ a(x/2)+h. These facts 
together with (6) show that the zero solu- 
tion is exponentially stable, that is, |x(t)|< 
Ke" ||x || for t >rt, where v = (log 2)/(o(a/2) + 
h), K = 2/6(1). From (8) it follows that the zero 
solution of (AL) is uniformly asymptotically 
stable if and only if 


{ZE0(A)|ReASO =. (11) 


H. Equations of Neutral Type 


To deal with an equation of neutral type, such 
as 


X(t)= f(t, X, Xs (12) 
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a phase space such as C'=C'([—h, 0], R") of 
continuously differentiable functions should 
be chosen. The solution of (E) will become 
smooth as time elapses if f in (E) is sufficiently 
smooth. However, this property cannot be 
expected for (12), and the solution of (12)—(5) 
need not belong to C! even if €€C' without a 
specific condition on & such as E(0) = f(z, é, È. 
Such a restriction on initial functions is an 
obstruction to the development of the general 
theory. Equations of the form 


{Die x)= fitx) (N) 


cover a fairly general class of equations of 
neutral type, where D(t, o) and f(t, o) are 
defined on an open set D cR x C and a solu- 
tion of (N)—(5) means a continuous function 
x(t) defined on [t —h, a), a>t, which satisfies 
(5) and 


D(t, x)= D(t, €)+ [re x)ds on [t,a). 


(E) corresponds to the case where D(t, g)= 
(0), while X(t)= G(t, x,)+ g(t, x,) is reduced 
to the form (N) if G(t, g) is linear in ọ and 
continuously differentiable with respect to t by 
setting D(t, p) = o(0) — G{t, p) and f(t, g)= 
g(t, p) —(0/6t)G(t, p). 

A continuous function D(t, o) linear in ¢ is 
said to be atomic at 0 if in the representation 


0 
D(t,@)= | [d ult, s)] e(s) 
=h 


(see equation (7)) P(t)= u(t, 0)— u(t, — 0) exists 
and is nonsingular, which is equivalent to 


D(t, p) = P(t)e(0)+ | [d.Ho(t, s) ]e(s) (13) 


with a nonsingular matrix P(t) and a matrix 
Ho(t, s) whose total variation with respect to s 
on [ —o,0] tends to 0 as o0 locally uni- 
formly in t. A nonlinear function D(t, p) is said 
to be atomic at 0 if D(t, p) has a continuous 
*Fréchet derivative D,(t, p) with respect to 
and D,(t, o) is atomic at 0. In equation (N), 
D(t, o) is always assumed to be atomic at 0, 
and many of the fundamental theorems for (E) 
are also valid for (N). Let T(t) be the operator 
solution of the autonomous linear equation 


“ Dix) = f(x). (14) 


Then {T(t)},>o is a strongly continous semi- 
group of bounded linear operators, and the 
corresponding tinfinitesimal generator A has 
the domain D(A)={geC|@eEC and D(ġ)= 
f(@)}. The properties of the spectra o(A) are 
the same as for (AL), except that the character- 
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istic equation is now given by 


0 0 
cet 100+ | ae | canis) |=0 


=h =h 


and the property (b) is true if x> ap, where 
D(ep), f(@) are assumed to be of the forms (13) 
and (7), respectively, without the argument t, 
and where dp is given by 


0 
det] P+ |” edna |=0. 
-h 


Similarly, the decomposition C = P, +Q, is 
possible if x>ap. Hence, to obtain the uniform 
asymptotic stability of the zero solution of (14), 
the condition a, <0 should be assumed in 
addition to (11). The linear function D(¢) is 
said to be stable if ap <0. If D(ọ) is stable, then 
the Lyapunov method is also applicable, as a 
sufficient condition, to the stability problem of 
(N) with linear D(ọ) instead of general D(t, o), 
but in this case condition (i) for V(t, g) should 
be replaced by. 


a(|D(9)|)< V(t, o) < d(|¢|)). 


ap=max {Re A 


I. Infinite Retardation 


There are some basic differences between cases 
of finite retardation and of infinite retardation. 
For example, in the definition of stability, the 
inequality |x(t)|< € in (9) can be replaced by 
lx I| <£ with no difference in the case of finite 
retardation, but this replacement yields a 
different concept deeply connected with the 
choice of the phase space in case of infinite 
retardation. 

There are several ways to define a phase 
space for a functional differential equation 
with infinite retardation. One of those phase 
spaces generally used is a linear space X of 
functions: (—co,0]—R"” with a seminorm ||- ||, 
such that if a function x:(—0, a)—>R" satisfies 
x,€X and it is continuous on [t, a], then 


(i) x,é€X for all tet, a), 
(ii) | x, is continuous as a function [t, a) X, 
(i) m| x(t) S |x My < K max, ep, | X(5)] 

+ M(t—1)|lx,Ilx, 


where m and K are positive constants and 
M(t) is a continuous function. 

If X satisfies the foregoing conditions and if 
f(t, @) is continuous on an open domain Dc 
R x X, then the local properties (a)—(c) in Sec- 
tion D hold, and so does (d) when D=R x X. 
On the other hand, if X has a fading memory, 
namely, 


(iv) M(t)<Me “ in (iii) for positive con- 
stants M, u, 
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then the procedure in Section F is applicable 
by restricting o(A) to o,(A)= {AE 0(A)|Red> 

— u} and, hence the zero solution of (AL) with 
infinite retardation is uniformly asymptotically 
stable under (11). 

For 0<h<@ and 0<y<.o, both the space 
C,,, of continuous functions @:(—h,0]—R’ 
with a finite limit lim, ._,e”@(s) and the space 
M,,, of measurable functions @:(—h,0]—>R" 
with [°,,e”"|@(s)|ds< œ satisfy the foregoing 
conditions, where the norms are given by 
lole, =SUP 1.0121 0(5)| and lolim, ,=100)] 
+ [°,,e”|~(s)|ds. These spaces have a fading 
memory if h< œ or y>0. 

In much of the literature in which the equa- 
tion of infinite retardation has a form like 
(2) or (4), the space CB of bounded continu- 
ous functions or the space Cy of continuous 
functions with compact supports are suffi- 
cient as a phase space under the norm ||ọ|| = 
SUP, <o|P(s)|. However, it is to be noted that 
Co is not complete when CB does not satisfy 
condition (ii). When CB is chosen as the phase 
space, in order to show the existence theorem, 
f(t, x,) should be continuous for any bounded 
continuous function x in addition to the cont- 
inuity in (t, o). This condition is satisfied if J, 
=[ g(t), t] in (3) for a continuous function 
g(t)<t. Equation (2) or (4) with 0< å <1 give 
rise to such a case, but g(t) must be equal to 0 
in (2) and (4) with 1=0. If g(t) œ as t> œ, 
the Razumikhin condition (ii**) D* V(t, x) < 
—cV(t, x) whenever V(s, x,)< F (V(t, x,)) 

(se g(t), £]) for a Lyapunov function is effec- 
tive, but one can conclude only that x(t)—>0 as 
too without uniformity. 
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A. Definition [1-3] 


Let C(X) be the tBanach algebra of all con- 
tinuous complex-valued functions on a com- 
pact Hausdorff space X with pointwise oper- 
ations and funiform norm. A function algebra 
(or uniform algebra) on X is a closed sub- 
algebra A of C(X) containing the constant 
functions and separating the points of X, i.e., 
for any x, ye X, x# y, there exists fe A with 
F(A f(y). If o, xe X, denotes the evaluation 
mapping f—/(x) of A, the correspondence x—> 
P, is a homeomorphism of X into the tmaxi- 
mal ideal space M(A) of A. Since f (¢,) = f(x) 
for any xe X and fe A, the fGel’fand represen- 
tation of A is an isometric isomorphism. By 
identifying x with ọ,, we regard X as a closed 
subset of Mt(A) and the Gelfand transform f 
JeéA, as a continuous extension of f to M(A). 


B. Examples [1,3,4] 

For a compact plane set K let P(K) (R(K)) be 
the subalgebra of all functions in C(K) that 
can be approximated uniformly on K by poly- 
nomials (rational functions with poles off K). 
A(K) denotes the subalgebra of all functions in 
C(K) that are analytic in the interior of K. 
These are function algebras on K and P(K)& 
R(K)¢& A(K). When K is the unit circle T= 
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{|z|=1}, P(T) is called the disk algebra, the 
most typical concrete example. 

The theory of function algebras emerged 
from attempts to solve, by means of functional 
analysis, certain problems in complex analysis, 
especially problems of uniform approximation, 
e.g., when does A(K)= P(K) or A(K)=R(K) 
hold? 

Another important example is given by the 
algebra H,,(U) of all bounded analytic func- 
tions on a bounded plane domain U with the 
supremum norm || f||,, =sup{|f(z)||ze¢U} [5]. 
Since the Gel’fand representation is an iso- 
metric isomorphism of H,,(U) into the algebra 
C(t A,(U))), H,,(U) is viewed as a function 
algebra on the space Nt(H,,(U)). Further 
examples result if we take K or U ina Rie- 
mann surface or in the complex n-space. 

We list a few abstract function algebras that 
reflect certain relevant properties possessed 
by concrete examples. A is called a Dirichlet 
(resp. logmodular) algebra if the set Re A= 
{Re f|feA} (resp. log|A7"|= flog fIlf fe 
A’) is dense in Cy(X), the space of all con- 
tinuous real-valued functions on X. It is called 
hypo-Dirichlet if the closure of Re A has finite 
codimension in Cp(X), and the linear span of 
log|A~*| is dense in Cp(X). 

In the following, A denotes a function alge- 
bra on X unless otherwise specified. 


C. Boundary and Representing Measure 


[1-4] 


A subset E of X is a boundary for A if for any 
feA there exists xe E such that | f(x)|= || fl. 
A closed boundary is a boundary closed in 
X.G. E. Shilov proved that there is a smallest 
closed boundary, ðA, which is called the Shilov 
boundary for A. A positive Borel measure u on 
X is a representing measure for ye 3A) if {(@) 
=| f(x)du(x) for all fe A. Each pe MA) has a 
representing measure supported by 0A. The 
Choquet boundary, c(A), consists of all xe X 
such that the evaluation , at x has a unique 
representing measure. Then c(A) is a bound- 
ary, whose closure is 0A. If X is metrizable, 
c(A) is a G; set in X and supports a represent- 
ing measure for every member of M(A). 

For ge Wt(A), M, denotes the set of repre- 
senting measures for o. It is a tweak* com- 
pact convex subset of the space of measures 
on X. M, is a singleton if A is Dirichlet or log- 
modular. It is finite-dimensional if A is hypo- 
Dirichlet. The case dim M, < +0 has been 
studied in detail. Extensive studies for the case 
dim M,,= +œ have been done only for con- 
crete examples related to polydisks, infinitely 
connected domains, etc. 

The notion of boundary reflects the tmaxi- 
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mum modulus principle for analytic func- 
tions, and representing measures come from 
*Poisson’s integral formula. The most relevant 
maximum principle is H. Rossi’s local maxi- 
mum modulus principle: For any closed set K 
in M(A), we have || f |lk= || llbaxucxnea) for all 
JEA, where bd K is the topological boundary 
of K and |/f||;=sup{|f(s)||seS}. A corre- 
sponding result for function spaces was con- 
sidered by Y. Hirashita and J. Wada. Closely 
related to representing measures are the or- 
thogonal measures for A, i.e., complex Borel 
measures u on X such that f fdu=0 for all 
fA; they are often useful in studying func- 
tion algebras by means of the duality tech- 
nique. The set of orthogonal measures is 
denoted by A+. 


D. Peak Sets [1-3] 


A subset K of X is a peak set for A if there 
exists fe A such that f(x)=1 for xe K and 
|f(y)\<1 for ye X —K. K is a generalized peak 
set if it is the intersection of peak sets. A point 
xeéX is a (generalized) peak point if the set 

{x} is a (generalized) peak set. The set of 
generalized peak points equals the Choquet 
boundary for A. If X is metrizable, the peak 
sets and generalized peak sets coincide. There 
exist X and A such that X is metrizable: X = 
Mt A) =c(A), but A 4 C(X) (B. Cole). A sub- 
set E of X is interpolating for A if for any 
bounded continuous function u on E there 
exists fe A with f | E =u. Then a closed G; set 
K in X is an interpolating peak set if and only 
if ze At implies | 4|(K)=0 (E. Bishop). 


E. Antisymmetric Decomposition [1-3] 


A subset F of X is a set of antisymmetry for A 
if every function in A which is real-valued on F 
is constant on F. Bishop’s antisymmetric de- 
composition then appears as an extension of 
the tWeierstrass-Stone theorem on uniform 
approximation. It is a refinement of Shilov’s 
decomposition and reads as follows: If {E,} is 
the family of maximal sets of antisymmetry for 
A, it is a partition of X into generalized peak 
sets such that fe C(X) with f |E € A|E, for all 
a belongs to A. An interesting connection was 
found by J. Tomiyama between the maximal 
antisymmetric decomposition of X relative to 
A and that of MA) relative to A. 


F. Parts and Analytic Structure [1—4] 


A. M. Gleason defined an equivalence relation 
~ in M(A) by setting p ~y if sup{| f(o)— 
SWI SEA, If | <1} <2. Each equivalence 
class for this relation is a part (or a Gleason 
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part) for A. Two points ọ, y belong to the 
same part if and only if there exist mutually 
absolutely continuous representing measures 
for ọ and v for Y such that c~! <du/dv<c for 
some constant c>0 (Bishop). An analytic 
structure in M(A) is a pair (V, t), with an tana- 
lytic set V in some open subset of C” and a 
nonconstant continuous mapping tT: V>M(A) 
such that fot is analytic on V for all fe A. 
For such a structure, t(V) is always within a 
part. But there can be a nontrivial part with 
no analytic structure, as was shown by G. 
Stolzenberg. A topological characterization 

of parts was obtained by J. Garnett. Sample 
results in the positive direction are: (1) If the 
ideal A,={ feA If(g)= 0} is finitely generated, 
there ts an analytic structure (V, t} such that + 
is a homeomorphism of V onto an open neigh- 
borhood of ø (Gleason); (2) if g has a unique 
representing measure and if the part P of ọ is 
not a singleton, there is a bijective continuous 
mapping t of the open unit disk onto P such 
that f ot is analytic for fe A (J. Wermer, K. 
Hoffman and G. Lumer); (3) if A is hypo- 
Dirichlet and if a part P is not a singleton, P 
can be made into a !-dimensional tanalytic 
space so that each fe A is analytic on P (J. 
Wermer and B. V. O'Neill). Analytic structures 
in tpolynomially convex hulls of curves in C” 
have also been studied [3, 4]. 


G. Abstract Function Theory [1,6-9] 


Let pe Mi(A), and choose me M,, which is 
fixed. The generalized Hardy class H,(m), 0 
<p< œ, associated with A is the closure 
(weak* closure, if p= œ) of A in the İL, space 
L,(m) on the measure space (X, m). Under 
suitable restrictions on A, œ, or m, we can 
recapture some of the important classical facts, 
most of which have their origins in the works 
of A. Beurling, R. Nevanlinna, F. and M. 
Riesz, and G. Szegö. In this area, H. Helson 
and D. Lowdenslager came up with a powerful 
method using orthogonal projections in Hil- 
bert space and gave together with S. Bochner’s 
remark, a strong influence for subsequent 
development. The modification argument was 
then devised by Hoffman and Wermer, in- 
spired by F. Forelli. After Hoffman’s detailed 
study of logmodular algebras, Lumer observed 
that most results remain valid when ọ has 

a unique representing measure. T. P. Srint- 
vasan and J.-K. Wang (— [8]) introduced the 
notion of weak* Dirichlet algebra and showed 
that some major theorems are mutually equiv- 
alent and are in fact measure-theoretic. With 
the Hoffman-Rossi complement, their result 
now states that for fixed me M, the following 
are equivalent: (i) A+ A is weak* dense in 
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L,,(m), i.e., A is a weak* Dirichlet algebra on 
(X, m); (il) if we M, is absolutely continuous 
with respect to m, then u =m; (iii) if a closed 
subspace M of L,(m) is simply invariant in the 
sense that A, M&M and A,M is not dense in 
M, then M =qH,(m) with qe L,,(m), |q|=1 
a.e.; (iv) the set log|H,,(m)~'| coincides with 

L (m; R), the set of real-valued elements in 

L (m); (v) for every we L; (m), w>0, we have 
inf{f |1 — f| wdm| fe Ap} =exp({log w dm); (vi) 
the linear functional h>{hdm on H,,(m) has a 
unique positive extension to L,.(m). Further 
extensions were subsequently made by use of 
the conjugation operator (— Section K) by 

T. Gamelin, H. K6nig, Lumer, K. Yabuta, 
and others. Some more properties of weak* 
Dirichlet algebras have been obtained by T. 
Nakazi. 


H. Generalized Analytic Functions [1,10] 
Let I be a dense subgroup of the additive 
group R of the reals with discrete topology, 
and let G be the character group of I’. Each 
aeT, as a character of G, defines a continuous 
function x, on G. Let A be the closed sub- 
algebra of C(G) generated by {y,|aeT,a>0}. 
A is a Dirichlet algebra but is far more difficult 
to describe than the disk algebra. The study of 
this algebra, especially that of invariant sub- 
spaces, has evolved from papers by Helson 
and D. Lowdenslager. Let c be the tHaar 
measure of G and H,(¢) the closure of A in 

L (e). A closed subspace M of L,(o) is called 
invariant if y,M S M for all acl, a>0. M is 
called doubly invariant if y, M <M for all ae 
T. Otherwise, it is called simply invariant. In 
fact, only the latter is interesting. Let e,€ G 
with teR be the character of I defined by 
e,(a)=e'"*. Then the mapping te, is a faith- 
ful representation of R into G. A cocycle on 

G is defined to be a Borel function B on G x 
R such that (i) | B(x, )|= 1, (ii) B(x +e,, 0 = 
B(x, s) B(x, s+) for xeG and s, teR. Two 
cocycles are identified if they differ only on 

a null set in G x R. For a cocycle B, let Mp 

be the set of feL,(c) such that B(x, t) f(x + 
ee H,(dt/(1+ t7)) for almost all xe G, where 
H,(dt/(1 + t)) is the closure, in the space 
L,(dt/(1+t7)) on R, of the set of boundary 
value functions on R of bounded analytic 
functions on the upper half-plane. Then the 
mapping B— M; is a bijection from the set of 
cocycles onto the set of simply invariant sub- 
spaces M of L,(a) such that M=(\{y,M|ae 
I',a<0}. Moreover, Mg =qH,(c) for some 

qe L,,(0), |q|=1 ae. if and only if B(x, t)= 
q(x) q(x +e,), i.e., B is a coboundary. When T # 
R, there is a cocycle that is not a coboundary. 
Further studies have been done by Helson, 
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Gamelin, J. Tanaka, and others. On the other 
hand, F. Forelli observed that tflows in com- 
pact spaces give rise to a kind of analyticity. 
In the above case, the algebra A consists of all 
f¢C(G) that are analytic with respect to the 
flow T,(x)=x-+ e, for xeG and teR. General- 
ized analytic functions induced by flows in 
general have been studied by Forelli and P. S. 
Muhly. 


I. The Unit Disk [1,6,7, 11, 12] 


Let A be the disk algebra P(T), D the open 
unit disk {|z|< 1}, and m the normalized Le- 
besgue measure on T. The algebra A has been 
the most important model in the theory of 
function algebras, and we find here the origin 
of many abstract results. Some typical results: 
(i) every orthogonal measure for A is abso- 
lutely continuous with respect to m (F. and M. 
Riesz); (ii) a closed set E in T ts an interpolat- 
ing peak set for the tGel’fand transform A 

of A if and only if m(E)=0 (W. Rudin and L. 
Carleson); (iti) A is maximal among the closed 
subalgebras of C(T) (Wermer); (iv) a function 
feC(CID) belongs to A if f is analytic at every 
zeD with f(z) 40 (T. Rado). 

The generalized Hardy class H,(m), 0<p< 
œ, associated with A is viewed as the set of 
nontangential boundary value functions of 
elements in the classical *Hardy class H,(D). 
Here we find the origin of invariant subspace 
theorems: A closed subspace M (+ {0}) of 
H,(m) is invariant, i.e., AM € M, if and only 
if M =qH,(m) with qe Ha (m), |q|=1 a.e. 
(Beurling). 

The algebra H,, = H,,(m) is a weak* Diri- 
chlet algebra, whose Shilov boundary is identi- 
fied with the maximal ideal space X of L, (m). 
We have L,,(m;R)=log|(H”) "|, and a fortiori 
H, is logmodular on X. The mapping z>¢, 
embeds the disk D in M(H) as an open set. 
The structure of M(H) was studied in detail 
by I. J. Schark, Hoffman, and others. We finish 
with three remarkable results: (1) D is dense in 
M(H) (Carleson). This is the corona theorem 
and was proved in the following equivalent 
form: For any f,,...,f,¢€H,,.(D) with | f,|+...+ 
| f,|2e>0 on D, there exist g4, ..., g E Ho (D) 
with figi +... + faga = 1. A simple proof was 
discovered by T. Wolff [12]. (ii) The convex 
combinations of tBlaschke products are uni- 
formly dense in the unit ball of H (D) (D. 
Marshall) [12]. (iii) Every closed subalgebra B 
between H„ and L (m) is a Douglas algebra, 
i.e., B is generated by H,, and the complex 
conjugates of a family of inner functions (S.-Y. 
Chang and Marshall) [11]. The proof of (iii) is 
an interesting application of the theory of 
tbounded mean oscillation. 
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J. Rational Approximation [1,2] 


The problem of rational (polynomial) approxi- 
mation on a compact plane set K asks when 
A(K)=R(K) (A(K)= P(K)) holds. An impor- 
tant tool for such problems is Cauchy’s trans- 
form of measure u: fi(f)=|(z—¢) * du(z). Using 
this, one can show, for instance, the following: 
(1) A(K)= P(K) if and only if K has a con- 
nected complement (S. N. Mergelyan); (2) fe 
C(K) belongs to R(K) if each point ze K has a 
closed neighborhood V with flave RIKANO V) 
(Bishop); (3) A(K)= R(K) if the diameters of 
the components of C— K are bounded away 
from zero (Mergelyan); (4) C(K)= R(K) if and 
only if almost all points of K are peak points 
for R(K) (Bishop). The last result cannot be 
extended much because there is a set K such 
that A(K)#R(K), while A(K) and R(K) have 
the same peak points (A. M. Davie). 

A complete characterization for A(K)= 
R(K) was obtained by A. G. Vitushkin: (5) 
The following are equivalent: (i) A(K)= 
R(K); (ii) for any bounded open set D in C, 
a(D —K)=a(D— K°); (iti) for any zebd K, 
there exists r>1 such that limsup, ,. «(A(z; ô) 
~ K°)/a(A(z;rd)— K)< +00, where A(z; ô) is 
the disk {weC||w—z|<6} and a(E), for any 
bounded set E in C, ts the continuous ana- 
lytic capacity of E, which is the supremum of 
| f ‘(o0)| for all continuous functions f on the 
Riemann sphere CU {æ} such that |f|<1, 
f(%)=0, and f is analytic off a compact sub- 
set of E. As for uniform or asymptotic ap- 
proximation on noncompact closed subsets 
in C, Carleman’s classical study has recently 
been extended by N. U. Arakelyan, A. A. 
Nersesyan, A. Stray, and others in an interest- 
ing way. 

In connection with rational approximation, 
we should note detailed studies on pointwise 
bounded approximation in H,,(U), U being 
a bounded open set in C, by O. J. Farrell, L. 
Rubel and A. Shields, Gamelin and J. Garnett, 
and Davie. 


K. Further Topics 


(1) Conjugation operator [9, 13]. Take any 

me M,, pe WA). The conjugation operator 
then associates with each ue Re A the unique 
element *u in Cp(X) such that u+i*ue A and 
{*udm=0. After the classical inequalities of 
Kolmogorov and M. Riesz, we consider the 
following conditions with constants c, and d,: 
(K) (f |*ul?dm)"’? <c, flu|dm for 0<p<1;(M) 
(f|*ul? dm)" <d (f lu|? dm)" for 1 < p< œ. Let 
m be arbitrary. Then the inequality (K) is valid 
for ue Re A, u>0. The inequality (M) is valid 
for all ue Re A if p is an even integer > 2; it is 
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valid for all ue Re A, u>0, if p is not an odd 
integer. All remaining cases have counter- 
examples. On the other hand, M, always con- 
tains an m such that log] /(ọ)|< flogi f|dm for 
J€A (Bishop). Such a representing measure is 
called a Jensen measure for o. If m is Jensen, 
(K) is valid for all we Re A and all O<p<1, 
and (M) is valid for all ue Re A and all 1 < 
p<. 

(2) Riemann surfaces. For a compact bor- 
dered Riemann surface R, let A(R) be the 
algebra of functions, continuous on CIR and 
analytic on R. Then A(R) is hypo-Dirichlet on 
the border bd R of R, and many results for the 
disk algebra are extended to A(R). Most of the 
basic results are described in [14]. The maxi- 
mality of A(R) in C(bd R) was obtained by H. 
Royden; extreme points of related Hardy 
classes were discussed by Gamelin and M. 
Voichick; and invariant subspaces were deter- 
mined by Forelli, M. Hasumi, D. Sarason, and 
Voichick. A further extension to infinitely 


connected surfaces has been obtained by C. W. 


Neville, Hasumi, and M. Hayashi in the case 
of open Riemann surfaces R of Parreau- 
Widom type, which is defined as follows: Let 
G(a, z) be tGreen’s function for R, and let 
B(a,&), «>0, be the first tBetti number of 
the domain {ze R|G(a, z)> a}; then R is of 
Parreau-Widom type if | B(a, «)d«< +00. For 
such surfaces the situation looks favorable: 
For instance, the Cauchy-Read theorem is 
valid, and the Brelot-Choquet problem con- 
cerning Green’s lines is solved affirmatively. 

As for the generalization of approxima- 
tion theorems of Mergelyan and Arakelyan, 
we refer to the work of Bishop and L. K. 
Kodama for compact sets and to that of S. 
Scheinberg for noncompact closed sets. 

(3) Higher-dimensional sets. Much attention 
has been paid to algebras of analytic functions 
on domains in C”, n> 2, e.g., polydisks, unit 
balls, and general pseudoconvex domains. 
Polydisk algebras and ball algebras have been 
studied extensively by W. Rudin, P. Ahern, 
Forelli, and many others [15, 16]. Approxi- 
mation theorems of Mergelyan type were ob- 
tained by G. M. Henkin, N. Kerzman, and I. 
Lieb for strictly pseudoconvex domains with 
smooth boundary and by L. Hormander and 
Wermer and L. Nirenberg and R. O. Wells, Jr., 
in the case of totally real manifolds. Further 
improvements have been obtained by R. M. 
Range, A. Sakai, and others. 
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Functions 


A. History 


Leibniz used the term function (Lat. functio) in 
the 1670s to refer to certain line segments 
whose lengths depend on lines related to 
curves. Soon the term was used to refer to 
dependent quantities or expressions. In 1718, 
Johann Bernoulli used the notation øx, and by 
1734 the modern functional notation f(x) had 
been used by Clairaut and by Euler, who 
defined functions as analytic formulas con- 
structed from variables and constants (1728) 
[1]. tCauchy stated (1821) [2]: “When there is 
a relation among many variables, which deter- 
mines along with values of one of them the 
values of the others, we usually consider the 
others as expressed by the one. We then call 
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the one an ‘independent variable,’ and the 
others ‘dependent variables.’” tDirichlet consi- 
dered a function of xe[a, b] in his paper (1837) 
[3] concerning representations of “completely 
arbitrary functions” and stated that there was 
no need for the relation between y and x to be 
given by the same law throughout an interval, 
nor was it necessary that the relation be given 
by mathematical formulas. A function was 
simply a correspondence in which values of 
one variable determined values of another. 


B. Functions 


Today, the word “function” is used generally 
in mathematics in the same sense as a tmap- 
ping (— 381 Sets C) or, which is the same 
thing, a tunivalent correspondence (— 358 
Relations B). But this word is sometimes used 
in a wider sense, to mean a general (not nec- 
essarily univalent) correspondence, called a 
many-valued (or multivalued) function; in that 
case a univalent correspondence is called a 
single-valued function. 

Specialists in each branch of mathematics 
have their respective ways of using the word. 
In analysis, values of a function are often 
considered real or complex numbers; such 
functions are called real-valued functions or 
complex-valued functions, respectively. Fur- 
thermore, if the domain of the function is also 
a set of real or complex numbers, then it is 
called a real function or a complex function, 
respectively (— 131 Elementary Functions; 84 
Continuous Functions; 198 Holomorphic 
Functions). If the domain of a real- or 
complex-valued function is contained in a 
function space, the function is often called a 
functional; the tdistribution is an example. In 
algebra we often fix a ‘field, tring, etc., and 
consider functions whose domains and ranges 
are in such algebraic systems. Special names 
are given to functions having special prop- 
erties, which can be defined according to the 
structures of the domain and the range. For 
example, when both domain and range of a 
function f are sets of real numbers, fis called 
an even function if {(t)= f(—t), and an odd 
function if f(t)= —f(—1). A function f that pre- 
serves the order relation between real num- 
bers, i.e., such that t; <t, implies f(t,)<f(t,), 
is called a tmonotone increasing function. 

A mapping from a set I to a set F of func- 
tions, p: I—> F, is called a family of functions 
indexed by J (or simple family of functions), 
and is denoted, using the form f} instead of 
pA), by { fa}aey OF {f,}(AET). In particular, if 
I is the set of natural numbers, the family is 
called a sequence of functions. 
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C. Variables 


A letter x, for which we can substitute a name 
of an element of a set X, is called a variable, 
and X is called the domain of the variable. An 
element of the domain of a variable x is called 
a value of x. In particular, if the domain is a set 
of real numbers or complex numbers, the 
variable is called a real variable or a complex 
variable, respectively. On the other hand, a 
letter that stands for a particular element is 
called a constant. 

When the domain and range of a function f 
are X and Y, respectively, a variable x whose 
domain is X is called the independent variable, 
and a variable y whose domain is Yis called 
the dependent variable. Then we say y is a 
function of x, and write y= f(x). When a con- 
crete method is given by which we make a 
value of y correspond to each value of x, we 
say that y ts an explicit function of x. When a 
function is determined only by a tbinary re- 
lation such as R(x, y)=0, we say that y is 
an implicit function of x (— 208 Implicit 
Functions). 

Given functions f, g with an independent 
variable t, suppose that y is regarded as a 
function of x defined by relations x = f(t), 
y=g(t). Then we say that y is a function of x 
with the variable t as a parameter. A function 
whose range is a given set C with variable t as 
its independent variable is often called a para- 
metric representation of C by t. 

If the domain of a function fis contained in 
a Cartesian product set X; x X, X... X X,, 
the independent variable is denoted by 
(X;,X2,...,X,), and fis often called a function 
of n variables or a function of many variables 
(when n 22). 


D. Families and Sequences 


A function whose domain is a set J, ọ:I— X, is 
called a family indexed by 7 (or simply family), 
and I is called the index set. In the case ~(A)= 
x,(4e1), the family is denoted by {x,},., or 
{x,}(Ae1). If the range X of a function ọ is a 
set of points, a set of functions, a set of map- 
pings, or a set of sets, then the family {x,},~, is 
called a family of points, a family of functions, 
a family of mappings, or a family of sets, res- 
pectively. If the set J is a tdirected set, the 
family is called a directed family. Generally, if 
J is a subset of I, the family {x} } ze; is called a 
subfamily of {x,},.,. In particular, if J is a 
finite or infinite set of natural numbers, the 
family indexed by / is called a finite sequence 
or infinite sequence, respectively. Sequence is a 
generic name for both, but in many cases it 
means an infinite sequence, and usually we 
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have [=N. Then the value corresponding to 
néN is called the nth term or, generally, a 
term. For convenience, the Oth term is often 
used as well. If each term of a sequence is a 
number, a point, a function, or a set, the se- 
quence is called a sequence of numbers, a se- 
quence of points, a sequence of functions, or a 
sequence of sets, respectively. A sequence is 
usually denoted by {a,}. If it is necessary to 
show the domain of n explicitly, the sequence 
is denoted by {a,},,.,. If J is a subset of I, a 
sequence {a,},<, 1s called a subsequence of the 
sequence {a,},-;. And if I =N, the composite 
{ar} Of {a,} and a sequence {k,,} of natural 
numbers with k; <k,<k,...is usually called a 
subsequence of {a,}. 
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Variation 


A. Monotone Functions 


A function (or mapping) f from an tordered 
set X to another ordered set Y is called a 
monotone increasing (monovone decreasing) 
function if 


xı <x, implies 


A(X) <f(X2) 
F(X) 2 f(x). (1) 


A monotone increasing (decreasing) function is 
also called a nondecreasing (nonincreasing) 
function. In either case, the function f is called 
simply a monotone function. If X and Y are 
totally ordered sets and the inequality < (>) 
holds in (1) instead of < (>), then f is called a 
strictly (monotone) increasing (strictly (mono- 
tone) decreasing) function. In either case, f is 
called simply a strictly monotone function. 

In particular, when X and Y are subsets of 
the real line R, a monotone function is contin- 


(x;<x, implies 
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uous except for at most a countable number 
of points. Hence it is Riemann integrable in a 
finite interval provided that it is bounded. A 
continuous real function f(x) defined on an 
interval in R is tinjective if and only if it is 
strictly monotone. In such a case, the range of 
the function f(x) is also an interval, and the 
inverse function is also strictly monotone. 
Furthermore, a differentiable real function f 
defined on an interval is monotone if and only 
if its derivative f’ is always >0 (monotone 
increasing) or always <0 (monotone decreas- 
ing). If f’>0 (<0), f is strictly monotone 
increasing (decreasing). 


B. Functions of Bounded Variation 


Let f(x) be a real function defined on a closed 
interval [a, b] in R. Given a subdivision of the 
interval a=xy<x,<x,<...<x,=b, we de- 
note the sum of positive differences f(x;)— 
J(x;-,) by P and the sum of negative differ- 
ences f{(x,)—f(x;-,) by — N. Then we easily 
obtain 


P—N=f(b)—f(a), 
P+N=) f(x) —f(x;-1)I- 


The suprema of P, N, and P +N for all pos- 
sible subdivisions of [a,b] are called the posi- 
tive variation, the negative variation, and the 
total variation of the function f(x) in the inter- 
val [a,b], respectively. If any of these three 
values is finite, then all three values are finite. 
In such a case, the function f(x) is called a 
function of bounded variation. Every function 
of bounded variation is bounded, but the con- 
verse is not true. The positive and negative 
variations z(t), v(t) of the function f(x) in the 
interval [a, t] are monotone increasing func- 
tions with respect to t, and we have 


F(x) — fla) = n(x) v(x) (2) 


if f(x) is a function of bounded variation. 
Hence every function of bounded variation has 
both left and right limits at every point. A 
monotone function is a function of bounded 
variation, and the sum, the difference, or the 
product of two functions of bounded variation 
is also a function of bounded variation. Hence 
J(x) is a function of bounded variation if and 
only if it is the difference of two monotone 
functions. The representation (2) (representing 
a function of bounded variation as the differ- 
ence of two monotone increasing functions) 

is called the Jordan decomposition of the func- 
tion f(x). A function of bounded variation is 
Riemann integrable, continuous except for at 
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most a countable number of points, and dif- 
ferentiable talmost everywhere. 

A continuous function defined on a closed 
interval is bounded but not necessarily of 
bounded variation (e.g., f(x)=x sin(1/x) (x€ 
(0, 1]), 0 (x =0)). A discontinuous function 
may be a function of bounded variation on 
a closed interval (e.g., sgn(x)). However, an 
tabsolutely continuous function, a differenti- 
able function with bounded derivative, or a 
function satisfying the *Lipschitz condition is a 
function of bounded variation on a closed 
interval. 

The notion of functions of bounded vari- 
ation was introduced by C. Jordan in connec- 
tion with the notion of the length of curves (— 
246 Length and Area). 


C. Lebesgue-Stieltjes Integral 


Let f(x) be a right continuous function of 
bounded variation on a closed interval [a, b], 
and f(x)=2(x)— v(x) the Jordan decomposi- 
tion of f(x). Then z(x) and v(x) are monotone 
increasing right continuous functions and hence 
define bounded measures dz(x) and dv(x) on 
[a,b], respectively (— 270 Measure Theory L 
(v)). The difference dx — dv of these two mea- 
sures is a tcompletely additive set function 

on [a,b] which is often called the (signed) 
Lebesgue-Stieltjes measure induced by f, written 
df. For every function g integrable with respect 
to the measure du=dz + dy, we define {,.g df 

to be equal to [,gdn—|,gdv and call it the 
Lebesgue-Stieltjes integral. In this case, h(x) 

= fia,}}g df is of bounded variation on [a, b] 
and the Lebesgue-Stieltjes measure dh(x) 

is denoted by g(x)df(x). If fi and fọ are of 
bounded variation on [a, b], then so is the 
product f = f, fa, and we have 


Af (x)= fi (x £0) dfa(x) + fo(x FO) df, (x). (3) 


The Lebesgue-Stieltjes integral for a continu- 
ous integrand is often called the Riemann- 
Stieltjes integral, because it can be defined in 
an elementary way similar to the definition of 
the Riemann integral. 

The notion of bounded variation can also be 
defined for interval functions on R” and set 
functions on an abstract space (— 380 Set 
Functions). 
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167 (XIV.7) 
Functions of Confluent Type 


A. Confluent Hypergeometric Functions 


If some singularities of an ordinary differential 
equation of tFuchstan type are confluent to 
each other, we obtain a confluent differential 
equation whose solutions are called functions 
of confluent type. The equations that appear 
frequently in practical problems are the con- 
fluent hypergeometric differential equations 


d?w 


R dw = 
aye (y ye aw = (1) 


and related equations. Equation (1) corre- 
sponds to the thypergeometric differential 
equation for which a tregular singular point 
coincides with the point at infinity and is an 
tirregular singular point of class 1. For (1), 
z=O0is a regular singular point, and a series 
solution (radius of convergence 00) is given by 


F(a, y; z)= F(a, y; z) 


7 aul Es 2 2" Aies 2) 
nzo nty(y+1)...(p+n—1) 
where y is not a nonpositive integer. The func- 
tion ‚F, in (2) is a tgeneralized hypergeometric 
function due to Barnes and is called a hyper- 
geometric function of confluent type or Kum- 
mer function. If y is not equal to a positive 
integer, the other solution of (1) independent of 
(2) is given by z’’F(1+a—y, 2—y;z) (= Ap- 
pendix A, Table 19.I). 


B. Whittaker Functions 


Equation (1) with w=e7? z7”? W, y —20=2k, 
y? —2y=4m?—1 reduces to Whittaker’s dif- 
ferential equation 


d? W ( 1 
— +| —-+ 


TEELLA, 3 
dz? a Ss as: ©) 


If 2m is not equal to an integer, (3) has two 
series solutions for any finite z: 

Mg. m(Z) = 2°) e= FS 4m—k, 1 +2m; 2), 
My. -m(Z) =z "e= F} —m—k, 1 —2m; 2). 


If 2m is an integer, since the functions M, ,, 
and M, -m are linearly dependent, E. T. Whit- 
taker considered a solution of the form 


1 
Wie, m(Z) = -zp Kta m)z"e 


(0+) t\k-(2)+m 
«| (aarti +£) edt. 
Z 


io) 
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If k—4—m is equal to a negative integer, this 
integral does not exist. The function 


k ,—2/2 æ 
= axe \ pk (/2)-+m 
2 


t k—-(1/2)+m 
x (: +£) e ‘dt 
z 


for Re(k —4—m) <0 is defined for any m, k, 
and for any z except when z is a negative real 
number. We call M, m and W, m the Whittaker 
functions. Bessel functions are particular cases 
of these functions, and the relation 


z7 +2 


nO =mi) 


Mo, n(2iz) 

is satisfied. In Whittaker’s differential equation, 
since W_, m(— zZ) is also a solution and 

W, m(Z)/ W_ 4. m( — Z) is not equal to a constant, 
W, m(z) and W_, ,,(—z) can be considered a 
pair of fundamental solutions (— Appendix A, 
Table 19.1). 


C. Parabolic Cylinder Functions 


Putting x =(č? —n?)/2 and y= ën, the curves 
corresponding to =constant and to y = 
constant, respectively, constitute families of 
orthogonal parabolas. The curvilinear coordi- 
nates (č, 7, z) in three dimensions are called 
parabolic cylindrical coordinates. By using 
parabolic coordinates, separating variables in 
Laplace’s equation into the form f(é)g(n)e'™, 
and making a simple transformation, we find 
that f and g satisfy a differential equation of 
the form 


ar 


+(n+45—427)F =0. (4) 


By means of the Whittaker function W, m(z), a 
solution D,,(z) of (4) is represented by 


-1/2 2 
D (z) = 202 0 K Wri2+c1jay,-1/4(22 ). 


Equation (4) is called Weber’s differential 
equation or the Weber-Hermite differential 
equation, and D,(z) the Weber function. An- 
other solution of (4) is D_,,_, (iz) or D_,-,(—iz). 
The solutions of (4) are called parabolic cyl- 
inder functions. In particular, if n is equal to 

a nonnegative integer, then 


H,(z)=2°"? exp(4z2)D,(./2 2) 


is the tHermite polynomial of degree n. Solu- 
tions of differential equations for harmonic 
oscillators in quantum mechanics are of this 
form. 

In general, suppose that three regular sin- 
gular points are confluent to the point at infin- 
ity, and that they are reduced to an irregular 
singular point of class 2. Suppose further that 


644 


there are no other singularities. Then differen- 
tial equations of order 2 with these conditions 
are transformed into the form (4), whose solu- 
tions are represented by parabolic cylinder 
functions. Differential equations of the form 
(4) are reduced to confluent hypergeometric 
differential equations if z? is chosen as an 
independent variable (— Appendix A, Table 
20.1TT). 


D. Indefinite Integrals of Elementary 
Functions 


Since exponential and trigonometric functions 
can be represented by particular types of 
Kummer functions, their indefinite integrals 
that cannot be represented by elementary 
functions, e.g., fincomplete T -functions and 
the error function Erfz={§ exp(—t*)dt, can be 
represented by Kummer or Whittaker func- 
tions. They are included in a family of tspecial 
functions of confluent type. The functions 
defined by 


Cj | Coed ie 


are called Fresnel integrals, which are also 
represented in terms of the Whittaker function 
as 


C(z)—iS(z) 


1-i e 78 
4 —22/2 3 
= 1 Ju zil e z?/ W_14, 1⁄4(iZ) . 
T 


Fresnel integrals first appeared in the theory of 
the diffraction of waves. More recently they 
have been applied to designing highways for 
high-speed automobiles. Furthermore, the 
functions 


ctu | cos($s* as, 

apie 

Sa, sin eas 
0 2 


(obtained by a change of variables z = xu?/2) 
are also called Fresnel integrals. Numerical 
tables are available for them. The curves x= C 
and y=S with a parameter z or u are called 
Cornu’s spiral (Fig. 1). The functions 


x dt x et 
Lix=| —, Eix= Pas 


where a tprincipal value must be taken at t=0 
if x >0, 


. *sint 
se ("Ma 


0 


: “cost 
and cix=—( — dt 
P | 
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Fig. 1 


are called the logarithmic integral, exponential 
integral, sine integral, and cosine integral, or 
integral logarithm, integral exponent, integral 
sine, and integral cosine, respectively. They 
satisfy the relations 


Eix=Lie*, Eiix=Cix+iSix+(n/2)i. 


They have important applications: Ei x in 
quantum mechanics, Six and Cix in electrical 
engineering, and Lix in estimating the number 
of tprimes less than x (— 123 Distribution of 
Prime Numbers). Lix is also denoted by li x 
(— Appendix A, Table 19.1). 


E. Stokes’s Equation 


Consider a linear differential equation of the 
second order with five regular singular points 
including the point at infinity such that the 
difference of the characteristic indices at every 
singularity is equal to 1/2. Such equations are 
called generalized Lamé’s differential equa- 
tions. F. Klein and M. Bécher have shown 
that every linear differential equation that is 
commonly treated in mathematical physics is 
represented by a confluent type of generalized 
Lamé’s equation. Among these equations, if 
all five singularities are confluent to the point 
at infinity, the resulting equation is called 
Stokes’s differential equation, which is applied 
to the investigation of diffraction. This is re- 
duced to tBessel’s differential equation of order 
1/3 by suitable transformations of the inde- 
pendent and dependent variables. 
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A. General Remarks 


It is a general method in modern analysis to 
consider a set X of mappings of a space Q into 
another space A as a space (— 381 Sets) and 
its elements (namely, mappings of Q into A) as 
points of the space X, and to investigate them 
as geometric objects. In particular, it is impor- 
tant to consider the case where Q is a ftopo- 
logical space, a ‘measure space, or a fdiffer- 
entiable manifold and X is a set of real- or 
complex-valued functions defined on Q and 
satisfying certain conditions, such as continu- 
ity, measurability, and differentiability. Such 
spaces are generally called function spaces; 
they usually form ttopological linear spaces 
(— 37 Banach Spaces; 197 Hilbert Spaces; 424 
Topological Linear Spaces). 


B. Examples of Function Spaces 


The following are important examples of func- 
tion spaces. Throughout this section, all func- 
tions are real- or complex-valued, and two 
functions on a measure space are identified 
whenever they are equal to each other ‘almost 
everywhere. 


(1) The Function Spaces C(Q), C,,(Q), and 
C,(Q). The totality of continuous functions 
f(x) defined on a compact tHausdorff space Q 
is denoted by C(Q). Let f+g and of (a a real 
or complex number) be the functions f(x)+ 
g{x) and af(x), respectively. Then C(Q) forms 

a ‘linear space. Furthermore, define the norm 
of f by |f| =supxeo| J (x)|. Then C(Q) becomes 
a tBanach space since it is complete in (the 
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metric defined by) the norm. The norm is 
called the supremum norm or uniform norm 
because limpo || f, —/|| =O means that f(x) 
converges to f(x) uniformly on Q as n> œ. 
Define f-g to be the function f(x)g(x). Then 
clearly || f-g|| < |f| |lg|]. Hence C(Q) is also a 
*Banach algebra. 

Suppose that a subset R of C(Q) satisfies the 
following three conditions: (i) R is an talgebra 
over the complex number field with respect to 
the addition and multiplication defined above 
and contains the function identically equal to 
one. (it) For any two distinct points x and y 
of Q, there exists a function fe R satisfying 
f(x) # f(y). Gii) For any fe R, there exists an 
f*eR such that f*(x)= f(x) on Q. Then R is 
dense in C(Q) with respect to the supremum 
norm (namely, in the sense of uniform conver- 
gence). This fact is known as the Weierstrass- 
Stone theorem (or Stone-Gel’fand theorem). 

A subset E of C(Q) is tprecompact (i.e., any 
sequence of functions in E contains a sub- 
sequence that converges uniformly on Q) if and 
only if E is tuniformly bounded and fequicon- 
tinuous (Ascoli-Arzela theorem). Since C(Q) is 
not a freflexive Banach space except for trivial 
cases (— Section C), precompact sets and 
relatively compact sets are different in the 
tweak topology. For important characteriza- 
tions of the latter sets — [2, 5]. 

When Q is a topological space that is not 
necessarily compact, the totality of bounded 
continuous functions on Q (denoted by BC(Q)) 
is also a Banach space with respect to the 
supremum norm || f || =sup,.9| f(x)|. Let Q be 
a locally compact Hausdorff space. Then the 
space C(Q) of all continuous functions on Q is 
endowed with the topology of uniform conver- 
gence on the compact sets, i.e., the flocally 
convex topology defined by the tseminorms 
Sup,<x|f(x)| as K ranges over the compact sets 
in Q. C(Q) is always a complete locally convex 
space. It is a tFréchet space if Q is c-compact 
(i.e., Q is the union of a countable family of 
compact sets). We denote by C,,(Q) the sub- 
space of all functions f(x)e C(Q) that converge 
to zero as x tends to infinity (i.e., given an €> 
0, there is a compact set K such that | f(x)|< 
e for x K). C,,(Q) is a Banach space with the 
norm sup,.<o|f(x)|. It can be regarded as a 
closed linear subspace of BC(Q). 

The totality of continuous functions with 
compact support is denoted by C,(Q) or #(Q), 
where the support (or carrier) of a function f 
is the closure of the set {x| f(x) 40} in Q and 
is usually denoted by supp f. If Q is not com- 
pact, C)(Q) is not complete with respect to the 
supremum norm, but when Q is o-compact 
C)(Q) is complete with respect to the strongest 
locally convex topology with the property 
that for each compact set K in Q the embed- 
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ding in C,(Q) of the linear subspace X% of all 
functions with support in K equipped with 
the supremum norm is continuous. Usually 
X (Q) denotes the space C)(Q) equipped with 
this topology. 


(2) The Lebesgue Spaces L,,(Q) (0< p< œ). Let 
(Q, u) be a tmeasure space. We denote by L,(Q) 
the totality of tmeasurable functions f(x) on Q 
such that | f(x)|? is integrable. A function f de- 
fined on (Q, u) is called square integrable if fe 
L,(Q, n). If QO is the interval (a, b) equipped 
with the Lebesgue measure, it is sometimes 
denoted by L,(a,b). We define the norm || f ||, 
by 


1/p 
ist=tt,=( is) Pduts) 


If 1 <p< œ, then L,(Q) is a Banach space. The 
triangle inequality for this norm is precisely 
the tMinkowski inequality. If 0<p<1, the 
norm no longer satisfies the triangle inequal- 
ity but does satisfy the quasinorm inequality 
If +glp £201 flp + lgl), and if X | Alle < 
oo, then È f, converges unconditionally in 

L (9). Hence L,(9),0<p<1, is a tquasi- 
Banach space. If lim,.... lJa — f |p =0, we say 
that the sequence { f,} converges to f in the 
mean of order p (or in the mean of power p), 
and write lim... Ja = f. If { fa} converges to 
f in the mean of order 2, we simply say that 

{ fa} converges to f in the mean. (The nota- 
tion l.i.m. means the limit in the mean and is 
used mostly when p=2.) For any f, ge L,(Q), 
(£9) =Saf(x)g(x) du(x) is well defined, by the 
*Schwarz inequality, and has the properties of 
the tinner product. Hence, L,(Q) is a Hilbert 
space. If 1<p<o, then L,(Q) is a tuniformly 
convex Banach space and is in particular 
treflexive. Deepest results on L,(R"), 1<p<o, 
are often derived from the Littlewood-Palay 
theory due to J. E. Littlewood and R. E. A. C. 
Paley, A. Zygmund [6], and E. M. Stein [7]. 
Its starting point is the inequality 


Apl S lp S lal < 4f lp» 
where g( f) is the function 
%0 1/2 
ano-{| |grad, u(x, prar) 
0 


obtained from the Poisson integral 
u(x, t) 


_P(n+1)/2) 


rina | ta? +x— yl) OP fy)dy, 
R” 


where |x|=(x?+...+x2)!?. 

The L, spaces, 1 < p< œ, are generalized 
in the following way. Let ®(s) be a convex 
and nondecreasing function on [0, 00) satisfy- 
ing ®(0)=0 and @(s)/s— oo as sooo. Denote 
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by Lg(Q) (L3(Q)) the set of all functions f(x) 
such that ®(| f(x)|) is integrable (®(k| f(x)|) is 
integrable for some k > 0). La(Q)= L§(Q) if 
@(2s) < C(s). L§(Q) is a Banach space, called 
the Orlicz space, under the norm 


IfI =intfa>0 


fou ISOJI aux) < i 


L,(Q), 1 <p < œ, is the Orlicz space for (s) 
=s?, 


(3) The Function Space M(Q)= L (Q). Let Q 
and yu be as in (2). A measurable function f(x) 
on Q is said to be essentially bounded if there 
exists a positive number « such that | f(x)|<« 
almost everywhere on 2. The infimum of such 
a is called the essential supremum of f, denoted 
by esssup,.9|f(x)|. The totality of essentially 
bounded measurable functions on Q (denoted 
by M(Q)) is a Banach space with respect to 
the norm || {|| = || fll. =€88SUP seal f). If 
u(Q) < œ, then M(Q)c L,(Q) for any p>0, and 
ISl =m; Ifill, for any fe M(Q). From 
this point of view, M(Q) is also denoted by 

L (Q) even when p(Q)= œ. This is also the 
reason why the notation ||: || is used for the 
norm in M(Q). 


(4) The Lorentz Spaces Li, p(Q) (O<p,q< 0). 
The Lebesgue spaces L,(Q), 0<p< œ, are 
rearrangement invariant. Namely, define for a 
measurable function f on a measure space 

(Q, u) the distribution function j1,(s)= {xe 
Q|| f(x)|>st, s>0, and the rearrangement 
f*(t)=inf{s>0| u,(s)<t}, t>0. Then feL,(Q) 
if and only if f*eL,(0, ©) and || f\|,= If * lp 
Another important class of rearrangement in- 
variant spaces are the Lorentz spaces L, ,)(Q), 
0<p, q< oo (G. G. Lorentz, 1950; R. A. Hunt 
[8]), which is defined to be the quasi-Banach 
space of all measurable functions f on Q such 
that 


IF ling = lt? F*Ollag < 00, 


where L* is the L,-space on (0, œ) relative 

to the measure dt/t. Lip, »(Q)=L,(Q) with 
equal norms. If 1<p<oo and 1 <q < œ, then 
L4p,q(Q) is a Banach space under the equiva- 
lent norm |t? Jof*(s)ds|lps. Except for 
these cases, Lip, p(Q) is not equivalent to 

a normed space in general [8]. If ¢gg<q,, then 
Lipa (82) € Ly, q,() with continuous embed- 
ding. In case p(Q)< œ, Lip, 4,)(Q)< Lip, gpl) 
for po >p; and any qo, g;. The Lorentz spaces 
play an important role in interpolation and 
approximation theory (— 224 Interpolation of 
Operators). 


(5) The Function Space S(Q). Let (Q, u) be a 
measure space with p(Q)< oo. Denote by S(Q) 
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the totality of measurable functions on Q that 
take finite value almost everywhere. Then || || 
= fall FONA +1 £00) du) for fe S(Q) has 
the properties of the fpseudonorm and S$(Q) 
is a fFréchet space (in the sense of Banach) 
that is not locally convex in general. We have 
lim, If, —S || =0 if and only if 


lim p({x| fa) S00 >2})=0 


for any positive number £. Convergence of this 
type is called convergence in measure (or as- 
ymptotic convergence) and is the same notion 
as tconvergence in probability of a sequence 
of frandom variables (— 342 Probability 
Theory). If { f,}¢L,(Q) converges to fe L,(Q) 
in the mean of order p, then { f,} converges to 
f asymptotically, but the converse is not true 
in general. If a sequence { f,} €S(Q) converges 
to feS(Q) almost everywhere, then { f,} con- 
verges to f asymptotically. Any sequence { f,} 
that converges to f asymptotically contains a 
subsequence { f, } that converges to f almost 
everywhere. 


(6) The Sequence Spaces c, co, |, (0< p< œ), 
m=l,,, and s. The totality c (resp. co) of se- 
quences x = {č} that converges (resp. con- 
verges to zero) as noo forms a Banach space 
with respect to the norm ||x||=sup|€,]|. co 
(resp. c) is the space C,,(Q) (resp. C(Q)) when Q 
is (resp. the one-point compactification of) the 
discrete locally compact space {1, 2, 3, ... }. The 
sequence space |, 0<p< œ (resp. m=1,,), is 
defined to be the spaces L,(Q) (resp. M(Q)), 
where Q is the space {1,2,3,...,n,...}, of 
which each point has unit mass, while s de- 
notes the space S(O), where Q={1,2,...,n,...} 
provided with the measure assigning mass 
1/2” to the point n. s is the set of all sequences 
equipped with the topology of pointwise con- 
vergence (s is also used to denote the space of 
rapidly decreasing sequences; — Section (16)). 
Assume that the space L,(Q) mentioned in 
(2) is tseparable and that {¢,} is a tcomplete 
orthonormal set in L,(Q). Then putting 


? 


g= | OE TE 


(tFourter coefficients) for any fe L,(Q), we have 
fén EL and Y~,|&,|7=||f||*. Conversely, for 
any {€,}el, there exists an f =E] €,0,€ 
L,(Q) whose Fourier coefficients are the given 
€,, (Riesz-Fischer theorem). By means of this 
correspondence, separable spaces L (Q) and 
l, are mutually isomorphic as Hilbert spaces. 
Sometimes we denote by /,(Q) the function 
space L,(Q), where Q is an arbitrary set en- 
dowed with the measure assigning mass 1 to 
each point. 
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(7) The John-Nirenberg Space BMO. A locally 
integrable function f(x) on R” is said to be of 
bounded mean oscillation if 


iF lawo=spist | | f(x) — fs|dx < æ, (1) 
s 


where the supremum is taken over all (solid) 
spheres S in R”, fs is the mean |S| fs f(x)dx, 
and |S| denotes the measure of S (F. John and. 
L. Nirenberg, 1961). The set BMO(R") of all 
functions on R” of bounded mean oscillation 
forms a Banach space under the norm men- 
tioned above if two functions f and g are iden- 
tified whenever f —g is equal to a constant 
almost everywhere. Condition (1) is equivalent 
to 


\/p 
sup( isi" | 1/00) (1rd) <% 
S 


for any 1 <p< œ. There are constants B and 
K >0 such that 


ItxeS|| f(x) —fs1> 2} 
< B|S|exp(—KA/|| fll amo) 


for any sphere S and 2>0. BMO is a slightly 
larger space than L, (e.g. log|x|¢ BMO) 

and has better properties. For example, the 
*Calderon-Zygmund operators are bounded in 
BMO(R"). We have 


BMO(R")=L,,(R")+ 3 R;L a(R"), 


where R; are the tRiesz transforms [9]. This is 
called the Fefferman-Stein decomposition. The 
Riesz transforms can be replaced by more 
general families of singular integral operators 
(A. Uchiyama, Acta Math. (1982)). 


(8) The Hardy Spaces H, (0< p< œ). The 
classical theory of Hardy classes (— 159 
Fourier Series G) has been reconstructed by 
the real-analysis method and extended to 
higher-dimensional cases by E. M. Stein, G. 
Weiss, C. Fefferman, and others. According to 
their terminology the elements of the Hardy 
space H, are (the complex linear combinations 
of) the real parts of the boundary values of 
holomorphic functions of the Hardy class. 

Let fe #'(R") be a ttempered distribution. 
For a ge F(R") define the radial maximal 
function M, f and the nontangential maximal 
function M% f relative to by 
M; f(x) =suplo, * f(x), 


t>0 


MŠJ(x)= sup lo* f(y), 

|x—yl<t 
where ¢,(x)=t "(x/t) and * denotes convo- 
lution. Then the Hardy space H,(R"), 0<p< 
œ, is defined to be the space of all tempered 
distributions f which satisfy the following 
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equivalent conditions: (i) Mj feL, fora ọ 
with f p(x)dx #0; (ii) M* fe L, for a ọ with 
Jodx 40; (iii) MZ feL, for any g; (iv) Mž fe 
L, for any; ọ. If a distribution f satisfies (one 
of) these conditions, then its Poisson integral 
u(x, t)= ,* f(x) is a function, where g(x) = 
const(1 + |x|?) "+02, and its radial maximal 
function u*(x)=sup,, 9 |u(x, t)| and nontangen- 
tial maximal function u*(x)=sup),_y<,|u(y, £) 
both belong to L,. Conversely if u(x, t) is a 
harmonic function on the upper half-space 
t>0 and if either u* or u* belongs to L,, 
then its boundary value f=u(-,0) exists in the 
sense of tempered distribution and f satisfies 
the above conditions. Define the norm of an 


feH,(R’) by |\u*||,,. Then H,(R”) becomes 


a quasi-Banach space. If 1<p<, then 
H,(R")= L,(R") with equivalent norms. H,(R") 
is a Banach space strictly smaller than L,(R"). 
An feL,(R") belongs to H,(R") if and only if 
all the Riesz transforms R,f are in L,(R"), and 
IS lz, is equivalent to || fl], +£ | Rif lı. Simi- 
lar characterizations of H,(R") are also known 
for p>0 (Fefferman and Stein [9]). 

Let Ij, j=1,...,m, be tproper convex open 
cones in R” such that the tpolars I cover 
R". Then an fe./'(R") belongs to H,(R") if 
and only if there are holomorphic functions 
F(x +iy) on R"+iI; such that sup{||F,(- + 
iv)I,|yeT} <% and f=) F(- +1750) (D. L. 
Burkholder, R. F. Gundy, and M. L. Silver- 
stein for n=1 and L. Carleson for n> 1). Let 0 
<p<İ. A measurable function a on R” is said 
to be a p-atom if there is a sphere S such that 
suppacS and |ala <|S|~'”” and if [a(x)x*dx 
=0 for all multi-indices « with |a|<n(p !—1). 
Here a multi-index « is an n-tuple (%4, ..., p) 
of nonnegative integers, |a|=a,+...+4, and 
x*=x,"'...x,’". A distribution f belongs to 
H,(R"), 0<p<1, if and only if there are a 
sequence of p-atoms a; and a sequence of 
numbers 4;>0 in /, such that f= å;a; in the 
sense of distributions, and the norm If lla, is 
equivalent to the infimum of ll, (R. R. Coif- 
man for n=1 and R. H. Latter for n> 1). The 
theory of H, and BMO has been generalized 
to more general situations (— Coifman and 
Weiss, Bull. Amer. Math. Soc. (1977)). 


From now on we assume that Q is a domain 
in the n-dimensional Euclidean space R” (or 
more generally a differentiable manifold). D* 
stands for D,“ ... D,*", where D;=6/0x;. 


(9) The Function Spaces C'(Q) and C}(Q) (= 
0,1, 2,..., 00). The totality of /-times continu- 
ously differentiable functions in Q (namely, 
differentiable functions of fclass C' in Q) is 
denoted by C'(Q). We say that a sequence 

{ f,} of functions in C'(Q) converges to 0 in 
C'(Q) if |D*f (x)| converges to 0 uniformly on 
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every compact subset of Q for every « satisfy- 
ing 0<|a|<1(0<|a|<o if l= œ). CQ) is a 
Fréchet space. The totality of functions in 
C'(Q) whose supports are compact subsets of 
Q is denoted by C§(Q). We say that a sequence 
{ f,} of functions in C} (Q) converges to 0 in 
C}(Q) if supp f, (v= 1, 2,...) is contained in a 
compact subset of Q independent of v and { f,} 
converges to 0 in C'(Q). C4(Q) is an t(LF)- 
space. 

When Q is a locally closed set in R" (or a 
differentiable manifold with boundary), we 
denote by C'(Q) the totality of functions f(x) 
on Q together with their continuous formal 
derivatives D* f(x), |a|<1(|a|< æ if l= œ) on 
Q such that for every « and m with |a|<m<l 
(m<oo if l=), 


Df- E PSOE- 
=o(|x— y|") 


locally uniformly in Q as |x— y| tends to 0. 
Convergence in C!(Q) is defined in the same 
way as above. 

If Q is a closed set in R” with a finite number 
of connected components in each of which . 
two points x and y are connected by an arc of 
length < C|x— y| with a constant C indepen- 
dent of x and y, then every f in C!(Q) can be 
extended to an f in C!(R”) (Whitney’s extension 
. theorem). 


(10) The Lipschitz Spaces A‘. Let s>0, and 

let k be the least integer greater than s. The 
Lipschitz (or Hélder) space A*(R") is the totality 
of functions f(x) on R” which satisfy 


list <e. 


When s<1, this is exactly the tLipschitz (or 
Holder) condition of order s. But when s=1, it 
is strictly weaker than the tLipschitz condition. 
A function fe A! is said to be smooth in the 
sense of A. Zygmund [6]. Suppose 0<h<s is 
an integer. Then a function f belongs to A‘ if 
and only if it is h times continuously differenti- 
able and all the derivatives D*f of order h are 
in AS", A‘(R") is a Banach space of functions 
modulo the polynomials of degree <k—1. 

Suppose that 1<q<oo and f is a measur- 
able function on R” such that 


k 


k ' 
2 PEU +jy) 


j=0 


lf llas=sup 


Xy 


1/4 
supinr( isie Í f= Plata) <0, 
S P S 


where the supremum is taken over all spheres 
in R” and the infimum over all polynomials of 
degree <s. Then f(x) is equal to a function 

fi (x) in A(R”) almost everywhere and the su- 
premum is equivalent to the norm || f llas. 
Conversely every f € A‘(R") satisfies the above 
inequality (S. Campanato). 
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(11) The Sobolev Spaces W,(Q), H'(Q), and 
H}(Q) (1=0, 1,2, ...,1<p<oo or ~œ <l< %0, 
1<p<oo). Let />0 be an integer and 1 <p< 
oo. The Sobolev space W;(Q) is the totality of 
functions f(x) such that for all satisfying 

jal <l, the derivatives D*f(x) in the sense of 
distribution (— 125 Distributions and Hyper- 
functions) belong to L,(Q) with respect to 
Lebesgue measure in Q. W}(Q) is a Banach 
space with the norm 


I= Ifi =( È 


O<lal<I JO 


1/p 
|D*f rax) . 


Clearly Wp (9) = L,(Q). W(Q) is a Hilbert 
space with respect to the inner product 


D=] $ D'S) D%g(x)dx. 

Q0 <la<i 
Sometimes W3(Q) is denoted by H'(Q). Its 
closed linear subspace obtained as the comple- 
tion of CF (Q) is denoted by H}(Q). We have 
Hp (Q) = WP(Q)= L,(Q). However, if 1> 1, we 
have Hj(Q)< Wi(Q), and identity does not 
hold unless Q=R’. 

The definition of Sobolev spaces has been 
extended to those with fractional and also 
negative order —o0 <s < oo in many different 
ways. When 1< p< œ, it is natural to define 
W,(R") to be the space of all tempered distri- 
butions f on R” such that (1—A)*?*f=F 1((1 
+E) Ff)eL,(R”). If s>0, then W;(R") thus 
defined coincides with the space of all fe 
L,(R") whose Poisson integral u(x, t) satisfies 


ioe) 1/2 
I Paatu Dde) 
0 


for some (and any) integer k > s/2. 

The Poisson integral can be replaced by 
other regularizations, and thus the definition 
of Sobolev spaces of fractional orders is ex- 
tended to arbitrary open set Q with the cone 
condition (T. Muramatu [13]). Here Q is said 
to satisfy the cone condition if there are a 
bounded and uniformly continuous mapping 
Y:R”>R” and an e>0 such that for any xeQ 
the convex hull of the -ball with center at x + 
(x) and {x} is included in Q. 


<0 
P 


(12) The Besov Spaces B}, (—œ <s<%,1<p, 
q <œ). The effort to make the Sobolev em- 
bedding theorem [10] more precise led S. M. 
Nikol’skii and O. V. Besov [11] to the other 
classes of “Sobolev spaces” of fractional order. 
Let s>0 and 1 <p, q< œ. The Besov space 

By, q(R”) is the totality of functions fe L,(R”) 


such that 
k /k i q dy 1/4 
s = TPC +y) oar] 
If les, (| p> C) f JY ply + 
<% 
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for some (and any) integer k> s. In terms of 
the Poisson integral an fe L,(R") belongs to 
By.q(R") if and only if 
q 1/p 
a) <% 
P 


(es 
( (249471 
0 


for some (and any) integer k>s/2. The first 
definition is obviously extended to an arbi- 
trary domain Q in R”. If Q satisfies the cone 
condition, then the functions in By, ,(Q) are 
characterized similarly as above by using a suit- 
able regularization u(x, t) of f(x) (Muramatu 
[13]). Let Q be a domain with the cone condi- 
tion. When 0<h<s is an integer, an f belongs 
to BY, ,(Q) if and only if fe W2(Q) and all 

the derivatives D*f of order h are in BS #(Q). 
By, (Q) is a Banach space with the norm || f |l, 
+| fps, If p=2, then BS (Q) coincides with 
the Sobolev space W;(Q). But in the other 
cases, BY, ,(Q) is different from W3(Q) for any q. 
However, By, ,(Q) or By, (Q) is often called a 
Sobolev space of order s and denoted by 
W;(Q). Clearly we have BE, o = La NAS. 

The Besov space B¥, ,(Q) of order s <0 is de- 
fined to be the totality of distributions f which 
can be represented as f= Èa <; D*f, with 
Jac BS'*(Q) for some (and any) integer k 
such that s+ k>0. The norm is defined to be 
inf È, || fall astro. In terms of the Poisson in- 
tegrals or other regularizations the same char- 
acterization holds as above. 

If Q is a domain in R” with the cone con- 
dition, then the restriction B} ,(R")— By, ,(Q) 
(resp. W>(R")—> W3(Q) for 1<p<a)isa 
bounded linear surjection with a bounded 
right inverse [13]. 

From now on we denote by By, ,, etc., 

By, q(Q), etc. for a domain Qc R” with the cone 
condition. If g <r, then B,,,< B,,,. If 1<p<2, 
then B; p= W; By». If 2<p<oo, then Bh 4 
W, = Bp,p Ifs>t, then B}, o € Bh 1 

Sobolev-Besov embedding theorems: (i) Let p 
<p’ and s—n/p=s'—n/p’. Then BS. c Bs. ,, 
Ba, p = Wp, W; BS, p and, if p' < œ, then Wp c 
W; . (it) Let 0<s<n/p and n/p'=n/p—s. 

Then By .g = Lipa and WP Lip, pl E Lp). (ii) 
Let s=n/p. Then B},ı € BC for any p and B¥, ,, 
c BMO for p< œ. (iv) Let O<n’<nand0< 
s’=s—(n—n’)/p. Then there is a bounded trace 
operator Tr: BS, ,(R")> Bs, ,(R”) that extends 
the restriction mapping S(R")>. A(R"). Tr 

is surjective and has a bounded linear right 
inverse [11-13]. 


Afu(-, t) 


(13) The Function Spaces 2, &, G,,,» 2B, and F. 
The spaces of infinitely differentiable functions 
Cò (Q) and C”(Q) are also denoted by 2(9) 
and 6(Q), respectively. The totality of func- 
tions f(x) in C*(Q) such that, for all «, D*f(x) 
belongs to L (Q) with respect to tLebesgue 
measure is denoted by 2, 9). The neighbor- 
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hood V,,, of 0 in 2, (Q) is defined to be the 
totality of functions f(x) such that ||D*f||,<e 
for any « satisfying |x| <l. In particular, 2, (Q) 
is also denoted by A(Q). (A(Q) is also used to 
denote the space of hyperfunctions.) 

A function f(x) is called a rapidly decreas- 
ing C®-function if it belongs to C”(R") and 
satisfies 


a |x|*|D*f(x)|=0 


for any « and any integer k>0. The totality of 
rapidly decreasing C®-functions is denoted by 
SF. The neighborhood V,,, of 0 in Z is defined 
to be the totality of functions f(x) such that 
(1+ |x]?)'|D%f(x)| <e for any « satisfying |a| <l. 
The spaces in this section are tnuclear except 
for 2, and &, and are employed in the theory 
of distributions [14] (— 125 Distributions and 
Hyperfunctions). When Q =R”, we usually 
omit (R”); for example, A(R") and %,,(R") are 
denoted by 2 and &, ,, respectively. 


(14) The Function Spaces 7, y M, Za,» Em} 
óm,» and = C°. Let {M, be a sequence of 
positive numbers satisfying the logarithmic 
convexity My < Mp-ı Mp+1- &m,,\(Q) (resp. 

é m,)(2)) denotes the totality of C®-functions 
f on Q such that for any compact set K in Q 
there are constants k and C (resp. for any k>0 
there is a constant C) satisfying 


| D*f(x)| < CKM gj xeK, |a| 20. 


ép} (Q) is the totality of treal analytic func- 
tions on Q and is denoted by (Q) or C°(Q). 
If {M,} satisfies the Denjoy-Carleman condi- 
tion & M,/Mp+1 < œ, then Dia, (Q)= AQ) 
ém, (9Q) and Ay )(Q)= HYN Ey, (Q) are 
dense i in BQ). Conversely, if Dry „Qi is differ- 
ent from {0}, then {M,} satisfies the Denjoy- 
Carleman condition. In this case an fe 
ĉr m,}(Q) (resp. & M») is sometimes called 
an ultradifferentiable function of class { M,} 
(resp. (M,)). The most important is the case 
where M, =p"! for an s>1. Then an fe 
ĉ; m, (Q) (resp. & mp) is called a function 
of Gevrey class {s} (resp. (s)). The topological 
properties of 2 (Q) (resp. im (Q), etc.) have 
been discussed by A. Marine (resp. H. 
Komatsu). 

For function spaces of S type — 125 Distri- 
butions and Hyperfunctions. 


(15) The Function Spaces (9), ©,(Q)= A,(Q), 
and A(Q). Let Q be an open set in C”. The 
totality of tholomorphic functions on Q is 
denoted by O(Q). @(Q) is a tnuclear Fréchet 
space with the topology of uniform conver- 
gence on compact sets. It is a closed linear 
subspace of C(Q) and also of C”(Q). 

For any pÈ 1, the totality of functions f 
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holomorphic in Q and satisfying | f(z)? dx dy 
<œ (z=x+iy;dxdy is Lebesgue mea- 

sure), denoted by @,(Q) or A,(Q), is a Ba- 

nach space with respect to the norm || f||,= 
(fal fl? dx dy)”. In particular, it is a Hil- 
bert space when p=2. 

The totality of functions bounded and con- 
tinuous on the closure of Q and holomorphic 
in Q (denoted by A(Q)) is a Banach space 
with respect to the norm || f'|| =sup,-9| f(z)|.- 


(16) The Kéthe Spaces () 2(«) and E 2” (a). 
Let {x} be an increasing sequence of se- 
quences «™ =(a), a, ...) of positive numbers. 
The echelon space () 4(*“) of G. Köthe [15] 
is the totality of sequences x =(€,,&,,...) 

such that p(x) = Xia |é; <œ for any k. 

It is a Fréchet space with the topology deter- 
mined by the seminorms p“. The co-echelon 
space X 4* (a) is the totality of sequences y= 
(n1, N2,- ) such that |7,|< Co for some C 
and k. The space s of rapidly decreasing se- 
quences and the space s’ of slowly increasing 
sequences are the echelon space and the co- 
echelon space for the sequence a!) = i*. If «= 
k', then we obtain the space of power series 
with infinite radius of convergence and the 
space of convergent power series. More gener- 
ally, let x = («;) be a sequence of positive num- 
bers. The echelon spaces for «f = exp(ka,) and 
exp(—k~'a,) are called the infinite type power 
series space and the finite type power series 
space and are denoted by A,,(«) and A, (a), 
respectively. Echelon spaces and co-echelon 
spaces have been employed to construct exam- 
ples and counterexamples in the theory of 
tlocally convex spaces by K othe [15], Grothen- 
dieck, Y. and T. Komura, E. Dubinsky, D. 
Vogt, and others. 


C. Dual Spaces 


When Q is a compact Hausdorff space, any 
bounded linear functional ® on C(Q) is ex- 
pressed by the *Stieltjes integral 


®(f)= | F(x) de(x), fe C(Q), (2) 
Q 


with a t*Radon measure o, i.e., a (real- or 
complex-valued) tregular fcountably additive 
set function defined on the Borel sets in Q. 
Since ọ is of bounded variation, the totality of 
such @ is denoted by BV(Q). Conversely, any 
ye BV(Q) gives a bounded linear functional on 
C(Q) defined by (2), and ||®|| =the total varia- 
tion of g over Q. Hence the dual space of C(Q) 
is isomorphic to the Banach space BV(Q) with 
the norm ||®||. 

Let Q be a locally compact Hausdorff space. 
Then the dual space of C,,(Q) is again the 
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Banach space BV(Q) of Radon measures of 
bounded variation. On the other hand, the 
dual space of C,(Q) is the space of all Radon 
measures on Q. N. Bourbaki takes this fact as 
the definition of measure. 

Any bounded linear functional ® on L, (Q) 
is expressed as 


D(f -Í S(x)o(x)du(x), feL (Q), (3) 
Q 


with a suitable pe M (Q); and ||®jj = lollo. 
Conversely, any o € M (Q) defines a bounded 
linear functional on L, (Q) by means of (3). 
Accordingly, the dual space of L, (Q) is isomor- 
phic to M(Q). 

The dual space of L,(Q) (1 <p < œ) is iso- 
morphic to L,(Q), where q is the real number 
defined by (1/p)+(1/q)=1 (accordingly, 1 <q < 
oo) and is called the conjugate exponent of p. 
Any bounded linear functional on L,(Q) is 
expressible by the formula in (3) (where fe 
L,(Q)) with ge L,(Q), and ||! = lll: 

The dual space of M(Q) is isomorphic to the 
normed linear space of all (real- or complex- 
valued) finitely additive set functions g defined 
on all measurable sets in Q, of bounded vari- 
ation over Q, and absolutely continuous with 
respect to the measure u given in Q (i.e., u(N)= 
0 implies p(N)=0). If l1<p<oo and l<q< 
oo, then the dual space of Lip, (Q) is isomor- 
phic to Liya (Q), where p’ and q’ are conjugate 
exponents of p and q, respectively. 

If Q is nonatomic (i.e., Q has no set of posi- 
tive measure that cannot be decomposed into 
two subsets of positive measure), then no con- 
tinuous linear functionals exist other than 
zero on S(Q) and on L,(Q) and L,,, 4)(Q) for 
O<p<il. 

The sequence spaces Co, I, (1 <p < œ), m, 
and s (the space defined in Section B (6)) are 
special cases of C,,(Q), L,(Q), M (Q), and S(Q), 
respectively. Hence their dual spaces can be 
described explicitly. For example, the dual 
space of cy (resp. /,) is 1, (resp. m), and if 1< 
p< œ, the dual space of l, is 1, (where (1/p)+ 
(1/q)=1). The coupling of x =(€,) and y= 
(y,) 1s given by È énn, The dual space of s is 
the totality of sequences (y,,) such that 4, =0 
except for a finite number of n. 

Let VMO(R") denote the closure of C)(R”) in 
BMO(R’). Then the dual space of VMO(R") is 
identified with H, (R”). This can be considered 
to be a generalization of the tF. and M. Riesz 
theorem. On the other hand, the dual space of 
H,(R") is BMO(R") [9]. Hardy spaces H,(R"), 
0<p<l1, are not locally convex but have suffi- 
ciently many bounded linear functionals, and 
their dual spaces are identified with Lipschitz 
spaces A‘(R"), where s=n(p ' —1). 

Let 1<p, q< œ. Then the dual space of 
W,(R") is isomorphic to W,,*(R") and that of 
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By, a(R”) to B,*,-(R"), where p’ and q’ are conju- 
gate exponents of p and q, respectively. The 
dual space 2’ of 2 is defined to be the space of 
tdistributions; the dual spaces of &, 2, and 
fF are (algebraically) linear subspaces of 2’. 
Similarly, the dual spaces of 2; m) and Dy 
are called the spaces of tultradistributions 
of classes {M,} and (M,), respectively. The 
dual space of æ is identified with the space of 
thyperfunctions with compact support (— 125 
Distributions and Hyperfunctions). A continu- 
ous linear functional on @(Q) is called an 
analytic functional. For each analytic func- 
tional ® there is a compact set L <Q such that 
ID( f)|<C sup,.,|f(z)|. A compact set K is 
called a porter of ® if every compact neighbor- 
hood L of K satisfies this condition. Porters 
are similar to supports of generalized func- 
tions, but an analytic functional does not 
necessarily have a smallest porter. 

The dual space of an echelon space is the 
corresponding co-echelon space. 
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A. General Remarks 


By a function-theoretic null set we mean an 
exceptional set that appears in theorem as the 
one asserting that a certain property holds 
with a “small exception.” We give below some 
of the more important examples of exceptional 
sets. For simplicity, we limit ourselves to the n- 
dimensional Euclidean space R” (n> 2). 


B. Sets of Harmonic Measure Zero 


Denote by x, the characteristic function of 

a set E on the boundary ôD of a bounded 
domain D in R". We call the thypofunction 
H, and thyperfunction H, (— 120 Dirichlet 
Problem) the inner and outer harmonic mea- 
sures of E (with respect to D), respectively. 
When they coincide, the function is called the 
harmonic measure of E. A necessary and suffi- 
cient condition for E to be of inner harmonic 
measure zero is that u<0 hold in D whenever 
a tsubharmonic function u bounded above in 
D satisfies limsupu(P)<0 as P tends to any 
point of ôD — E. This theorem implies the 
following uniqueness theorem: If h is bounded 
and harmonic in D, if E is a set of inner har- 
monic measure zero on OD, and if h(P)0 as P 
tends to any point of ôD — E, then h=0. A 
necessary and sufficient condition for E to be 
of outer harmonic measure zero is that there 
exist a positive tsuperharmonic function v in D 
such that v(P)— oo as P tends to any point of 
E. (Concerning the existence of a limit for a 
subharmonic or tharmonic function at every 
boundary point except those on a set of har- 
monic measure zero, — 193 Harmonic Func- 
tions and Subharmonic Functions.) 


C. Sets of Capacity Zero 


Although there are many kinds of capac- 
ity (— 48 Capacity), here we consider only 
tlogarithmic capacity and «-capacity (a > 0). 
Let K be a nonempty compact set in R”. Set 
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W(K)=inf, f {PQ *du(P)du(Q), where p runs 
through the class of nonnegative tRadon mea- 
sures of total mass 1 supported by K, and 
write C,(K)=(W(K))~ 1". Define C,(@) =0 

for an empty set @. For a general set ECR", 
define the inner capacity by supy_,C,(K) and 
the outer capacity by the infimum of the inner 
capacity of an open set containing E. When 
the inner and outer capacities coincide, the 
common value is called the a-capacity (or 
capacity of order x) of E and is denoted by 
C,(E). We denote the logarithmic capacity of E 
by Co(E). In order that Cy(K)=0 (n= 2) or the 
*Newtonian capacity C,_,(K)=0 (n>3) fora 
compact set K, it is necessary and sufficient 
that the harmonic measure of K with respect 
to G—K vanish for any bounded domain G 
containing K. Then K is removable for any 
harmonic function that is bounded or has a 
finite tDirichlet integral in G— K. In general, 
K is said to be removable for a family F of 
functions if for any domain G containing K 
and fe F defined in G— K, there exists ge F 
defined in G such that g = f in G—K. Let K be 
a compact set in R? with C,(K)=0 and G be a 
domain containing K. Let f be a holomorphic 
function defined in G—K for which every 
point of K is an tessential singularity. Then the 
set of texceptional values for f at every point 
of K is of logarithmic capacity zero. If f is 
tmeromorphic in |z|<1 and 


re 
acre aya? 


then f has a finite limit in any angular domain 
with a vertex at every point of |z|= 1 except for 
those belonging to a set of «-capacity zero 
(logarithmic capacity zero if «=0). 


|z|)*dxdy<oo (O<a<1), 


D. Hausdorff Measure 


Let «>0 and a set E be given in R”. Denote by 
c a covering of E by a countable number of 
balls with radii d,,d,,..., all of which are 
smaller than e (>0). As e>0, inf, >, d? in- 
creases. The limit is called the Hausdorff mea- 
sure of E of dimension « and is denoted by 
A,(E). In order for a compact set K to be 
removable for the family of harmonic func- 
tions defined in a bounded domain and satisfy- 
ing the tHölder condition of order a, it is 
necessary and sufficient that A,_,,,(K)=0. 
Next, suppose that K is a compact set ina 
plane and the complement G of K with re- 
spect to the plane is connected. Set || fll, = 

(f fol f| dxdy)"" for f holomorphic in G and 
q, 1 <q < %, and || fl o =supglf|. Denote by 
H” the family of f#0 with || f|, < œ. If p is 
defined by i/p+1/q=1, then A,_,(K)<0o 
implies H4= @ for q,2<q< œ, and A,(K)=0 
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implies H” = @. Moreover, H? = Ø if and 
only if C,(K)=0, and H? = Ø implies C,_,(K) 
=0 for q,2<q<o [1]. 


E. Null Sets Defined with Respect to Families 
of Functions 


Conversely, L. V. Ahlfors and A. Beurling 
characterized the size of sets in a plane by 
means of families of functions [2]. Let D be a 
domain, and let f represent a holomorphic 
function in D. Fix a point zo in D. Set 


B={flIfl< 1}, 


>= iri || I Pardyeal, 


€={ f|the area of R°>z}, 


where R° is the complement of the trange R 
of (f(z)—f(z9))71. Denote by SB, SD, SE 
the families consisting of constants and tuniva- 
lent functions in 8, D, ©, respectively. Use 
the notation % to represent any one of these 
six families, and define Mg = Mg(Zo; D) by 
sup{|f'(Zo)|| fe}. Then My = Mg> My= 
Mee 2 Mog= Mop, and Mg(Zo; D)=0 im- 
plies M.(z; D)=0 for any zeD. 

Denote by N; the class of compact sets K 
such that the complement K“ of K is con- 
nected and M,(z; K*)=0. We call Ke Nga 
null set of class Nx. In order for K to be re- 
movable for B or D, it is necessary and suffi- 
cient that K € Ng or € Np, respectively. We in 
general have 


If A,(K)=0, then Ke Ng. There exists a set 

K € Ng with A,(K)>0(A. G. Vitushkin, Dokl. 
Akad. Nauk SSSR, 127 (1959); J. Garnett, Proc. 
Amer. Math. Soc., 21 (1970)). When K is a 
subset of an analytic arc A, K e Ng implies 
A,(K)=0, Ny is equal to Neg, and K be- 
longs to Np if and only if C,(A)=C,(A — K). 

If an tanalytic function has an essential sin- 
gularity at every point of K € Ng, then any 
compact subset of the set of exceptional values 
at every point of K belongs to Ng. A necessary 
and sufficient condition for K € N, is either 
that the complement of any one-to-one fcon- 
formal image of K‘ be of plane measure zero 
or that any tunivalent analytic function in K“ 
be reduced to a linear fractional function. If 
the union of at most a countable number of 
Ng or Ny sets is compact, it belongs to the 
same class. But it not true for Neg sets [3]. 


F. Analytic Capacity 


For a compact set K, let Dg be the unbounded 
connected component of K°. The quantity 
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Mgl, Dx) is called the analytic capacity of K 
and is denoted by a(K). If «(K)>0, there exists 
an extremal function fo(z)=a(K)z '+..., 
called the tAhlfors function, that maps Dx 
onto a covering surface of the unit disk. In 
general, «(K) is not greater than the logarith- 
mic capacity C,(K). If K is a continuum, then 
a(K)=C)(K), and «(K) is attained by and only 
by fo(z), which maps Dx onto |w|< 1 conform- 
ally and z= œ to w=0. For a linear set K, 
a(K) is equal to a quarter of its length (C. 
Pommerenke, Arch. Math., 11 (1960)). The 
concept of analytic capacity is basic for the 
theory of rational approximation on compact 
sets. 
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A tcontinuous mapping f from the interval 
I={t|0<t<1} into a topological space Y is 
called a path connecting the initial point f(0) 
and the terminal point f(1). In particular, a 
path satisfying f(0)= f(1)= yo is called a loop 
(or closed path) with y, as the base point. For a 
path f, the inverse path f of f is defined by f(t) 
= f(1— t). When the terminal point of f and 
the initial point of g coincide, the path F de- 
fined by F(t)= f(2t) for O<t<1/2 and F(t)= 
g(2t—1) for 1/2<t<1 is called the product 

(or concatenation) of f and g, and is denoted 
by f: g. With [f] standing for the equivalence 
class of a path f under the relation of fhomo- 
topy relative to 0, 1 (EJ) (i.e., by homotopy 
with 0 and 1 being fixed), the inverse [f] += 


[f] and the product [ f]-[g]=[f-g] are 
defined. In particular, in the set of homotopy 
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classes of loops with one common base point 
Yo, the product is always defined, and the set 
forms a group 7,(Y, yo). This group is called 
the fundamental group (or Poincaré group) (H. 
Poincaré, 1985) of Y (with respect to yo). If Y is 
tarcwise connected, then z; (Y, yo) &7 (Y, y) 
for an arbitrary pair of points yọ, y,, and the 
structure of the group is independent of the 
choice of the base point. This group is de- 
noted simply by z,(Y). A continuous mapping 
¢:(Y, Vo) >(Y’, yo) induces a homomorphism 
Px: T1(Y, Vo) >n (Y, yo) by sending [f] to 
?xLf]=Leof], and (p'o p) =P% Py holds 
for the composite gy’ o of mappings. Thus 
m,(Y) is a ttopological invariant of Y. If z, (Y) 
consists of only one class (the class of the 
constant path), we say that Y is simply con- 
nected. For example, cells and spheres S” 
(n22) are simply connected. The famous 
*Poincaré conjecture states that a simply con- 
nected 3-dimensional compact tmanifold is 
homeomorphic to the 3-dimensional sphere. 
We have van Kampen’s theorem: Let P be a 
connected polyhedron, P, and P, be its con- 
nected subpolyhedra such that P, N P, is con- 
nected, and P= P, UP,. Then z, (P) is isomor- 
phic to the group (famalgamated product) 
obtained from the tfree product of z,(P,) and 
7,(P,) by giving the relations that the images 
of each element of z; (P, N P3) in 2,(P,) and in 
m,(P,) are equivalent. Also, the fundamental 
group of the tproduct spaces is the direct 
product of the fundamental groups of the 
spaces involved. Any group is the fundamental 
group of some {CW complex. The Abelization 
m,/[2,,2,] of the fundamental group z, = 
n,(Y)(Y arcwise connected) is isomorphic to 
the !-dimensional integral thomology group 
H,(Y). For example, the fundamental groups 
of a circle S! and a ttorus T” are an infinite 
cyclic group and a free Abelian group of rank 
n, respectively; the fundamental group of a 1- 
dimensional CW complex is a free group; and 
the fundamental group of an orientable 2- 
dimensional closed surface of tgenus p is a 
group having 2p generators {d,,...,d,,b,,..., 
bp} and a relation a,b,a;'b;'...a,b,a,'b, ' = 
1. If xo is a fixed point of the circle $+, then 
the fundamental group can be defined as the 
set of all homotopy classes of continuous map- 
pings f:(S', xo) (Y, yo). 

Extending the definition of the fundamental 
group by replacing I, S! with I”, S", we obtain 
the n-dimensional homotopy group (— 202 
Homotopy Theory; 91 Covering Spaces). 
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Galois, Evariste 


Evariste Galois (October 25, 1811—May 31, 
1832) was born in Bourg-la-Reine, a suburb of 
Paris. In 1828, while still in junior high school, 
he published a paper on periodic tcontinued 
fractions. Although he published four papers, 
his most important works were submitted to 
the French Academy of Science and either lost 
or rejected. He was unsuccessful in his attempt 
to enter the Ecole Polytechnique and instead 
entered the Ecole Normale Supérieure in 1829. 
Active in political affairs, he was expelled from 
school, imprisoned, and died in a duel soon 
after his release. 

The night before the duel, he left his re- 
search outline and manuscripts to his friend, 
A. Chevalier. These were published by J. 
Liouville in J. Math. Pures Appl., first series, 
11 (1846). The contents include the concept 
of groups and what essentially became the 
tGalois theory of algebraic equations. The 
manuscript also contained expressions such as 
“theory of ambiguity,” which seems to indicate 
that Galois intended to study the theory of 
algebraic functions along the same lines. 
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A. History 


After the discovery of formulas giving the 
general solutions of algebraic equations of 
degrees 3 and 4 in the 16th century, efforts to 
solve equations of degree 5 remained unsuc- 
cessful. Early in the 19th century P. Ruffini 
and N. H. tAbel showed that a general alge- 
braic solution is impossible. Shortly after- 
ward, E. tGalois established a general prin- 
ciple concerning the construction of roots of 
algebraic equations by radicals. The principle 
was described in terms of the structure of a 
certain permutation group (the Galois group) 
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of the roots of the equation. Even in this 
original form of the theory (Galois theory), 
Galois not only completed the research started 
by J. L. Lagrange, P. Ruffini, and Abel, but 
also made an epochal discovery that opened 
the way to modern algebra. J. W. R. Dedekind 
(Werke III, 1894) interpreted this result as a 
duality theorem concerning the automorphism 
groups of a field. It was shown later that 
Galois theory plays an important role in the 
general theory of commutative fields estab- 
lished by E. Steinitz. In the 1920s, W. Krull 
generalized the idea of Dedekind, using the 
concept of topological algebraic systems, and 
obtained the Galois theory of infinite alge- 
braic extensions (Math. Ann., 100 (1928)). The 
Galois theory gives a model for a successful 
theory summarizing the essentials of separable 
algebraic extensions and has led to analo- 
gous theories for other algebraic systems. For 
example, E. R. Kolchin constructed an analo- 
gous theory for differential fields where the 
Galois groups are algebraic groups (— 113 
Differential Rings, Galois theory of differential 
fields). Another line of development of this 
theory, also originated by Dedekind (Werke 
III, 1876/77), led to the Galois theory of rings, 
an object of active research by N. Jacobson 
(Ann. Math., 41 (1940)), T. Nakayama, and 
others since the 1940s. Also, recently, the 
Galois theory for some general algebraic sys- 
tems containing inseparable fields has been 
constructed by M. E. Sweedler, U. S. Chase, 
and others in which Galois groups are re- 
placed by Hopf algebras or bialgebras (— 203 
Hopf algebras). 


B. Definitions 


Given a group G of tautomorphisms of a given 
tfield L, the subfield F(G)={aeL|a’=a,ceG} 
is called the invariant field associated with G. 
For any extension fields L, L’ of K, an isomor- 
phism of L into L’ whose restriction to K is 
the identity is called a K-isomorphism. If L is a 
tnormal extension of K, any K-isomorphism 
of L into L’ is a K-automorphism of L. If an 
algebraic extension L of K is normal, the 
group G(L/K) of all K-automorphisms of L is 
called the Galois group of L/K. A tseparable 
normal algebraic extension of K is called a 
Galois extension of K. Let L/K be a finite 
normal extension; then there exist intermedi- 
ate fields M and N of L/K such that M/K is a 
Galois extension and N/K is a tpurely tnsep- 
arable extension and L= M @, N (— 277 
Modules J). Further, G(L/K)= G(L/N) = 
G(M/K) and the order of G(L/K) is equal to 
the tseparable degree of L/K, i.e. [M: K]. A 
necessary and sufficient condition for L/K to 
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be a Galois extension is that the invariant field 
associated with G(L/K) be K. A Galois exten- 
sion L/K is called an Abelian extension or a 
cyclic extension when G(L/K) is tAbelian or 
teyclic, respectively (— 149 Fields). 


C. Fundamental Theorem of Galois Theory 


Let L/K be a finite Galois extension and G its 
Galois group. Then there exists a fdual lattice 
isomorphism between the set of intermediate 
fields of L/K and the set of subgroups of G, 
under which an intermediate field M of L/K 
corresponds to the subgroup H = G(L/M); 
conversely, a subgroup H of G corresponds to 
M =F(H). The degree of extension [L: M] is 
equal to the order of the corresponding sub- 
group H (in particular, [L: K] is the order 

of G), and [M : K] coincides with the index 
(G:H). If subfields M and M’ are tconjugate 
over K, then the corresponding subgroups 
G(L/M) and G(L/M’) are conjugate to each 
other in G, and vice versa. In particular, M/K 
is a Galois extension if and only if the sub- 
group H corresponding to M is a tnormal 
subgroup of G, and in this case, the Galois 
group G(M/K) is isomorphic to the factor 
group G/H. 


D. Extensions of a Ground Field 


Let L/K be a finite Galois extension, K’/K any 
extension, and L’ the tcomposite field of L and 
K’. Then L’/K’ is also a Galois extension, and 
its Galois group is isomorphic to G(L/LN K’) 
by the restriction map. 


E. Normal Basis Theorem 


Let L/K be a finite Galois extension with 
Galois group G. Then there exists an element u 
of L such that {u° |o eG} forms a basis for L 
over K called a normal basis. If we denote by 
KEG] the *group ring of G over K, a *K[G]- 
module structure can be introduced in L by 
the operation X a,a(x)= a,x’; the existence 
of a normal basis implies that L is isomorphic 
to K[G] itself as a K[G]-module, or in other 
words, that the K-linear representation of G 
by means of L is equivalent to the tregular 
representation of G. 


F. Examples of Galois Extensions 


(1) tCyclotomic Fields. Let m be a positive 
integer not divisible by the tcharacteristic of K; 
€ a tprimitive mth root of unity, and L= K(@). 
Then L/K is an Abelian extension, and its 
Galois group is isomorphic to a subgroup of 
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the treduced residue class group (Z/mZ)*; 

in particular, if K =Q, the subgroup coin- 
cides with (Z/mZ)*, by the irreducibility of 
tcyclotomic polynomials. Hence the degree 
[Q(4):Q] is equal to (m), where ¢ is fEuler’s 
function. 


(2) Finite Fields. A ‘finite field K has nonzero 
characteristic p, and the number q of elements 
of K is a power of p. Also, K is uniquely deter- 
mined by q (up to isomorphism), hence it is 
denoted by GF(q) or F,. Thus GF(q”) is the 
only extension of GF(q) of degree n; moreover, 
it is a cyclic extension. 


(3) Kummer Extensions. Assume that K con- 
tains a primitive mth root ¢ of unity and the 
characteristic of K is 0 or is not a divisor of m. 
Denote by K* the multiplicative group of K. 
An extension L of K can be expressed in the 
form L=K(2/a,, sas Gs) (a;¢ K) if and only if 
L/K is an Abelian extension and all oe G(L/K) 
satisfy o” = 1; in this case L/K is called a Kum- 
mer extension of exponent m. There exists a 
one-to-one correspondence between Kum- 
mer extension L of exponent m over K and 
finite subgroups H/(K*)” of the factor group 
K*/(K*)", given by the relations H = L"(Q K*, 
L=K (2/H ). Moreover, there exists a canon- 
ical isomorphism between H/(K*)” and the 
tcharacter group of G(L/K), so that H/(K*)” is 
isomorphic to G(L/K). Let L= K(0) be a cyclic 
Kummer extension of degree m of K, and let o 
be a generator of the Galois group G(L/K). 
Then the Lagrange resolvent (¢, 0) = 6+ €0° 
+...4+67 10%" satisfies (C,0)7 =" (6, 0), 

(C, #)"e K, and @ and its conjugates can be 
expressed in terms of (¢, 0). In particular, L is 
generated by (¢, 0) over K. 


(4) Artin-Schreier Extensions. Assume that K is 
of characteristic p #0. For any element a of an 
extension of K, we denote by Za the element 
a’ —aand by (1/P)a a root of PX —a=0. 

A finite extension L of K is of the form L= 
K((1/P)a,,...,(1/P)a,) (a;€ K) if and only if 
L/K is a Galois extension whose Galois group 
is an Abelian group of texponent p; in this 
case, L/K is called an Artin-Schreier extension. 
There exists a one-to-one correspondence 
between Artin-Schreter extensions L over K 
and finite subgroups H/AK of the additive 
group K/AK, given by the relations H = 
PLANK, L=K((1/P)H); moreover, H/PK is 
isomorphic to the character group of G(L/K) 
(therefore also to G(L/K) itself). More gen- 
erally, for Abelian extensions L of exponent p” 
(i.c., Galois extensions whose Galois groups 
are Abelian groups of exponent p”), we obtain 
similar descriptions by using the additive 
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group of fWitt vectors of length n instead of 
K (— 449 Witt Vectors). 


G. Galois Group of an Equation 


L/K is a finite Galois extension if and only if L 
is a tminimal splitting field of a fseparable 
polynomial f(X) in K[X]. In this case, we call 
G(L/K) the Galois group of the polynomial 
F(X) or of the algebraic equation {(X)=0. The 
equation f(X)=0 is called an Abelian equa- 
tion or a cyclic equation if its Galois group is 
Abelian or cyclic, respectively, while f(X)=0 is 
called a Galois equation if L is generated by 
any root of f(X) over K. Generally, G(L/K) 
can be *faithfully represented as a permutation 
group of roots of f(X)=0. If this group is 
tprimitive, then f(X)=0 is called a primi- 

tive equation. The index of the group in the 
group of all permutations of roots ts called 

the affect of the equation f(X)=0; if the affect 
is 1, the equation f(X)=0 is called affect- 

less. Let u,,...,u, be talgebraically indepen- 
dent elements over K. Then for the polyno- 
mial F,(X)=X"—u,X""'+...+(—1)"u, in 
K(u,,...,u,)_X], the equation F,(X)=0 is 
called a general equation of degree n. The 
Galois group of F,(X)=0 is isomorphic to the 
tsymmetric group ©, of degree n, and if K is 
not of characteristic 2, then the quadratic 
subfield corresponding to the talternating 
group A, is the field K(./D) obtained by 
adjoining the quadratic root of the *discrimi- 
nant D of F(X). 


H. Solvability of an Algebraic Equation 


Assume that K is of characteristic 0, f(X)e 
K[X], and L is the minimal splitting field of 
f(X). We say that the equation {(X)=0 is 
solvable by radicals if there is a chain of sub- 
fields K=Lyo>cL,c...cL,=L such that L;= 
Li (2/a;) with some a;€ L;—,, and this is the 
case if and only if the Galois group of f(X) 

is fsolvable (Galois). In particular, Abelian 
equations are solvable by radicals. Cyclic 
equations are solved by using the Lagrange 
resolvent, and theoretically the general solv- 
able equation can be solved by repeating this 
procedure. Since S, is solvable if and only if 
n<4, it follows that a general equation of de- 
gree n is solvable only if n=1, 2, 3, 4 (Abel). 
(For a method of solving these equations 

— 10 Algebraic Equations D.) Also, a poly- 
nomial is solvable by square roots if and only 
if the order of the Galois group is a power of 
2. This fact enables us to answer some ques- 
tions concerning geometric construction prob- 
lems such as trisection of an angle or division 
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of a circumference in equal parts (— 179 
Geometric Construction). 


I. Infinite Galois Extensions 


If a Galois extension L/K ts infinite, then its 
Galois group is an infinite group. Let {M,} be 
the family of intermediate fields of L/K that 
are finite and normal over K, and put H, 

= G(L/M,). Then by taking {H,} as a tbase of 
a neighborhood system of the unity element, G 
becomes a ‘topological group. This topology is 
called the Krull topology (Krull, Math. Ann., 
100 (1928)). G is then isomorphic to the tpro- 
jective limit of the family of finite groups 
{G/H} and is ttotally disconnected and *com- 
pact. There is a one-to-one correspondence 
between the set of intermediate fields of L/K 
and the set of closed subgroups of G given by 
the map (Galois group) (invariant field), and 
thus we have a generalization of Galois theory 
for finite extensions as described in Section C. 
Various theories, including the theory of Kum- 
mer extensions, can be generalized to the case 
of infinite extensions (— 423 Topological 
Groups). 


J. Galois Cohomology 


Let L/K be a finite Galois extension and G its 
Galois group. Then both the additive group 

L and the multiplicative group L* have G- 
module structures. The tcohomology groups 
of G with coefficient module L are 0 for all 
dimensions because of the existence of a 
normal basis {— 200 Homological Algebra). 
As for the multiplicative group L*, we have 
H°(G, L*)= K*/N(L*) (N is the tnorm N, x), 
H'(G, L*)=0 (Hilbert’s theorem 90 or the 
Hilbert-Speiser theorem). Jn particular, if G is a 
cyclic group with generator o, then every 
element a such that N(a)=1 can be expressed 
in the form a=b' *. H?(G, L*) is isomorphic 
to the *Brauer group of fcentral simple alge- 
bras over K which have L as a ‘splitting field. 
In the case of number fields, a number of 
G-modules arise, such as tprincipal orders, 
tunit groups, fideal groups, tidele groups, and 
so on, whose cohomological considerations 
are important (— 6 Adeles and Ideles; 59 Class 
Field Theory). Further, let A be a group (not 
necessarily commutative), and suppose that G 
acts on A. We denote by 7a the action of ce G 
on a€ A. A is called a G-group if °(ab) ="a°%b 
for all a, be A and o€G. Then, in the same way 
as for G-modules, we can define the Oth coho- 
mology group H°(G, A) to be the subgroup A° 
of A consisting of all elements of A left fixed by 
G. The map a:ora, from G into A such that 
a,,=4,°d, (o, tE G) is called a 1-cocycle with 


661 


values in A. Denote by Z1(G, A) the set of 

the 1-cocycles of G with values in A. Two 1- 
cocycles a and a’ in Z! (G, A) are called co- 
homologous if there exists an element be A 
such that a, =b ‘a,°b for all o€G. This is an 
equivalence relation in Z'(G, A) and the 1- 
cohomology set H+ (G, A) of G with values in 

A is defined to be the set of cohomologous 
classes of Z! (G, A). H'(G, A) does not have the 
structure of a group in general, but there does 
exist an element called an identity, namely, the 
cohomologous class containing the trivial 1- 
cocycle. In general, a set X with an element p 
of X is called a pointed set; for any two pointed 
sets (X, p) and (Y, q), a map f: X + Y is called 

a morphism of pointed sets if f(p)=g. For 
pointed sets (X, p), (Y, q), and (Z,r), a sequence 
of morphisms X 4,Y4Z of pointed sets is 
called exact if Im f = Kerg. The 1-cohomology 
set H'(G, A) with the identity can be regarded 
as a pointed set. Therefore, exact sequences of 
these sets make sense and possess some of the 
properties of cohomology groups in the com- 
mutative case. For example, let 1-+A—>B- 
C1 be an exact sequence of G-groups (and 
A is central in B); then 


H°(G, A)> H°(G, B) > H°(G, C) 
+ H1(G, A)> H! (G, B) > H' (G, C) (> H2(G, A)) 


is an exact sequence of pointed sets. For 

a Galois extension L/K with Galois group 
G, a linear algebraic L-group defined over 
K (— 13 Algebraic Groups) has naturally 
the structure of a G-group, and we have 
H'(G,GL,(L))=0. Applying the above exact 
sequence to 1>SL,(L) >GL,(L)4$.L* 1, 

we have H'(G, SL,(L))=0. Also, we have 
H'(G, Sp,,(L))=0 for any integer n>1. It is 
difficult to define higher cohomology sets 
naturally, but various methods to define them 
have been obtained. 

In many cases, we are more concerned with 
the tcategory of Galois extensions of K with 
K-isomorphisms between them than with a 
single extension L/K. In other words, we con- 
sider a tfunctor L> (L) of the category of 
Galois extensions of K into the category of 
(Abelian) groups, and study the cohomology 
related to G(L/K)-module or G(L/K)-group 
structures derived from (L). In the case of 
infinite algebraic extensions, we consider the 
inductive limit of cohomology of subfields 
of finite degrees, making use of continuous 
cocycles of Galois groups relative to the Krull 
topology [8,9]. 

Let L/K be a Galois extension with Galois 
group G. Consider objects X, Y defined over K 
on which the extension of the ground field is 
defined (such as K-linear spaces with certain 
tensors on them, algebras over K (associative 
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or not), K-varieties, or K-algebraic groups 
defined over K). When X and Yare isomor- 
phic over L, Y is called a L/K-form of X. Let 
E(L/K, X) be the set of all isomorphism classes 
of X over L. Let A be the group of all auto- 
morphisms over L of X. For a L/K-form Y 
of X and an isomorphism f: X > Y over L, 
since G acts on X and Y, one can define the 
isomorphism °f: X -Y over L that satisfies 
*(f(x)) =°f(°x). In particular, A is a G-group 
by this action. Further, for o eG, define a, 
=f !-%eA, then a:o a, is a 1-cocycle of G 
with values in A that corresponds to a L/K- 
form Y of X. This induces a bijection from 
E(L/K, X) onto H1(G, A). Thus L/K-forms 
of X can be classified if one can determine 
H'(G, A). For example, H' (G, Sp,,(L))=0 is 
equivalent to saying that an L/K-form of a 
skew-symmetric bilinear form on a K-linear 
space of dimension 2n is unique up to K- 
isomorphisms. L/K-forms of a semisimple 
Lie algebra g over K can be classified by the 
1-cohomology set of the algebraic group 
Aut(g @ gL). (For the L/K-forms of algebraic 
groups — 13 Algebraic Groups M.) This de- 
scent theory can also be discussed for more 
general categories. 


K. Galois Theory of Rings 


The theory of tcentralizers in simple algebras 
can be interpreted as the theory of a certain 
Galois correspondence with respect to inner 
automorphism groups. Also, by using tcrossed 
products, we can deduce from the theory of 
centralizers in simple algebras the Galois 
theory of commutative fields. On the other 
hand, Jacobson obtained a Galois theory of 
‘division rings with respect to finite groups of 
outer automorphisms that is similar to the 
commutative case. Since then, many alge- 
braists have proceeded with investigations that 
aim either at unifying these two theories by 
admitting inner automorphisms in the group 
of automorphisms, at extending the theory 
from division rings to general rings such as 
simple rings, tprimitive rings, or tsemiprimary 
rings, or at weakening the finiteness con- 
ditions. One principal method in these theories 
lies in considering first the endomorphism ring 
Hom,(S, S) for an extension S/R and then the 
roles of endomorphisms (or derivations) in it 
[6] (— 29 Associative Algebras). 

Let K be a field of characteristic p >0 and 
L/K be a finite purely inseparable extension 
such that L” c K. The set of all tderivations of 
L/K forms a restricted Lie algebra D(L/K) 
over K. Then there exists a one-to-one and 
dual lattice isomorphic correspondence be- 
tween the set of intermediate fields M of L/K 
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and the set of restricted Lie subalgebras H of 
D(L/K), given by the relations H = D(L/M) 
and M = {aeL|d(a)=0, de H} (Jacobson [7]). 
Namely, in the case of purely inseparable ex- 
tensions, derivations or higher derivations play 
the role of automorphisms, and the bialge- 
bras defined by such derivations correspond 
to Galois groups of Galois extension. Now, 
the group algebra KG of a group G over K 
and the (restricted) universal enveloping alge- 
bra of a (restricted) Lie algebra over K both 
have the structure of a *Hopf algebra. From 
this fact, unifying the Galois theory of Galois 
extensions and Jacobson’s theory for purely 
inseparable extensions and using bialgebras 
or Hopf algebras, one can construct Galois 
theories of more general objects containing 
certain nonseparable field extensions (see M. 
E. Sweedler, Ann. Math., 87 and 88 (1968); S. 
U. Chase and Sweedler, Lecture notes in math. 
97, Springer, 1969). 
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A. Introduction and Historical Highlights 


Game theory consists of mathematical models 
used in the study of decision making in situ- 
ations involving conflict and cooperation. A 
conflict arises when each player in a game 
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selects from a list of alternatives one which, 
possibly together with chance and random 
events, leads to various outcomes over which 
the players have different preferences; thus the 
behavior of one player aiming at his own 
favorable outcome might induce unfavorable 
outcomes for others. Although the potential 
outcomes usually bring about conflicts among 
the players, there may be room for cooper- 
ation among some of them. Game theory 
attempts to extract that which is common and 
essential to such situations, to handle them by 
means of mathematical methods, and to pro- 
vide a normative guide to rational behavior 
for cach of the players. Game theory thus goes 
beyond classical theories of probability and 
decision making that are sufficient to solve 
games involving just one player and chance. 
Modern game theory started in 1944 with 
the publication of the monumental book by 
von Neumann and Morgenstern [1]. These 
authors presented many logical classifications 
of games, including a distinction between two- 
person and n-person games, between constant- 
sum (zero-sum) and general-sum games (de- 
pending upon whether the sum of payoffs 
to the players is constant (zero) or not), and 
between noncooperative and cooperative games 
(depending upon whether any collaboration 
among the players is prohibited or allowed). 
The second edition included a remarkable 
expected utility theory, which has become a 
mainstay of game theory. Von Neumann and 
Morgernstern also gave three representations 
of games. The first representation is the so- 
called extensive form; this representation has 
been slightly modified by Kuhn [2]. The sec- 
ond representation is the normal-form game; 
this is the form which the minimax theorem 
for two-person zero-sum games was estab- 
lished [3]. This theorem was generalized to n- 
person general-sum noncooperative games by 
Nash [4]. Finally, the characteristic-function 
form was the one in which the authors devel- 
oped the theory of stable sets. Several other 
solution concepts, such as the core, the Shap- 
ley value, and the bargaining set, have since 
been defined for games in characteristic- 
function form. Historically, the development of 
game theory has been closely related to vari- 
ous areas of pure mathematics, such as analy- 
sis, topology, geometry, and the foundations of 
mathematics. A survey of game theory up to 
1957 was presented by Luce and Raiffa [5]. 
In the 1950s and early 1960s, several major 
papers appeared in five issues of the Annals of 
Mathematics Studies [6]. Lucas [7] has pre- 
sented a good survey of developments to 1972, 
and Shubik [8] has surveyed the development 
of the field through 1981. Current articles 
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appear mainly in the International Journal 

of Game Theory, and also in the journals in 
fields such as operations research, manage- 
ment science, economics, political science, and 
psychology. 


B. The Extensive Form 


An n-person game in extensive form is repre- 
sented by a game (tree (i.e., a connected graph 
with no cycles (— 186 Graph Theory)) having 
the following properties: There is one special 
vertex corresponding to the starting point of 
the game. Each nonterminal vertex corre- 
sponds either to a move of one of the n players 
or to a chance move. The edges ascending from 
a vertex denote the alternatives of the player at 
this vertex. For each terminal vertex there is 
an n-dimensional vector whose components 
represent the payoffs to each player. The state 
of a player’s information at any stage can be 
described by certain subsets of the set of all his 
vertices, called information sets. At each of his 
moves, player i knows which information set 
he is in, but not which vertex he occupies 
within this set. A local strategy for player i is a 
tprobability distribution over the set of all 
alternatives at each of his information sets. A 
behavior strategy for player i is a function 
which assigns a local strategy to all of his 
information sets. A pure strategy is a special 
behavior strategy that assigns a particular 
choice to each information set. Kuhn [2] 
showed the existence of pure optimal strategies 
for n-person general-sum noncooperative 
games with perfect information (i.e., games in 
which all information sets contain a single 
vertex) as a generalization of the result for 
two-person zero-sum games given in [1]). 
Kuhn also proved the existence of equilibrium 
behavior strategies for games with perfect 
recall (i.e., games in which player i at any of his 
information sets remembers all his prior moves 
but is not aware of the prior choices of the 
other players). The definition of “equilibrium 
strategy” is given in the next section. Because 
of the complexity of n-person general-sum 
games, research on them has been minimal 
since the work of Kuhn. Recently a new attack 
on them has begun; for example, the Nash 
equilibrium point in extensive forms has been 
reexamined by Selten [9]. 


C. The Normal Form (or Strategic Form) 


An n-person game in normal form is specified 
by {N, {Xien {Fo en}, where N = {1, SRy n} 
is a set of players, X' is the set of player i’s 

strategies, and F' is a real-valued function on 
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the Cartesian product []?_, X‘, called player i’s 
payoff function. Chance is incorporated by 
invoking an extra set X° of chance moves and 
a probability distribution on X°. A two-person 
zero-sum game or matrix game is the simplest 
case in which the existence of an equilibrium 
point has been established. Equilibrium fol- 
lows from von Neumann’s minimax theorem: 
Let M'={1,...,m,}, M?7=({1,...,m,} be the 
sets of pure strategies that may be chosen by 
two players. Let a;; be player 1’s payoff when 
strategies i and j are taken by players 1 and 2. 
A mixed strategy for player i is a probability 
distribution on M'. Sets of mixed strategies 
for players 1 and 2 are thus given by X! = 
{xe R" |E x;=1, x,;20ieM'} and X? = 
{xeER™| DM, x,=1, x, 20 ie M?}. If players 

1 and 2 use the mixed strategies x! and x’, 
respectively, the expected payoff for player 1 
is F*(x",x*)=%, 2x} a,x}, which is a pay- 
off function for player 1. Von Neumann [3] 
proved 

max min F!(x!,x?)= min max F}(x!, x”). 
xiex! x?ex? x2eX2? xiex! 

We say that a pair (£t, 2?) satisfying the 
foregoing minimax theorem is an equilibrium 
point. This pair (<', $?) turns out to be a saddle 
point (— 292 Nonlinear Programming A) of 
F!(x!, x”). The duality theorem (— 255 Linear 
Programming B) is mathematically equivalent 
to this minimax theorem. A generalization of 
this equilibrium to n-person general-sum non- 
cooperative games was presented by Nash [4]. 
An n-tuple (£+, ..., $") (Że X) is a Nash equilib- 
rium if, for each i, 


a 81, A EL gm 
SPR cp xi $t, 2") forall xte Xi. 


That is, no player can improve his payoff by 
changing his strategy if all other players con- 
tinue to use the same strategies. Nash demon- 
strated the existence of this equilibrium for n- 
person general-sum noncooperative games 
with finite pure strategies. The existence of 
Nash equilibria for wider classes of nonco- 
operative games is proved by means of fixed- 
point theorems (— 153 Fixed-Point Theorems). 
Recently, multistage games, such as super- 
games and stochastic games in which games 
are played repeatedly, have become a major 
research topic of noncooperative game theory 
(with work being done on both extensive and 
normal forms). In two-person general-sum 
games or bimatrix games, there arise possibil- 
ities of cooperation (or bargaining) between 
two players. A solution concept for such situa- 
tions, the Nash bargaining solution, was pro- 
posed by Nash [10]; this solution was sub- 
sequently discussed by Luce and Raiffa [5]. 
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D. The Characteristic-Function Form (or 
Coalitional Form) 


An n-person cooperative game in characteristic- 
function form is given by a pair (N,v), where N 
={1,...,n} is a set of players and v is a real- 
valued function on a set of all subsets of N 
with v(@)=0, called a characteristic function. 
Sometimes v is assumed to be superadditive, 
i.e., (SUT) >v(S)4+ 0(T) “S, TEN with SN 
T#@. v(S) represents the worth or the power 
achievable by the subset (coalition) S when 

its members cooperate regardless of the be- 
havior(s) of the players in the complement of 
S. An n-dimensional vector x =(x,,...,X,) is 
said to be an imputation if it satisfies (i) x; > 
v({i}) “ie N (individual rationality) and (ii) 
Dien X; =V(N) (group rationality). The set of all 
imputations, denoted by A, represents all 
reasonable or realizable ways of distributing 
the available gains among the n players. An 
imputation x is said to dominate another im- 
putation y if there is some nonempty coalition 
S such that (i) x;> y; “ie S and (ii) Xes x; < 
v(S) (the effectiveness of S with respect to x). 
The original solution concept for games in 
characteristic-function form given by von 
Neumann and Morgenstern [1] is now called 
the stable-set or von Neumann-Morgenstern 
solution. A subset K of A is a stable set if (i) 
no dominance relation exists between any 
two elements of K (internal stability) and (ii) 
any imputation outside K is dominated by 
some imputation of K (external stability). The 
existence of stable sets was settled negatively 
by the ten-person example of Lucas [11]. This 
example, however, is rather specialized, and 
thus the existence of stable sets may yet be 
proved for a large class of games. Stable sets 
have been characterized for several classes 

of games, and these sets accurately reflect 

the coalition-forming processes among the 
players. Thus stable-set theory remains a ma- 
jor research topic in the theory of games in 
characteristic-function form. Many other 
solution concepts have been developed since 
that of the stable set. One of these is the core, 
defined by Gillies in [6] (its naive idea had 
already appeared in [1]). The core is the set 
C={xeA|Yie5x;20(S) SSN}. This says that 
no coalition can protest against or block an 
imputation in the core on grounds that the 
coalition can expect more. For superadditive 
games the core coincides with the set of un- 
dominated imputations, and thus the core is a 
subset of any stable set if both exist. The con- 
dition for nonemptiness of the core was de- 
rived by Shapley [12] using the duality theo- 
rem. Another solution concept, defined by 
Shapley in [6], is known as the Shapley value. 
The Shapley value is a function ¢(v) =(9, (v), 
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.--5@,(v)) sending an arbitrary characteristic 
function v on N into an n-dimensional Eucli- 
dean space satisfying the conditions (i) ~,,(zv) 
= ;(v), where z is any permutation of N and 
mv(mS)= v(S) “SEN; (ii) Dies p0) = v(S) "SEN 
such that o(T)=v(SN T) ‘TEN; and (iii) 

ilv + w) = @,(v) + 9,(w) “ie N and for any two 
games v and w. These three axioms uniquely 
determine the value 


igs 
p= Z SIE (o(8) (5 i) 
sO 


for each i, where s is the number of players in 
S. The Shapley value is an imputation, and 
can be viewed as a fair-division solution since 
the three axioms are desirable properties for 
any equitable allocation scheme. From the 
above formula, the Shapley value can also be 
interpreted as the average of the marginal 
contributions of the players in a coalition. 

A bargaining-set concept was proposed by 
Aumann and Maschler in [6]. This set de- 
scribes what payoffs are stable once a partic- 
ular coalition structure (a tpartition of N) has 
formed. Briefly, a payoff associated with a 
coalition structure is stable or in a bargain- 
ing set if there is no objection to it from any 
player; or, even with an objection, if there 
exists a counterobjection to such an objection 
from other players. For details — [6]. Since 
there are many different ways to define objec- 
tion and counterobjection, there are various 
types of bargaining sets. Some of them are 
known to be nonempty. Two additional solu- 
tion concepts, the kernel and the nucleolus, 
derive from investigations into particular 
bargaining sets. The kernel, introduced by 
Davis and Maschler [13], is always a non- 
empty subset of its bargaining set. More impor- 
tant is the nucleolus defined by Schmeidler 
[14], which is a unique imputation in the 
kernel and thus in the bargaining set. It is 
also in the core if the latter is nonempty. The 
“excess” of any nonempty SSN for an imputa- 
tion x ts defined by e(x, S)=v(S)—X;-5x;. The 
excess represents the dissatisfaction or the 
complaint of the coalition S with respect to x. 
The nucleolus is the imputation which mini- 
mizes the largest excess. If we have a tie, i.e., 
if the maximum excess attains a minimum 

at several imputations, then the next largest 
excess is to be compared, and so on. That 

is, the nucleolus is the tlexicographical mini- 
mum in the ordering of these arrangements. It 
can be computed by solving a series of linear 
programs. 

Several generalizations and variations of 
the classical von Neumann-— Morgenstern for- 
mulation of games in characteristic-function 
form have also been investigated, such as the 
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games without side payments studied by 
Aumann and others (— e.g., M. Shubik (ed.), 
Essays in Mathematical Economics: In honor of 
Oskar Morgenstern, Princeton Univ. Press, 
1967); the games in partition-function form 
proposed by Thrall and Lucas (R. M. Thrall, 
and W. F. Lucas, Naval Res. Logistics Quart., 
10 (1963), 281-298); and the games with in- 
finitely many players, in connection with which 
the Shapley value theory has become a major 
research topic [18]. 


E. Applications; Related Areas of Mathematics 


Game theory has been applied to many fields, 
such as economics, political science, manage- 
ment science, operations research, information 
theory, and control theory, as well as to pure 
mathematics. Games in extensive form are 
now important tools for analyzing the effects 
of information, and thus for solving many 
decision problems with uncertainty. Non- 
cooperative games in normal form and Nash 
equilibria have been used in the study of many 
phenomena, including oligopolistic markets 
(Friedman [15]), bidding processes, electoral 
competition, resource allocation, and arms 
control. Cooperative games have been success- 
fully applied to economics, and the relation 
between the core and competitive equilibria 
sheds further light on the theory of competi- 
tive economy. It is generally true that com- 
petitive equilibria are contained in the core. 
Debreu and Scarf [16] demonstrated that 

if the number of players approach infinity 

in a certain manner, the core shrinks to the 

set of competitive equilibria. By working in 
measure-theoretic terms, Aumann [17] was 
able to identify the core with the set of com- 
petitive equilibria. It was also demonstrated 
by Aumann and Shapley [18] that the set of 
competitive equilibria (and hence also the 
core) converges to the Shapley value under 
such formulations, provided certain conditions 
are satisfied. Another major application of 
cooperative game theory has arisen in pcittical 
science, wherein value-type solutions, such as 
the Shapley value, are widely used as indices of 
the power of each participant in various voting 
situations. Major applications are to problems 
of cost allocation for public goods such as 
water resources [19], public transportation 
systems, and telephone systems; in such appli- 
cations the core, the Shapley value, and the 
nucleolus have all been employed. The books 
[8, 20-22] are good references to the most 
recent applications of game theory. 

Game theory also has many close relations 
with various areas of pure mathematics. The 
following are typical examples. The study of 
certain types of games in extensive form has 
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increased our understanding of the axiom of 
choice (— 34 Axiom of Choice and Equiva- 
lents A) and other foundational questions. 
Nash equilibrium is closely related to fixed- 
point theorems (— 153 Fixed-Point Theorems) 
and to separation theorems (— 89 Convex Sets 
A). Cooperative game theory also has many 
connections to functional analysis and to 
convex analysis. 

Finally, we mention that the study of dif- 
ferential games (— 108 Differential Games) is 
also highly developed and widely used in areas 
such as economics and control theory. 
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A. The Gamma Function 


The function T(x) was defined by L. Euler 
(1729) as the infinite product 


1 œ 1\* x\ t! 
ZT hha) 


Legendre later called it the gamma function or 
Euler’s integral of the second kind. The latter 
name is based on the fact that for positive real 
x, we have 


| ee Tae 


0 


This function satisfies the functional relation 
I'(x+ 1)=xI(x), 


and hence for positive integral x, we have 
I'(x+1)=x!. C. F. Gauss denoted the function 
I(x +1) by J](x) or x!, even when x is not a 
positive integer. The function x! is also called 
the factorial function. The gamma function 
can also be defined as the solution of the func- 
tional equation I (x+ 1)=xT (x) satisfying the 
conditions 


T(x+n) D 
T(n)n* 


T(1)=1, lim 
Furthermore, we have 


1 = x 
= Cx 1+ — “xin 
F xe I ( + =) e 


This expression is known as the Weierstrass 
canonical form. Here C is Euler’s constant 


1 1 
Cotim( 1454. += togn 
2 n 


ro 


=0.57721566490153286060651209 ..., 
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which is conjectured to be ftranscendental, 
but as yet even its irrationality remains un- 
proved. However, it is known, that if C were 
rational, the numerator and the denominator 
would be integers of more than 30,000 digits 
(R. P. Brent, 1980). 

The numerical value of C was calculated 
by Adams (1878) to 260 decimal places, and 
recently it has been calculated to more than 
20,000 decimal places by means of an elec- 
tronic computer. Seven thousand digits have 
been computed by W. A. Beyer and M. S. 
Waterman (Math. Comp., 28 (1974)), and 
20,000 digits have been computed by Brent 
et al. (1980). 

T(x) is tholomorphic on the complex x- 
plane except at the points x=0, —1, —2,..., 
where it has simple tpoles. When Rex >0, we 
have Hankel’s integral representation 


1 
r(x)= -> | ear 
2isinzx Jo 


x # integer, 


where the contour C lies in the complex plane 
cut along the positive real axis, starting at oo, 
going around the origin once counterclock- 
wise, and ending at œ again. 

Among various properties of this function 
(— Appendix A, Table 17.1), the following two 
formulas are especially useful for numerical 
calculations: Binet’s formula 


“UE log2x 
logIT (x)= x5 logx—x+ 
S tan(t 
+2 | aA) Rays 
o e“™—i 


and the tasymptotic expansion formula that 
holds when |argx|<(7/2)— ô (6 > 0), 


log 2x 
2 


1 
loa 0)~( x5 )logx—x4 


æ Aa Bz, 
+ ee ee ee 
È 2n(2n—1)x?"7! 


(Stirling’s formula), where the B, are tBernoulli 
numbers. This last formula can be rewritten as 


P(x+1)=x!~x*e"*,/2nx, 


which is used for large positive integers x. 
The integrals 


A œ 
| ett dt, | Be dt, 
0 à 


are known as the incomplete gamma functions 
and are used in statistics, the theory of molec- 
ular structure, and other fields. The texponen- 
tial integral and the terror function (— 167 
Functions of Confluent Type D) are special 
cases of the incomplete gamma function. 


Rex>0, 
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B. Polygamma Functions 


The derivatives of the logarithm of the gamma 
function are named the digamma function (or 
psi function) (x) =dlog I'(x)/dx; the trigamma 
function w’(x); the tetragamma function y"(x); 
the pentagamma function (x), etc. These 
functions are called polygamma functions. In 
particular, w(x) is the solution of the functional 
equation 


yx+1)—y(x)=1/x, wh=—-C, 
lim (f(x +n)— (1 +n))=0. 


C. The Beta Function 


Euler’s integral of the first kind 


1 
sœ» | t= (1—4)! dt, 


0 


Rex>0, Rey>0, 


is called the beta function and is an analytic 
function of two variables x, y. This function is 
related to the gamma function as follows: 


gya O 


r(x+y) 
If the upper limit 1 in the integral is replaced 
by «, the result is called the incomplete beta 
function B,(x, y). 
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Carl Friedrich Gauss (April 30, 1777- 
February 23, 1855) was born into a poor 
family in Braunschweig, Germany. From 
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childhood, Gauss showed genius in mathe- 
matics. He gained the favor of Grand Duke 
Wilhelm Ferdinand and under his sponsorship 
attended the University of Gottingen. In 1797, 
on proving the tfundamental theorem of alge- 
bra, he received his doctorate from the Univer- 
sity of Halle. From 1807 until his death, he 
was a professor and director of the Observa- 
tory at the University of Gottingen. 

On March 30, 1796, he made the discovery 
that it is possible to draw a 17-sided tregular 
polygon with ruler and compass, which moti- 
vated his decision to devote himself to mathe- 
matics. The publication of his Disquisitiones 
arithmeticae in 1801 opened an entirely new 
era in number theory. In pure mathematics, 
he did excellent research on tnon-Euclidean 
geometry, thypergeometric series, the theory of 
functions of a complex variable, and the whole 
theory of felliptic functions. 

In the field of applied mathematics he made 
outstanding contributions to astronomy, ge- 
odesy, and electromagnetism; he also studied 
the tmethod of least squares, the theory of 
surfaces (— 111 Differential Geometry of 
Curves and Surfaces), and the theory of tpo- 
tential. He considered perfection in papers for 
publication to be of utmost importance; thus 
his published works are few relative to his 
amount of research. However, the scope of his 
work can be seen in his diary and letters, some 
of which are included in his complete works, 
comprising 12 volumes. He ts generally consid- 
ered the greatest mathematician of the first 
half of the 19th century. 


References 


[1] C. F. Gauss, Werke I-XII, Königliche 
Gesellschaft der Wissenschaften, Göttingen, 
1863-1933. 

[2] C. F. Gauss, Disquisitiones arithmeticae. 
German translation, Untersuchungen über 
höhere Arithmetik, Springer, 1889 (Chelsea, 
1965); English translation, Yale Univ. Press, 
1966. 

[3] F. Klein, Vorlesungen über die Entwick- 
lung der Mathematik im 19. Jahrhundert I, II, 
Springer, 1926-1927 (Chelsea, 1956). 


176 (XVII. 13) 
Gaussian Processes 


A. Gaussian Systems 


A system X={X,(@)|AEA} of real-valued 
random variables on a probability space 
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(Q, B, P) is said to be Gaussian, or X is a Gauss- 
ian system, if any finite linear combination 

of elements X, of X is a ‘Gaussian random 
variable. 

Any subsystem of a Gaussian system is 
again a Gaussian system. In particular, the 
joint distribution of (X,,X,,...,X,,) for any 
finite subsystem {X,|1<j<n} of a Gaussian 
system X is a multidimensional Gaussian 
distribution. This distribution is supported on 
the whole of R” or on a hyperplane (at most 
(n—1)-dimensional). Let m, be the expectation 
of X;, and let V=(V, ,) be the (positive definite) 
‘covariance matrix of {X;} given by 


Vin =E (Xm) Xm) 1<j, k<n. 


Suppose that the rank of the matrix Vis r. 
Then the distribution of (X,, X,,...,X,,) is 
concentrated on an r-dimensional hyperplane 
of R”: 


m+VR", m=(m,,m),...,m,)- 


When V is nondegenerate, that is, when r=n, 
the distribution is supported by the whole of 
R” and has a density function of the form 


(27) "Py exp] —3(0—m)¥“o—m'], 


where x =(X,,X2,...,X,)ER", |V| and V~! are 
the determinant and the inverse, respectively, 
of V, and where (x— mY denotes the (column) 
vector transpose to the (row) vector (x —m). 
The above expression is the general form of 
the density of an n-dimensional Gaussian 
distribution. The characteristic function @(z) of 
this distribution is given by 


o(z)=exp im. z) -5z a| A 


Z=(Z1, Z2, v5 Z JER; 


where ( , ) denotes the inner product on R”. 
This distribution is denoted by N(m, V). If, in 
particular, m=0 and if V is the identity matrix, 
then it 1s called the n-dimensional standard 
Gaussian distribution. 

For a general Gaussian system X= {X, | 
AeA} we are given the mean vector m} = 
E(X,), A€ A, and the covariance matrix V, „= 
E{(X,—m,)(X,—m,)}, 4, we A, which is posi- 
tive definite; for any n> 1, complex numbers 
Xis Xz,- XE C, and /,,/,,...,4,€A, we have 


n 
D AVi 4,29. 
j,k= 


1 

Conversely, given m,, EA, and a positive 
definite V=(V,_,,|A, we A), there exists a Gaus- 
sian system X = {X,|Ae A}, the mean vector 
and the covariance matrix of which coin- 

cide with (m,) and V=(Vj_,), respectively. If 
there exists another system X’ with the same 
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property as X, then X and X’ have the same 
distribution. 

A necessary and sufficient condition for the 
X, in a Gaussian system to be independent is 
that V, „=0 for every pair 2# u. For a Gauss- 
ian system X = {X,|n>1}, the tconvergence 
in probability of the sequence {X,,} is equiv- 
alent to tconvergence in the mean square. 
The limit of the sequence in this case is again 
Gaussian in distribution. Also for this system, 
the talmost sure convergence of the sequence 
implies the convergence in mean square. 

There are many characterizations of Gauss- 
ian distributions and of Gaussian systems. 

(i) A necessary and sufficient condition for a 
distribution to be Gaussian is that cumulants 
(tsemi-invariants) y, of all orders exist and 
satisfy y, =0 for all k > 3. (ii) Gaussian distri- 
butions have a self-reproducing property: If 
X and Y are independent Gaussian random 
variables, then their sum S= X + Y is also 
Gaussian. A converse to this property holds: If 
the sum is Gaussian, then, assuming that they 
are independent, X and Y are both Gaussian 
as well (P. Levy, H. Cramér). (iti) If X, and X, 
are independent and if Y, =aX,+5X, and 

Y, =cX,+dX, are also independent, then 
both X, and X, are Gaussian except for the 
trivial case b=c=0 or a=d=0. (iv) If for X 
and Y there exist U independent of X and V 
independent of Y satisfying Y=aX + U, X= 
bY+V for some constants a, b, then there are 
only three possibilities: (1) (X, Y) is Gaussian, 
(2) X and Y are independent, (3) there is an 
affine relation between X and Y. (v) Suppose 
that a distribution has a finite mean m and a 
density function of the form f(x —m). If the 
tmaximum likelihood estimate of the mean is 
always given by the arithmetic mean of the 
samples, then the distribution is Gaussian 

(C. F. Gauss). 

For any element X, of a Gaussian system X 
and for a subsystem X’ of X, the conditional 
expectation E(X,/B’) is the orthogonal projec- 
tion of X, onto X’, where B’ is the smallest ø- 
field with respect to which all the X, in X’ are 
measurable and where X’ is the closed linear 
subspace of L?(Q, P) spanned by X’. 

A system X= {X,|AeEA}, X,=(X},...,X9, 
of d-dimensional random variables is said 
to be Gaussian if the collection {Xj|ZeA, 

1 < j<d} is Gaussian. 


B. Complex Gaussian Systems 


Let Z be a complex-valued random variable 
with mean m, and denote it in the form Z = 

X +iY¥+m,i=,/—1, X, Y real. If X and Y 
are independent and have the same Gaussian 
distribution with zero mean, then Z is called a 
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complex Gaussian random variable. A system 
Z={Z,|AEA} of complex-valued random 
variables is said to be complex Gaussian if any 
finite linear combination £ c;Z ap GE C, is 
complex Gaussian. A complex Gaussian sys- 
tem has properties similar to those of a Gaus- 
sian system discussed above. For instance, two 
complex Gaussian random variables are inde- 
pendent if and only if they are uncorrelated. 
Convergence properties are also similar. Fur- 
thermore, one can define complex Gaussian 
systems consisting of higher-dimensional ran- 
dom variables. 


C. Gaussian Processes 


A real-valued stochastic process {X,} is called 
a Gaussian process or a normal process if it 
forms a Gaussian system. If {X,} is a complex 
Gaussian system, it is called a complex Gauss- 
ian process. The most important example 

of a Gaussian process ts Brownian motion 
{B,|t>0} with the properties E(B,)=0 and 
E(B,— B)? =|t—s|. There are several gener- 
alizations of this process, among them: (i) 
‘Brownian motion with a d-dimensional time 
parameter, (ii) a Wiener process with d- 
dimensional time parameter, which is a Gaus- 
sian system {X,|a=(a,,..., 4,4), all a;>0} with 
E(X,)=0 and E(X,:X,2)=[],min {a} , a7}, 
a'=(ā},..., đa) i= 1, 2. 

Since a multidimensional Gaussian distri- 
bution is completely determined by its tmean 
vector and the tcovariance matrix, a Gaussian 
process is strongly stationary if it is weakly 
stationary (— 395 Stationary Processes); and 
such a process is called a stationary Gaussian 
process. The mean value m and the spectral 
measure F(dA) are associated with a weakly 
stationary process. The measure F (dA) is sym- 
metric with respect to the origin since each X, 
is real-valued. Conversely, given such F(dA) 
and m, we can construct a real weakly station- 
ary process {X,} with mean value m and spec- 
tral measure F(dA). Generally, such a process 
{X,} is not determined uniquely; however, if 
{X,} is Gaussian, then there exists only one 
stationary Gaussian process with given m and 
F (dA). In view of this fact, stationary Gaussian 
processes can be regarded as being typical 
among weakly stationary processes. 

The discussions so far on stationary Gauss- 
ian processes are generalized to the case of 
stationary complex Gaussian processes, for 


which symmetry of F (då) need not be assumed. 


The trandom measure {M(A)} in the ‘spectral 
decomposition of a complex Gaussian process 
{X,} again forms a complex Gaussian system. 
Weakly stationary Gaussian random distri- 
butions can also be introduced. 
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D. White Noise and Gaussian Random 
Distributions 


A real-valued weakly stationary process with 
discrete parameter is called a white noise if the 
mean m=0 and the tcovariance function p(t) 
= 1 (t=0); =0 (t #0). Obviously the tspectral 
measure F (dA) is the tLebesgue measure dA. 

In the continuous parameter case, a weakly 
stationary random distribution (— 407 Sto- 
chastic Processes C) is called a white noise if m 
=0 and p=6 (6 is fDirac’s delta function: 6(¢) 
= ¢(0)). The spectral measure is, therefore, 

the Lebesgue measure. In both cases Gauss- 
ian white noise {X,} or {X,} is determined 
uniquely. It has independent values at every 
point in the following sense: In the discrete 
parameter case, X, , X,,,...,X, are mutually 
independent for any mutually distinct t4, 
t>,...,¢,. For a continuous parameter Gauss- 
ian white noise, often called simply white 
noise if no confusion occurs, X, ,Xy,,.-.,X 
are mutually independent if the supports of 
Y1,P2,---,Q, are disjoint. In the latter case 
Gaussian white noise is realized by taking the 
derivative of a Brownian motion. The tcharac- 
teristic functional c(@) is given by 


1 
ew)=exp| -3101 |, 


where || || is the L?(R!)-norm. 

Characteristic functionals of general Gauss- 
ian random distributions are expressed in the 
form 


Pn 


, 
c(p) =exp imo —3 Ko, o|, 


where m is the mean functional and K is the 
covariance functional. 


E. Representations of Gaussian Processes 


The family of tstochastic integrals X, = 

fa F(t,u)dB,, t>a (t>a if a= —oo), based 

on Brownian motion, defines a Gaussian 
process {X,|t>a}. The converse problem 

is, however, not obvious. Given a Gaussian 
process {X,|t>a} with E(X,)=0, the problem 
is to find a Brownian motion {B,} and a kernel 
F that give a representation of the above form 
(P. Lévy [10]). A more specific problem dis- 
cussed below is important. If, in addition, the 
representation is formed in such a way that 
B,(X)=B,(B) holds for every t, then the repre- 
sentation ts called canonical, where B,(X) is the 
smallest o-field with respect to which all the 
X, S<t, are measurable. The canonical repre- 
sentation, if it exists, is unique up to equiva- 
lence, i.e., F(t, u)* is unique. The existence of 
the canonical representation is, however, not 
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always guaranteed. Mean continuous, zero 
mean, purely nondeterministic stationary 
Gaussian processes have tbackward moving 
average representations, which are nothing but 
the canonical representations. Once the canon- 
ical representation of {X,} is obtained, the 
kernel, together with known properties of {B,}, 
tells us important properties of the given pro- 
cess, ¢.g., sample function properties, Markov 
properties, and so forth. 

Let {X,|t> a} be a general Gaussian process 
with E(X,)=0, and let M,(X) be the subspace 
of L?(Q, P) spanned by the X,, s<t. Assume 
that (i) {X,} is separable, i.e., M „ (X) is sepa- 
rable and (ii) {X,} is purely nondeterministic 
in the sense that (\,M,(X)= {0}. Then X, is 
expressed as a sum of stochastic integrals: 


xX=5 | F(t, u) dBi, 
J Ja 


where the {Bi}, j= 1,2,...,N (N can be oo), 
are mutually independent tadditive Gaussian 
processes. In addition, the B/ can be taken in 
such a way that the measures dv/(u) = E(dB/)?, 
j=1,2,...,N is decreasing in j (i.e., dv! >> 

dv? >>...) and that B,(X)=\/,B,(B) holds for 
every t. Such a representation is called a gen- 
eralized canonical representation of {X,} [3]. It 
always exists under the above assumptions (i) 
and (ii), although it is not unique when N >2. 
The number N, called the multiplicity of {X,}, 
is independent of the choice of a generalized 
canonical representation. It is a good question 
to ask when {X,} has a simple unit multiplicity 
or when {X,} has the canonical representation. 
No interesting answer to this question has 
been given so far except for stationary Gauss- 
isn processes. 


F. Gaussian Markov Properties 


A tsimple Markov Gaussian process {X,|t >a} 
has the canonical representation if it is sepa- 
rable and purely nondeterministic and if its 
covariance function never vanishes. It is of the 
form 


x, -so | g(u) dBiu), 


t 


f(t)40, [ctu duo for t>a. 


a 


A generalization of the simple Markov prop- 
erty of a Gaussian process is given in the 
following manner (suggested by P. Lévy [10]). 
If the tconditional expectations E (X,,/B,(X)), 
for any choice of distinct t,,t,,...,tysj>t 
(j20) span an N-dimensional subspace of 
L?(Q, P), then the process {X,} is called an N- 
ple Markov Gaussian process. If the canonical 
representation of an N-ple Markov Gaussian 
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process exists, then it is expressed in the form 
N t 

X,=) sof g(u) dB, 
j=1 a 


where det(f;(t;)) never vanishes for any choice 
of distinct t; (>a), 1<i<.N, and where the 

g; are linearly independent vectors in L?([a, t]) 
for any t >a. Conversely, a Gaussian process 
given by the above expression is N-ple Mar- 
kov. A stationary N-ple Markov Gaussian 
process has a fspectral density function f(A) of 
a specific form, namely, it is expressed in the 
form [3] 


f(A)=|Q(iA)/P(iA)/’; P, Q polynomials, 
deg P=N and degQ<N 


(rational spectral density function). Y. Okabe 
[14] proved that the roots of P(x)=0 are all 
real, and introduced a multiple Markov prop- 
erty of a stationary Gaussian process {X,} in 
a much wider sense to prove that {X,} enjoys 
this property if and only if it has a rational 
spectral density. 

A somewhat restricted definition of multiple 
Markov properties for a Gaussian process 
uses differential operators. Assume that X, is 
(N — 1)-times differentiable with respect to the 
L?(Q, P)-norm. If there is an Nth order dif- 
ferential operator L= Xio a,(t)D™ * with 
D=d/dt such that 


LX,=B,, _B,=dB,/dt white noise, 


then {X,} is called an N-ple Markov Gaussian 
process in the restricted sense. Such a process is 
naturally N-ple Markov, and the canonical 
representation always exists. The kernel of the 
representation is the tRiemann function of the 
differential operator L that was used in the 
definition. The spectral density function corre- 
sponding to this process is of restricted form, 
namely, it is expressed in the form 1/| P(iA)|?, 
where P is a polynomial of degree N, and 
P(i4)=0 has no root in the lower half-plane. 
Many attempts have been made to define 
a Markov property of a Gaussian random 
field, namely, a Gaussian system with a multi- 
dimensional time parameter; however, only 
two significant approaches are mentioned here. 
Let {X,|aeR*! be a Gaussian random field 
with d-dimensional time parameter. H. P. 
McKean [12] gave a Markov property in the 
following manner. Let A, be the o-field gen- 
erated by the X,, xeU, U(c Rô) open. For 
a closed set C, we set F-=(), Fc, where C, is 
the e-neighbourhood of C. If, for any open 
set U, Fy and Fy: become independent under 
the assumption that Fy is known, then {X,} 
is called a Markov Gaussian random field in 
the McKean sense. It has been proved that 
a Brownian motion with d-dimensional time 
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parameter is Markov in this sense if d is odd, 
while it is not Markov if d is even. Further 
detailed investigations of Markov properties 
for Gaussian random fields have been given 
by L. Pitt, S. Kotani, Y. Okabe [17], and 
others. 

Another definition of Markov properties 
of Gaussian trandom fields, in fact those of 
Gaussian random distributions, has been given 
by E. Nelson [13] in connection with Eucli- 
dean fields (invariant under Euclidean mo- 
tion) in quantum theory. Let X, be a Gauss- 
ian random field. For a smooth surface Tin 
R? let F, be the a-field generated by Xa, D= 
Y ® 6(n), Y being a smooth function with 
compact support and n being normal to T. If 
Fp and Fp become independent under the 
assumption that Fp, I = ôD, is given, then 
{X} is called a Markov Gaussian random 
field in the Nelson sense. Given a Gaussian 
Euclidean field which is Markov in the Nelson 
sense, then under some reasonable assump- 
tions, one can form a field (in the sense of 
tquantum field theory) satisfying the tWight- 
man axiom by changing imaginary time to 
real time. 


G. Sample Function Properties 


Some finer results about the smoothness of 
sample functions have been obtained for station- 
ary Gaussian processes. G. A. Hunt proved that 
if the spectral measure F (dA) of a stationary 
Gaussian process {X,} has finite moment of 
order «, then almost all its sample functions 
satisfy the following tHolder condition of 
order « [5]: For every finite interval J and 
every constant C, there exists a sufficiently 
small hy =hg(I, C) such that 


[Xt+n— Xi < Cl h (log 1/14)? 


for any tel and |h| <ho. The following result is 
due to Yu. K. Belyaev: Sample functions of a 
stationary Gaussian process are with proba- 
bility 1 either continuous or unbounded on 
every finite interval. 

Other conditions for continuity of sample 
functions are given in terms of the fcovariance 
function. Let v(s, t) be the covariance func- 
tion of a Gaussian process {X,|te[0, 1]}. Ifv 
satisfies 


|v(s,,t)—v(sy,t)|<e|s,—s|" 


(0<a<I, v(0,s)=0, s,,s,€[0,1]), 
then 
xX, —X, 
P( lim sup oem Crea eee A T 
òjo -elsalti — talt lloglti— tall 


sudne 1, 
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where c, is a constant. This result is due to Z. 
Ciesielskt. 

Conditions for continuity of sample func- 
tions have also been given for Gaussian pro- 
cesses with multidimensional time. parameters. 
For {X,|te[0, 1]"}, set d(s, t)=E[(X,—X,)7]"”. 
If there exists a function @ monotone increas- 
ing on some interval 0<u<a such that 


d(s,t)< g(iis—tll), 


where || || is the metric in R”, and such that 


+a 
| p(e*)dx< a, 


then almost all sample functions are continu- 
ous (X. Fernique [2]). Conversely, if there 
exists a monotone increasing function ¢ satis- 
fying d(s, t)= ọ(||s—t||) and if sample functions 
are continuous with positive probability, then 
the above integral should be finite. 

Suppose that {X,} is a stationary Gaussian 
field. R. M. Dudley [11] proved that if there 
exists a number q > 1 and a neighborhood 
V=V(0) in R” such that 


Sq -*(log N(V,q-*))"? < 00, 
k 


where N (V, £) is the minimum number of £- 
balls (relative to the metric d) that cover V, 
then almost all sample fields of {X,} are con- 
tinuous, and Fernique [18] proved that the 
converse statement is also true. Thus the 
problem of continuity of the sample fields of 
stationary Gaussian fields has been completely 
solved. 


H. Gaussian Measures 


Let {X,,teT}, T an interval, be a Gaussian 
process. It gives a probability distribution on 
the measurable space (R’, 2), where 2 is the 
o-field of Borel subsets of RT. Such a mea- 
sure P is called a Gaussian measure. Let P$, 
i= 1, 2, be Gaussian measures induced by 
Gaussian processes {X}, te T}, i=1, 2, respec- 
tively. Then for these measures concepts such 
as tabsolute continuity, fequivalence, and 
‘singularity are introduced in the usual man- 
ner. The most significant property for Gauss- 
ian measures is that two measures P! and P? 
are either equivalent or singular. A powerful 
criterion for testing this dichotomy is the 
tentropy of P? with respect to P', given by 


P?(A)) 
P*(A))) 


H(P?|P*)= sup), {Pr adtog 


where the sup is taken over all finite parti- 
tions a= {A,,...,A,}, A;eB, |] A;=R" (Yu. 
A. Rozanov [15]). The measures P* and P? 
are equivalent (resp. singular) if and only if 
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H(P?|P')< (resp. = œ). This statement can 
be paraphrased in terms of the Hilbert spaces 
H(X), i=1, 2, spanned by {X$}, i= 1, 2, respec- 
tively. For simplicity, assume that E(X/)=0. 
Then, P! and P? are equivalent if and only if 
the following two conditions hold: (i) A map- 
ping A determined by AX} = X? is extended 
to an invertible linear transformation from 
H(X') to H(X”); (ii) There exists a symmetric 
operator B of tHilbert-Schmidt type such that 
(Ag, An). =((T — B)&, n), where (č, 7); = E(čn) is 
the inner product in H(X’). Next, suppose that 
{Xt}, i=1, 2, are not centered. Set E(X} =mi, 
i=1, 2. Then P! and P? are equivalent if and 
only if {X}, with X? = X! — mi satisfy the fol- 
lowing condition: (itt) in addition to conditions 
(i) and (11) above, there exists an element č in 
H(X') such that m} —m? = E[E,X/], te T. 
Given equivalent Gaussian measures P! 
and P? on (R7, 2), T=[0, to], it is in general 
not easy to obtain the fRadon-Nikodym de- 
rivative; however, if one of them, say Pt, is the 
tWiener measure, then one can obtain ọ = 
dP'/dP? explicitly (M. Hitsuda [4]). Assume 
that P? is derived from {X,}. By assumption, 
X, has the tcanonical representation 


x,=8,-| ‘|, Ks.) dB, +a) ds, teT, 
o 0 


where {B,! is the *Brownian motion from 
which Pt is assumed to be derived and where 
le L?(T), ae L?(T). With this notation, we can 
write 


o= =o] | f I(s, 1)dB, +a) dB, 
1 fto s 2 
-Í (| ls u)dB, +a) as| 
2Jo \Jo 


This result yields a remark about general 
Gaussian measures: If P! and P? are equiva- 
lent, {X},te T} has a canonical representation 
if and only if {X7,teT} does. 


I. Nonlinear Functionals 


Start with nonlinear functionals of a Brownian 
motion {B,,teR'}, and call them Brownian 
functionals. If {¢,} is a complete orthonormal 
system in L*(R'), then the collection {X, = 
Jo,(u)dB,} of tstochastic integrals forms a 
Gaussian system of independent standard 
Gaussian random variables. A Fourier- 
Hermite polynomial based on {¢,} is a poly- 
nomial in X, expressible as a finite product in 
the form 


cf] H,(Xj/./2), c a constant. 
J 


The sum n= È n; is the degree of this polyno- 
mial. If n> 0, its mean is zero and the variance 
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is [e]? T1(n;! 2"). Taking c to be [],(n! 2") "1, 
the collection of all Fourier-Hermite polyno- 
mials based on {¢,} forms a complete ortho- 
normal system in the Hilbert space (L?) con- 
sisting of all Brownian functionals with finite 
variance. Denote by #, the subspace of (L?) 
spanned by all the Fourier-Hermite polyno- 
mials of degree n. Then the Hilbert space (L?) 
admits a direct-sum decomposition: 


(L=) OH, 
n=0 


which is called the Wiener-Itô decomposition 
[7]. The subspace #%, is eventually indepen- 
dent of the choice of a complete orthonormal 
system {p}. A member X of #%, is referred to 
as a multiple Wiener integral of degree n and is 
expressed in the form 


=|. [io.s2.s59dB, dB, dB, 
R” 


where f is a symmetric L?(R")-function. In ad- 
dition, E(X?)=n!||f\|?, || || being the L?(R")- 
norm. 

Brownian functionals can be expressed as 
functionals of white noise, so that the Hilbert 
space L?(j:) is viewed as a realization of (L?), 
where y is the probability distribution of the 
Gaussian white noise introduced in Section D. 

Nonlinear functionals of a general Gaussian 
process can be dealt with in a similar but 
somewhat more complicated manner. If the 
process has a canonical representation, then 
the functionals can easily be rewritten as 
Brownian functionals. 


J. Applications to Prediction Theory 


If a Gaussian process {X,|t >a} has the canon- 
ical representation 


x= | F(t,u)dB,, 
then the best tpredictor E(X,/B,(X)), s<t, is 
given by 


3 
| F(t, u)dB,,. 
a 
This is in agreement with the best tlinear 
predictor. 

No systematic approach for nonlinear pre- 
diction theory has been established so far. We 
illustrate this theory only in a typical case that 
arises from a stationary Gaussian process. Let 
{X,} be a real-valued stationary Gaussian 
process. Without loss of generality, we can as- 
sume that E(X,)=0. We consider the continu- 
ous parameter case, since the discrete param- 
eter case is easier and can be inferred from it. 

Let M,(X) be the subspace of L?(Q, P) de- 
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fined in the same way as in Section E, and let 
L,=L?(Q,B,(X)). If the process is purely non- 
deterministic in the sense that { \,M,(X)= {0}, 
then X, has the backward moving average 
representation 


X= |  flt—s)dB, 


where {B,} is a Brownian motion. Further- 
more, this representation is canonical, 1.e., 
B,(X)=B,(B). Since B is assumed to be \/,B,, 
every Y in (L°) can be expressed in the form 


Y= X | al | F815 S25 
p=0 J -a ~O 4 -0O 


...55,)4B,, dB,,...dB, , 


in terms of multiple Wiener integrals. The 
predictor of Y based on the known values 
X,,5<0, that is, the projection of Y on Lg is 
given by truncating the domain of the integral 


00 0 S3 S2 
X | a | Joksi: 52 
p=0 J -o0 -0J -0 


.-,5,)4B, dB, ... dB,- 


K. Extrapolation and Interpolation 


Extrapolation of a stationary Gaussian pro- 
cess is used to find the best estimate of the 
unknown values of a given process when we 
have been given the only some of the past 
values of the process. M. G. Krein [8] ob- 
tained some results by using the theory of the 
inverse spectral problem discussed by I. M. 
Gelfand and B. Levitan. 

In contrast to extrapolation is the interpola- 
tion of a stationary Gaussian process. There 
is an important contribution to this prob- 
lem by H. Dym and H. P. McKean [1] who 
developed the Kreïn theory by applying the 
de Branges method to the theory of entire 
functions. 
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A. General Remarks 


A power series g(t)= Èpo a,t” in t that con- 
verges in a certain neighborhood of t=0 de- 
fines the sequence of numbers {a,}. The func- 
tion g(t) is called the generating function of 
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the sequence. Similarly, the series K(x, t)= 
Eco f(x)t” that is convergent for x and t 

in a certain domain in (x, t)-space is called the 
generating function of the sequence of func- 
tions { f,(x)}. Sometimes the function g(t) = 
Dr ola,/n!)t” is called the exponential generat- 
ing function of the sequence {a,'. For exam- 
ple, the generating functions of the tbinomial 
coefficients and tLegendre polynomials 

are (1 +4)" and (1 —2tx +17)", respectively. 
When a generating function of {a,' or { f,(x)} 
is given, we can obtain a, or f(x) by integral 
expressions; for example, in the latter case we 
have 


K(x, 0) 
dt, 
Int) “2ni =a, at 


where the contour C is a sufficiently small 
circle going counterclockwise around the 


origin. A generating function can be continued 


beyond the domain of convergence of the 
power series. Simple generating functions are 
known for many important orthogonal sys- 
tems of functions (— 317 Orthogonal Func- 
tions). Generating functions are widely used 
because they enable us to derive analytically 
the properties of sequences of numbers or 
functions. For a system of numbers or func- 
tions depending on a continuous parameter 
mstead of the integral parameter n, we define 
the generating function in the form of a tLa- 
place or tFourter transform. In particular, 
for a probability ‘distribution function F(x), 
the exponential generating function f(t)= 
fee dF (x) for the moments {a,} of F(x) 
is called the moment-generating function of 


F(x). There is another kind of generating func- 


tion for the sequence a: 2. a,n * in the form 
of tDirichlet series; it is frequently used in 
number-theoretic problems. 


B. Bernoulli Polynomials 


A system of polynomials 


Bo)= > G) B,(0)x" = 
k=0 


is defined by the generating function 


B,(x) is called the Bernoulli polynomial of 
degree n. Since B,(0) is the coefficient of 
t*/k! in 


t ie t” 
= Y B,0)—, 
1 2 ( I 


e'— 


te xe. 


we have Bo(0)= 1, B,(0)= 


n> 1. The nth Bernoulli number B, (— Ap- 


—1/2, By(0)= 1/6, ...: 
B,,4,(0)=0 for n> 1; and (—1)""' B,,,(0)>0 for 
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pendix B, Table 3) is usually defined by |B,,(0)]. 
Sometimes other definitions, such as B, = B,(0) 
or B,=B,,(0), are used. Bernoulli polynomials 
satisfy the relations 

B,(x+1)—B,(x)=nx""|, 
dB,(x)/dx a nB,- (x), 


which are used in tinterpolation. For example, 
a polynomial solution of the tdifference equa- 
tion Ly f(x)=xd"_, a,x" is given by 
fx) = 9 a, Bay 0d /(n + 1) + (arbitrary con- 
stant). In e we have 1°+2"+...+p"= 
(Ba (p+ 1)— Bri (Dnt 1). 


C. Euler Polynomials 
A system of polynomials 


E= (ex 


is defined by the generating function 


2e” ā £ 
=}, En 


e e+ n=0 


TA 


We call E,,(x) the Euler polynomial of degree n. 
Here a, is defined by a, = E,(0) and 


=F a 


e =, n=0 


t” 


so that we have ag = 1, a, = —1/2, a3 = 1/4, ...; 
a>, =0 for n> 1. The nth Euler number is 
sometimes taken as a,, but more often it is 
defined by 


n n 
E,=(—1)" 2 
Seri (Ja 


i.e., by 


% E, : 
Baa aa 1) a x 

(— Appendix B, Table 3). All the E,, are inte- 
gers, E,,,4;=0(m=0,1,...) and E,,,>0 (m= 
0,1,...); in the decimal expressions for E,, the 
last digit is 5 for E,,,(m>1) and 1 for E4m+2 
(m>0). Sometimes Ez, is denoted by E„. We 
have the relations 


E, (xX) + E,(x + )=2x", 
E,(1 —x)=(—1)"E, (x), 


dE,(x) 


dx = nEn- (x), 
and in particular, 
—1" +2" —3"+4"—...+(—1} p" 


=((—1) E (p + 1)— E„(1))/2. 
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D. Application to Combinatorics 


Let p(n) denote the number of ways of dividing 
n similar objects into nonempty class. This 

is called the number of partitions of n. Euler 
noticed that the following formula is valid for 
the generating function of p(n): 


t+ Š porst| Fax] 


whence we obtain p(n) = p(n—1)+ p(n—2)— 
p(n—5)—p(n—7) + p(n— 12) + p(n—15)— 
Die (- 1)" [p(n — (3k? — k)/2) + p(n— (3k? + 
k)/2)] (— 328 Partitions of Numbers). Let 
B(n) denote the number of ways of dividing n 
completely dissimilar objects into nonempty 
classes. These are called Bell numbers. The 
first few are 1, 1, 2, 5, 15, 52, 203, 877, 4140, 
21147, ... [2]. For this sequence with B(0)=1, 
the ional generating function 1°, B(n)x" 
does not converge except at x =0. However, 
the exponential generating function g(x)= 

2. B(n)(n!)"1 x" is convergent for all complex 
numbers x and is equal to e® '. The differen- 
tial equation g' =e*g gives rise to a recursive 
formula 


B(n+1)= EC Jz% 
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178 (VII.6) 
Geodesics 


A. General Remarks 


In the study of global differential geometry, 
geodesics play an essential role because the 
behavior of geodesics on a complete Riemann- 
ian m-manifold (m2) M without boundary 
heavily influences its topological structure. A 
significant merit of using geodesics is that one 
can make elementary and intuitive observa- 
tions on M (as in Euclidean geometry), which 
often yields fruitful results. 

Riemannian structure induces in a natural 
way a distance function d on M, and hence M 
is a metric space. A smooth curve y of constant 
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speed is by definition a geodesic if and only if 
every point on y is contained in a nontrivial 
subarc whose length realizes the distance be- 
tween its endpoints. M is equipped with the 
Levi-Civita connection V (— 80 Connections) 
by means of which a geodesic is characterized 
as an autoparallel curve. In local coordinates, 
y is obtained as a solution of an ordinary 
differential equation of order two (— 80 Con- 
nections L). It follows from the properties of 
solutions of the equation for geodesics that 
every point pe M has a neighborhood in which 
every two points can be joined by a unique 
shortest geodesic that depends smoothly on 
both of the endpoints. A classical result of J. 
H. C. Whitehead (Quart. J. Math., 3 (1932)) 
states that every point pe M has a metric ball 
B centered at p such that every two points in B 
are joined by a unique shortest geodesic whose 
image is contained in B. Let M, be the tangent 
space to M at p, and let M,<M, be the set 

of all vectors ve M, such that the solution of 
geodesic y with initial condition »(0)= p, y(0) = 
v is well defined on [0, 1]. M p contains an 
open neighborhood of the origin and is fstar- 
shaped with respect to the origin. The ex- 
ponential mapping exp,: M,—M is defined to 
be exp, v=y(1). This mapping exp, is smooth 
and has the maximal rank at the origin. Small 
balls around p are obtained as the image 
under the exponential mapping of the corre- 
sponding balls in M, centered at the origin, 
and the restriction of exp, to these balls are 
diffeomorphisms. Thus the topology of M asa 
metric space is equivalent to the original one 
of M. The fundamenta! Hopf-Rinow theorem 
(Comment. Math. Helv., 3 (1931)) states that (1) 
M is complete as a metric space if and only if 
M,=M, for some pe M; (2) M is complete if 
and only if every closed metric ball is compact; 
and (3) if M is complete, then every two points 
can be joined by a shortest geodesic, namely, a 
geodesic with length realizing the distance 
between the endpoints. 

In the following discussion M is assumed to 
be connected, complete, and without boundary. 
Let y:[0,1]—>M be a geodesic. A piecewise 
smooth 1-parameter variation V along y is a 
continuous mapping V:[0, 1] x (—«,6)>M 
with a finite partition 0=t)<t,<...<t,=1 
such that V |[t;, ¢;.; ] x (—é, £) is smooth and 
V(t, 0)= y(t) for all te[0, 1]. Then the first and 
second variation formulas for V are 


L'(0)=<Y, 9) [o/LQ) 


and 


L"Q)\= > ji (KY, YO — CRG PY, Yat 


+ Vajas Y,, ý> re t |w 
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respectively, where Y,(t)=0V(t, s)/@s|,-, for 
té[t;_,,t,] is the variation vector field as- 
sociated with V|[t,;_,,t;] x (—e, €), L(s) is the 
length of the variation curve t> V(t, s), and 
Y; = V; Y; and R is the curvature tensor. 

A smooth vector field J along a geodesic 
y:[0,1]—>M is called a Jacobi field if and only 
if it satisfies J” + R(J,¥)~=0. The set of all 
Jacobi fields along y forms a vector space iso- 
morphic to Mo) x Myo by the natural corre- 
spondence J (J (0), J’(0)). Every Jacobi field 
is associated with a l-parameter geodesic varia- 
tion V along y. Namely, every variation curve 
of V is a geodesic, V is smooth, and J(t)= 
OV(t, s)/s| s-o for all te [0, 1]. Conversely, 
the variation vector field of any 1-parameter 
geodesic variation V along y is a Jacobi field. 
Especially, if A is a tangent vector to M, at 
(0), then d(exp,)..9) A=J(1), where J is the 
Jacobi field associated with the 1-parameter 
geodesic variation V(t, s)=exp,t(j(0)+ 5A), 
and A is identified under the canonical parallel 
translation in M,. A point y(f9) is said to be 
conjugate to p=7(0) along y if and only if there 
exists a nontrivial Jacobi field along y that 
vanishes at 0 and tọ. A theorem of Morse and 
Schoenberg states that if the sectional curva- 
ture K of M satisfies 0< A < K < B< œ, then 
every unit speed geodesic y:[0,]]—M has a 
point conjugate to (0) if l>n//A and has no 
point conjugate to (0) if 1< n//B . Especially, 
every geodesic y on M with K <0 contains no 
conjugate point on it, and exp, is locally regu- 
lar for any pe M. Thus a complete and simply 
connected M with K <0 is diffeomorphic to 
R” (Hadamard and Cartan). 

Let y be a unit speed geodesic with y(0)= p. 
If t, >0 satisfies d(p, y(t))=t for all te[0,t,) 
and d(p, p(t))<t for all t>¢,, then the point 
y(t,) is called a cut point to p along y. It ap- 
pears no later than the first conjugate point to 
p along y. The set C(p) of all cut points to p is 
called the cut locus of p. Let U be the set of 
vectors tve M,, 0<t<1, where exp,v is the cut 
point to p along y(t)=exp, tv. U is a nonempty 
open set and exp,|U: U>M — C(p) is a diffeo- 
morphism, where M — C(p) is diffeomorphic 
to an m-disk. The cut locus possesses essential 
information on the topology of M. 

The metric comparison theorems of H. 
Rauch (Ann. Math., (2) 54 (1951)) and M. Ber- 
ger (Illinois J. Math., 6 (1962)) are stated as 
follows: Let infy K =d > —œ and sup,, K=A< 
œ, and let y:[0, 1] M be a nontrivial geo- 
desic. Let M(c) be a complete and simply 
connected space form of constant curvature 
c. Fix geodesics y5, ya: [0, 1] M(6), M(A), re- 
spectively, with the same speed as y. If J, J;, 
and J, are Jacobi fields along y, ys, and y4 
respectively such that J(0)=J,;(0)=J,(0)=0 
and ||J"(O)|| = |175(0)| = || 44(0)||, then ||J4(2)|| < 
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J (t)l| < |[Js(t)|| holds for all te[0, t,], where 
Yalta) is the first conjugate point to y,(0) along 
ya (Rauch). If the Jacobi fields satisfy || J(0)|| = 
I ¥5(0)|| = || J,(0)|| and J’(0) = J5(0) = J4,(0) =0, 
then || J,4(0)|| < JON < || J5(t)|| holds for all 
te[{0,t,], where t4 is the first zero point of 
J, (Berger). They are often applied to compare 
curve lengths as follows. Let c:!>R" be a 
piecewise smooth curve with ||c(t)|| <t,. Fix 
points pe M, p€ M (ô), and pae M(A) with 
fixed isometric identifications of the tan- 
gent spaces with R”. Then L(exp,,oc)< 
L(exp,oc)<L(exp,,oc). If P, P;, and P, are 
unit parallel fields along corresponding geo- 
desics o, 65, ca: IM, M(6), M(A), respec- 
tively, which have the same speed, and if 
<P, =<P;, 65> = <P}, G>, then for every 
piecewise smooth function f:1—>[0, t4] the 
curves u(s)=exp,,,f(s)P(s) and u,(s), ua(s) de- 
fined in the same way on M (ô), M (A), respec- 
tively, have lengths L(u,)< L(u) < L(u,). They 
play important roles in the theory of geodesics. 

A geodesic triangle is a triple (yo, Yı, Y2) of 
shortest geodesics (for convenience they are 
parametrized on [0, 1]) such that y,(1)=y;.,(0) 
for all i=0, 1, 2 with mod 3. The vertices po, 
Pı» P2 and the angles 0o, 0}, 8, of a geodesic 
triangle (yo, y1; %2) are defined by p;,, =y;(0) 
and 0;.,=cos~"{<},(0), — Yi- (D/P: lP- I}, 
where angles are always taken in [0, z]. The 
triangle comparison theorem of Toponogov 
(Amer. Math. Soc. Transl., 37 (1964)) is stated | 
as follows. Let K >6> —œ be satisfied on 
M. For any geodesic triangle (yọ, y1, 72) on M 
there is a geodesic triangle (fo, 94,92) on M (ô) 
such that L(},)= L(y,) and the angles 8, of this 
triangle satisfy 0,<0, for i=O, 1, 2. It turns 
out that if 6>0 then the circumference of 
any geodesic triangle on M does not exceed . 
2n/./6. 

For details of the basic facts stated above we 
refer the reader to [1-3]. 


B. Curvature and Fundamental Groups 


The fundamental group z,(M) of M(— 170 
Fundamental Groups) is influenced by the 
curvature of M. A basic idea, going back to 
Hadamard (J. Math. Pures Appl., 4 (1898)) 
and Cartan, states that every nontrivial ffree 
homotopy class 6€2,(M) of loops on a com- 
pact M contains a closed geodesic of minimal 
length whose preimage under the covering 
projection z:M—M in the universal Riemann- 
ian covering M is either closed (when ô is of 
finite order) or a straight line (when 6 is of 
infinite order) that is translated along itself by 
the tdeck transformation 6. 

(1) By using the second variational formula, 
an even-dimensional compact M with K >0 
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either is simply connected or has z,(M)=Z, 
(Synge, Quart. J. Math., 7 (1936)). A beautiful 
result of Myers (Duke Math. J., 8 (1941)) states 
that if the Ricci curvature Ric of M is bounded 
below by 6>0, then the diameter d(M) is not 
greater than n/./5 and hence M is compact 
and 7,(M) is finite. It is proved in [3] that if 
K 20, then there exists a finite normal sub- 
group ® of z,(M) such that 2,(M)/® is a 
Bieberbach group (— 92 Crystallographic 
Groups). The splitting theorem (— Section 
F(2)) is used in the proof. 

(2) When K <0 is satisfied on M, M is dif- 
feomorphic to R”, and hence z;(M)=0 for all 
i>2. A basic tool used in the study of z,(M) in 
this case is that the displacement function X—> 
d(X, 5(%)), Xe M, of every isometry ô on M is 
convex [4], where a function on M is said 
to be convex if and only if its restriction to 
every geodesic ts convex. A classical result of 
Preissmann (Comment. Math. Helv., 15 (1943)) 
states that every nontrivial Abelian subgroup 
of z,(M) of a compact M with K <0 is an 
infinite cyclic subgroup. It is proved in [4] that 
if K <0 holds on M and if every deck trans- 
formation on M translates some geodesic 
along itself, then z; (M) is a disjoint union of 
infinite cyclic subgroups, and any two com- 
muting elements belong to the same cyclic 
subgroup. Moreover, if M is compact and 
K <0 and if z,(M) is a direct product I, x I, 
such that z,(M) is tcenterless, then M is iso- 
metric to the Riemannian product M, x M,, 
and z,(M;)=T; holds for i=1, 2 [5,6]. It is 
shown in [7] that the fundamental group of 
M with K <0 occurs as that of an (m+ 1)- 
dimensional M’ with K <c <0, which is diffeo- 
morphic to M x R. The warped product [4] 
is used to construct such a metric on M x R. 


C. Cut Locus 


The injectivity radius i(M) of M is defined to 
be the infimum of the continuous function 
x—d(x, C(x)), xe M. As is seen in Section D, 
the estimate of injectivity radii provides many 
fruitful results on the topology of Riemannian 
manifolds. It follows from Synge’s result that 
an even dimensional compact and orientable 
M with 0<K <1 has its injectivity radius 
i(M)>z [3]. However, such an estimate can- 
not be obtained in odd dimensions. The ex- 
amples discussed in [8] show that there are 
infinitely many homotopically distinct homo- 
geneous Riemannian 7-manifolds of posi- 

tive curvature. The injectivity radii of such 
examples are estimated precisely in [9], ac- 
cording to which there is no positive lower 
bound for them. The sphere theorem of Kling- 
enberg (Comment. Math. Helv., 35 (1961)) 
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states that if M is compact and simply con- 
nected and if 1/4<K <1, then i(M)2>7, and 

M is a topological sphere. This extends the 
pioneering work by H. Rauch (Ann. Math., (2) 
54 (1951)). A rigidity theorem of M. Berger 
(Ann. Scuola Norm. Sup. Pisa, 14 (1962)) states 
that if dim M is even, M is simply connected, 
and 1/4<K <1, then M is homeomorphic to 
S” (if d(M) > n) or isometric to one of the sym- 
metric spaces of compact type of rank 1 (if 
d(M)=72). A slight generalization of the trian- 
gle comparison theorem is used to prove that 
for given positive constants d, V, and S, there 
exists a constant C,,(d, V, S)>0 such that if K, 
d(M) and the volume v(M) satisfy |K|<S, 
d(M)<d, and v(M)> V, then i(M)> C,,(d, V, S) 
[10]. 


D. Finiteness Theorem 


Finiteness theorems are a natural extension 

of the sphere theorem. They provide a priori 
estimates for the number of various topolog- 
ical types of manifolds (for instance, homology, 
homotopy, homeomorphism, diffeomorphism 
types, etc.), which admit certain classes of 
Riemannian metrics characterized by geo- 
metric quantities. The basic idea of the esti- 
mates is to find a constant c>0 that depends 
only on geometric information which char- 
acterizes the class of manifolds so that if M be- 
longs to the class, then i(M)>c. Then a number 
N is found from the information given a priori 
such that every element M of the class has an 
open cover of at most N balls whose radii are 
all less than c. It is proved in [11] that for 
given d€(0, 1) and m, there exists a number 
N(6,m) such that there are at most N (ô, m) 
homotopy types for the class of all simply 
connected 2m-dimensional manifolds with 
d6<K <1. Furthermore, for given positive 
numbers d, V, and S, there are at most finitely 
many homeomorphism (or diffeomorphism) 
types for the class of all m-manifolds M, each 
of which has the property that d(M) <d, v(M) 
>V, and |K|<S [10]. This result is applied to 
obtain a result in [12], which states that there 
exists for given m, V>0 and an e>0 such that 
if M is compact, —1—e< K < —1, and v(M)> 
V, then M admits a metric of constant nega- 
tive curvature. 

For an ¢>0, M is said to be -flat if and only 
if supy K-d(M)* <c is satisfied for M. Every 
compact flat manifold is ¢-flat for all ¢>0. M is 
called a nilmanifold if and only if it admits a 
transitive action of a tnilpotent Lie group. It 
is shown in [13] that for a compact M with 
dim M =m, there exists a number é(m)>0 such 
that if M is e(m)-flat, then (1) there is a maxi- 
mal nilpotent normal divisor N c z, (M), (2) 
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the order of 2,(M)/N is bounded by a constant 
which depends only on m, and (3) the finite 
cover of M which corresponds to N is diffeo- 
morphic to a nilmanifold. Moreover, if M is 
e(m)-flat then M is diffeomorphic to R”. If M is 
e(m)-flat and if z,(M) is commutative, then M 
is diffeomorphic to a ‘torus. 


E. Uniqueness Theorem 


Uniqueness of topological structures (as in the 
sphere theorem) of certain classes of compact 
Riemannian manifolds is discussed here. 

The discovery of texotic 7-spheres by J. 
Milnor (Ann. Math., (2) 64 (1956)) gave rise to 
the question of whether in the sphere theorem 
the conclusion (homeomorphism to the sphere) 
could be replaced by diffeomorphism. Since 
the number of differentiable structures on a 
topological sphere depends on its dimension 
(— 114 Differential Topology), it has been 
thought that in order to get the standard 
sphere in the sphere theorem the best possible 
restriction for the curvature might also depend 
upon dimension. The appropriate differenti- 
able pinching problem is to find a sequence 
{Am} so that if M is compact and simply con- 
nected and if ô< K <1 for some 6>A,,, then 
M is diffeomorphic to S”, and A,, is the least 
possible with this property. D. Gromoll and Y. 
Shikata proved independently [14, 15] that A,, 
<1 holds for all m>2. Later it was proved 
that there exists a 6) €(1/4, 1) such that A,, < ôo 
holds for all m>2 [16]. The diffeotopy theo- 
rem, which plays an essential role in the proof 
of finding such a do, provides a sufficient con- 
dition for a diffeomorphism on S”~' to be 
tisotopic to the identity mapping. 

A different idea is put forth in [17], which 
imitates the Gauss normal mapping of a 
closed convex hypersurface in R™*?. It is 
proved that if the curvature operator is suffi- 
ciently closed to the identity, then M is diffeo- 
morphic to S”. This idea is used to obtain a 
diffeomorphism between M and a spherical 
space form. A generalization of the sphere 
theorem states that if M is compact and if 
(d(M)/x)*inf,, K > 1/4, then M is a topological 
sphere [18]. 


F. Noncompact Manifolds 


Let M be noncompact. It is due to the nature 
of noncompactness that through each point on 
M there passes a ray y:[0, 0)>M, e.g., y is a 
unit speed geodesic any of whose subarc is 
minimizing. 


(1) Busemann Functions. A Busemann function 
F,:M—>R with respect to a ray y is defined by 
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F,(x)=lim,.,,. [t—d(x, y(t))]. The original defi- 
nition of it was used by H. Busemann to define 
a parallel axiom on straight G-spaces (— Sec- 
tion H). F, has the property F,'((—00,t,])= 
{xe F,((—00, tI) d(x, OF, *((—00, t2])) > 
t,—t,} for any t, >t,, and hence F,(x)=t, — 
d(x, 6F, '((—00, t,])) for all t, and x with 
F,(x)<t,. It has been proved, by using the 
second variation formula, that if Ric>0, K > 

0 [19] or if, in the case where M is Kählerian, 
the holomorphic bisectional curvature [20] 

is nonnegative, then F, is subharmonic (sh), 
convex or ‘plurisubharmonic (psh), respec- 
tively, where the holomorphic bisectional 
curvature is defined as R(X, JX, JY, Y) for the 
complex structure J and orthonormal vectors 
X and Y. If the holomorphic bisectional curva- 
ture [20] is positive, then F, is strictly psh. If 
K >0, then the triangle comparison theorem 
implies that F =sup{F,|y(0)=p} is convex and 
is an exhaustion [21], where the sup is taken 
over all rays emanating from a point pe M, 
and a function f is said to be an exhaustion if 
and only if f~'((—0oo, a]) is compact for all 
aER. A well-known theorem of H. Grauert 
(Math. Ann., 140 (1960)) states that if M admits 
a strictly psh exhaustion function, then M is a 
*Stein manifold (— 21 Analytic Functions of 
Several Complex Variables). In this context, 
various conditions for curvatures on Kahler 
manifolds under which they become Stein 
manifolds have been found [20]. 

Let H be complete and simply connected, 
and K <0. Busemann functions on H are 
differentiable of class C?, and the thorosphere 
F,"({t}) is a C?-surface [22]. On a parabolic 
visibility manifold (— Section F(3)) the nega- 
tive of every Busemann function is C!-convex 
without minimum [7]. 


(2) Ends and Splitting Theorems. Ends of a 
noncompact M are defined as follows: If A, 
and A, are compact subsets of M with A, c 
A,, then any component of M — A, is con- 
tained in a unique component of M—A,. An 
end is the limit of an inverse system {compo- 
nents of M — A; A} directed by the inclusion 
relation as indicated above and indexed by 
{A|A compact}. 

It has been shown that if Ric >0 [23] (or 
20 [19]), then M has exactly one end (or at 
most two ends). A visibility manifold has at 
most two ends if it is not Fuchsian [7]. If M 
admits a locally nonconstant convex function, 
then M has at most two ends [24]. 

If M has more than one end, then there 
exists a straight line y:R-M which is by 
definition a nontrivial geodesic any of whose 
subarcs is minimizing. A classical result of 
Cohn-Vossen (Mat. Sb., 1 (1936)) states that if 
dim M =2, K >0 and if M contains a straight 
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line, then the total curvature is 0 and hence M 
is isometric to either a plane or a cylinder S! x 
R. Generalizations of this result state that if 
M admits k independent straight lines and if 
K>0(Toponogov, Amer. Math. Soc. Transl., 
37 (1964)) (or if Ric>0 [19]), then M is iso- 
metric to the Riemannian product N x R*. 


(3) Structure Theorems. The structure theorem 
of 2-dimensional M with K >0 was proved by 
Cohn-Vossen (Compositio Math., 2 (1935)). It 
has been shown that M is diffeomorphic to R” 
if K >0 [23] and that if K >0, then there 
exists a compact totally geodesic submanifold 
S without boundary such that M is homeo- 
morphic to the total space of the tnormal 
bundle over S [21]. Here, homeomorphism 
can be replaced by diffeomorphism, and if 
K >0 outside a compact set of M, then M 
is diffeomorphic to R” [26]. 

For a ray y and a point x on H, if o is a 
ray with o(0)=x and if d(y(t), o(t)), t >0, is 
bounded above, then ø is called an asymptotic 
ray (a ray asymptotic) to y. The asymptotic 
relation is an equivalence relation on the set 
of all geodesics on H. A point at infinity of H 
is an equivalence class on geodesics on H, 
and the set of all points at infinity of H is 
denoted by H(co). With a suitable topology, 
the set of all points at infinity of H constitutes 
a bounding sphere such that H = HU H(«) 
is a closed m-cell. For a properly discontin- 
uous group D of isometries acting on H, a 
closed D-invariant limit set L(D) is obtained 
in H(o) as the set of all accumulation points 
of 6(p), ĝe D. M = H/D is a visibility manifold 
if and only if H satisfies the following axiom: 
any two distinct points on H(0o) can be joined 
by at least one geodesic. If K <c <0 is satisfied 
on M, then M is a visibility manifold [7]. 
By investigating limit sets, visibility manifolds 
are Classified into three types [7]. (1) M is 
parabolic; M is diffeomorphic to N x R and 
is characterized by the fact that it has a convex 
function without minimum. (2) M is axial; M 
is a vector bundle over St, and hence it is 
diffeomorphie to either S! x R™~! or the prod- 
uct of a Möbius strip with R™~”. (3) M is 
Fuchsian; M has more than two ends, and a 
strong algebraic restriction is imposed on the 
fundamental group of M. 


(4) Convex Functions. Some elementary prop- 
erties of C” convex functions on M have 
been investigated and it was proved in [4] 
that if M admits a C” convex function with- 
out minimum and if K <0, then M is diffeo- 
morphic to N x R, where N is a level hyper- 
surface. When K >0, F can be replaced by a 
strongly convex exhaustion function. Namely, 
for every compact set Ac M there is a d6>0 
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such that the second difference quotient along 
every geodesic at any point on A is bounded 
below by ô [20]. A standard tconvolution 
smoothing procedure yields smooth convex 
approximations in neighborhoods of compact 
sets for every strongly convex function such 
that their second derivatives along every geo- 
desic is positive. Global approximations can 
be constructed [27]. Let @: M—R be a convex 
function which is not constant on any open 
set. It has recently been proved that (1) if b> 
infy > —oo, then there is a homeomorphism 
H:o7'({b}) x (infy p, 00) M — {minimum 

set of ọ, if any} such that go H(y, «) =a for all 
yeo'({b}) and for all «e(infy p, 00); (2) if @ 
takes a minimum, then the continuous exten- 
sion H:[inf,, p, 00) M of H is proper and 
surjective [24]. 


G. Manifolds All of Whose Geodesics Are 
Closed 


As is well known, every symmetric space of 
compact type of rank 1 (it is abbreviated 
CROSS as in [28]) has the property that all 
geodesics are simply closed and of the same 
length. Let M be a compact Riemannian mani- 
fold with the property that all geodesics on M 
are simply closed and that they have the same 
length (SC property). The problem discussed 
here is whether such an M is isometric (or at 
least the topology of such an M is equivalent) 
to some CROSS, or if it is possible to classify 
all such manifolds. 

An example of a nontrivial SC-manifold 
was first constructed by Zoll (Math. Ann., 57 
(1903)) on S? as a surface of revolution in R?, 
which is not isometric to a standard sphere. 
Blaschke conjectured that every SC-structure 
on PR? is the standard real projective space. 
This has been solved affirmatively by L. W. 
Green (Ann. Math., 78 (1963)). In higher di- 
mensions it has been proved that every in- 
finitesimal deformation of the standard SC- 
structure on PR” is trivial [29]. A general 
result for a SC-manifold M states that the 
volume of M (with dim M =m) with period 22 
is the integral multiple of the volume of the 
standard unit m-sphere and this integer is a 
topological invariant [30]. For a point pe 
M, if every geodesic segment with length / 
emanating from p is a simple geodesic loop at 
p, then M is called a SC?-manifold. It has been 
proved that the tintegral cohomology ring of 
every SC?-manifold is isomorphic to one of the 
CROSS [31], which is a generalization of 
Bott’s theorem (Ann. Math., 60 (1954)). 

An essential difference between the metrics 
of a Zoll surface and a standard sphere is seen 
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on the cut locus and the conjugate locus of a 
point. The (tangent) cut locus of any fixed 
point of a CROSS coincides with the (tangent) 
first conjugate locus; however, this does not 
hold on a Zoll surface. This observation gives 
rise to the definition of a Blaschke manifold: 
For distinct points p, ge M, let A(p,q)< M, be 
the set of all unit vectors tangent to the mini- 
mizing geodesics from q to p. M is said to be 
a Blaschke manifold at a point p if for every 
qeC(p), A(p, q) is a great sphere of the unit 
hypersphere in M, centered at the origin. M is 
called a Blaschke manifold if it is so at every 
point of it. The following statements are equiv- 
alent [28]: (1) M is a Blaschke manifold at 

p, (2) the tangent cut locus C, at p is spher- 
ical in M, (3) along every geodesic starting 
from p, the first conjugate point p appears 

at a constant length and the multiplicity of 

it is independent of the initial direction. Then 
such an M can be exhibited as DU, E, where 
D is a closed ball with center p, E a C®-closed 
k-disk bundle over an (m—k)-dimensional 
C”-compact manifold with boundary dE dif- 
feomorphic to S”1, and a:6D->@E an tattach- 
ing diffeomorphism. Conversely, if M is ex- 
hibited as DU, E then there exists a metric g 
on M such that (M, g) is a pointed Blaschke 
manifold at p, where p is the center of D. A 
generalized Blaschke’s conjecture states that 
every Blaschke manifold is isometric to a 
CROSS. A partial solution to this conjecture 
has been obtained by M. Berger and states 
that if (S”, g) is a Blaschke manifold, then it is 
isometric to a standard sphere [28]. 


H. G-Spaces 


G-spaces were created by Busemann to show 
that many global theorems of differential 
geometry are independent of the Riemannian 
character of the metric and also of smooth- 
ness. This approach naturally led to novel 
questions and results. (Many facts hold even 
when the distance is not symmetric; — [34].) 
The symbol G suggests that the principal 
property of G-spaces is the existence of geo- 
desics with the same properties as in complete 
Riemannian manifolds without boundary 
except for differentiability. A G-space X is 
defined as follows: (1) X is metric with (sym- 
metric) distance d; (2) every bounded infinite 
set has at least one point of accumulation; (3) 
given two distinct points p, re X, there is a 
point qe X such that p#q#r and d(p,qg)+ 
d(q,r)=4d(p,r); (4) to each point xe X a posi- 
tive number p, is assigned such that for any 
p, qE X with d(p, x)<p,, d(q, x) <p, there is 

a point r with d(p, q)+ d(q, r)=d(p, r); (5) if 
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d(p, q)+4(q,r)=d(p,r) and if d(p, q)+ d(q,r')= 
d(p,r’), then d(q,r)=d(q, r) implies r=r’. 

From (2) and (3) it follows that any two 
points on X can be joined by a curve called a 
segment whose length realizes the distance 
between them. A geodesic arc (or for simplicity 
geodesic) y:[a,b]—X is a curve such that any 
subarc contained in a p,-ball around some 
point xe X is a segment. From (4) and (5) it 
follows that every geodesic arc has a unique 
infinite extension to both sides. If a geodesic 
arc is extended infinitely to both sides, then it 
is called a geodesic line. 

The absence of smoothness causes an essen- 
tial difference in the fact that the distance 
function to a fixed point on X is not necessar- 
ily convex in a small ball around it, in contrast 
to the Whitehead theorem (— Section A) for 
Riemannian manifolds (or tFinsler spaces). 
Here a function on X is called convex if and 
only if its restriction to every arc length— 
parametrized geodesic line is convex. 

Every two points on X have neighborhoods 
which are homeomorphic to each other. Thus 
the topological dimension of X(— 117 Dimen- 
sion Theory) is well defined. A 1-dimensional 
X is either a circle S! or the whole real line. If 
dim X =2 [33] or if dim X =3 [36], then X isa 
topological manifold. 

It should be noted that G-space theory not 
only proves known Riemannian theorems 
under weaker conditions, but also leads to 
many facts which were either not considered 
previously or well understood in (or thought 
to be very different from) the Riemannian case. 
Among many fruitful topics discussed by 
Busemann [33], only two are included below. 


(1) G-Surfaces. For each point p on a 2-dimen- 
sional G-space Z, which is called a G-surface, 
let S,={xeS|d(x, p)=p}, where 0<p<p,/4 is 
a fixed number. S, is homeomorphic to a circle 
and bounds a 2-disk. An angle A at p is the set 
of all segments emanating from p and passing 
through the points on a connected subarc of 
Sp. An angular measure |:| for the angles at 

p is a nonnegative function which satisfies: 

(1) |A|=7 if and only if p is the midpoint of 
the segment joining two points on S, which 
bounds A; (2) if the intersection‘of two angles 
A, and A, is a unique segment, then | A, U A,| 
=|A,|+|A,]|. If a triple of points (po, p,, p2) on 
F are contained in a sufficiently small ball, 
then the three segments joining them define 

a triangle, and each vertex p; has the angle 
|Pi-1 PiP;+1| determined by the two segments. 
An excess £(po p; P2) of a triangle (po, p4, p2) is 
defined to be &(poP1 P2)=|PoP1P2|+1P1P2Pol 
+|P2Pop,|—7. A degenerate triangle has _ 
excess 0. If a triangle (a, b, c), which consists 

of geodesic arcs, is simplicially decomposed 
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into a finite number of triangles (a;, bi, c;), i= 
1,...,k, then e(abc) = £$- e(a;b;c;), and this 

is independent of the choice of finite decompo- 
sition. If Y is compact and orientable, the 
total excess (S) of Z is defined by e(Y)= 
Et e(a;b;,c,)), where ¥ is simplicially decom- 
posed into nondegenerate finite triangles 
(a;,b;,¢;), i=1,...,k. Then e(Y)=22x(S), 
where y() is the tEuler characteristic of /. 
The results obtained by Cohn-Vossen [35] on 
the ttotal curvature of complete open surfaces 
have also been generalized to G-surfaces with 
angular measure uniform at z. Although no 
angular measure leading to a true analog of 
the tGauss-Bonnet theorem exists even in G- 
surfaces [34], many of its applications, in 
particular the results by Cohn- Vossen, can be 
proved by using angular measures which are 
uniform at z [33]. 

If every two points on X can be joined by a 
unique geodesic line, then all geodesic lines on 
X are simultaneously either straight lines or 
circles of the same length /. If in the latter case 
dim X > 1, then X has a two-fold universal 
covering space X whose geodesic lines are all 
circles of the same length 2/, and they have the 
property that all geodesics through a point on 
X meet at the same point at length /. For the 
Riemannian case it is a Blaschke manifold (— 
Section G). If every two points on Z can be 
joined by a unique geodesic line, then ¥ is 
either a plane or a projective plane. 


(2) G-Spaces with Nonpositive Curvature. Let 
(Po, P1; P2) be a triple of points on X which are 
contained in a small ball. X, by definition, ts of 
nonpositive curvature if and only if for any such 
triple of points l 


d(Pi, Pi+ı)>2d(pi, Piz) for i=0,1,2,  (*) 


where p; is the midpoint of the unique segment 
joining p;-; to p;.,. If the above inequality is 
strict for any nondegenerate small triangle, 
then X is said to be of negative curvature. X of 
curvature Q is defined in the same way. It 
should be noted that a G-space of curvature 0 
is a locally Minkowskian space [33]. The 
sectional curvature of a Riemannian manifold 
M is nonpositive if and only if (*) holds for all 
small triangles on it. Because of d(po, Po) < 
A(Po, Pi) + 4(P1 Po) < {4( Po. P1)+ (Dos Pa) }/2s 
the distance function to a point xe X of non- 
positive curvature is convex on a small ball 
around x. X is called straight if and only if all 
nontrivial geodesic lines on X are straight 
lines. If X has nonpositive curvature, then 

the universal covering space X is straight. 
Moreover, for any two geodesic lines y, ¢:R> 
X, the function td(a(t), y(t)) is convex. In 
particular, the distance function to every fixed 
point on X is convex, and (*) holds for any 
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triangle on X. Thus X is contractible. Every 
nontrivial element of the homotopy class of 
loops on X with base point xe X contains a 
geodesic loop at x which has the minimum 
length. This fact and the strict convexity of 
t->d(o(t), y(t)) for any two arc length para- 
metrized geodesic lines y, o on the universal 
covering X of X with negative curvature imply 
that if X is compact, then every Abelian sub- 
group of the fundamental group 7, (X) of X is 
infinite cyclic. This is a generalization of the 
Preissmann theorem (— Section B). 
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A. General Remarks 


A geometric construction problem is a problem 
of drawing a figure satisfying given conditions 
by using certain prescribed tools only a finite 
number of times. If the problem is solvable, 
then it is called a possible construction problem; 
if it is unsolvable, even though there exist 
figures satisfying the given conditions, then it 
is an impossible construction problem. If there 
does not exist a figure satisfying the given 
conditions, then we say that the problem is 
inconsistent. 

Among problems of geometric construction, 
the oldest and the best known are those of 
constructing plane figures by means of ruler 
and compass. In this article, we call these 
problems simply problems of elementary geo- 
metric construction. The following are some of 
the more famous problems of this kind; the 
first four are possible construction problems. 

(1) Suppose that we are given three straight 
lines }, m, n and three points P, Q, R in a plane. 
Draw a triangle ABC in such a way that ver- 
tices A, B, C lie on l, m, n and sides BC, CA, 
AB pass through P, Q, R (Steiner’s problem). 

(2) Suppose that we are given a circle O and 
three points P, Q, R not lying on O. Draw a 
triangle ABC inscribed in O in such a way that 
the sides BC, CA, AB pass through P, Q, R 
(Cramer-Castillon problem). 

(3) Draw a circle tangent to all of three 
given circles (Apollonius’ problem). 

(4) Suppose that we are given a triangle. 
Draw three circles inside this triangle in such a 
way that each is tangent to two sides of the 
triangle and any two of the circles are tangent 
to each other (Malfatti’s problem). 

(5) Let n be a natural number. For the divi- 
sion of the circumference of a circle into n 
equal parts (consequently, the construction of 
a tregular n-gon) to be a possible construction 
problem, it is necessary and sufficient that 
the representation of n as a product of prime 
numbers take the form n=2*p, ... p,, where 
AZO, p1,..-,p, are all different prime numbers 
of the form 2" +1 (tFermat number) (C. F. 
Gauss, 1801). 

(6) The following are three famous impos- 
sible construction problems of Greek origin: (i) 
divide a given angle into three equal parts 
(trisection of an angle); (ii) construct a cube 
whose volume is double that of a given cube 
(duplication of a cube or the Delos problem); 
and (iii) construct a square whose area is that 
of a given circle (quadrature of a circle). P. L. 
Wantzel (1837) proved that problems (i) and 
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(ii) are impossible except for the special cases 
of 2/2, 7/4, etc. for (i); and C. L. F. Lindemann 
(1882) proved the impossibility of (iii) while 
proving that the number z is ftranscendental. 


B. Conditions for Constructibility 


A problem of elementary geometric construc- 
tion amounts to a problem of determining a 
certain number of points by drawing straight 
lines that pass through given pairs of points, 
and circles having given points as centers and 
passing through given points. Let (a,,5,), 

(az, 65), ...,(a,, bn) be rectangular coordinates 
of given points, and let K be the smallest 
tnumber field containing the numbers a,, 
...,0,. Straight lines that join given pairs of 
points and circles that have given points as 
centers and that pass through given points are 
represented by equations of the first or second 
degree with coefficients belonging to K. Con- 
sequently, the coordinates of points of intersec- 
tion of these straight lines and circles be- 

long to a quadratic extension K’=K (Vd) of 
K. Let A be the set of coordinates of the points 
that are to be determined. Then the problem is 
solvable if and only if any number « in A is 


contained in a field L=K(/d,,./d2, canes a 
where d,,,€K(./dy,...,/d;) (i=0,1,...,7—1). 
Thus L is a normal extension field of K whose 
degree over K is a power of 2. Using this 
theorem we can prove the impossibility of 
trisection of an angle and duplication of a 
cube. 

Since the 18th century, besides the problem 
of construction by ruler and compass, prob- 
lems of construction by ruler alone or by 
compass alone have also been studied. We 
state here some of the more notable results: 

(1) If by drawing a straight line we mean the 
process of finding two different points on that 
line, then we can solve all the problems of 
elementary geometric construction by means 
of compass alone (G. Mohr, L. Mascheroni). 
(2) If by drawing a circle we mean the process 
of finding its center and a point on its circum- 
ference, and if a circle and its center are given, 
then we can solve any problem of elementary 
geometric construction by means of ruler 
alone. (3) It is not possible to find the center of 
a given circle by ruler alone (D. Hilbert). (4) It 
is impossible to bisect a given segment by ruler 
alone. (5) When two intersecting circles or 
concentric circles are given, we can find the 
centers of these circles by ruler alone. When 
nonintersecting and nonconcentric circles are 
given, it is not possible to find their centers by 
ruler alone (D. Cauer). 

Cases have been considered in which the 
radius of a circle we can draw by compass or 
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the length of a segment we can draw by ruler 
is required to satisfy certain conditions. Also, 
various considerations have been made con- 
cerning cases in which we can use tools other 
than ruler and compass. For example, it is 
known that although not all possible elemen- 
tary geometric construction problems are 
solvable by ruler alone, all these problems are 
possible if we have either a pair of parallel 
rulers, a square, or a triangle having a fixed 
acute angle. If we use a square and a compass, 
then the trisection of an angle and the dupli- 
cation of a cube are possible (L. Bieberbach). 
Also, when a conic section other than a circle 
is given, the trisection of an angle and the 
duplication of a cube become possible by ruler 
and compass (H. J. S. Smith and H. Kortum). 
By ruler and ttransferrer of constant lengths, 
we can solve Malfatti’s problem but not Apol- 
lonius’ problem (Feldblum). 

Even when a problem is possible, the 
method of construction may be rather com- 
plicated and impractical. In these cases, vari- 
ous methods of highly accurate approximate 
construction have been investigated. 
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A. General Remarks 


Geometric optics is a mathematical theory of 
light rays. It is not concerned with the prop- 
erties of light rays as waves (e.g., their wave- 
length and frequency), but studies their prop- 
erties as pencils of rays that follow three laws: 
the law of rectilinear propagation, the law 

of reflection (i.e., angles of incidence and re- 
flection on a smooth plane are equal (Euclid)), 
and the law of refraction (i.e., if 0 and @’ are 
angles of incidence and refraction of a light 
ray refracted from a uniform medium to a 
second uniform medium and if n, n’ are the 
refractive indices of the first and the second 
medium, respectively, then nsin ĝ =n sinb (R. 
W. Snell, Descartes)). These three laws follow 
from Fermat’s principle, which states that the 
path of a light ray traveling from a point A’ to 
A in a medium with refractive index n(P) at P 
is such that the integral f4. n(P)ds attains its 
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extremal value, where ds is the line element 
along the path. This line integral is called 

the optimal distance from A’ to A. Therefore 
Fermat’s principle can be taken as a foun- 
dation of geometric optics and is, in a way, 
similar to the tvariational principle in particle 
dynamics (Maupertuis’s principle), 


f, /2h—2U(P)ds=0, 


which is satisfied by the path of a particle of 
unit mass having constant energy h passing 
through a field of tpotential U(P). The quan- 
tity ./2h—2U corresponds to the refractive 
index n. 

In an optical system, express the position 
of a point on the path of a light ray by ortho- 
gonal coordinates (x, y, z), and define the tLa- 
grangian L=n./14+x?+ 9? (x =dx/dz, p= 
dy/dz), optical direction cosines p = 0L/0X, q = 
OL/éy, and the fHamiltonian H = xp + ýq — 
L=—./n* —p*—q?. Then the canonical 
equations of the path are obtained as in parti- 
cle dynamics; x, y and p, q are called tcanon- 
ical variables. Due to the variational prin- 
ciple, the integral of the linear differential form 
pdx+qdy—Hdz=a, along a light path is a 
function S(A’, A) of the endpoints A’, A of the 
path, and the optical direction cosines and the 
Hamiltonians of the system at A and 4’ are 
given by 
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Hence we obtain tHamilton-Jacobi differential 
equations 
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As a corollary to these relations we obtain 
Malus’s theorem, which states that a pencil of 
light rays perpendicular to a common surface 
(locally) at a given moment is also perpendic- 
ular to a common surface (locally) after an 
arbitrary number of reflections and refractions. 
Suppose that light rays travel from an ob- 
ject space into an image space through an 
optical apparatus. If all the rays starting from 
any one point of the object space converge to 
a point of the image space and if the mapping 
given by this correspondence is tbijective, then 
we say that this imaging is perfect. Examples 
of perfect imaging systems are realized by 
optical apparatus such as Maxwell’s fisheye 
(having refractive index n(r)=a/(b+r7), where 
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r denotes the distance from the center of the 
system) and Luneburg’s lens (n(r) = ./a—r7*). 
Perfect imaging conserves optical distance, 
yields the relation n(A’)ds' =n(A) ds, and gives 
a tconformal mapping, with the magnification 
inversely proportional to the refractive index. 


B. Gauss Mappings 


Consider an optimal system with a symmetri- 


cal axis of rotation, its optical axis. A ray of 


light that is near the optical axis and has a 
small inclination to the axis is called a paraxial 
ray. A mapping realizable by paraxial rays 
where the canonical variables x, y, p, q can be 
considered to be infinitesimal variables whose 
squares are negligible, is called a Gauss map- 
ping (Gauss map). When the positions of an 
object point and its image under a Gauss 
mapping are represented by homogeneous co- 
ordinates, the mapping is represented as a 
linear transformation, i.e., a tcollineation, 
which maps a point to a point and a line toa 
line. A point in one space corresponding to 
the point at infinity in the other space is called 
a focus. If we take a focus as the origin of a 
coordinate system in each space and use the 
thomogeneous coordinates x; such that x = 
X1/X4, Y= X2/X4, Z= X3/X4, then a Gauss 
mapping can be represented as x, =x}, 

X2 =X}, X3 = fX}, f'x4 =x. The ratio of x to 
x’, ie., the lateral magnification, is x/x’ =z/f = 
f'/z’, where x’ is the length of an object ortho- 
gonal to the axis and x is the length of its 
image. The distance f between a focus and a 
point where the lateral magnification is 1 (such 
a point is called a principal point) is called 
focal length in each space. The telescopic 
mapping, i.e., X4 =X), X2 =X, X3=aXx, 

x4 = bx4, is also a Gauss mapping, in which 
the lateral magnification is constant. 


C. Aberration 


When a mapping is realized not only by par- 
axial rays but also by rays having larger incli- 
nations, a departure from the Gauss mapping 
arises. This departure is generally called aber- 
ration. Suppose that a light ray that passes 
through the point (x’, y’,z’) of a plane perpen- 
dicular to the optical axis at a fixed z’ and has 
optical direction cosines p’, g’ is transformed 
by the optical apparatus into a light ray that 
passes through the point (x, y, z) of a plane 
perpendicular to the optical axis at a fixed z 
and has optical direction cosines p, q there. 
Then by the variational principle, pdx + qdy— 
p dx’ ~q' dy =dW (dW is an texact differen- 
tial). Therefore the transformation (x’, y’, p’, q’) 
—>(x, y, p,q) is a canonical transformation. The 
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mapping can be described by 
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in terms of W and can also be represented in 
terms of V=W+p’'x'+q'y or U=W+ p’x’+ 
q'y —px—qy. For a given optical system, 

one of these functions W, U, V (called a char- 
acteristic function or eikonal) can be used to 
estimate the aberration. By developing such a 
characteristic function in power series of 
canonical variables and observing its terms 

of less than the fifth power, we can single out 
five kinds of aberration: spherical aberration, 
curvature of image field, distortion, coma, and 
astigmatism in a rotationally symmetric opti- 
cal system. To eliminate these aberrations an 
optical system must satisfy Abbe’s sine con- 
dition (the elimination of spherical aberration 
and coma), Petzval’s condition (the elimination 
of astigmatism and curvature of image field), 
and the tangent condition (the elimination of 
distortion). 

The path of a charged particle in an electro- 
magnetic field can be treated in the same way 
as the path of a light ray. Let e represent the 
specific charge of the particle, h the energy, 
Ap the electrostatic potential, and A,, A,, A, 
vector potentials. Then the index of refrac- 
tion is ./2(h—€Ag) + €(A, dx/ds+ A,dy/ds + 
A,dz/ds). In this case, the index of refraction 
shows the anisotropy caused by the existence 
of the magnetic field. The paths of paraxial 
rays are determined by a set of linear differ- 
ential equations of the second order, and the 
Gauss mapping is realized as in geometric 
optics. 
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181 (VI.1) 
Geometry 


The Greek word for geometry, which means 
measurement of the earth, was used by the 


181 
Geometry 


historian Herodotus, who wrote that in an- 
cient Egypt people used geometry to restore 
their land after the inundation of the Nile. 
Thus the theoretical use of figures for practical 
purposes goes back to pre-Greek antiquity. 
Tradition holds that Thales of Miletus knew 
some properties of congruent triangles and 
used them for indirect measurement, and that 
the Pythagoreans had the idea of systematiz- 
ing this knowledge by means of proofs (— 24 
Ancient Mathematics; 187 Greek Mathemat- 
ics). tEuclid’s Elements is an outgrowth of this 
idea [1]. In this work, we can see the entire 
mathematical knowledge of the time presented 
as a logical system. It includes a chapter (Book 
V) on the theory of quantity (i.e., the theory of 
positive real numbers in present-day termi- 
nology) and chapters on the theory of integers 
(Books VII-IX), but for the most part, it treats 
figures in a plane or in space and presents 
number-theoretic facts in geometric language. 

Geometry in today’s usage means the 
branch of mathematics dealing with spatial 
figures. In ancient Greece, however, all of 
mathematics was regarded as geometry. In 
later times, the French word géomètre or the 
German word Geometer was sometimes used 
as a synonym for mathematician. In a fragment 
of his Pensées, B. Pascal speaks of the esprit de 
géométrie as opposed to the esprit de finesse. 
The former means simply the mathematical 
way of thinking. 

Algebra was introduced into Europe from 
the Middle East toward the end of the Middle 
Ages and was further developed during the 
Renaissance. In the 17th and the 18th cen- 
turies, with the development of analysis, geom- 
etry achieved parity with algebra and analysis. 

As R. Descartes pointed out, however, fig- 
ures and numbers are closely related [2]. 
Geometric figures can be treated algebraically 
or analytically by means of fcoordinates (the 
method of analytic geometry, so named by 
S. F. Lacroix [3]); conversely, algebraic or ana- 
lytic facts can be expressed geometrically. Ana- 
lytic geometry was developed in the 18th cen- 
tury, especially by L. Euler [4], who for the 
first time established a complete algebraic 
theory of tcurves of the second order. Previ- 
ously, these curves had been studied by Apol- 
lonius (262-200? B.c.) as tconic sections. The 
idea of Descartes was fundamental to the 
development of analysis in the 18th century. 
Toward the end of that century, analysis was 
again applied to geometry. For example, G. 
Monge’s contribution [5] can be regarded as a 
forerunner of tdifferential geometry. 

However, we cannot say that the analytic 
method is always the best manner of dealing 
with geometric problems. The method of treat- 
ing figures directly without using coordinates 
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is called synthetic (or pure) geometry. In this 
vein, a new field called tprojective geometry 
was created by G. Desargues and B. Pascal in 
the 17th century. It was further developed in 
the 19th century by J.-V. Poncelet, L. N. Car- 
not, and others. In the same century, J. Steiner 
insisted on the importance of this field (— 343 
Projective Geometry). 

On the other hand, the taxiom of parallels 
in Euclid’s Elements has been an object of 
criticism since ancient times. In the 19th cen- 
tury, by denying the a priori validity of Eu- 
clidean geometry, J. Bolyai and N. I. Loba- 
chevskii formulated non-Euclidean geometry, 
whose logical consistency was shown by 
models constructed in both Euclidean and 
projective geometry (— 285 Non-Euclidean 
Geometry). 

In analytic geometry, physical spaces and 
planes, as we know them, are represented as 
3-dimensional or 2-dimensional Euclidean 
spaces E’, E*. It is easy to generalize these 
spaces to n-dimensional Euclidean space E”. A 
“point” of E” is an n-tuple of real numbers (x,, 
..., X,), and the distance between two points 
(Xis 2225 Xah (Yis -3 Yn) 18 (Yi — x1) + +n 
—x,)*)"?. The geometries of E?, E> are called 
plane geometry and space (or solid) geometry, 
respectively. The geometry of E” is called n- 
dimensional Euclidean geometry. We obtain 
n-dimensional projective or non-Euclidean 
geometries similarly. F. Klein [7] proposed 
systematizing all these geometries in group- 
theoretic terms. He called a “space” a set S on 
which a group G operates and a “geometry” 
the study of properties of S invariant under the 
operations of G (— 137 Erlangen Program). 

B. Riemann [6] initiated another direction 
of geometric research when he investigated n- 
dimensional tmanifolds and, in particular, 
*Riemannian manifolds and their geometries. 
Some aspects of Riemannian geometry fall 
outside of geometry in the sense of Klein. It 
was a Starting point for the broad field of 
modern differential geometry, that is, the geom- 
etry of fdifferentiable manifolds of various 
types (— 109 Differential Geometry). 

The reexamination of the system of axioms 
of Euclid’s Elements led to D. Hilbert’s tfoun- 
dations of geometry and to the axiomatic ten- 
dency of present-day mathematics. The study 
of algebraic curves, which started with the 
study of conic sections, developed into the 
theory of algebraic manifolds, the algebraic 
geometry that is now developing so rapidly (— 
12 Algebraic Geometry). Another branch of 
geometry is topology, which has developed 
since the end of the 19th century. Its influence 
on the whole of mathematics today is con- 
siderable (— 114 Differential Topology; 426 
Topology). Geometry has now permeated all 
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branches of mathematics, and it is sometimes 
difficult to distinguish it from algebra or anal- 
ysis. The importance of geometric intuition, 
however, has not diminished from antiquity 
until today. 
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A. History 


H. Minkowski introduced the notions of lat- 
tice and convex set in the talgebraic theory of 
numbers. He developed a simple yet powerful` 
method of arithmetic investigation using these 
geometric notions to simplify the analytic 
theory of tDiophantine approximation, which 
had been developed by P. G. L. Dirichlet and 
C. Hermite. His theory, the geometry of num- 
bers, has continued its development and con- 
tributed to various fields of mathematics (— 
83 Continued Fractions). 


B. Lattices 


Let E” be an n-dimensional Euclidean space 
identified with the linear space R”. For a point 
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P in E”, we denote the corresponding vector in 
R” by v(P)="(x,,.-.,X,). A subset A of E” is 
called an n-dimensional (homogeneous) lattice 
if there exists a basis {v,,...,v,} of R” such that 
A={PEeE|v(P)= 2X7, 4,0,, AiE Z}. The set of 
points {X,,...,X,} such that v(X;)=0, (i= 
1,...,m) is called a basis of the lattice A. A 
typical example of a lattice is the point set 
corresponding to Z” in R”. The tfree module 
generated by v; (i=1,...,n) is denoted by A* 
and is called the lattice group of A. We have A* 
= {veR"|v=v(P), PEA}. If {u,,...,u,} is an- 
other basis of the free module A*, then there 
exists an element («;;) of GL(n, Z) (ie., aj€Z 
and |det(«;)|= 1) such that u;= L7_, yti. 
Hence the quantity |det(v,, ...,v,)| is inde- 
pendent of the choice of the basis {v,,...,v,}. 
We denote this quantity by d(A) and call it the 
_ determinant of the lattice. We denote the mini- 
mum distance between the points belonging to 
A by d(A). 

A subset L of the space E” is called an in- 
homogeneous lattice if there exists a homo- 
geneous lattice A in E” and a point P, in E” 
such that L={PeE"|v(P)—v(Py)eA*}. Thus 
an inhomogeneous lattice is obtained from a 
homogeneous lattice by translation. In this 
article we restrict ourselves to the case of 
homogeneous lattices and henceforth omit the 
adjective “homogeneous.” 

Suppose we are given a sequence of lattices 
A,, Ag,..., in E” with bases {X$9}, {XP}... 
If the sequence of points X converges to 
X; (i= 1,...,n) and the set {X,,..., X,} forms 
a basis of a lattice A, we call A the limit of 
the sequence {A,}; we also say that the se- 
quence {A,} converges to the lattice A. In this 
case we have d(A,)—>d(A), 6(A,)6(A). The 
notion of convergence of lattices gives rise to a 
topology of the space M, of all the lattices in 
E. A sequence {A,} of lattices is said to be 
bounded if there exist positive numbers c and 
c’ such that d(A,)<c, 6(A,)>c’ for all v. A 
bounded sequence of lattices has a convergent 
subsequence. 

Let S be a subset of the space E”. A lattice A 
is called S-admissible if we have AN S'= {0}, 
where S' is the interior of S and O is the origin. 
We denote the set of S-admissible lattices by 
A(S). Given a closed subset M of M,, we put 
A(S\M)= inf. asyamyq@(A) if A(S)A M is non- 
empty, while if A(S)N M is empty, we put 
A(S\M)= œ. When M = Mp, we write A(S\M) 
= A(S) and call it the critical determinant of S. 
Generally, a lattice A in M is said to be critical 
in M with respect to S if Ac A(S) and d(A)= 
A(S\M). Suppose that we have 0<A(S\M)< 
oo. Then for a lattice critical in M with respect 
to S to exist it is necessary and sufficient that 
there exist a bounded sequence {A,} such that 
A, E MNA(S) and d(A,)>A(S\M). 
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C. Successive Minima and Minkowski’s 
Theorem 


A subset S of the space E” is called a bounded 
star body (symmetric with respect to the 
origin) if there exists a continuous function F 
defined on the space E satisfying the following 
four conditions: (i) F (0)=0; (ii) if X #0, then 
F(X )>0; (iii) for an arbitrary real number t 
and a point X, we have F(tX )=|t|F(X); (iv) 
S={X|F(X)<1}. A bounded closed tcon- 
vex body that is symmetric with respect to the 
origin is a bounded star body. If we are given 
a star body S, the associated function F, and a 
lattice A, there exist a set of points {P,,..., Pa} 
in A and a set of positive numbers {p,,...,p,} 
satisfying the following four conditons: (1) 
v(P,), ..., v(P,) are linearly independent; (2) 
F(P,)=p; (i= 1, -..,1); 3) Di < «.. <pys (A) if P is 
a point in A and v(P) is not contained in the 
subspace spanned by {v(P,), ...,v(P,,-,)}, then 
F(P)>p,. The set {p,,...,p,} is uniquely 
determined by S and A. The numbers p; are 
called the successive minima of S in A; the 
points P; are the successive minimum points of 
Sin A. 

Minkowski’s theorem: Let A be a lattice in a 
Euclidean space E” and S a bounded subset of 
E”. Then we have the following: 

(I) If the volume V(S) is larger than d(A), 
then there exist points X, and X, in S such 
that X, # X, and v(X,)—v(X,)eA*. Suppose, 
moreover, that S is convex and symmetric with 
respect to the origin. Then, if V(S)>2"d(A), 
there exists a point X in SNA different from 
the origin. Hence we have 2"A(S) > V(S) (n= 
dim E”). 

(II) Let S be a bounded closed convex body 
that is symmetric with respect to the origin, 
and let p,,...,p, be the successive minima of S$ 
in A. Then we have p; ... pa` V(S)<2"d(A). 


D. Minkowski-Hlawka Theorem 


Suppose that we are given a subset S of the n- 
dimensional Euclidean space E” such that the 
characteristic function y(X ) of S is tintegrable 
in the sense of Riemann. Then we have the 
Minkowski-Hlawka theorem: (i) If n>2 and S 
is open, then A(S)< V(S), and (ii) if, moreover, 
S is symmetric with respect to the origin, 

then 2A(S) < V(S); (iii) if S is a symmetric 

star body with respect to the origin, then 
2¢(n)A(S) < V(S), where {(n) is the tRiemann 
zeta function. 

A proof for the theorem was given by E. 
Hlawka (1944); (tii) and (iv) were conjectured 
by Minkowski. C. L. Siegel obtained another 
proof (1945), and C. A. Rogers simplified the 
original proof by Hlawka (1947). There are 
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results concerning the estimation of A(S)/V(S) 
for various subsets S. 


E. Siegel’s Mean Value Theorem 


In an attempt to obtain a proof for the latter 
half of the Minkowski-Hlawka theorem, Min- 
kowski observed the necessity of establishing 
the arithmetic theory of the linear transfor- 
mation groups. Siegel was inspired by this ob- 
servation and obtained the following theorem, 
Siegel’s mean value theorem, which implies the 
Minkowski-Hlawka theorem: Let F be a tfun- 
damental region of the group SL(n, R) with 
respect to a discrete subgroup SL(n, Z). Let œw 
be the tinvariant measure on SL(n, R) such 
that f dc =1(— 225 Invariant Measures). Let 
f be a bounded Riemann integrable function 
with compact tsupport defined on the space 
R”. Note that the lattice Z” is stabilized by the 
subgroup SL(n, Z). We have 


> fa ndoto=| f(x)dx, 
F xez" R” 


x 


where the right-hand side of the equation is 
the usual Riemann integral of the function f. 
A. Weil considered this theorem in a more 
general setting (Summa Brasil. Math., 1 (1946)). 


F. Diophantine Approximation 


Minkowski initiated the notion of Diophan- 
tine approximation in reference to the prob- 
lem of estimating the absolute value | f(x) of a 
given function f, where x varies in Z or ina 
given ring of talgebraic integers. (A fDiophan- 
tine equation is an equation f(x)=0, where x 
varies in Z.) Today Diophantine approxima- 
tion (in the wide sense) refers to the investiga- 
tion of the scheme of values f(x), where x 
varies in a suitable ring of algebraic integers. 
The geometry of lattices is a powerful tool in 
this investigation. A typical problem in this 
field of study is that of approximating irra- 
tional numbers by rational numbers; here 
tcontinued fractions play an important role (— 
Section G). For the problem of uniform distri- 
bution considered by H. Weyl, the analytic 
method, especially that of trigonometric series, 
is useful (— Section H). Dirichlet’s drawer 
principle (to put n objects in m drawers with 
n>m, it is necessary to put more than one 
object in at least one drawer) is one of the 
basic principles used in the theory of Diophan- 
tine approximation. Recently, the theory has 
been applied to the theory of ttranscendental 
numbers and the theory of tDiophantine 
equations. 
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G. Approximation of Irrational Numbers by 
Rational Numbers 


Given an irrational number 0, we have the 
problem of finding rational integers x (> 0) 
and y such that |0 — y/x|]<eé/x, where € is a 
given positive number. Suppose that we are 
given a positive integer N. Using Dirichlet’s 
drawer principle we can show the existence of 
x (<N) and y such that |6—y/x|<1/xN. Let 
M(@) be the supremum of positive numbers 
M such that the inequality |@—y/x|<1/Mx? 
holds for infinitely many pairs of integers x, y. 
We have 1 < M(0) (< œ). Two irrational num- 
bers 6 and @ are said to be equivalent if there 
exists an element (a;)€ GL(2, Z) such that 

0 =(a,,0+4,,)/(a,,;9+a,,). In this case we 
have M(0)=M'(@). 

If the irrational number 0 satisfies the qua- 
dratic equation aĝ? +b0+c=0 (a,b,c are 
rational integers), then we have M(0)= 
k! /b* —4ac, where k= min{ax? + bxy+ 
cy*|x, yeZ, x #0, y 0}. In general, for an 
irrational number 0 of degree two, we have 
M(0)>./5. The equality M(0)=./5 holds if 0 
is equivalent to 0, =(1+,/5)/2. If @ is not 
equivalent to 0,, then M(6)>./8; the equality 
holds if @ is equivalent to 0, = 1 + ,/2. Simi- 
larly, we have 63, 04, ...; and M(@,)3 (n> œ). 
If M(@) <3, there exists a @, such that @ is 
equivalent to @,. The set of irrational numbers 
0 satisfying M(0)=3 is uncountable (A. A. 
Markov [13]). We have no information about 
M (06) for the general algebraic irrational num- 
ber 0. Let (0) be the supremum of real num- 
bers u such that the inequality |9 — y/x|<1/x* 
holds for infinitely many pairs of integers x, y. 
Given a number x > 2, it can be shown that the 
tLebesgue measure of the set of real numbers 0 
such that u(0) 2 x is zero. If @ is a real alge- 
braic number of degree n, then p(0)<n (J. 
Liouville). Concerning u(0), results have been 
obtained by A. Thue, Siegel, A. O. Gel’fond, 
and F. J. Dyson. K. F. Roth (1954) proved that 
u(0)=2 (Roth’s theorem [12]), which settled 
the problem of (8). Roth’s theorem means 
that if x is larger than 2, then there exist only a 
finite number of pairs x, y satisfying |0 — y/x| 
<1/x*. This can be generalized to the case of 
the approximation of an element @ that is 
algebraic over an A-field k by an element of 
the field k (S. Lang [9]). (An A-field is either an 
algebraic number field of finite degree or an 
algebraic function field in one variable over a 
finite constant field.) 

In 1970 W. M. Schmidt [20] obtained 
theorems on simultaneous approximation 
which generalize Roth’s theorem. Thus, if 
Xj -..,%, are real algebraic numbers such that 
1, %,,...,%, are linearly independent over the 
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field of rational numbers, then for every e>0 
there are only finitely many positive integers q 
with 

lazil- aan lg <1, 


where || é || denotes the distance from a real 
number é to the nearest integer; in particular, 
we have 


la;,—p/gi<q ETDE, i=1,...,7, 


for only finitely many n-tuples of rationals 


P1/q,---,P,/q- A dual to this result is as follows. 


Let «,;,...,4,, € be as before. Then there are 
only finitely many n-tuples of nonzero integers 
dis -<--> n With 


eS Ps Fann lg gnl E<. 


This last theorem can be used to prove that if 
a is an algebraic number, k a positive integer, 
and ¢>0, then there are only finitely many 
algebraic numbers w of degree at most k such 
that |x— w| < H(w) * '-*, where H(c) denotes 
the height of œ. See also [16,22]. 

The work of Thue, Siegel, and Roth had the 
basic limitation of noneffectiveness. A. Baker 
(1968) succeeded in proving that for any alge- 
braic number @ of degree n>3 and any k> 
n, there exists an effectively computable 
number c=c(6, «)>0 such that |8 — y/x|> 
cx "exp(log x)’/* for all integers x, y (x>0) 
[15]. This result is an immediate consequence 
of the following effective version of a classical 
theorem on binary Diophantine equations 
(Thue, 1909): Let f= f(x, y) be an irreducible 
binary form of degree n >3 with integer coeffi- 
cients, and suppose that «x >n. Then for any 
positive integer m, all integer solutions x, y of 
the equation f(x, y)=m satisfy max(|x], |y] < 
cexp(logm)*, where c > 0 is an effectively com- 
putable number depending on n, x, and the 
coefficients of f. Baker obtained this result by 
making use of his theorems which give effec- 
tive estimates of moduli of linear forms in the 
logarithms of algebraic numbers with alge- 
braic coefficients. A typical theorem reads as 
follows: Let «,,...,%, be nonzero algebraic 
numbers with loga,,...,loga, linearly inde- 
pendent over the rationals, and let Bo,..., Ba be 
algebraic numbers, not all 0, with degrees and 
heights at most d and H, respectively. Then for 
any k>n+ 1, we have |o + fi loga +... + 
B, loga,,| >cexp(—(log H)*), where c>0 is an 
effectively computable number depending only 
on n, K, loga,,...,loga,, and d [14, 19]. 

Results of this kind have many important 
applications in number theory. For instance, 
we obtain a generalization of the Gel’fond- 
Schneider theorem on transcendental num- 
bers. Furthermore, the imaginary quadratic 
number fields of class number 1 can be com- 
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pletely determined on the basis of Baker’s 
result. This was actually done by Baker (1966) 
and independently by H. M. Stark (1966) 

(— 347 Quadratic Fields). 

Refinements and generalizations of Thue’s 
theorem on the finiteness of solutions of bi- 
nary Diophantine equations have been ob- 
tained by Baker and his collaborators. (— 
[17], and also [16]). Also, p- and p-adic ana- 
logs of Baker’s results are known [18]. 

There are a number of unsettled problems 
on the irrationality of particular numbers, 
such as the tEuler constant C, 2°, or ((2k +1) 
with k a positive integer. R. Apéry (1978) 
proved that there exist a positive number € 
and a sequence of positive integers {q,} such 
that 0< ||q,0(3)||<e ™ for all n>1, so that €(3) 
is irrational. 


H. Uniform Distribution 


Let 0 be a real number, x a positive integer, 
and [0x] the maximum integer not larger than 
0x. We write (@x)= 0x —[0x] =6x(mod 1). 
Jacobi showed that if 0 is irrational, then 

the set {(0x)|xeN} is densely distributed in 
the interval (0, 1) (N is the set of positive in- 
tegers). In general, let f be a real-valued func- 
tion defined on N. We say that f(x) (mod 1) is 
uniformly distributed in the unit interval, or 
f(x) is uniformly distributed (mod 1), if the 
following condition is satisfied: Let «, B be an 
arbitrary pair of real numbers such that 0< 
a<f<1,and let N be a given positive integer. 
Let T(N) be the number of positive integers x 
such that x < N, a<(f(x)) <P, where (f(x))= 
SQx)—Lf(x)]. Then limy.,,, T(N)/N = B—a. In 
order for f(x) (mod 1) to be uniformly dis- 
tributed, it is necessary and sufficient that 
limy_., NIEN e?" =0 for any non- 

zero integer h (Weyl’s criterion, 1914). Weyl 
proved that if 0 is an irrational number, then 
6x (mod 1) is uniformly distributed. 

The following theorem, given by J. G. van 
der Corput, is often useful: Let f be a real- 
valued function defined on N. Consider the 
function f,(x)= f(x + h)— f(x) for an arbitrary 
positive integer A. If f,(x) (mod 1) is uniformly 
distributed (mod 1) for all such h, then f(x) 
(mod 1) ts also uniformly distributed (mod 1). 

Utilizing this theorem, it can be shown that 
if f(x)=0,x" + 0,.,x"' +...4+6,x, where at 
least one of the coefficients 8; is irrational, then 
f(x) (mod 1) is uniformly distributed. 

The notion of uniform distribution of se- 
quences of real numbers has an analog in 
compact Hausdorff spaces and in various top- 
ological groups. A systemmatic treatment of 
such generalized notions of uniform distri- 
bution can be found in [21]. 
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Mathematics that treats, by using functional 
analytical techniques, various problems con- 
cerning the tcalculus of variations, tsingular- 
ities, infinite-dimensional Lie groups, or tnon- 
linear partial differential equations, such as 
equations of fluids or of gravitation in general 
relativity, may be called global analysis if it 
uses as a main tool an infinite-dimensional 
version of differential geometry and topology 
analogous to that for finite-dimensional mani- 
folds. The term global analysis therefore has no 
precise definition. However, it can be said that 
it is analysis on manifolds, and the concept of 
tinfinite-dimensional manifolds is the central 
abstract idea in it. 

Suppose one considers a nonlinear differen- 
tial operator on a finite-dimensional manifold. 
Then by using functional analytical techniques 
it often happens that its domain is neither a 
linear space nor its open subset but an infinite- 
dimensional manifold, and that such a non- 
linear operator can be regarded as a tdifferenti- 
able mapping between infinite-dimensional 
manifolds. The tdifferential at a point in that 
source manifold is called a linearized operator, 
to which one can apply various theories of 
linear functional analysis. 

“Global analysis” seems to have first ap- 
peared in the literature in the late 1960s [1,2]. 
However, the phrase “infinite-dimensional 
manifold” has been used widely since about 
1960. By that early date, local theories of 
infinite-dimensional manifolds, sometimes 
called “general analysis,” such as definitions of 
differentiability, the timplicit function theorem, 
tTaylor’s theorem, the existence and unique- 
ness of tordinary differential equations, and 
the tFrobentus theorem, had already been 
established in Banach spaces [3]. Therefore, 
with these regarded as a local theory, the con- 
cept of infinite-dimensional manifolds could be 
defined naturally, and the concept of infinite- 
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dimensional Lie groups as well [4]. A few 
years later, R. Palais and S. Smale [5] and J. 
Eells and J. Sampson [6] showed that such 
concepts are useful in the calculus of varia- 
tions, and R. Abraham and J. Robbin [7] re- 
marked that transversality theorems initiated 
by R. Thom can be easily proved by using an 
infinite-dimensional version of tSard’s theorem 
[8]. 

The so-called tAtiyah-Singer index theorem 
[9], announced in 1963, gave impetus to the 
field as people sought the theorem’s most 
natural expression; the work finally resulted in 
the theorem classifying separable tHilbert 
manifolds by homotopy type (— 105 Differen- 
tiable Manifolds). 

After the appearance of these theorems an- 
nouncing the nonexistence of differential topol- 
ogy on separable infinite-dimensional Hilbert 
manifolds, global analysis moved toward 
more concrete problems and applications to 
various branches of mathematics. However, 
many of these applications are formulated not 
on tBanach or Hilbert manifolds, but on mani- 
folds modeled on Fréchet spaces (*Fréchet 
manifolds) or on nuclear spaces. For such 


situations we have in general no local theories. 


Neither the implicit function theorem nor the 
Frobenius theorem holds on such manifolds. 
However, since these theorems are crucial for 
nonlinear problems, various kinds of sufficient 
conditions for the validity of these theorems 
are being studied by many people (— 286 
Nonlinear Functional Analysis). 

As for the calculus of variations, Yamabe’s 
problem is being studied extensively, for this 
seems to be a typical problem not satisfying 
the so-called *Condition C. The original paper 
of H. Yamabe [13], insisting that every com- 
pact Riemannian manifold can be conformally 
deformed into a manifold of constant scalar 
curvature, contains a serious gap, and the 
problem is still open, though many cases are 
known where the statement holds (— 364 
Riemannian Manifolds H). The harmonic 
mappings defined in [6] are also being studied 
extensively (— 195 Harmonic Mappings). 

In differential geometry or the fgeneral 
theory of relativity definitions of various cur- 
vatures, such as Riemannian, Ricci or scalar, 
or Gauss, can be sometimes regarded as non- 
linear differential equations on manifolds. 
Proof of the global existence of solutions to 
these equations has long been sought, and 
several theorems have recently appeared [14- 
16]. 

Infinite-dimensional groups such as GL(E), 
or GL,(E) with uniform topology, are called 
tBanach-Lie groups, and they have been 
studied in the operator calculus. On the other 
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hand, infinite groups studied by S. Lie and 
E. Cartan were in fact infinite-dimensional 
germs of transformation groups defined on a 
neighborhood of a point in a manifold. These 
were not groups in the strict sense. Recently, 
H. Omori [17] has given a definition of ab- 
stract infinite-dimensional Lie groups that in- 
cludes Banach-Lie groups and many infinite- 
dimensional transformation groups studied 
by Cartan. An application of these groups to 
fluid dynamics can be found in [18]. More- 
over, tunitary representation theories of these 
groups are now being constructed [19, 20]. 
Though global analysis consists at pre- 
sent of a rather disorganized combination of 
many nonlinear problems in analysis on mani- 
folds and in mathematical physics, it is never- 
theless one of the most active branches of 
mathematics. 
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Gödel, Kurt 


Kurt Gödel (April 28, 1906—January 14, 1978) 
was born in Brno, Czechoslovakia (at that 
time Briinn, Austria-Hungary). He studied 
mathematics and physics at the University 

of Vienna, where he took the Ph.D. degree 

in 1930. After he had taught mathematics at 
the University of Vienna from 1933 to 1938, 
he was invited to the Institute for Advanced 
Study at Princeton, where he became professor 
in 1953. In 1976, he was named professor 
emeritus; he died in Princeton in 1978. 

Gödel contributed important fundamental 
results covering all aspects of mathematical 
logic. Among his famous works are the proof 
of the fcompleteness of the first-order predi- 
cate calculus, the incompleteness of the con- 
sistent axiomatic system containing Peano’s 
arithmetic (incompleteness theorem), and the 
tconsistency of the axiom of choice and the 
generalized continuum hypothesis. He also 
introduced the notion of frecursive functions 
and found the Gödel solution of Einstein’s 
equations of relativity. In addition to mathe- 
matical works, he left philosophical papers on 
set theory and the foundations of mathematics. 
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185 (1.8) 
Godel Numbers 


A. General Remarks 


K. Gödel [1] devised the following method to 
prove his incompleteness theorems (— Section 
C). 

Let S be a tformal system. In this article, we 
call its basic symbols, tterms, tformulas, and 
formal proofs the “constituents” of S. Let g be 
an tinjection from the constituents of S into 
the natural numbers satisfying the following 
two conditions: (1) Given a constituent C, we 
can compute the value g(C) in a finite number 
of steps. (2) Given a natura! number n, there 
exists a finitary procedure to find out whether 
there exists a constituent C of S such that 
g(C) =n; furthermore, when such a C exists, it 
can actually be specified in a finite number of 
steps. 

If such a mapping g is given for the system 
G, then the mapping g is called a Gödel num- 
bering and the number g(C) is called the Gédel 
number of the constituent C (with respect to g). 


B. An Example of Godel Numbers 


(1) Let w),%,,... be the basic symbols of ©. 
With each «; we associate a distinct odd num- 
ber q;: glx) =q; (i=9, 1, ...). (2) Let F be a con- 
stituent of S. If F is constructed from any 
other constituents Fo, F,,..., F of S by a rule 
peculiar to S (for convenience we write this 

F =(Fo, F,,..., F,)), and if, for each F;, g(F;) is 
already defined, then we put g(F)=<g(F), 
g(F;), ..., 9(F,)>, where (ao, ay, ..., apò 
denotes the number pæ pî! ... pt (p; is the . 
(i+1)st prime number). For example, suppose 
that S contains 0, =, v; (variables), and 

“| (negation) among the basic symbols, and let 
their Gödel numbers be 7, 9, 11/71, and 13, 
respectively. Since the formula 7 (O=v,) can 
be analyzed in the form (7, (0, =, v,)), its 
Gödel number is <13, (7,9, 111+} >>. For 
details — [1,4]. 


C. Gédel’s Incompleteness Theorems 


By means of a Göde! numbering any meta- 
mathematical notion about the constituents of 
a formal system S can be interpreted as a 
number-theoretic notion. For example, the 
notion “formula” is interpreted as the number- 
theoretic predicate Form(x) which means 

that x is the Gödel number of a formula. The 
provability of a formula is interpreted as the 
number-theoretic predicate Prov(x), which 
means that x is the Godel number of a prov- 
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able formula; accordingly, for any formula A 
of S, the proposition Prov(g(A)) means that A 
is provable. This interpretation is called the 
arithmetization of metamathematics. 

Let the formal system S include formal 
elementary number theory. Then many meta- 
mathematically useful number-theoretic predi- 
cates can be expressed by the respective for- 
mulas of ©; for example, there exist formulas 
Form(x) and Prov(x) expressing the predicates 
Form(x) and Prov(x), respectively. Further- 
more, Gödel proved the existence of a closed 
formula U such that the formula 
71Prov(g(U))< U is provable. This closed 
formula U is one of the so-called formally 
undecidable propositions, and in fact it is shown 
that neither U nor TIU is provable in G if S is 
consistent in a strong sense. This result is 
called Gédel’s first incompleteness theorem. 

By use of the formulas Form(x) and Prov(x) 
the consistency of the formal system © is 
expressed by the formula Consis which is an 
abbreviation of 4x(Form(x)A 7 Prov(x)). 
Godel obtained the result that the formula 
Consis is not provable in S if S is consistent, 
on the basis of the following three facts: (1) U 
is not provable in © if S is consistent; (2) 
Consis— 1Prov(g(U)) is provable in S; (3) 
1Prov(g(U))- U is not provable in S. This 
result is called Gédel’s second incompleteness 
theorem. 

The method of arithmetization is important 
and useful in the study of mathematical logic. 
The notion of Godel numbers of recursive 
functions is one of its applications (— 356 
Recursive Functions). 


D. Tarski’s Theorem Concerning Truth 
Definitions 


Let a consistent formal system S and a tmodel 
of S be given. By means of a Gödel number- 
ing, the truth notion of closed formula is inter- 
preted as the number-theoretic predicate 

Tr(x) which means that x is the Gödel number 
of a true closed formula; accordingly, for any 
closed formula A of S, the proposition 
Tr(g(A)) means that 4 is true. 

In relation to the foregoing fact, if there 
exists a formula Tr(x) of a single variable and 
the formula Tr(g(A))< A is provable for every 
closed formula A, then that formula Tr(x) is 
called a truth definition. A. Tarski proved the 
fact that there is no truth definition in © if the 
formal system S is consistent. 
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186 (XVI.12) 
Graph Theory 


A. Overview of Graph Theory 


Two aspects of graphs are the object of graph 
theory. One is that a graph expresses a binary 
relation over a set V, and the other is the fact 
that a graph ts a CW-complex of 1 dimen- 
sion, an object of study in algebraic topology. 
Because of its special natural as an object of 1 
dimension, we can consider various concrete 
properties in detail. Hence graph theory has 
close connection with other areas, such as 
network theory, system theory, automata 
theory, and the theory of computational pro- 
cesses, and it has many useful applications. 
The notion of a “graph” as currently used in 
graph theory was first discussed by L. Euler 
[1]. It is said that J. J. Sylvester coined the 
word “graph” as we presently understand it. 
However, up to now, there were few unifying 
principles, and graph theory seemed like a 
large collection of miscellaneous problems and 
ad hoc techniques. The terminology has not 
yet been standardized; different usages prevail 
in different schools. The resulting confusion 
can be seen in references books such as [2]- 
[5]. Recently, infinite graphs have also been 
studied. But here we restrict ourselves to finite 
graphs, since only they give a typical theory. 


B. Definition of Graph 


The notion of a graph G=(V,E,d*,07)isa 
composite notion of two finite sets V and E 
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and two maps 07 :E>V and d:E>V. An 
element of V is called a vertex, and an element 
of E is called an edge. The maps ô+ are called 
incidence relations. The terms point, or node 
instead of vertex, and arc, line, or branch in- 
stead of edge have also been used (sometimes 
with slightly different meanings). For an edge 
ecE, tee V is called the initial vertex and 

0” e€V is called the terminal vertex. Both are 
called the end vertices of e. The inverse map 6+ 
of ðt: V2 is defined by tv = fec E|dte= 

v} and has the following properties: (i) If 

v#v, then d*vNd*v'=6 vNd v = Ø. (ii) 
ver O*v=| yey 0 v= E. Conversely, if the 
maps 6*:V-+2¥ have the properties (i) and (ii), 
then there exist corresponding maps ô+. For 

a vertex ve V, |5* v] is called the outdegree or 
positive degree, |d v| is called the indegree or 
negative degree, and the sum |6*v|+|6d v] is 
called the degree. We always have the relation 
Lev lO vl=Zrev 1d vf =lE|, and Epey l(lô* v| + 
|S v])=2|E£|. The number of vertices with 

odd degree is always even. Two end vertices of 
an edge are called mutually adjacent, and two 
edges with at least one common end vertex are 
also called mutually adjacent. An edge satisfy- 
ing ô+ e= 07e is called a self-loop, and a vertex 
satisfying ĝ*v =ô“ v= Ø is called an isolated 
vertex. 

When permutation groups P, operating 
over V and P; operating over E are given, we 
can naturally define the permutation (zy, zp) 
over the graph G(zyé Pp, nge Pp). A graph is 
classified into equivalence classes by Pp, Pr. 
When P,, Pg are all the permutations of V, E, 
respectively, each equivalence class is called an 
unlabeled graph, whereas when P,, Pg consist 
only of an identity transformation, G is called 
a labeled graph. 

For a given graph G=(V,F,6*,6 )anda 
subset E’ c E, we can define the reoriented 
graph of edges in E’ by G'=(V, E, 8'*, 8'7), 
where 6’ *e=6*e if e¢E' and ð'te=0Fe if 
ec E’. We have an equivalence relation by 
identifying the reoriented graphs. Each class 
by this equivalence relation is called an un- 
directed graph or an unoriented graph. In con- 
trast, the graph in the original sense is called a 
directed graph or an oriented graph. 


C. Examples of Special Graphs 


(1) A complete graph is a graph such that there 
exists one and only one edge with end vertices 
v and v for every different pair of vertices v 
and v’. (2) A bipartite graph is a graph with a 
partition V’ UV~ =V such that VE NV =Ø 
and for every ee E, we have 0*eeV* and 

ô eeV . If there always exists an edge e with 
À 


ĝ*e=v, 6 e=v' for every pair ve V* and 
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v'eV , it is called a complete bipartite graph. 
(3) A regular graph is a graph whose degree 

at each vertex is the same. (4) A partial graph 
is defined as follows: Let F’ be a subset of E 

in a graph G=(V,E,0*,é7). The graph G' = 
(V’,E’,0'*, 6" ) is called a partial graph of 

G, where V’'=6* E'U E’, and 6’ are the 
restrictions of 6* on E’, respectively. When 
E’=E, the partial graph is the graph obtained 
by deleting all isolated vertices from G. Simi- 
larly, for a subset V’ of V, the graph G” = 

(V", E",6"* a"), defined by E”=5*V"U6" V” 
and with 6”* being the restriction of d+ on E”, 
is called a section graph of G. 


D. Representation of a Graph 


When we want to process by computer a 
problem concerning a graph, we must repre- 
sent a graph in a suitable form (— 96 Data 
Processing). A commonly used method is the 
following list representation: Let V= {v,, v3, 
.,Uy} and E={e,,e,,...,e,}. For each vE V 
(a= 1,...,M), we arrange the edges in 6*v, 
and in 6 v, in suitable orders. Then (i) for 
each v,€V, record |6*»,|,|6-v,|, B}, BY, B3, 
and B; , where B and B+ are the first and 
the last index of the edges in 6*v,, respectively. 
We define the corresponding values to be 0 if 
the 6*v, are empty. (ii) For each e, € E, record 
ô, ôr, Ei, E} , E} ,and Ez, where d+ are 
the number of vertices 6*e,, and Et, E+ are 
the number of edges immediately after or 
before e, among the edges with the same initial 
(for +) or terminal (for ~) vertex as that of e,. 
If e, is the last or the first among such edges, 
we define the corresponding values to be 0. 


E. Deformation of a Graph 


Let G=(V,E,0*,0-) bea graph. A graph ob- 
tained by opening an edge e is the graph G’ = 
(V, E— {e}, 0'*, 8"), where 0’* is the restric- 
tion of ô+ on E—{e}. The graph obtained by 
deleting all isolated vertices from G’ is a partial 
graph of G over E— {e}. A graph obtained by 
shortening an edge e is the graph G” =(V", E — 
{e},0"*, 0"), where V" =(V—{d*e, 0 e})U 
{ô}, 6 being a new vertex not contained in V, 
and 6”* =@-0+, gp: V>V’” being defined by 
p(v)=v for v#0*e, and =6 when v=" e, 

or v= e,. A graph obtained by opening 

and shortening several different edges is simi- 
larly defined, and the result is independent 

of the order of opening or shortening process. 
A graph obtained by opening edge(s) or by 
shortening edge(s) or by both processes is 
called a subgraph, a contraction, or a sub- 
contraction, respectively. 
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F. Connectedness 


Let G=(V,E,0*,é ) be a graph, and let v, and 
vg be two vertices. A path from v, to vg of 
length / is a sequence P=(v,=0,,,€1€«, Va,» 


Ezka eo EEx Va = Ug), Where £; are +1 or —1, 

and for every i=1,...,, we have 0° ep, = Va > 
Bit ide tent aye ee 

O eki = Va if E= +1, and 6° ep, = Vap O Ck T Vni, 

if &= —1. When v,= vg, it is called a closed 


path considering P as a cyclic sequence. If 

in the sequence P of a path no edge appears 
more than once, it is called a simple path. 
Similarly, if no vertex appears more than once, 
it is called elementary. If all ¢;= +1, it is called 
a direct path. Direct closed paths, etc., are 
similarly defined. 

If we define v~v by the existence of a path 
from v to v’, this ~ is an equivalence relation. 
Let us denote the equivalence classes by V,, 
...,V,. The section graph G;=(V,, E;, 6; , 0; ) 
determined by V, is called a connected com- 
ponent, or simply a component, of G. The sets 
E,,...,E, are mutually disjoint and the union 
is E. If we denote by vv’ the existence of a 
direct path from v to v’, the relation —> is a 
tpseudo-order. The relation vv defined by 
vv’ and v >v is an equivalence relation in V, 
and the equivalence classes V,,..., V or the 
section graphs G,=(V,, E,, 6*, ô; ) determined 
by V, are called strongly connected components 
of G. E,,..., E, are mutually disjoint, but their 
union is not always E. Among the sets V,, 

..., V, we can define an order relation V,->V; 
by the existence of ve Ñ, ve V,, v—v’, Classi- 
fication by © is a refinement of that by ~. 
When s= 1, the graph G is called connected, 
and when t= 1, G is called strongly connected. 

For a different pair v, v'e V, a set SoV— 
fv, v'} is called a separator of v and v if every 
path from v to v’ contains at least one vertex of 
S. When every separator S for any pair v, v’ 
has at least k elements, the graph G is called k- 
connected. For k > 3, there are several vari- 
ations of the definition of k-connectedness. 1- 
connectedness is equivalent to connectedness 
in the sense defined above. 

A simple path containing all edges of a 
graph is called an Euler path. Euler direct 
paths, etc., are similarly defined. A graph with 
an Euler path is called an Euler graph. A graph 
is an Euler graph if and only if (i) it is con- 
nected, and (ii) the number of vertices with 
odd degrees is 0 or 2 (Euler’s unicursal graph 
theorem [1]). Similar results are known for 
Euler closed paths or Euler direct paths. An 
elementary path containing all vertices of 
a graph is called a Hamilton path. Hamil- 
ton closed paths. etc., are similarly defined. 
The criteria for the existence of a Hamilton 
path for any given graph is unknown. This is 
known to be an tNP-complete problem (— 71 
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Complexity of Computations). Various suffi- 
cient conditions or necessary and sufficient 
conditions for special graphs have been given 
(> e.g., [6]). 


G. Tieset, Cutset, Tree, and Cotree 


For a graph G=(V,E,0",0 ), we define its 
incidence matrix [D% (a=1,..., M (=|V|),«= 
1,...,n(=|E]|))] by defining D% to be 1 if v,= 
ôte #0 ep, —1 if v, = e,4#0*e,, and 0 if 
te, =0 e, Or e fv, UÂ v, Similarly we 
define its adjacement matrix [T (2, f=1,..., 
M)] by defining T4 to be 0 if v, and v, are not 
mutually adjacent and 1 if v, and v, are mutu- 
ally adjacent. 

A set of edges forming a closed path is called 
a tieset. A set of edges of the form {e|d* ee W, 
0 eeV—W}Uf{eld ee W,0'eeV—W} fora 
partition (W, V — W) of V is called a cutset. A 
maximal subset of edges containing no tiesets 
is called a tree, and a maximal subset of edges 
containing no cutsets is called a cotree. A tree 
is sometimes called a spanning tree of G. Every 
tree is the complement (with respect to E) of a 
cotree, and vice versa. 

The number of elements of a tree is always 
the same and equal to the trank m of the in- 
cidence matrix. This value m is called the rank 
of the graph G. Similarly, the number k of 
elements of a cotree is always the same, which 
is called the nullity or the cyclomatic number of 
the graph G. Always, k=n—m. 

Let K be a field, K”, K™ be vector spaces 
over K of dimensions n and M, respectively, 
and K"™, K™* be the tdual spaces of K”, K™. 
The incidence matrix defines two mutually 
contragradient linear mappings ô: K">K™ 
and 6:K™*-— K" with respect to their canon- 
ical bases. A minimal set among the family of 
supports (in E) of nonzero vectors in the kernel 
Ker 0 is a tieset corresponding to an elemen- 
tary closed path. Similarly, a minimal set 
among the family of supports (in E) of nonzero 
vectors of the image Im6 is a minimal cutset. 

Let T (c E) be a tree and T=E—T bea 
cotree. Let us renumber the edges so that T= 
{e4,...5€mbs T= {emt -En FOr each emtpE 
T, there is a unique vector R*(p=1,...,k(=n— 
m)) in Ker ô whose support is in {€em+p UT 
and whose e,,,,- equals +1. Similarly, for 
each e,€ T (a=1,...,m), there is a unique vec- 
tor Dé (a=1,...,m) in Im ô whose support is in 
{e,} U T and whose e,- component equals +1. 
{Ri,..., Rg} is a basis of Ker ð, and {D}, ..., 
Dz} is a basis of Im ô. Both matrices R¥(k x n) 
and D?(m x n) are totally unimodular, i.e., every 
minor determinant is 0, +1, or —1. Further- 
more, the following relations hold: R?*?= 
OF (p,q=1,...,k), Dp = 69 (a,b=1,...,m), R+ 
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Din+p = 9 (a=1,...,m; p=1,...,k). The matrices 
Rj, and Dé are called the fundamental tieset 
matrix and the fundamental cutset matrix, 
respectively, with respect to the tree-cotree 
pair (T, T). The minor of the matrix R¥ con- 
sisting of all rows and k columns k,,..., Ky 

is +1 or —1 if and only if {e,,,...,e,,} isa 
cotree, and 0 otherwise. Similarly, the minor 
of the matrix D? consisting of all rows and m 
columns k,,...,,,is +1 or —1 if and only if 
{exs ...5&,,} IS a tree, and 0 otherwise. 


H. Planarity of a Graph 


Let there be a natural one-to-one correspon- 
dence between the sets of edges of two graphs 
G,=(V,, E; 67 , 6; ) (i=1, 2), and suppose that 
under the correspondence a tree T, in G, cor- 
responds to a tree T, in G, and that the fun- 
damental cutset matrices D, and D, with re- 
spect to (7;, E;— T;) are mutually equal. Then 
G, and G, are said to be 2-isomorphic. The 
definition is equivalent to the one given by 
the equality of fundamental tieset matrices. 
2-isomorphism is an equivalence relation. If 
G, and G, are 2-isomorphic, the families of 
trees, cotrees, tiesets and cutsets are mutu- 
ally corresponding. The coincidence of one 
of the families is a sufficient condition for 
the 2-isomorphism of G, and G, as undi- 
rected graphs. A 3-connected graph has no 
2-isomorphic graph other than itself. 
Similarly, when a tree T, of G, corresponds 
to a cotree T, of G, and if the fundamental 
cutset matrix of G, with respect to (T,, E, — T,) 
is equal to the fundamental tieset matrix of G, 
with respect to (E,— T,, T,) as matrices, then 
we say that G, is dual to G,. In this case, G, is 
dual to G, also. If G, is dual to G, and G, is 
dual to G}, then G, and G, are 2-isomorphic. 
The duality is a relation among the equiva- 
lence classes of graphs by 2-isomorphisms. 
Any graph G=(V,E,0',0 ) can be “drawn” 
in 3-dimensional Euclidean space in the fol- 
lowing sense: Each vertex is a (distinct) point, 
and an edge e is an arc connecting two points 
é*eand 0 e with a direction pointing from 
0* e to 6” e in such a way that no two arcs 
intersect. A graph representable on a plane or 
on a 2-dimensional sphere in the above sense 
is called a planar graph. A graph G is planar if 
and only if it has a dual graph (H. Whitney 
[7]). Another necessary and sufficient con- 
dition is that as an undirected graph, neither 
the complete five-point graph K, nor the bi- 
partite complete graph of three-three points 
K, 3, appear in any subcontraction of the 
graph G. (This is a version of the criterion of 
C. Kuratowsk: [8].) 
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I. Coloring of Graphs 


A coloring of the vertices of a graph G= 
(V,E,é*,0-)is a mapping wy from V to the set 
of integers N satisfying the condition w(v) 4 
wv’) for all adjacent vertices v and v’. »(G)= 
min{|w(V)||w is a coloring of the vertices of 
G} is called the chromatic number of G. If 

the graph G is drawn over a 2-dimensional 
closed surface with a 1-dimensional ‘Betti 
number b, we have »(G)<[(7+ ./1+246)/2 |, 
where | | denotes the integral part. Except 
for the tKlein bottle (with b= 2), where y(G)< 
6, this is the best possible, i.e., there exists a 
graph whose y(G) equals the upper bound on 
the right-hand side. As for b> 1, the inequality 
was first proved by P. J. Heawood (1890), 

and final results were established by J. W. T. 
Young and G. Ringel [10]. When b =0, »(G)< 
5 was shown by A. B. Kempe (1879), but the 
four color conjecture: “y(G)< 4?” has remained 
unsolved for more than a hundred years. The 
conjecture is believed to have been solved 
affirmatively recently through checking a 
huge number of cases on a large computer 
[11,12]. 

A subset Wc V is called an independent set 
or an internally stable set if no two vertices in 
W are mutually adjacent. «(G)=max{|W|| W 
is an independent set of G} is called the num- 
ber of independence of G. A subset W of V is 
called a dominating set or externally stable 
set of G if every vertex ve V is either ve W or 
adjacent to a vertex of W. B(G)=min{|W||W 
being a dominating set of G} is called the num- 
ber of domination of G. For every graph G, 
we have a(G) > B(G) and y(G)-a(G)>|V}I. 


J. Decision Problems and Graphs 


There are many interesting topics in decision 
problems concerning graphs, especially from 
the standpoint of tcomplexity of computa- 
tions (— e.g., [13]). The following are some 
typical problems: (i) Problems for which algo- 
rithms of polynomial order are known: Is 

the given graph k-connected, strongly con- 
nected, a Euler graph, or a planar graph? (ii) 
*NP-complete problems: Is the given graph a 
Hamilton graph? Do we have a(G)=k, B(G)= 
k, or y(G)=k? 

Let G, and G, be two graphs. The problem 
of whether they coincide as unlabeled graphs 
is called the isomorphism problem, and has 
been studied for many years in connection 
with problems concerning the structure of 
chemical compounds. Unfortunately, no al- 
gorithm of polynomial order is known; nor 
do we know whether this is an NP-complete 
problem. As for the isomorphism problem for 
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planar graphs, algorithms of polynomial order 
are known. 


K. Perfectness Theorem 


Let the number of independence and the chro- 
matic number of a graph G=(V, E,é*,07) be 
a(G) and »(G), respectively. Let V,,..., V, be 
a disjoint decomposition of V. We denote by 
@(G) the minimal number of r such that every 
section graph of G over V; contains a com- 
plete graph. Furthermore, we denote by œ(G) 
the maximum value of |W| for a subset Wc 
V such that the section graph of G over W 
contains a complete graph. We always have 
a(G)<6(G) and »(G)<@(G). G is called a- 
perfect if every section graph H of G satisfies 
a(H)=0(H). Similarly, G is called y-perfect 

if y(H)= (A) for every section graph H of 

G. The conjecture that y-perfectness and «- 
perfectness are equivalent has recently been 
solved affirmatively [14]. This is called the 
perfectness theorem. 
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Greek Mathematics 


It is generally believed that theoretical mathe- 
matics originated with the Greeks. The Greeks 
learned the arts of land surveying and com- 
mercial arithmetic from earlier civilizations; 
but they developed theoretical mathematics 
themselves, toward the middle of the 4th cen- 
tury B.c. The creation of a mathematics that 
transcends practical purposes was one of 

the most remarkable events in the history of 
human culture, and one that had an immense 
impact on the development of all branches of 
science. We owe the reestablishment of impor- 
tant Greek mathematical texts and the re- 
construction of the development of Greek 
mathematics to the historians of the 19th 
century (the oldest extensive exposition of the 
history of mathematics before Euclid is due to 
Proklos (Proclus) (410—485)). 

The earliest known Greek mathematicians 
are Thales of Miletus (c. 639-546 B.c.) and 
Pythagoras of Samos (fl. 510? B.c.). Both were 
Tonians, but the latter went to what is now 
southern Italy and founded a semireligious 
school whose members called themselves 
Pythagoreans. Their motto was “Everything 
is number”; their studies were called mathema 
(“what is learned”) and consisted of music, 
astronomy, geometry, and arithmetic (the 
subject group called the quadrivium, which 
formed the core of medieval and later higher 
education) “for the purification of soul.” Their 
research delved into the theories of proportion 
(in relation to music) and tpolygonal numbers 
(triangular numbers, square numbers, etc.), 
and more generally into the theory of numbers 
and geometric algebra. It is said that they 
knew of the irrationality of Ja though no 
evidence of this has been found. Even after the 
demise of the Pythagorean school, its followers 
continued to promote mathematics in col- 
laboration with the Academy of Plato. 

Another significant school was the Eleatic. 
Among its members Zeno (c. 490—c. 430 B.C.) is 
especially important. tZeno’s paradoxes are 
arguments leading to absurdity. Some see 
within them the origin of logical reasoning 
and, consequently, of theoretical mathematics 
[3]. It is chronologically difficult to attribute 


187 
Greek Mathematics 


to Zeno the consideration of the continuum 
and irrational numbers, but we can find in him 
the impetus toward atomistic reasoning. The 
computation of the volume of pyramids (by 
dividing them into “atomistic” laminae) by 
Democritus (fl. 430? B.c.) and the atomistic 
calculation of the area of circles by Antiphon 
(fl. 430 B.c.) came shortly after the time of 
Zeno. 

The middle decades of the 4th century B.c. 
are known as the Age of Pericles, the Golden 
Age of Athens. The ttrisection of an angle, the 
tduplication of a cube, and the tquadrature of 
a circle, known at that time as the “three big 
problems” (— 179 Geometric Construction), 
were studied by the Sophists. Hippias of Elis 
(fl. 420 B.c.), Hippocrates of Chios (fl. 430 B.c. 
in Athens), Archytas of Taras (c. 430-365 B.c.), 
Menaechmus (fl. 350 8.c.), and his brother 
Dinostratus (fl. 350 B.c.) solved these problems 
using conic sections and the quadratrix (a 
transcendental curve whose equation is y= 
xcot(zx/2)). 

By 400 B.C. Athens had lost its political 
influence, but it remained the center of Greek 
culture. It was during this time that Plato’s 
Academy flourished, and Plato (427-347 B.C.) 
and his followers laid particular importance 
on mathematics. Archytas, Menaechmus, and 
Dinostratus belonged to or were closely as- 
sociated with this school. During the first fifty 
years of the Academy, research in the follow- 
ing fields was pursued: methodology of mathe- 
matics or science in general (i.e., dialectics, 
analysis, synthesis); geometric reconstruction 
of Mesopotamian algebra; the theory of irra- 
tionals in relation to the geometrization of 
algebra (Theodorus of Cyrene (Sth century 
B.c.), who was Plato’s teacher, as well as Theai- 
tetus of Athens (4157-369 B.C.) contributed to 
this study, and the general theory of propor- 
tion by Eudoxus of Cnidos (c. 408~c. 355 B.C.) 
also belongs to this field); the method of ex- 
haustion (by Eudoxus); and studies of the 
“three big problems” and conic sections. It was 
this school in which the term mathema came 
to be used in its present sense of “mathemat- 
ics” rather than in the sense of disciplines in 
general. 

The conquests of Alexander the Great ac- 
celerated the already considerable cultural 
influence of Athens. Later, during the Ptole- 
maic period, the center of culture moved to 
Alexandria. The Mouseion at Alexandra, the 
combined library and university, is said to 
have possessed hundreds of thousands of 
volumes. 

At Alexandria, Euclid (c. 300 B.c.) com- 
piled his Elements, which became a model 
for scientific works for centurtes to come— 
Newton’s Principia as well as Spinoza’s Ethics 
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are modeled on it. Most historians say that 
Euclid’s method derives from Aristotle (384— 
322 B.c.), who, after studying at Plato’s Acad- 
emy, founded a new school, the Peripatetics, 
whose doctrines are at many points opposed 
to those of the Academy. Others, however, see 
the origin of Euclid’s axiomatic method in 

the Eleatics [5]; we can find prototypes of 
some parts of the Elements in both Oenopides, 
who lived during the time of Zeno, and in 
Hippocrates. 

The third century B.c. was the Golden Age 
of Greek mathematics. Archimedes of Syracuse 
(c. 282—212 B.c.) was the greatest mathema- 
tician, mechanic, and technician of antiquity. 
He did important work in mathematics, study- 
ing the exact quadrature of the parabola. 
According to his ephodos (method), he would 
obtain a result by mechanical experiments and 
then prove it by the method of “exhaustion.” 
He also computed the value of z; studied 
spirals and other curves, spheres, and circular 
cylinders; contributed to the development of 
statics and optics and their application; and 
had a profound influence on later mathema- 
ticians. During the same period, Apollonius of 
Perga (fl. 210 B.c.) wrote Konikon biblia (Books 
on Conics) in eight books, of which the last 
has been lost. The geometric theory of tconic 
sections contained in this work is not much 
different from the one we know today; it had a 
great influence on 17th-century scientists es- 
pecially. Other mathematicians of this period 
worth noting are Eratosthenes (c. 275-195 
B.c.), who conceived the tsieve method of find- 
ing prime numbers and who measured the 
earth, and Hipparchus (fl. 150 B.c.), called the 
father of astronomy, who made a table of 
sines. 

Hellenistic influence began to decline in the 
first century B.c., and the influence of Alex- 
andria decreased. The Mouseion burned in 
48 B.C., but was rebuilt. Among the mathe- 
maticians of this time, we may count Heron 
(fl. 60? A.p.); Menelaus (fl. 100 a.D.), who wrote 
Sphaerica; Theon of Smyrna (fl. 125 A.p.); 
Ptolemy (fl. 150 A.p.), the author of Almagest; 
Nicomachus (50?—150? a.b.), the author of 
Arithmetike eisagoge; Diophantus (fl. 250? 
A.D.), whose career is not fully known but who 
wrote Arithmetika, of which six of the original 
thirteen books remained to influence tFermat; 
and Pappus (fl. 300 a.p.), the last creative 
mathematician in Greece, who left eight books 
of the Synagoge, which influenced *Descartes 
and which still exist today. 

The period following the fall of the Western 
Roman Empire was a difficult one for Greco- 
Egyptian science. The Mouseion was de- 
stroyed for the second time in 392 a.D. Theon 
of Alexandria (fl. 380) and his daughter, Hy- 
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patia (c. 370—415), were at that time working 
on commentaries on the classics. Among the 
few remaining works of the period is Proclus’ 
(410-485) commentaries on the first book of 
Euclid’s Elements. The Athenian Academy was 
closed in 529 by order of the Emperor Justi- 
nian; the last director was Simplicius, who 
commented on Aristotle. Soon afterward, 
Alexandria fell into the hands of the Moors, 
and many scholars fled as refugees to Constan- 
tinople, the capital of the Eastern Empire. 
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A. General Remarks 


Green’s functions are usually considered in 
connection with tboundary value problems 
for ordinary differential equations and also 
with telliptic and tparabolic partial differen- 
tial equations. For example, consider bound- 
ary value problems for the tLaplacian in 3- 
dimensional space: L[u]=(07/6x{ + 67/0x3 + 
07/dx%)u. Let D be a bounded domain with a 
smooth boundary S and the boundary con- 
dition B on S be either u(x)=0 (x eS) (the first 
kind) or éu/én+ Bu=0 (xe S) (the third kind), 
where n is the outer normal of unit length, 
B(x)>0, and B(x)#0. We say that the function 
g(X1,X2,X35,, 65, €3) is the Green’s function of 
L (or the partial differential equation L[u]=0) 
relative to the boundary condition B, when (1) 


` g(x, č) satisfies L [g(x, ¢)] =0 except for x = £; 


(ii) g(x, €)= —1/4nr + w(x, ë), where r=(X7_, (x; 
—€,)*)'? and w(x, ë) is a regular function, i.e., 
of class C” for a suitable value v; (111) g(x, €) 
satisfies the boundary condition B, i.e., g(x, č) 
=0, xes (the first kind), or (6/6n+ $) g(x, ë) 
=0, xe S (the third kind). Conditions (i) and 
(ii) mean that g(x, č) is a fundamental solution 
of L, i.e., L,.[.g(x, €)] = 6(x — &), where 6(x — č) 
is *Dirac’s measure at the point x = €. Note 
that if g(x, č) is a fundamental solution, then 
by adding any solution u of the equation L[u]| 
=0 to g, we obtain another fundamental 
solution g +u. Thus Green’s function is the 
fundamental solution that satisfies the given 
boundary condition. To be more precise, in 
the boundary value problem, if the boundary 
condition is of the first kind, g(x, €) can be 
obtained by adding to a fundamental solution 
—1/4nxr the solution a(x, č) of the following 
Dirichlet problem: A w(x, ¢) =0, w(x, 2)= 
1/4ar (xe S). We remark that there are slightly 
different definitions for Green’s function. For 
example, there are cases where g(x, č) is de- 
fined by g(x, €)= 1/4ar+ (x, č) or by g(x, = 
1/r+ (x, č). 

In the case in the previous paragraph, 
Green’s functions satisfying the boundary 
conditions are uniquely determined. In gen- 
eral, if we are given a Green’s function, then 
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for any regular function v(x) the function 
=f g(x, Š)\o(č)dč 
D 


represents the solution of L[u] =v with the 
boundary condition B. More precisely, if v(x) 
satisfies the tHolder condition |v(x)—v(x’)|< 
L\|x—x’'|* (0<a<1)(L, « positive constants), 
then u(x) is of class C?. Conversely, if u(x) 
satisfies the equation L[u]=v and the bound- 
ary condition B, it is represented by the for- 
mula for u(x). This means that if we denote 
the operator that assoctates u to v by G, then 
G is the inverse operator of the Laplacian L 
with the boundary condition B, and Green’s 
function is the tintegral kernel of the operator 
G. Using this property, the boundary value 
problem relative to L can be reduced to a 
problem of tintegral equations. For example, 
the differential equation with the boundary 
condition B containing the complex parameter 
A, L[u]+Au=f, is equivalent to the integral 
equation u + åG[u}]=G[/f], which is obtained 
by letting G act from the left on the above dif- 
ferential equation. In this way, the problem 
can be simplified. 

In the case of general boundary value 
problems for higher-order elliptic operators, 
Green’s functions are defined in the same way 
as before (— 189 Green’s Operator). The im- 
portant case is when L and the boundary 
condition B define a jself-adjoint operator. 

In this case, Green’s function is symmetric 
(g(x, č)=g({ć, x)). To obtain Green’s function 
is not easy in general. However, in some cases 
such functions can be obtained fairly easily 
(— Appendix A, Table 15.VI). 


B. Self-Adjoint Ordinary Differential 
Equations of the Second Order 


Consider the operator L[u] =(p(x)u'Y + q(x)u 
(p(x) >0) defined in the interval a <x <b, with 
boundary conditions of the form au’ + Bu=0 
at the two endpoints. Then Green’s function 
g(x, č) is defined in the following way: (i) For 
x#€, LL g(x, 2)]=0; (ii) [ég(x, 6)/0x}x=375 = 
1/p(€); (iii) for & fixed, g(x, č) satisfies the homo- 
geneous boundary conditions at x =a and 
x=b. 

Conditions (i) and (ti) mean that L[g(x, €)] 
=ô(x— č). We can construct g(x, č) in the 
following way: Let u,(u,) be the solution of 
L{u]=0 satisfying the boundary condition at 
x=a({(at x=b). If u; and u, are linearly inde- 
pendent, we can satisfy p(u,u,—u,u5)=1 by 
choosing the constants suitably. Then Green’s 
function g(x, č) is defined by g(x, €)=u,(x)u,(€) 
for x <é, and g(x, č)= u; (€)u,(x) for č <x. If u; 
and u, are linearly dependent, there exists no 
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Green’s function that satisfies conditions (i), 
(11), and (iii). However, by modifying the defi- 
nition, we can get a generalized Green’s func- 
tion playing a similar role [2]. This method 
can be applied to the case of ordinary dif- 
ferential equations of higher order. 


C. The Laplace Operator 


When the domain D is the n-dimensional 
sphere of radius a with center at the origin, 
Green’s function of the Laplacian relative to 
the boundary condition u=0 is obtained in 
the following way. Let E(r) be the following 
fundamental solution of the Laplacian: E(r)= 
(2x)! logr for n=2 and E(r)= —((n—2)- 
o,r” 7)! for n> 3, where w, =22"?T(n/2) is 
the (n— 1)-dimensional surface area of the n- 
dimensional unit sphere. Then Green’s func- 
tion g(x, č} is defined by 


g(x, ) = E(r)— E(pr'/a), 


where p= (È= 8 ae r= (Pa (x;— cl ue 
č=(a/p} či. 


D. Helmholtz’s Differential Equation 


Let D be an exterior domain with a smooth 
boundary S in R?. In mathematical physics, 
the boundary value problem of finding a solu- 
tion u(x) of Helmholtz’s differential equation 
(A+k?)u(x) = f(x) (k>0) satisfying u(x)=0 
(xéS) is of particular interest. In this case, 
concerning the behavior of u(x) at infinity, 

we usually assume Sommerfeld’s radiation 
condition: 


When |x|— +0, 


‘ae 
(uiu) 


where ĝ/ðr is the derivative along the radial 
direction. It is known that this condition en- 
sures the uniqueness of the solution (Rellich’s 
uniqueness theorem). We can construct Green’s 
function G(x, €) for any k(>0) so that for 
smooth f(x) with bounded tsupport, 


u(x) = O(|x|~"), 


=o(|x|~*), 


|x|=r 


u= | G(x, €) f(¢)d¢ 


represents the solution satisfying u(x)=0 (xe 8S) 
and Sommerfeld’s radiation condition. Then, 
with 

ik|x—é| 


G(x, č)= ~ 4nix—e| 


+ K(x, č), 


where 


3 1/2 
x-ë=( È ise) > 


i= 


A 


701 


K.{x, é) can be obtained by solving an integral 


equation of tFredholm type [3, 4]. In this case, 


there exists no Green’s operator in L, space, 
and G(x, č) can be considered to be a general- 
ized Green’s function [4]. 


E. Stokes’s Differential Equation 


Let D be a bounded domain in R? with 
smooth boundary S, and consider Stokes’s 
differential equation in D 


Op 3 ‘Ou, 
Au,=———pX;,, i=1,2,3, Y —*=0, 
pon Ox; pe ' a Ox; 


where p, p are positive constants. In hydro- 
dynamics, we consider the boundary value 


problem of finding solutions (u; (x), uz (x), 43 (x), 


p(x)) of Stokes’s equation satisfying the bound- 
ary condition u,(x)=0, xeS (i= 1, 2, 3). In this 
case, Green’s tensors G,,(x, č), g(x, č) can be 
constructed, and for smooth functions X;(x) 
(i= 1, 2, 3) the unique solution of this bound- 
ary value problem is represented by 


u(x)= o> | G(x, C)X (6) dé, 


3 
no=e Ù | g(x, SX (Ede 


j=l 
[5]. 
F. Parabolic Equations 


Consider the boundary value problem (the 
initial boundary value problem): 


3 a2 
LW 5 = flo t>0, a<x<h, 
u(x, 0)= (x), 


where at x =a and x =b, u(x, t) satisfies 
some homogeneous boundary conditions. 
In this case, we can construct the function 
g(x, t; č, t) (t >) satisfying the following con- 
ditions: (i) L[g] =0 except for x=€. t=1; 
(ii) 

exp(—(x—)*/4c7(t —1)) 


2c. /n(t —T) 


+ (regular function) 


g(x, t; č, 1)= 


in a neighborhood of x= č, t =q; and (iii) 
g(x, t; č, t) satisfies the given homogeneous 
boundary conditions at x =a and x =b. Then 


t fb 
u(x, t)= | | g(x, t; Š, T)f(, de dt 
Ova 


+f g(x, t; č O)p(c)de 


a 


represents the solution of the problem stated 
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in this section for regular functions f, g. The 
function g(x, t; č, t) is called Green’s function 
relative to the boundary value problem. De- 
tailed consideration of such elementary cases is 
found in [6]. 


G. Kernel Functions 


The kernel function is closely related to 
Green’s function of A (the Laplacian) rela- 
tive to the first boundary value problem in a 
domain in R?. 

First we explain the general definitions of 
the kernel function. Let E be a general set, and 
let & be a tHilbert space of complex-valued 
functions defined on E with a suitable inner 
product (f, g). Suppose that we are given a 
function K(x, y) defined on E x E satisfying 
the following conditions: (i) For any fixed y, 
K(x, y) regarded as a function of x belongs to 
&; and (ii) for any f(x)e&, (f(x), K(x, y= 
f(y). Then K(x, y) is called a kernel function 
or reproducing kernel. The kernel function, if it 
exists, is unique and is tpositive definite Her- 
mitian; that is, 


n 


DY Kp Y);Š 20. (1) 


Conversely, any positive definite function is 

a reproducing kernel of some Hilbert space. 
A necessary and sufficient condition for the 
existence of the kernel function is that for 

any yeE, the linear functional f—> f(y) be 
bounded. In this case, the minimum of || f || 
under the condition f(y)=1 (fe) is attained 
by the element K(x, y)/K(y, y), and its value is 
K(y, y) 12. When § is a tseparable space, then 
by an torthonormal system {@,(x)}, we can 
represent K(x, y) as 


K(x,y)= PAD. (2) 


As an example of kernel functions, the fol- 
lowing case is of particular importance. Let E 
be an n-dimensional tcomplex manifold, g@ and 
W be holomorphic fdifferential forms of degree 
n on E, and let the following inner product be 
given: 


woui=| paw. 


Now ¥# is given by §={@|¢9, o)< +00}, and 
the kernel function is called the kernel dif- 
ferential. When E is a domain in C", regarding 
the coefficients of the differential form as func- 
tions, we call the kernel function Bergman’s 
kernel function. Moreover, if E can be mapped 
onto a bounded domain by a one-to-one 
holomorphic mapping, then 


ds? =) (0? log K(z, Z)/02,02,) dz, dz, 
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is positive definite and gives a Kahler metric 
which ts called the Bergman metric. 


H. Kernel Functions for Domains in the 
Complex Plane 


Let E be a domain D in the complex plane 
(z=x+iy). Let K(z,¢) be Bergman’s kernel 
function of D, and let G(z, ) be Green’s func- 
tion of A relative to the first boundary con- 
dition with a pole at ¢. Then we have 


K(z, 0) = —(2/m)6°G(z, 0)/d20€. (3) 


Next, let U(z, ¢) be the kernel function of 
the Hilbert space consisting of the holomor- 
phic differential forms whose integrals are 
single-valued, and let N (z, ¢) be Neumann’s 
function of A, i.e., the function that is har- 
monic in D— {¢}, has the same singularity as 
G at ¢, and whose derivative in the normal 
direction 6N/én is constant along the bound- 
ary. Then we have 


U(z, 0) =(2/n)07N(z, 0)/d260. (4) 


Now the kernel H(z, €)=(N(z, 0) — G(z, ¢))/27 
is the kernel function relative to the Hilbert 
space consisting of all real tharmonic functions 
whose integral mean value along the boundary 
I is 0 and having the inner product 


dp oO + K 


The kernel H(z, ¢) is called a harmonic kernel 
function. 

Suppose that the boundary T is tpiecewise 
smooth, and consider the space of all holo- 
morphic functions in D that are continuous on 
the boundary of D. The inner product of such 
functions ọ and yw is given by (ọ,y)= froy 
ds (ds is the element of the arc length of T). 
Hence we have a Hilbert space. Then the 
kernel function relative to this Hilbert space is 
called Szegö’s kernel function, which has a 
close relation with tbounded functions. 

The kernel functions enable us to represent 
holomorphic mappings that map the domain 
D onto various canonical domains (— 77 
Conformal Mappings). 
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A. General Remarks 


Consider the first and the third boundary 
value problems for the elliptic equation 
Alu) = —Au+ Sax) “rela = f(x) 

(— 323 Partial Differential Equations of Ellip- 
tic Type). Let D be a bounded domain of R” 
whose boundary S consists of a finite number 
of smooth hypersurfaces. By the tmethod of 
orthogonal projection, we take the domain 
GA) as follows: (i) {u(x)|u(x)e H?(D) and 
u(x)=0 for xeS} or (ii) {u(x)|u(x)e H?7(D) and 
u/n + B(x)u=0 for xe S$} according as we are 
considering the first or third boundary value 
problem, where H?(D) is the Sobolev space 
(— 168 Function Spaces). If the operator A is 
a one-to-one mapping from &(A) onto the 
function space L,(D), we call the inverse 
operator A‘ Green’s operator relative to the 
boundary condition, and we denote it by G. In 
general, the existence of A~' is not guaranteed. 
However, if we take real t large enough, G,= 
(A+t1)~! exists. 

Consider the general case where / is a com- 
plex parameter: (AJ — A) [u] = f(x), f(x)EL,(D). 
Letting G, act from the left, we have (J -—(A+ 

dG) [u] = —-Gf ane if ue L,(D) is 

a solution, clearly u(x)e YA), and u(x) satis- 
fies the first partial differential equation and 
the boundary condition. Since G, is a tcom- 
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pact operator in L,(D), the *Riesz-Schauder 
theorem can be applied (— 68 Compact and 
Nuclear Operators). In particular, if 2+¢ is not 
an ‘eigenvalue of G, u(x) =(AI — A) 1 f= —(I — 
(A+t)G,)~'G,f represents a unique solution. 

In the equations in the first paragraph of 
this section, if a,(x)=0 and c(x) and f(x) are 
real, then G, is a tself-adjoint operator in 
L,(D), and therefore the tHilbert-Schmidt 
expansion theorem can be applied. Namely, let 
{A;} be the eigenvalues of A such that Aw;(x)= 
A,(x), where {œw;(x)} is an torthonormal 
system in L,(D). Then for any f(x)e L,(D), f(x) 
=> ~,(f@,a;(x), where the right-hand side 
is taken in the sense of tmean convergence. 
Furthermore, for f(x)e (A), we have the 
expansion (Af)(x)= 17%, 1;(f, @)@,{x) in the 
same sense. 

When G, is not self-adjoint, let G* be the 
tadjoint operator of G, in L,(D). Then G* 
represents Green’s operator relative to the 
equation 


n je] 


(A* +1 [0] = —Av— Y ao) 


+(c(x)+0)v 
= g(x), 


corresponding to the boundary conditions (i) 
v(x) =0, x eS (first boundary condition) and (ii) 
(6/6n)v + B’'(x)v =0, xe S, where 


B(x) =p + ¥ ax) cosnx;, 
i=1 


with n the outer normal (third boundary con- 
dition) [2]. 


B. Elliptic Equations of Higher Order 


Green’s operator can be defined for elliptic 
equations of higher order. Consider the 
equation 


A(x, 0/0x)u(x) = f(x), 
B(x, 6/0x)u(x) = 0, 


xeD; 
xeS, 
j=1,2,...,b(=m/2), (1) 


where A is an ‘elliptic operator of order m and 
the boundary operators {B;} satisfy: (i) At 
every point x of S, the normal direction is not 
characteristic for any B,; and (ii) the order m; 
of B; is less than m, and m;#m, (j #k). The 
domain YA) of A is defined by 


GA) = {u(x)|ue H"(D) and 
B,(x, 6/éx)u(x)=0 for xeS, 
j=1,2,...,b}. 


When A is a one-to-one mapping from 2(A) 
onto L,(D), the inverse G = A`! is called 
Green’s operator. 
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This general boundary value problem, es- 
pecially the existence theorem, was treated 
under some algebraic conditions on A and 
{B;} by M. Schechter [5] who showed that 
G[u]e H”(D) if u(x)eL,(D) and that G[u] 
depends continuously on u. In particular, if 
m>n/2, then by fSobolev’s theorem, G is a 
continuous mapping from L.(D) into C°(D), 
and G is represented by an tintegral operator 
of Hilbert-Schmidt type (L. Garding [6]). 
Namely, for any f(x)e L,(D), 


canon | G(x, E)f(¢) dé, 
D 


| G(x, €)|? dx dé < +00. 


In general, the function G(x, č), obtained by 
the kernel representation of Green’s operator 
G, is called Green’s function. 

On the other hand, consider, for exam- 
ple, the *Dirichlet problem of A in R*. Then 
Green’s function is defined in the following 
way (— 188 Green’s Functions): G(x, €)= 
—(4n|x — é|)! + u(x, č), where u(x, ë) satisfies 
(i) A, u(x, €)=0, and (ii) G(x, ĉ)lxes=0. The 
function defined in this manner coincides with 
Green’s function defined as a kernel represen- 
tation of Green’s operator [1,2]. 

Suppose that in problem (1) the telliptic 
operator A(x, 0/0x) is independent of x, and let 
E(x) be a tfundamental solution, i.e., E(x) 
is a distribution solution of A(é/dx)E(x)=6(x) 
(6(x) is fDirac’s 6-function; — 112 Differential 
Operators). Then E(x) is a C®-function away 
from the origin. Moreover, in a neighborhood 
of the origin the following estimates hold: For 
|a|<m, 


a) 


where c is some positive constant. When prob- 
lem (1) has Green’s operator G, we can say 

the following, using the fundamental solution: 
Green’s function G(x, č) exists and can be 
written as G(x, €)= E(x —€) + u(x, é), where for 
any fixed ëe D, u(x, č) satisfies (i) A(0/dx)u(x, é) 
=0 and (ii) B(x, 6/0x)G(x, €)=0, xe S, j= 

1,2, ...,b. 


ex rr, m—n—ļ|aļ|<0, 


<< clog|x| 1, m—n—|a|=0, 


C, m—n—|a|>0, 


C. Hypoelliptic Operators 


Let 
A Ai 
x, 6/ Ye a oe) ax)? 


C 


A Ne gll 
ôx] Ox™... Axe" 
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be a general partial differential operator with 
C”-coefficients. If the kernel E(x, é) satisfies 
A(x, 6/6x) E(x, €)=6(x — č), that is, if we have 
the relation <E{x, č), 'A(x, 6/6x)p(x)>, = (6) 
for any ọ(x)€ 2, then E(x, €) is called a funda- 
mental solution of A, where ‘A is the transposed 
operator of A: 


qa Na 
‘A(x, 0/dx)o(x)= Y (-1)" (Z) (a,(x)v(X)). 
lal <m ox 

Now if there exists a fundamental solution 
E’(x, €) of ‘A(x, 0/0x) such that (i) E(x, č) de- 
fines a kernel that gives rise to two continuous 
mappings, one of which maps the space 2; 
into &,, and the other of which maps the space 
Y,, into é, (— 168 Function Spaces), and (ii) 
for x# č, E(x, č) a C*”-function of (x, č), then 
any distribution u(x) satisfying A(x, 6/0x)u(x) 
= g(x) is a C”-function, where g(x) is of class 
C”. In general, an operator A with the prop- 
erty that any solution u(x) of A(x, 6/éx)u(x) = 
g(x) is of class C® whenever g(x) is of class 
C~”, is called hypoelliptic. Elliptic and para- 
bolic operators are both hypoelliptic. L. Hör- 
mander characterized the hypoelliptic dif- 
ferential operators with constant coefficients 
[8] (— 112 Differential Operators). 

A kernel E(x, č) such that 


A(x, 6/0x) E(x, ¢) = d(x — E) + w(x, ¢), 


with w(x, é) a C’-function, is called a para- 
metrix of A. To prove the hypoellipticity of A, 
it suffices to show the existence of a parametrix 
E'(x, č) of the operator 'A having the prop- 
erties (i) and (ii) mentioned in the previous 
paragraph. 

To explain the notion of the fundamental 
solution for the tevolution equations, suppose 
that we are given an evolution equation 


am 


LiuJ= 5p, t) 


EE (2) 2 wx,9=0 
Ag AX, ZT] Bp UY, FD HY, 
j<m RA ü Ox ot! 

xER", to<t<T. A kernel E(x, t; č, to) (to < 
t<T) is called a fundamental solution to the 
evolution equation if 


Ly (E(x, t; Gs to))=0, t> tos 
and 
ato Ot! Venetis ô(x— č) j=m-—l. 
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A. Definition 


Let G be a nonempty set. Suppose that for any 
elements a, b of G there exists a uniquely deter- 
mined element c of G, which is called the prod- 


‘uct of a and b, written c=ab. We call G a 


group or multiplicative group if (1) the associa- 
tive law a(bc)=(ab)c holds, and (ii) for any 
elements a, beG there exist uniquely deter- 
mined elements x, ye G satisfying ax =b, 
ya=b. Then the mapping (a, b)—ab is called 
multiplication in G. Condition (ii) is equivalent 
to the following two conditions: (tii) There 
exists an element e (called the identity element 
or unit element of G) such that ae =ea =a for 
any element a of G; and (iv) for any element a 
of G there exists an element x such that ax = 
xa=e. 

The element x in condition (iv) is called the 
inverse (or inverse element) of a, denoted by 
a‘, The uniqueness of the identity element e 
and the inverse a~' follows readily from the 
axioms. The identity element of a multiplica- 
tive group is sometimes denoted by 1. If ab 
= ba, then we say that a and b commute. The 
commutative law, ab = ba for any elements a, 
beG, is not assumed in general. A group satis- 
fying the commutative law is called an Abelian 
group (or commutative group) in honor of N. 
H. Abel, who made use of commutative groups 
in his study of the theory of equations. The 
product in a commutative group is often writ- 
ten in the form a+ b, and in this case the 
mapping (a, b)>a+ b is called addition. The 
element a+b is called the sum of a and b, and 
G is called an additive group. In an additive 
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group the identity element is usually denoted 
by 0 and called the zero element, and the in- 
verse of a is denoted by —a(— 2 Abelian 
Groups; 277 Modules). To describe the tlaw of 
composition, we sometimes use notation differ- 
ent from multiplication or addition (— 409 
Structures). 


B. Examples 


A ‘linear space over a ffield K is an additive 
group with respect to the usual addition of 
vectors (— 256 Linear Spaces). A field is an 
additive group with respect to the addition, 
and the set of nonzero elements of a field 
forms a group with respect to the multiplica- 
tion, which is called the multiplicative group 
of the field (— 149 Fields). 

All invertible n x n matrices over a ring R 
form a group with respect to the usual multi- 
plication of matrices. This group is called the 
tgeneral linear group of degree n over R (— 60 
Classical Groups). 

All one-to-one mappings from a set M onto 
itself (i.e., all permutations on M) form a group 
with respect to the composition defined by 
(f og) (x)= f(g(x)) (xe M). (Sometimes the 
product fog is denoted by gf and f(x) by xf. 
Then x(gf)=(xg)f) The group of ali permuta- 
tions on M is called the symmetric group on 
M. A group G is called a permutation group 
(on M) if every element of G is a permutation 
on M. For instance, the general linear group of 
degree n over a field K may be regarded as a 
permutation group on the set of n-dimensional 
vectors, and it is also regarded as a permuta- 
tion group on a ftensor space. 

All ‘motions in a Euclidean space form a 
group with respect to the usual composition of 
motions. All invertible n x n matrices over K 
leaving a given tquadratic form invariant form 
a group with respect to the usual multiplica- 
tion of matrices. This group is called the 
torthogonal group belonging to the given 
quadratic form. If K is the fcomplex number 
field or the real number field, then these 
groups are Lie groups (— 13 Algebraic 
Groups; 151 Finite Groups; 161 Free Groups; 
249 Lie Groups; 423 Topological Groups). 


C. Fundamental Concepts 


If a group G consists of a finite number of 
elements, then G is called a finite group; other- 
wise, G is called an infinite group. The number 
of elements of G is called the order of G. A 
nonempty subset H of G.is called a subgroup of 
G if H is a group with respect to the multipli- 
cation of the group G. Hence a nonempty sub- 


190 C 
Groups 


set H is a subgroup of G if and only if a~'beH 
for any a, be H. For a family {H,} of sub- 
groups of G, the intersection {\, H, is also a 
subgroup. 

The associative law of multiplication says 
that elements a,, a,, a, of G determine the 
product a,a,a3, which is the common value of 
(a, a,)a, and a,(a,a,). This law can be gen- 
eralized to say that any ordered set of n ele- 
ments a,,43,...,d, (n> 2) of G determines their 
product a,a,...a, (general associative law). 
When a, =a, =- =a, =a, we denote the 
product aa...a by a". If we define a" for n>0 
by a? =e and a~"=(a")~', we then have a"a™ 
=a"*™ (a"y"=a"" for any n, meZ.. If there 
exists a positive integer n such that a" =e, then 
the smallest positive integer d with af =e is 
called the order of the element a. If there is no 
such n, then a is called an element of infinite 
order. If a is of infinite order, then its powers 
a? (=e), a*!, a*?,..., are all unequal. If a is of 
order d, then the different powers of a are a? 
(=e), a, a*,...,a7-'. All the powers of a form 
a subgroup <a> of G, called a cyclic subgroup. 
The order of an element a is the same as the 
order of the subgroup <a>. The group <a> 
itself is called a cyclic group and is an example 
of an Abelian group (— 2 Abelian Groups). 

Let S be a subset of a group G. Then the 
intersection of all subgroups of G containing S 
is called the subgroup generated by S and is 
denoted by <S>. It is the smallest subgroup 
containing S, and if S is nonempty, <S con- 
sists of all the elements of the form 
ay'as?...ar” (a,eS,m;€Z). If <S>=G, the 
elements of S are called generators of G. When 
G has a finite set of generators, G is said to be 
finitely generated. When S = {a}, then <S> 
coincides with <a>, and the element a is the 
generator of the cyclic group <a>. Suppose 
that elements a,,...,a, of G satisfy an equa- 
tion of the form af" a3"2...a7"= 1. This equa- 
tion is then called a relation among the ele- 
ments a,,...,d,- If we have a system of gen- 
erators and all relations among the generators, 
then they define a group (— 161 Free Groups). 
It is, however, still an open problem to find a 
general procedure to decide whether the group 
determined by a given system of generators 
and the relations among them contains ele- 


_ments other than the identity (— 161 Free 


Groups B). 

For a subset S and an element x of a group 
G, the set of all elements x~! sx (seS) is de- 
noted by x 'Sx or S*, and S and S* are called 
conjugate. We have (ab )*=a*b*, (a Y 
=(a*)7!. If H is a subgroup, then H* is also a 
subgroup. For a subset S, the set of all ele- 
ments x satisfying S*=S forms a subgroup 
N(S), called the normalizer of S. The set of all 
elements that commute with every element of 
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S forms a subgroup Z(S), called the centralizer 
of S. The centralizer Z of G is called the center 
of G. The set of all elements conjugate to a 
given element a of G is called a conjugacy class. 
A group G is the disjoint union of its conju- 
gacy classes. 

Let H be a subgroup of a group G and x an 
element of G. The set of elements of the form 
hx (he H) is denoted by Hx and is called a 
right coset of H. A left coset xH is defined 
similarly. G is the disjoint union of left (right) 
cosets of H. The cardinality of the set of left 
cosets of H equals that of the set of right cosets 
of H; it is called the index of the subgroup H 
and is denoted by (G: H). Given two subgroups 
H and K of G, the set HxK = {hxk|heH, ke K} 
is called the double coset of H and K, and G is 
the disjoint union of different double cosets of 
H and K. If the left cosets of a subgroup H are 
also the right cosets, i.e., if Hx =xH for every 
xeG, then H is called a normal subgroup (or 
invariant subgroup) of G. An equivalent con- 
dition is that H = H* for all xe G. The center of 
G is always a normal subgroup of G. If H isa 
normal subgroup of G, the set of all products 
of an element of Ha and an element of Hb 
coincides with Hab. Thus if we define the 
product of two cosets Ha and Hb to be Hab, 
then the set of cosets of H forms a group. This 
group is denoted by G/H and is called the 
factor group (or quotient group) of G modulo 
H. (When G is an additive group, G/H is also 
denoted by G—H and is called the difference 
group.) The group G itself and {e} are normal 
subgroups of G. If G has no normal subgroup 
other than these two, then G is called a simple 
group. A subgroup of finite index contains a 
normal subgroup of finite index. If H is a 
subgroup of finite index, then we can find a 
common complete system of representatives of 
the left cosets and the right cosets of H. If G is 
finitely generated, then so is any subgroup of 
G of finite index. 

Let R be an tequivalence relation defined in 
a group G. If xRx’ and yRy’ always imply 
(xy)R (xy), then we say that R is compatible 
with the multiplication. The tquotient set G/R 
is a group with respect to the induced multipli- 
cation. This group is called the quotient group 
of G with respect to R. The equivalence class 
H containing e is a normal subgroup, and 
xRx if and only if x~! x'e H, i.e., x and x’ are 
contained in the same coset of H. Thus G/R 
coincides with G/H. 

If G is a finite group of order n, then the 
order and the index of any subgroup of G, the 
order of any element of G, the cardinal number 
of any conjugacy class of G, and the number of 
different conjugate subgroups of any subgroup 
of G are all divisors of n. 


706 


D. Isomorphisms and Homomorphisms 


If there is a one-to-one mapping a «>a of the 
elements of a group G onto those of a group 
G' and if aa and beb imply abab’, then 
we say that G and G’ are isomorphic and write 
GSC. If we put a’ = f(a), then f:G->G' isa 
*byection satisfying f(ab)= f(a) f(b) (a, be G). 
More generally, if a mapping f:G—G’ satisfies 
f (ab) = f(a) f(b) for all a, be G, then f is called a 
homomorphism of G to G’. An tinjective (fsur- 
jective) homomorphism is also called a tmono- 
morphism (*epimorphism). If there is a surjec- 
tive homomorphism GG’, then we say that 
G’ is homomorphic to G. The composite of two 
homomorphisms is also a homomorphism. If a 
homomorphism f:G—>G’ is a bijection, then f 
is called an isomorphism. In this case f7! is 
also an isomorphism, and we have G&G”. 

For a subgroup H of a group G, the in- 
jective homomorphism f: H>G defined by 
J(a)=a (ae H) is called the canonical injection 
(or natural injection). For a factor group G/R 
of G, the surjective homomorphism f:G—>G/R 
such that ae f(a) (ae G) is called the canonical 
surjection (or natural surjection). 

Let f:G—G’' be a homomorphism. Then the 
image f(G) of f is a subgroup of G, and the 
kernel H = {ac G| f(a)= e (the identity of G’)} 
of f is a normal subgroup of G. The equiva- 
lence classes of the equivalence relation given 
by f(x)= f(y) are just the cosets of H, and f 
induces an isomorphism f:G/H— f(G). The 
latter proposition is called the homomorphism 
theorem of groups. This theorem is extended in 
the following way: For simplicity let f: G>6G' 
be a surjective homomorphism. (i) If H’ is a 
normal subgroup of G’, then the inverse image 
H = f~'(H')is a normal subgroup of G, and f 
induces the isomorphism f:G/H >G'/H'. (ii) if 
H is a subgroup and N is a normal subgroup 
of G, then HN = {hn| he H,neN} is a subgroup 
of G, and the canonical injection H -HN 
induces an isomorphism H/HN N>HN/N. (iii) 
If H and N are two normal subgroups of G 
such that H >N, then the canonical surjection 
G-—>G/N induces an isomorphism G/H > 
(G/N)/(H/N). Propositions (i), (ii), and (iii) are 
called the isomorphism theorems of groups. 

A homomorphism of G to itself is called an 
endomorphism of G, and an isomorphism of G 
to itself is called an automorphism of G. The 
set of automorphisms of G forms a group with 
respect to the composition of mappings, called 
the group of automorphisms of G. Given an 
element a of G, the mapping xa 'xa (xeG) 
yields an automorphism of G which is called 
an inner automorphism of G. The set of inner 
automorphisms of G forms a normal subgroup 
of the group of automorphisms of G, called the 
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group of inner automorphisms of G, which is 
isomorphic to the factor group of G modulo its 
center. The factor group of the group of auto- 
morphisms of G modulo the group of inner 
automorphisms of G is called the group of 
outer automorphisms of G. 

Ifa mapping {:G->G’ from a group G to 
another group G’ satisfies {(ab) = f(b) f(a) 
(a,be G), then f is called an antihomomor- 
phism. A bijective antihomomorphism is called 
an anti-isomorphism. When G=G’, f is called 
an anti-endomorphism or anti-automorphism 
(e.g., f: GG defined by f(a)=a™! is an anti- 
isomorphism). 


E. Groups with Operator Domain 


Let Q be a set and G a group. Suppose that for 
each EQ and xeG, the product @xeéG is 
defined and satisfies 0(xy) = 6(x)(@y). Then Q is 
called an operator domain of G, and G is called 
a group with operator domain Q, or simply an 
Q-group. (We sometimes write x° instead of 
0x.) The mapping (0, x) 0x from Q x G to G is 
called the toperation of Q on G. If G is an Q- 
group, then any element 0 of Q induces an 
endomorphism 0,:x—0x of G. Conversely, if 
we are given a mapping 0-6, of Q to the set 
of endomorphisms of G, then we may regard G 
as an 9-group. Any group may be regarded as 
an Q-group with Q equal to the empty set or 
to the set consisting of the identity automor- 
phism of G. Thus the general theory of groups 
can be extended to the theory of groups with 
operator domain, and in some cases effective 
use of suitable operator domains can be fruit- 
ful in the investigation of the properties of 
groups themselves. (— 2 Abelian Groups; 277 
Modules). 

A subgroup H of an Q-group G is called an 
Q-subgroup (or admissible subgroup) if 0x € H 
for any 0EQ and xeH. In this case, H is also 
an Q-group. If an equivalence relation R de- 
fined in G is compatible with the multiplica- 
tion and also compatible with the operators, 
namely, if xRx’ implies (8x) R(x’) for any 
EQ, then the quotient group G/R is also an 
Q-group. The equivalence class containing e is 
an admissible normal subgroup. Conversely, if 
H is an admissible normal subgroup, then the 
equivalence relation defined by H is compat- 
ible with the operators, and the factor group 
G/H is an Q-group. A homomorphism f:G 
->G of an Q-group G to an Q-group G’ is 
called an Q-homomorphism (admissible homo- 
morphism or operator homomorphism) if {(0x) 
= Of(x) for any 0€Q and xeG. If f is an iso- 
morphism, then f is called an Q-isomorphism 
(admissible isomorphism or operator isomor- 
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phism). We have the homomorphism theorem 
and the isomorphism theorems of Q-groups if 
we consider only admissible subgroups and 
admissible homomorphisms. 


F. Sequences of Subgroups 


Let H,, H,... be an infinite sequence of 
(normal) subgroups of a group G. If H; $ H;., 
(i= 1, 2,...), then the sequence is called an 
ascending chain of (normal) subgroups. If H, 
ZH;,, (i=1,2,...), then it is called a descend- 
ing chain of (normal) subgroups. If there is no 
ascending (or descending) chain of (normal) 
subgroups of G, we say that G satisfies the 
ascending (or descending) chain condition for 
(normal) subgroups. These conditions are the 
same as the ascending (or descending) chain 
condition in the ordered set of all (normal) 
subgroups of G (— 311 Ordering C). A group 
G satisfies the ascending chain condition for 
subgroups if and only if every subgroup of 

G is finitely generated. Also, for groups with 
operator domain we have similar results. The 
structure of Abelian groups satisfying the 
ascending (descending) chain condition is 
completely determined (— 2 Abelian Groups). 
It is not known whether there is an infinite 
group satisfying both the ascending and de- 
scending chain conditions for subgroups. A 
group satisfying the descending chain con- 
dition for subgroups has no element of infinite 
order, but the converse is not true. There is an 
infinite group which is finitely generated and 
has no element of infinite order (— 161 Free 
Groups C). 


G. Normal Chains 


A finite sequence G=Gy>G,>G,>...5G, 

= {e} of subgroups of a group G is called a 
normal chain if G; is a normal subgroup of G,_, 
for i=1,2,...,r. We call r the length of the 
chain. The sequence G)/G,, G,/G,,...,G,_,/G, 
is called the sequence of factor groups of the 
normal chain. A normal chain G=H,>H,> 
H,>...>H,= {e} is called a refinement of 

the chain G=G)>G,>...>G,={e} if every G; 
appears in this chain. Two normal chains with 
the same length are called isomorphic if there 
is a One-to-one correspondence between their 
sequences of factor groups such that corre- 
sponding factor groups are isomorphic. Any 
two normal chains have refinements which are 
isomorphic to each other (Schreier’s refine- 
ment theorem). A normal chain is called a 
composition series (or Jordan-Hdélder sequence) 
if it consists of different subgroups of G and in 
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any proper refinement there appear two suc- 
cessive subgroups which are the same. The 
sequence of factor groups of a composition 
series is called a composition factor series, and 
the factor groups appearing in this series are 
called composition factors. Any composition 
factor is a simple group. As a direct conse- 
quence of the refinement theorem we see that if 
a group G has a composition series, then the 
composition factor series is unique up to iso- 
morphism and the ordering of the factors. 
(This theorem is due to O. Holder. C. Jordan 
proved that if G is a finite group, then the 

set of orders of composition factors is inde- 
pendent of the choice of composition series. 
Hence we call the theorem the Jordan-Hélder 
theorem.) 

For an Q-group G, if we consider only Q- 
subgroups, we have definitions and theorems 
similar to those in this section. When we take 
the group of inner automorphisms of G as Q, 
then a composition series of the Q-group G is 
called a principal series. If we take the group of 
automorphisms of G as Q, then a composition 
series is called a characteristic series. An in- 
finite group G does not always have a compo- 
sition series. Even if G has a composition 
series, G may have an infinite normal chain 
G,<G,¢...<G such that each G; is a normal 
subgroup of G,,, and |_) G,=G. In fact, there is 
a simple group which has such an infinite 
normal chain (P. Hall). Two groups which 
have isomorphic composition series are not 
necessarily isomorphic. A subgroup of a group 
G is called a subnormal subgroup of G if it may 
appear in some normal chain. The intersection 
of two subnormal subgroups is also sub- 
normal, but their join (i.e., the subgroup gen- 
erated by both of them) is not necessarily 
subnormal in an infinite group. The set of 
subgroups and the set of normal subgroups of 
a group form ‘lattices with respect to the inclu- 
sion relation (for the relationship between 
these lattices and the group structure — [8]). 


H. Commutator Subgroups 


Given two elements a and b of a group G, we 
call a’'b~'ab=[a,b] the commutator of a and 
b. The subgroup C generated by all commu- 
tators in G is called the commutator subgroup 
(or derived group) of G. The subgroup C is a 
normal subgroup of G, and the factor group 
G/C is Abelian. On the other hand, if Bis a 
normal subgroup of G and G/B is Abelian, then 
B contains C. For two subsets A, B of G, the 
subgroup generated by the commutators [a,b] 
(ae A, be B) is called the commutator group of 
A and B and is denoted by [A, B]. If A and B 
are normal subgroups of G, then C’=[A, B] is 
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also a normal subgroup of G, and A/C’ com- 
mutes with B/C’ elementwise in the factor 
group G/C’. Furthermore, [A, B] is the mini- 
mal normal subgroup with the property. The 
subgroup [G, G] is the commutator subgroup 
of G. 

If the commutator subgroup of G is Abelian, 
then G is called a meta-Abelian group. If a 
group G has a normal chain G(=G))>G,> 
G,(= {e}) of length 2 and the factor groups 
G/G,, G,/G, are Abelian, then G is meta- 
Abelian. Meta-Abelian groups are special 
cases of solvable groups, discussed in Section I. 


I. Solvable Groups 


Suppose that we are given a series of subgroups 
G; i=0, 1,2,...) of G such that G = Go and 
LG;,G,]=G;,,,. Then we have a normal chain 
G=G)>G,>G,>.... If G.={e! for some 

r, then G is called a solvable group. For the 
normal chain G (= Gp) >G, >...G, (= {e}) the 
factor groups G,/G,,, (i=0,1,...,r—1) are all 
Abelian. A finite group G is solvable if and 
only if G has a composition series G = Họ > 
H,>H,>...>H,={e} such that the factor 
groups H,/H,,,; (i=0,1,...,s—1) are all of 
prime order. An tirreducible algebraic equa- 
tion over a field of tcharacteristic 0 is solvable 
by radicals if and only if its Galois group is 
solvable (— 172 Galois Theory). 


J. Nilpotent Groups 


The sequence of subgroups G=G,)>G,> 
G>... defined inductively by setting G, = 

[G, G,_,] (r=1,2,...) is called the lower 
central series of G. If G,= {e} for some n, then 
G is called a nilpotent group, and the least 
number n with G, = {e} is called the class of the 
nilpotent group G. A nilpotent group ts solv- 
able. Let Z, be the center of G, Z,/Z, be the 
center of G/Z,, and so on. Then we have a 
sequence of subgroups Zo = {e} CZ, CZ, 0..., 
called the upper central series of G. A group G 
is nilpotent if and only if Z„= G for some m, 
and the least number m with Z,,=G is the 
class of G. For the subgroups G, and Z, (r= 
1,2,...), we have [G,_,;,Z,]={e}. If Gisa 

*Lie group, then G is nilpotent if and only if 
the corresponding *Lie algebra g is nilpotent, 
i.e., g"=0. 


K. Infinite Solvable Groups 


The concepts of solvability and nilpotency are 
generalized in several ways for infinite groups. 
For instance, a group G is called a generalized 
solvable group if any homomorphic image of G 
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which is unequal to {e} contains an Abelian 
normal subgroup unequal to {e}, and G is 
called a generalized nilpotent group if any 
homomorphic image (+ {e}) of G has center 
unequal to {e}. These definitions coincide with 
the previous ones for finite groups but not for 
infinite groups [7]. 


L. Direct Products 


Let G,,...,G, be a finite number of groups. 
The set G of all elements (x,, ...,x,,) with x,EG, 
(i=1,...,n) is a group if we define the prod- 
uct of two elements x =(x,,...,x,) and y= 
(Yis, Yn) tO be xy =(X1 Yis -+3 Xn ¥,). WE 

call G the direct product of groups G4, ..., G, 
and write G=G, x... x G,. If e, is the identity 
element of G,, then e=(e,,...,e,) is the identity 
element of G. The mapping (x,,...,X,)—X; 
from G to G; is a surjective homomorphism, 
called the canonical surjection. The subgroup 
H;={(e;,... .5€,)|X;EG;} is 
isomorphic to G;. The subgroups H; (i= 
1,...,n) satisfy the following conditions: (i) H; 
is a normal subgroup of G. (ii) H; commutes 
with H, elementwise if ij. (iii) Any element of 
G can be written uniquely as the product of 
elements of H,,...,H,,. Conversely, if a group 
G has subgroups H,,..., H, satisfying these 
three conditions, then G is isomorphic to H, 
x... X H,. In this case we also write G= H, 
x... X H,, and we call this a direct decompo- 
sition of G. Each H; is called a direct factor of 
G. Conditions (i), (it), and (iii) are equivalent 

to condition (i), (i) G = H4 H, ... H„, and 

H, ... H;i- N H;= {e} (i=2,...,n). 

A group G is called indecomposable if G 
cannot be decomposed into the direct product 
of two subgroups unequal to {e}, and com- 
pletely reducible if G is the direct product of 
simple groups. If G satisfies the ascending or 
descending chain condition for normal sub- 
groups, then G can be decomposed into the 
direct product of indecomposable groups. 
Such a decomposition is not unique in general, 
but if G has two direct product decompo- 
sitions G=G, x... X Ga= H, x... x H,, where 
G; and H, are indecomposable and not equal 
to {e}, then m=n and the factors G; are iso- 
morphic to the factors H, for some j; moreover, 
if G, corresponds, say, with H,, then we have 
G=H,xG,x...xG,,. This fact was first 
stated by J. H. M. Wedderburn, and a com- 
plete proof of the theorem was given by R. 
Remak and O. Schmidt. Later W. Krull ex- 
tended it to more general groups (with opera- 
tor domain), and we call it the Krull-Remak- 
Schmidt theorem. O. Ore formulated it as a 
theorem on tmodular lattices. 

For an infinite number of groups G, (AeA) 
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we define the direct product J],., G; of these 
groups similarly. The set of all elements 
(...,X),...}(x,€G,) such that almost all x, (e., 
all except a finite number of 4) are identity 
elements is a subgroup of the direct product, 
called the direct sum (or restricted direct prod- 
uct) of {G;}. 


M. Free Products 


Given a family of groups {G,},.,, we define 
the most general group G generated by these 
groups, called the free product of {G,}, to- 
gether with canonical injections f,:G,-G. 

Let S be the disjoint union of the sets 
{G,},-q, and regard G, as a subset of S. A 
word is either void or a finite sequence a,, 
a,,...,a, of elements of S, and we denote the 
set of all words by W. The product of two 
words w and w is defined by connecting w 
with w’ so that the tassociative law holds. We 
write w>w’ when two words w and w satisfy 
one of the following two conditions: (i) The 
word w has successive members a, b which 
belong to the same group G,, and the word w’ 
is obtained from w by replacing a, b by the 
product ab. (ii) Some member of w is an iden- 
tity element, and w’ is the word obtained from 
w by deleting this member. For two words w 
and w’, we write w=w’ if there is a finite se- 
quence of words w=Wo, W,,...,W,=W such 
that for each i (1 <i<n), either w,_, >w; or 
w,>w;,_,. This relation is an equivalence rela- 
tion and is compatible with the multiplica- 
tion. Thus we may define a multiplication for 
the quotient set G of W by this equivalence 
relation, and then G is a group whose identity 
element is the equivalence class containing the 
void word. Any xeG, is regarded as a word, 
and we have an injective homomorphism 
f,:G,->G by assigning the corresponding class 
to each element of G,. The group G is called 
the free product of the system of groups 
{Gita and f} is called the canonical injection. 
The free product G of {G,},., is characterized 
by the following universal property: Given a 
group G’ and homomorphisms f}: GG 
(AeA), we can find a unique homomorphism 
g:G—G’ such that go f, =f. The free product 
is the dual concept of direct product and is also 
called the tcoproduct (— 52 Categories and 
Functors). If each G, is an infinite cyclic group 
generated by a,, then the free product of the 
G; is the free group generated by {a,} (— 161 
Free Groups). 

The concept of free product is generalized in 
the following way. Let H be a fixed group. We 
consider the family of pairs (G,j), where G is a 
group and j: H >G is an injective homomor- 
phism. A homomorphism of pairs f :(G,j)—> 
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(G’,j’) is defined to be a group homomorphism 
f:G—G' such that foj=,’. For a given family 
of pairs {(G,,j,)}, we have the amalgamated 
product (G, j) of the family and the canonical 
homomorphism f, :(G,,j,)>(G,j), which is 
characterized by the following universal prop- 
erty: Given a pair (G', j’) and homomorphisms 
SiA(G,,j,) (Gj), we have a unique homo- 
morphism g:(G,j)—(G’,j’) such that go f, = 
fj. If H = {e}, then the amalgamated prod- 

uct is the same as the free product. Now f, is 
an injection. If we regard G, as a subgroup of 
G, then G is generated by the subgroups G, 
and G,NG,=j,(H)=j,(H) Q+ p). 

The notion of the amalgamated product is 
useful in constructing groups with interesting 
properties. For instance, we have a group 
whose nonidentity elements are all conjugate 
(B. H. Neumann and G. Higman), and a group 
generated by a finite number of elements such 
that its homomorphic image (# {e}) is always 
an infinite group (Higman) so that we have an 
infinite simple group generated by a finite 
number of elements. 


N. Extensions 


Let N and F be groups. A group G is called an 
extension of F by N if G has a normal sub- 
group N isomorphic to N and G/N = F. The 
problem of finding all extensions was solved 
by Schreier (Monatsh. Math. Phys., 34 (1926); 
Abh. Math. Sem. Univ. Hamburg, 4 (1928)). 
Suppose that (1) to each oe F there corre- 
sponds an automorphism s, of N; (2) there 
exist elements c, (0, TEF) of N such that 
54(5,(€)) =Cg,e(Sye(@))q.! (a€ N); and (3) Cy. <Cos,p 
= Sq(C,,p)€e,c9- Then the set G of all symbols as, 
(ae N,oeéF) is an extension of F by N if we 
define multiplication by as, - bs, =(as,(b)cq, .)Sor- 
In fact, the set of all elements @=ac,\s, (ae N) 
is anormal subgroup N of G such that G/N 
=F. Any extension can be obtained in this 
way. A system (s,,¢,,,) satisfying (1), (2), and (3) 
above is called a factor set belonging to F. 
Two factor sets (s,,c,,,) and (t,,d,,,) are said to 
be associated if there exist elements a, (o€ F) 
of N such that t,(a)=s,(a,aa,') and d, ,= 
d,(S,(4,))Cq..45;- In this case, two extensions 
determined by these factor sets are isomorphic. 
If (s,,C,,,) IS associated with (t,,d, ,) (do, = 1 for 
any o, tE F), then we say that the correspond- 
ing extension is a split extension. In this case, 
the extension G contains a subgroup F iso- 
morphic to F, and G=FN, FON ={e}. We 
call such an extension a semidirect product of 
N and F. 

If N is Abelian, then condition (2) is simply 
s,(s,{a)) =5,,(a), since the only inner automor- 
phism of N is the identity mapping. The con- 
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ditions of associated factor sets and split ex- 
tension are also simplified. If N is contained 
in the center of G, then G is called a central 
extension of N. 


O. Transfers 


Let H be a subgroup of finite index in G and g; 
(i=1,...,h) be representatives of the right 
cosets of H. For be Hg, we write g;=b. Then 
for xeG an element X= H' JI} g;x(g,;x) ! of 
H/H’ is determined uniquely (independent of 
the choice of representatives), where H’ is the 
commutator subgroup of H. The correspon- 
dence G'x-—X yields a homomorphism of G/G' 
to H/R’, which is called the transfer from G/G’ 
to H/H’. 


P. Generalizations 


The concept of group can be generalized in 
several ways. A set S in which a multiplication 
(a, b)—>ab satisfying (ab)c = a(bc) (the associa- 
tive law) is defined is called a semigroup. If S is 
a commutative semigroup in which ax = bx 
implies a=b (the cancellation law), then S can 
be embedded in a group G so that the multi- 
plication in S is preserved in G and any xeG is 
the quotient of two elements of S:x=a™'b= 
ba™ (a, be S). Such a group G is determined 
uniquely by S. We call it the group of quotients 
of S. 

The notion of semigroup is obtained by 
taking only associativity from the group 
axioms. On the other hand, if Q is a set with a 
law of composition (a, b)—>ab which is not 
necessarily associative but satisfies the con- 
dition that any two among a, b, c in the equa- 
tion ab=c determine the third uniquely, then 
Q is called a quasigroup. A quasigroup with an 
identity element e such that ea=ae=a for 
every element a is called a loop. For loops, we 
have an analog of the structure theory of 
groups (R. H. Bruck, Trans. Amer. Math. Soc., 
60 (1946)). 

If we give up the possibility of forming 
products for all pairs of elements or the 
uniqueness of the product in the axioms for 
groups, then we have the following generaliza- 
tions of groups. A set M with multiplication 
under which to any elements a, be M there 
corresponds a nonempty subset ab of M is 
called a hypergroupoid. Moreover, if the asso- 
ciative law (ab)c =a(bc) holds and for any 
elements a, be M there exist x, ye M such 
that bexa, beay, then M is called a hyper- 
group. 

A set M is called a mixed group if (1) M can 
be partitioned into disjoint subsets My, M,, 
M,,...;(2) for ae Mo, be M; (i=0, 1,2,...), 
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elements ab, a\b of M; are defined such that 
a(a\b) =b; (3) for b, ce M;, an element b/c of 
Mo is defined such that (b/c): c =b; and (4) the 
associative law (ab)c =a(bc) (a,be Mọ,cE M) 
holds (A. Loewy, 1927). 

A set M is called a groupoid if (1) M can be 
partitioned into disjoint subsets M,; (i, j = 
1,2,...); (2) for ae M;; and be M,,, an ele- 
ment abe M; is defined; (3) for ae M;; and 
be Mir an element a\be M,, is defined such 
that a(a\b) =b; (4) for ae M;, and be M,,, an 
element a/be M; is defined such that (a/b)b 
=a; and (5) for ae M;j, be Mp, and ce My, the 
associative law a(bc)=(ab)c holds (H. Brandt, 
1926). These generalized concepts also have 
some practical applications (— 2 Abelian 
Groups; 13 Algebraic Groups; 60 Classical 
Groups; 69 Compact Groups; 92 Crystallo- 
graphic Groups; 122 Discontinuous Groups; 
151 Finite Groups; 161 Free Groups; 243 
Lattices; 249 Lie Groups; 277 Modules; 362 
Representations; 422 Topological Abelian 
Groups; 423 Topological Groups; 437 Unitary 
Representations). 


Q. History 


The concept of the group was first introduced 
in the early 19th century, but its rudiments can 
be found in antiquity; in fact, it was virtually 
contained in the concept of motion or trans- 
formation used in ancient geometry. From the 
time it took explicit form in the late 19th cen- 
tury, it has played a fundamental role in all 
fields of mathematics. 

In their study of algebraic equations in the 
late 18th century, J. L. Lagrange, A. T. Van- 
dermonde, and P. Ruffini saw the importance 
of the group of permutations of roots; using 
this idea N. H. Abel showed that a general 
equation of degree > 5 cannot be solved alge- 
braically. A. L. (Cauchy studied the group of 
permutations of roots for its own interest, but 
a complete description of the relationship 
between groups and algebraic equations was 
first given by E. tGalois. C. Jordan developed 
a detailed exposition of the theory given by 
Abel and Galois in his Traité des substitutions 
(1870) [1]. Up to that time, a group meant a 
permutation group; the axiomatic definition of 
a group was given by A. Cayley (1854) and L. 
Kronecker (1870). F. *Klein emphasized the 
significance of group theory in geometry in his 
tErlangen program (1872), and M. S. tLie 
developed the theory of tLie groups in the 
1880s. In 1897, W. Burnside published his 
Theory of groups [3], whose second edition 
(1911) is one of the classics in group theory 
and is still valuable. Since 1896, G. Frobenius 
[2] and others have developed the theory of 
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representation of groups by matrices (— 362 
Representations). By that time, the theory of 
finite groups had acquired all its essential 
features. Among the branches of abstract 
algebra, the theory of groups was the first to 
develop; it led to the progress of abstract alge- 
bra in the 1930s. Since the latter half of that 
decade, the theory of finite groups has been 
developed further; there has been increased 
interest in the theory, and many significant 
results have been obtained, especially since 
1955 (— 151 Finite Groups). 
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Differential geometry studies differentiable 
manifolds and geometric objects or structures 
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on them. Among the geometric structures, the 
Riemannian and complex structures, with their 
contacts with other fields of mathematics and 
with their richness in results, occupy a central 
position in differential geometry. Perhaps it is 
impossible to say precisely what a differential 
geometric structure is or should be. However, 
the notion of G-structure allows a unified 
description of many of the interesting known 
geometric structures, such as Riemannian and 
complex structures. 


A. The Notion of G-Structures 


Let M be an m-dimensional C”-manifold, and 
let t: T(M)—>M be its ttangent bundle. For 
xe M, T,(M)=t '(x) is the vector space of 
tangent vectors at x. Let n: F(M)—M denote 
the fframe bundle of M. It is a GL(m; R)- 
principal bundle on M and F,(M)=7 '(x) is 
the set of linear isomorphisms (called frames at 
x) of R” onto T,(M). Here GL(m; R) is the 
general linear group of R". Hence F,(M) can 
be naturally identified with the set of ordered 
bases of T,(M). Write 0=(0', ...,0”) for the 
canonical form of F(M), which is the R™- 
valued 1-form on F(M) defined by 0(v)= 

p ‘(x,{v)) for any ve T,(F(M)). Given a dif- 
feomorphism f:M =N of M onto another 
C*-manifold N, we can naturally define the 
bundle isomorphism f: F(M)& F(N) by 
SD) =f, OP. 

Let G be a tLie subgroup of GL(m; R). A 
principal G-subbundle z,: PM of the frame 
bundle z: F(M)—M is called a G-structure 
over M. Thus P is a regular submanifold of 
F(M) satisfying the following three conditions: 

(Al) x(P)=M. 

(A2) for peP and oe GL(m;R), poe P if and 
only if oeG. 

(A3) for any xe M, there exist an open 
neighborhood U of x and a C*-mapping 
s: U >P such that z(s(y))=y for any ye U. 
Conversely, if a regular submanifold P of F(M) 
satisfies the above three conditions (A1), (A2), 
and (A3), then tp=7| p: P->M is a G-structure 
over M. When G is closed, then condition (A3) 
is automatically satisfied. The restriction of 0 
onto P is called the canonical form of P and is 
also denoted by 0. For an open subset U of M, 
the restriction P| U=zp'(U) is a G-structure 
over U. 

Let tp:P>M and 7g:Q-N be G-structures 
over M and N, respectively. A diffeomorphism 
f{:M=N is called a G-isomorphism of P onto 
Q if f"(P)=Q. When such an f exists, we say 
that the G-structures P and Q are equivalent. A 
G-isomorphism of P onto itself is called a G- 
automorphism of P. Write Aut(M, P) for the 
group of G-automorphisms of P. 
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Let g be the Lie algebra of G. Since G acts 
on P from the right, we have the natural Lie 
algebra homomorphism ::g—>T (P, T(P)). Here 
T(P, T(P)) is the Lie algebra of C” —vector 
fields on P. Actually z is injective, and we write 
A* for 1(A) (Aeg). We call A* the fundamental 
vector field corresponding to A. 

For xe R”, define po(x)E F(R”) by 
Do(x)((v/))= LP, v4(2/0x’),, where x',...,.x” 
are the natural coordinates of R™. Any open 
subset D of R” has the natural G-structure 
P(D, G) defined by P(D, G)= { po(x): o € F(R”); 
xeD,coeG}. We say that a G-structure zp: 
PM over M is integrable if for each xe M, 
there exists a local coordinate system (U, ọ, D) 
around x such that ọ is a G-isomorphism 
of P|U onto P(D, G). Equivalently, if we set 
p=(x',...,x”), then the frame {(0/0x'),, ..., 
(é/6x™),} belongs to P for each ye U. 


B. Examples 


The following examples (B1)—(B7) show that 
some classical geometric structures can be 
treated uniformly from the viewpoint of G- 
structures. 

(B1) An absolute parallelism on M is, by 
definition, a system {X,,..., X,,} of C” -vector 
fields on M such that at each point xe M 
{X,(x), ..., X,,(x)} is a basis of T(M),. Clearly 
this is equivalent to giving an {e}-structure 
Tp: P—M, where e stands for the identity 
matrix of GL(m; R). Then P is integrable if and 
only if [X;, X;]=0, 1 <i, j<m. In this case, 
Aut(M, P) is a finite-dimensional tLie trans- 
formation group on M. 

(B2) Let E be a k-dimensional ‘distribution, 
namely, E is a k-dimensional vector subbundle 
of T(M). Write R” = R* @ R™ =. Set GL(k, m; R) 
= {¢€GL(m; R); o(R*) <R*} and P={peF(M); 
p(R*)< E}. Then P is a GL(k, m; R)-structure 
over M. This gives a bijective correspondence 
between the k-dimensional distributions on M 
and the GL(k, m; R)-structures on M. More- 
over, P is integrable if and only if E is tin- 
volutive (Frobenius theorem). 

(B3) Let g be a Riemannian metric on M. 
Set O(m)= {0 € GL(m; R);'co =e}. Define P(g) 
by P(g) = {(015 «++. Um)€F(M); g(v;, v) = 6,;}. 
Then P(g) is an O(m)-structure on M. This 
gives a bijective correspondence between the 
Riemannian metrics on M and the O(m)- 
structures over M. Then Aut(M, P(g)) is 
the group of tisometries of g, and a finite- 
dimensional Lie transformation group on M. 
Moreover, P(g) is integrable if and only if g is 
flat in the sense that the Riemannian curvature 
tensor of g is zero. 

(B4) Two Riemannian metrics g, and g, are 
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called conformally equivalent if there exists a 
C®-function p >0 with g, = pg. This is an 
equivalence relation among the Riemannian 
metrics on M. An equivalence class {g} is 
called a conformal structure on M. Set CO(m) 
={aA, AcO(m),aeR—{0}}. Define P({g}) 
by P({g})={(1, sq) €F(M); g(01, 0) = 
ad,;,aeR— {0}}. Then P({g}) is a CO(m)- 
structure on M. This gives a bijective cor- 
respondence between the conformal struc- 
tures on M and the CO(m)-structures over 

M. Then Aut(M, P({g})) is the group of tcon- 
formal transformations of {g} and is a finite- 
dimensional Lie transformation group. More- 
over P({g}) is integrable if and only if the 
conformal structure {g} is conformally flat in 
the sense that at each point xe M, there exists 
a local coordinate system (x',...,x™) around x 
such that g(0/0x', 6/dx4) = pô; where p is a 
positive function. 

(B5) Assume that m is even, say m= 21. By an 
almost symplectic structure on M, we mean a 
differential 2-form Q of maximal rank, i.e., 
QA... AQ (l times) never vanishes. Let A be 
the standard skew-symmetric bilinear form on 
R” defined by A((u’), (v) =(u! v? —u2v')4... 
+(u?!~! y?!— y?!p?!“1), The symplectic group 
Sp(l; R) is defined by Sp(i; R)= {oe GL(21; R); 
A(ou, ov) = A(u, v), u, ve R?”}. Set P(Q)= 
{pe F(M); p*A=Q,,x=2p(p)}. Then P(Q) is 
an Sp(/;R)-structure over M. This gives a 
bijective correspondence between the almost 
symplectic structures on M and the Sp(I; R)- 
structures over M. Then Aut(M, P(Q)) is the 
group of symplectic transformations of Q, 
which is never a finite-dimensional Lie trans- 
formation group. Moreover P(Q) is integrable 
if and only if Q is a symplectic structure, i.e., 
dQ =0. 

(B6) Assume that m is even, say m=21. We 
identify C’ with R” as a real vector space. 
Then GL(m; C) is a Lie subgroup of GL(2/; R). 
Let J be an almost complex structure on M. 
Set P(J)= {pe F(M); Jp(u) = p(iu), ue R” =C'}. 
Then P(J) is a GL(/; C)-structure on M. This 
gives a bijective correspondence between the 
almost complex structures on M and the 
GL(l; C)-structures over M. Then Aut(M, P(J)) 
is the group of J-analytic transformations 
on M. Furthermore, Aut(M, P(J)) is a finite- 
dimensional Lie transformation group when 
M is compact. Moreover P(J) is integrable if 
and only if J is a complex structure. 

(B7) It is easy to see that there is a bijec- 
tive correspondence between the SL(m; R)- 
structures over M and the set of volume ele- 
ments on M. Any SL(m;R)-structure over M is 
always integrable and Aut(M, P) is the group 
of diffeomorphisms preserving Q, which is 
never a finite-dimensional Lie transformation 
group. 
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C. Structure Functions 


Let G be a Lie subgroup of GL(m;R) and g the 
Lie algebra of G. For convenience, we write V 
for R” and V* for the dual space of V. Then 
V @ A?V* can be considered as the space of 
skew-symmetric bilinear mappings from V x V 
into V, and g ® V* can be identified with the 
space of linear mappings from V into g. Define 
a linear mapping 0:g Q V*>V @ A?V* by 
(6T)(u, v) = —T(u)v+ T(v)u 

for Teg®& V*, uver. 


Put 
H?'(g)=V@ A?V*/d(g @ V*). 


Let tp: P—M be a G-structure over M and 
@ the canonical form of P. Take any frame p in 
P. An m-dimensional linear subspace H of 
T,(P) is said to be a horizontal subspace of 
P at pif 0,: H —>V(=R”) is a linear isomor- 
phism. Then we write fy for 0, ': V-H. Write 
H(T,(P)) for the set of horizontal subspaces of 
P at p. For He 9(T,(P)), define c(p, HEV © 
A2V* by 


c(p, H)(u, v)=dO( falu), fulv)) for 
Then we have 
c(p, H,) < c(p, H,)e o(g & V*) 

for H,,H,€(T,{(P)). 


u,veV. 


Therefore we have a well-defined element 
c(p)€ H> + (g) by c(p) = {e(p, H)}. We call the 
mapping c: P— H?” ' (g) the structure function of 
the G-structure zp: P>M. 

Let Q>N be another G-structure over N 
and f: MN bea G-isomorphism of P onto 
Q. Then we have c( f(p))=c(p) for peP. In 
particular, it is easily seen that if P is inte- 
grable, then c=0. However, the converse is not 
true in general. 


D. Prolongation of Linear Lie Algebras and 
Groups 


Let K denote either R or C. Let g be a Lie 
subalgebra of gl(m; K). For k=0,1,2,..., let g, 
be the space of K-symmetric multilinear 
mappings 

t:K” x... x K”—>K” 


—— 
(k+ 1) times 


such that for each fixed u}, ..., u€ K”, the 


linear transformation 
veK”=>t(v, u, ... u EK” 


belongs to g. In particular, go =g. We call g, 
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the kth prolongation of g. If g, =0, then g,,, 
=, 42=.-..=0. The first integer k such that g, 
=0 is called the order of g. If g, #0 for all k, 
then g is said to be of infinite type. When g 
contains a matrix of rank 1 as an element, then 
g is of infinite type. When K =C, the converse 
is also true [3]. However, when K=R, the 
converse is not true in general. In fact, if we 
consider gl(l; C) as a real Lie subalgebra of 
gl(2/; R) (example (B6)), then gl(/; C) is of in- 
finite type, having no matrix of rank 1 in it. In 
general, a Lie subalgebra g of gl(m; R) is said 
to be of elliptic type if g contains no matrix of 
rank 1 as an element. We consider g; as an 
Abelian Lie subalgebra of gl(K” @ g) by the 
correspondence teg; tegl(K™ @ g), where t 
is defined by 


t(u)=t(-,u) for ueK", 


t(A)=0 for Aeg. 


More generally, we consider g,,, to be an 
Abelian Lie subalgebra of gl(K” 8 g,@...@ 
g,) by virtue of the correspondence téq,,,+> 
tegl(K"™ @ qo ®... ® gı), where t is defined by 


t(u)=t(-,...,°,Weg, for uek”, 


AED... Bg. 


Identifying gl((K” ® go ® ... ® 9,-1) ® g,) with 
al(K™ ® go ® ... ® g,), we have the important 
identity 


(D1) (G01 =Sks15 


Let G be a Lie subgroup of GL(m; R) and g 
the Lie subalgebra of gl(m; R) corresponding to 
G. For k=1,2,..., let G, be the connected 
Abelian Lie subgroup of GLAR” @ go @... ® 
9-1) corresponding to the Abelian Lie sub- 
algebra g,; namely, G, consists of the linear 


mappings o(t) (te g) defined by 


t(A)=0 for 


k=0, 1, 2,.... 


o(t)(u)=ut+t(-,...,°,u) for ueR”, 


a(t)(AJ=A_ for AEgo®@... Bay-1- 
We call G, the kth prolongation of G. We say 
that G is of finite (resp. infinite) type if g is of 


finite (resp. infinite) type. We say that G is of 
elliptic type if g is. From (D1), we have 


(D2) (Gi) =Gert- 


We know all irreducible Lie subalgebras g of 
gl(m; K) with order >2 [3,6]. 

(D3) An irreducible Lie subalgebra g of 
gl(m; C) is of infinite type if and only if g is one 
of 


gl(m; C), si(m; C), coo(c), <0( 0). 


where csp(m/2; C)=C @ sp(m/2, C). 
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(D4) An irreducible Lie algebra g of gl(m; R) 
is of infinite type if and only if g is one of 


fe). ale mC 
g 2? ’ S$ 2° > csp 4’ > 
m 
—;C ’ 
a(€) 


gl(m;R), sl(m; R), sp(Z:R), so( SR), 


where csp(m/2; R)=R © sp(m/2; R). 


(D5) Let S be an m-dimensional irreducible 
Hermitian symmetric space of compact type, 
which is different from the complex projective 
space. Let L(S) be the identity connected com- 
ponent of the group of biholomorphic trans- 
formations of S. Take any point o€S and set 
Lo(S)= {oe L(S); a(0)=0}. Let g(S) be the 
linear isotropy Lie subalgebra of gl(m; C) cor- 
responding to L(S). Then g(S) is an irreduc- 
ible Lie subalgebra of gl(m; C) of order 2. Con- 
versely, every irreducible Lie subalgebra g of 
gl(m; C) of finite type with order > 2 is equal to 
g(S) as given above. 

(D6) Let S be an m-dimensional irreducible 
symmetric space of compact type. Assume that 
S admits a finite-dimensional connected Lie 
transformation group L(S) which strictly con- 
tains the connected component of the group of 
isometries of S. Take any point o€S, and set 
L(S)= {a € L(S); (0) =0}. Let g(S) be the 
linear isotropy Lie subalgebra of gl(m; R) cor- 
responding to L,(S). If (S, L(S)) ¢(sphere, pro- 
jective transformation group) or (S, L(S))# 
(complex projective space, complex projective 
transformation group), then g(S) is an irre- 
ducible Lie subalgebra of gl(m; R) of order 2. 
Conversely, every irreducible Lie subalgebra 
g of gl(m; R) of finite type with order > 2 is 
equal to g(s) as given above. For example, if 
we take an m-dimensional sphere as S and the 
group of conformal transformations of S as 
L(S), then g(s)=co(m). 


E. Prolongation of G-Structures 


Let G be a Lie subgroup of GL(m; R) and let g 
be the Lie subalgebra of gl(m; R) correspond- 
ing to G. We choose once and for all a linear 
subspace C of V® A? V* such that 


VA /\2V*=A(g@V*)@C. 


In general there is no natural way of choosing 
such a C. 

Let np: P—M be a G-structure on M. For 
each horizoantal subspace H of P at p (Le., 
HeO(T,(P)), we define the frame (p, H)e F,(P) 
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to be a linear isomorphism (p, H): R” @ g> 
T,(P) given by 


(p, H)(v ® A)= falv) D A* for veR”, Aeg, 


where A* is the fundamental vector field on P 
induced from Aeg. Let P, = {(p, H)e F(P); 
He HXA(T,(P)), c(p, H)e C}. Then P,>P isa 
G,-structure over P. We call P, >P the first 
prolongation of P. The kth prolongation P, of 
P is defined inductively by P,=(P,_,), =the 
first prolongation of P,_,. From (D2), P, is a 
G,-structure over P,_,. 

Take any feAut(M, P). It can be proved 
that fe Aut(P, P,). Since P, =(P,_,),, we can 
define fe Aut(P,_;, P) inductively by f = 
(JPV. By the correspondence f f™, we 
can consider Aut(M, P) as a subgroup of 
Aut(P,_,, P,). 

If G is connected, then 


(E1) Aut(M, P)=Aut(P,_,,P,), k=1,2,.... 


Let Q>N be another G-structure. By a similar 
argument to that above, we see that P>M 
and Q—>N are equivalent if and only if P,—> 
P,-; and Q,->Q,-_, are equivalent. In [1], E. 
Cartan studied general equivalence problems 
of two G-structures. In that problem the above 
prolongation procedure plays an important 
role. 


F. Automorphisms of G-Structures of Finite 
Type 


S. Kobayashi proved the following: 

(F1) Theorem [4]. Let P>M be an {e}- 
structure over M. Then Aut(M, P) is a finite- 
dimensional Lie transformation group on 
M such that dim Aut(M, P)<dim M. More 
precisely, for any point xe M, the mapping 
o€Aut(M, P)t+a(x)eM is injective, and its 
image {ø(x); c€ Aut(M, P)} is a closed sub- 
manifold of M. The submanifold structure on 
this image makes Aut(M, P) into a finite- 
dimensional Lie transformation group on M. 

Now let G be a Lie subgroup of GL(m; R) 
of finite type, say G, = {e}. Then P,>P,_, 
is an {e}-structure. From theorem (F1) 
above, it follows that Aut(P,_,, P) is a finite- 
dimensional Lie transformation group. As 
explained in Section E, Aut(M, P) is a sub- 
group of Aut(P,_,, P,). Clearly Aut(M, P) is 
closed in Aut(P,_,, P,). Since every closed 
subgroup of a Lie group is again a Lie group, 
we have the following: 

(F2) Theorem. Let P>M be a G-structure 
over M. If G is of finite type, then Aut(M, P) is 
a Lie transformation group of dimension 
<dim(R” ®g@q, @... Pg,-}). 
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G. Automorphisms of G-Structures of Elliptic 
Type 


The following theorem of R. Palais allows us 
to prove that the automorphism groups of 
many general geometric structures are finite- 
dimensional Lie transformation groups. 

(G1) Theorem [5]. Let L be a group of C”- 
transformations of a C*®-manifold M. Let I be 
the set of all vector fields X on M which gener- 
ate global 1-parameter groups g,=exptX of 
transformations of M such that 9,¢ L. If the 
set I generates a finite-dimensional Lie alge- 
bra of vector fields on M, then L is a finite- 
dimensional Lie transformation group, and | is 
the Lie algebra of L. 

Let P>M be a G-structure over M. For any 
representation p:G-—>GL(V), we write E(p) for 
the associated vector bundle of P with respect 
to p. Let p,:G—GL(gl(m; R)) be the represen- 
tation defined by p,(¢)A=aAo ' for ceG, 
Aegl(m; R). Let p, :G—GL(g) be the represen- 
tation defined by p,(a)A=aAo | for cEG, 
Aeg. Then p, is a subrepresentation of p,, 
since p,(o)A=p,(a)A for ceG, Aeg. We re- 
mark that E(p,)=Hom(T(M), T(M)). We 
write g(M) for E(p,). Then g(M) is a vector 
subbundie of Hom(T(M), T(M)). We write F 
for the quotient vector bundle Hom(T(M), 
T(M))/g(M). Let ®: Hom(T(M), T(M))— F be 
the natural projection. We fix an affine con- 
nection V on M which preserves P. We write 
T for the torsion tensor field of V. For each 
vector field X er(M, T(M)), we define a C®- 
section VX + T(X,:) in '(M, Hom(T(M), T(M)) 
by 


ue T(M),t>V,X + T(X(x), ule T,(M). 


Then define a first-order linear differential 
operator 


D:T(M, T(M))>T(M, F) 


by D(X)=®(VX + T(X,:)). It is easy to see 
that 


(G2) I(M, P)ckerD, 


where I(M, P) is the Lie algebra of all vector 
fields X €I'(M, T(M)) which generate global 
1-parameter groups y,=exptX of transforma- 
tions of M such that p,e Aut(M, P). Then we 
can show: 

(G3) D is an elliptic operator if and only if G 
is of elliptic type. 

Now suppose M is compact. Then from the 
standard fact on linear elliptic operators on 
compact manifolds, we know the dimension of 
ker D is finite if D is elliptic. Thus from (G2), 
(G3), and theorem (G1), we have the following: 

(G4) Theorem [7]. Let P> M be a G- 
structure on an m-dimensional compact mani- 
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fold M. If G is of elliptic type, then Aut(M, P) 
is a finite-dimensional Lie transformation 
group. 


H. Local Equivalence Problem 


Let tp:P>M and mg:Q-N be two G- 
structures. We say that P>M and Q>N 
are locally equivalent if the following holds: 

(H1) For arbitrarily given (p, g)e P x Q, there 
exists a local G-isomorphism f:U =V of P|U 
onto Q|V such that f)(p)=q, where U (resp. 
V) is an open neighborhood of zp(p) (resp. 
rola). 

From now on, we assume that the structure 
function cp (resp. Cg) of P (resp. Q) is constant. 
If such an f as in (H1) exists, we must have 
col fP(p))=c,(p') for p'e P| U. Therefore cp 
and Cg must be the same constant. Cartan 
reduced the local equivalence problem be- 
tween P and Q to that of certain differential 
systems on P x Q. In fact, let 6p =(0',..., 0”) 
and 0)=(w’,...,w”) be the canonical forms of 
P and Q respectively. Let a: Px OP, B:P x 
Q- 0 be the canonical projections such that 
a(p,q)=p, B(p, q)=q. Let X be the differen- 
tial system on P x Q generated by {a*6' — 
Beyp',...,0%0"™—Payp™). IfX is the m- 
dimensional regular submanifold of P x Q 
given by the graph of f" in (H1), i.e., X = 
{(p', f(p))eP x Q; p'e P| U}, then X is an 
m-dimensional integral submanifold of X 
satisfying the following conditions: 


(H2) (p,qeX, 


(H3) «a*6',...,0*60” are linearly indepen- 
dent on T,,, ,,(X). 


(p.a) 


Conversely any m-dimensional integral sub- 
manifold of X satisfying (H2) and (H3) is the 
graph of a local G-isomorphism f required in 
(H1). Therefore the local equivalence problem 
between P and Q is equivalent to the problem 
of finding an m-dimensional integral submani- 
fold X of £ satisfying (H2) and (H3) for any 
(p,gyeP xQ. 

We say that G is involutive if there exist 
linear subspaces 0= he V c... cV,,=R”™ 
such that 


(H4) dimV,=k (k=0,...,m), 
(H5) dimg,=} dimg(V,), 
k=0 


where g is the Lie algebra of G and g(V,)= 
{Aeg; A(u)=0 for ue V,}. Now we have: 
(H6) Proposition. Let PM and Q>N be 
two G-structures such that their structure 
functions are constant and equal. The dif- 
ferential system X on P x Q defined as above 
is involutive at any m-dimensional integral 
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element E of X on which x*8!,..., a0" 
are linearly independent if and only if G is 
involutive. 

Combining proposition (H6) above with the 
classical tCartan-Kahler theorem, we obtain 
the following theorems: 

(H7) Theorem ([1], also see [8]). Let P>M 
and Q-N be two real analytic G-structures 
over M and N, respectively, such that their 
structure functions are constant and equal. 
Suppose G is involutive. Then PM and 
Q-N are locally equivalent. 

(H8) Theorem. Let PM be a real analytic 
G-structure over M. Suppose G is involutive. 
Then P is integrable if and only if the structure 
function of P is zero everywhere. 

Finally, we remark that for a general Lie 
subgroup G c GL(m, R), G, is involutive for 
k>ky [8]. 


I. Cartan-Kahler Theorem on Differential 
Systems 


Let U be an open subset of R”. We write 
A*(U) for the space of real analytic differential 
k-forms on U. Put A*(U)=A°(U)@...® 
A™(U). Then A*(U) is a graded R-algebra 
with respect to the usual exterior product. By 
a differential system on U we mean an ideal 
» of A*(U) such that 


1) X=A(UINT@AU(U)NNIG... 
®=NA"(U), 

(12) d(2)c 2, 

(13) 2 is finitely generated as an ideal. 


For convenience we write 2 = YM A*(U). 
When 2°) = {0}, we call X a restricted dif- 
ferential system. 

Let 2 be a differential system on U c R”. By 
a k-dimensional integral manifold of 2, we 
mean a k-dimensional regular real analytic 
submanifold of U such that for any «e X, we 
have a| X =0. In the above definition, it is 
sufficient to know that «| X =0 for any ce 2™. 

For xe U, we write G,(T,(U)) for the set of 
all k-dimensional linear subspaces of T,.(U). 
Set G,(T(U)) = (<u G(L,(U)) (disjoint union); 
then G,(T(U)) is naturally a real analytic 
manifold of dimension m +m(m — k). Let X 
be a differential system on U. A k-plane Ee 
G,{T,(U)) is called a k-dimensional integral ele- 
ment of X at x if for any xe 2), we have a| E 
=0. Thus X is a k-dimensional integral sub- 
manifold if and only if T,.(X) is a k-dimensional 
integral element of X for each xe X. We write 
I,(2) for the set of k-dimensional integral ele- 
ments of X. In general /,(2) ts a real analytic 
subset of G,(T(U)). For any k-dimensional 


pale; 


integral element E at xe U, we define the polar 
space H(E) by 


H(E)= (ue T,(U); a(u, v1, ...,0,) =0 
for ye Z" D v,,...,0,EE}. 


Then Ec H(E). Moreover FeG,.,(T,(U)) 

with F>E isin J,,,(2) if and only if F c H(E). 
Let E be a k-dimensional integral element of 
X. We choose ọ!,..., øte A! (U) such that 
o'|E,...,@*|E are linearly independent. Define 
U(p*,...,0") by 


Uo',....p={FeG,(T(U)) p'|F, ..., p*|F 
linearly independent}. 
Then X(t, ...,p*) is an open neighborhood 


of E in G,(T(U)). For any element Fe lot, 

..,@*), define F}, ..., FLE F by o'(F,)=6). Thus 
{F,,...,F,} is the dual basis of o'|F,..., o*|F. 
Any element «e 2“) defines a real analytic 
function a, on %(o',..., 0") by a (F)=a(F,, 
..., F). We call E regular if there exist «!,..., 
x'e X™ and an open neighborhood ¥ of E in 
U(p',...,@*) such that 


(14) IMENI ={FeV:01(F)=...=a!(F) 
=0}, 


(15) dal,...,da} are linearly independent 
ony, 


(I6) dim H(F) is constant for any Fey. 


We say that Ee/,(2) is an ordinary element if 
E contains a (k —1)-dimensional regular in- 
tegral element. 

The following is the well-known theorem 
due to Cartan and Kahler [7]. 

(17) Theorem. Let X be a restricted dif- 
ferential system on UCR”. Let X bea k- 
dimensional integral submanifold of Z. Sup- 
pose that T,(X) is regular for a point xe X and 
that there exists a (k + 1)-dimensional integral 
element F at x with F> T,(X). Then there 
exists a (k + 1)-dimensional integral submani- 
fold Y of X such that xe Y, T,(Y)=F, and 
X <Y ina neighborhood of x. 

We say that X is involutive at Ee I,(2) if 
there exist 0=E°CE1c...cE*! CE such 
that E/ is a j-dimensional regular integral 
element of X (j=0,1,...,k—1). Applying 
theorem (G7) inductively, we obtain: 

Corollary. Let X be a restricted differential 
system on UCR” If E is a k-dimensional 
involutive integral element of X at x, there 
exists a k-dimensional integral submanifold of 
& such that xe X and T.(X)=E. 

In view of this corollary, it is important to 
know when & is involutive at Ee (X). 

(18) Lemma [9}. Let X be a restricted dif- 
ferential system on U cR”, and E be a k- 
dimensional integral element of £X. Then S is 
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involutive at E if and only if there exist 0 = E° 
cE'c...cE*' cE such that R(X) is a sub- 
manifold of G,(T(U)) near E with dimension m 
+k(m—k)— LEd t; where t; =m —dim H(E’). 
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A. Fourier Transforms 


Let f(x) be an element of the tfunction space 
L,(—oo, œ) and t a real number. Then the 
integral 


f)=(n) 1? | i fx)e dx (1) 


converges, and the function f(t) is continuous 
in (—00, 0). We call f the Fourier transform 
of f. It f(x)e L (—œ, œ), then f(x)eL,(—a,a) 
for any finite interval (—a, a), and if we set 


f=)? f f(xje "dx, 


then f, tconverges in the mean of order 2 as 
ao toa function fin L3. In this case, we 
define f to be the Fourier transform of f 
(e€ L3). Furthermore, in this case, if we set 


Sax) = (27)? r fei dt, 


then 1.i.m.,.,. f,(x) = f(x) (Plancherel theo- 
rem). Moreover, we have tParseval’s identity: 
f2 Gy? dx = f2 AOI? dt (— 160 Fourier 
Transform). 

Suppose that f(x) is periodic with period 22 
and feL,(—7,7). We set 


a, =(27) t i f(xje "dx 


(Fourier coefficients). The nth partial sum of 
the tFourier series s,(x)}= È” „a,e™* con- 
verges in the mean of order 2 to f(x), and 
Parseval’s identity 


Lay 2 
a | oras È Jan]? 


holds. On the other hand, if {a„} is a given 
sequence such that E£ _,,|a,|? < œ, then 
>"._,a,e'”* converges in the mean of order 2 
to a function f(x), the Fourier coefficients of 
f(x) are {a,}, and Parseval’s identity holds 


(— 159 Fourier Series). 


B. Bochner’s Theorem and Herglotz’s Theorem 


A complex-valued function f(x) defined on 
(—co, œ) is said to be of positive type (or posi- 
tive definite) if ©? -1 f(x; —x,)6&, 20 for any 
finite number of reals x,, x»,...,X, and com- 
plex numbers €,, &5,...,¢,- If f(x) is measur- 
able on (—00, 00) and of positive type, then 
there exists a ‘monotone increasing real- 
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valued bounded function a(t) such that 


fQx)= | g e"*da(t) (2) 


for almost all x. If a( —co)=0 and a(t) is right 
continuous, then «(t) is unique (Bochner’s 
theorem). Conversely, if «(t) is nondecreasing 
and bounded and f(x) is defined by (2), then 
J (x) (called the Fourier-Stieltjes transform of 
a(t)) is continuous and of positive type. 

A sequence {a,} (—00 <n < œ) is said to 
be positive definite (or of positive type) if 
Eik jn Sj 20 for finitely many arbitrarily 
chosen complex numbers €,, €,...,¢,. If {a,} 
is of positive type, then there exists a mono- 
tone increasing bounded function a(t) on 
[ —z, z] such that 


T 
a,= | e™ da(t) 
(Herglotz’s theorem). Conversely, if x(t) is 
monotone increasing and bounded and a, is 
defined by the above integral, then the se- 
quence {a,} is of positive type. 


C. Poisson’s Summation Formula 


If f(x)EL,(—, œ) is of bounded variation 
and continuous and if f(t) is its Fourier trans- 
form, then we have 


Ja ¥ flak)=,/b a) Ff (bk), 
k=- œ =-0 

where ab = 2q (a> 0). This is called Poisson’s 

summation formula. 


D. Generalized Tauberian Theorems of Wiener 


Suppose that we are given a function f(x)e 
L,(—©, œ) whose Fourier transform f(t) is 
never zero. Then the set of functions given by 


h(x)= | S(x— y)g(y) dy, 


where géL,(—, œ), is dense in L; (~œ, œ). 
Hence we can deduce the tgeneralized Tauber- 
ian theorem of Wiener: If the Fourier trans- 
form k (®© of k,(x)eL,(—0o, œ) does not van- 
ish for any real t and 


im | kie=aroa=c | ki (y)dy 


— 00 


for a function f(x) that is bounded and 
measurable on (~, œ), then for any k€ 
L,(—0, co), 


fee) 


k,(y)dy 


— oa 


tim | kalo ofondy=C | 


mm 2) 
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(— 160 Fourier Transform G). Hence we can 
deduce *Tauberian theorems of the Littlewood 


type [3]. 


E. Harmonic Analysis 


Let «(A) be a complex-valued function on 
(—æœ, cœ) that is of bounded variation and 
right continuous. If we have the expression 


J= | e' da), (2’) 


then we say that the function f(t) is repre- 
sented by the superposition of harmonic oscil- 
lations e™™. Conversely, when f(t) is given, we 
have the problem of finding a function «(A) as 
above such that f(t) can be expressed in the 
form of (2’). When such a function «(A) exists, it 
is also an important problem (in harmonic 
analysis) to find the amplitude «(/)— «(4 —0) of 
the component of a proper oscillation. Con- 
cerning this problem, we have the following 
three theorems: 

(I) A necessary and sufficient condition for 
f(x) to be representable in the form (2’) is 


ef EP) mene 


(II) For a function f(t) expressed in the form 
(2'), we have: (i) for any 45, 


dà <œ. 


1 fT f 
a(Żo)— a(o —0) = lim | f(e dt, 
pa -T 


and (ii) if x(4) is continuous at A=A,—o and 


4= åo +0 (o>0), then 
a(o —0) 


sin A 


allo +0)— 


= lim (= 
T>% 


(III) In (2’), suppose that the discontinuity 
points of (å) are 4,, 2,,..., and set a, =a(A,) — 


fe > dt. 


2(/, 0) (n=1,2,...); then 
5 tee oO 
lim- | f(t+s)fis\ds=¥ |a,|2ei*" 
S>% o n=1 


F. The Paley-Wiener Theorem 


In formula (1), if we change the variable from t 
to a complex variable { =t + ic, then we have 


rosea f" fixe dx, (3) 


which is called the Fourier-Laplace transform 
of f(x). In particular, if f(x) has bounded tsup- 
port, then F({) is an tentire function. Concern- 
ing Fourier-Laplace transforms, Paley and 
Wiener proved the following theorems: 


192 H 
Harmonic Analysis 


(I) A necessary and sufficient condition for 
an entire function F({) to be the Fourier- 
Laplace transform of a tfunction of class C® 
having its support in a finite interval [ — B, B] 
is that for any N, there exists a constant C> 0 
such that |F(O] < Ca(1 +161) ~ e2"! for all (= 
t+io. 

(II) If g(t)e L,(0, 00), then its one-sided *La- 
place transform 


oO 


f(z)=(2n)'? | g(tje “dt 


0 
satisfies: (i) f(z) is holomorphic in the right 
half-plane Rez>0, and (ii) 


sup | | f(x +iy)/? dy < oo. 
x>0 Seale 6) 

Conversely, if f(z) = f(x + iy) (x >0) satisfies 
(i) and (ii), then the boundary function f(iy)e 
L (—œ, œ) exists and is such that 


im | Lf (iv) —f(x + iy)? dy =0, 


its Fourier transform in L, 


N 


fiiyje® dy 


SN. 


g(t)= Lim. (27) 1? | 


vanishes at almost all negative t, and f(z) is the 
one-sided Laplace transform of g(t). 


G. Harmonic Analysis on Locally Compact 
Abelian Groups 


The general theory of harmonic analysis on 
the real line was extended to a theory on lo- 
cally compact Abelian groups by A. Weil, I. 
M. Gelfand, D. A. Raikov, and others. The 
theory of normed rings was utilized for the 
development of the theory (— 36 Banach 
Algebras). This theory is called harmonic 
analysis on locally compact Abelian groups, 
and it has been developed as described in the 
following sections. 


H. Group Rings 


Let L, =L,(G) be the set of all integrable 
functions with respect to a tHaar measure on a 
locally compact Abelian group G. If we define 
the norm and multiplication in L,(G) by 


fll -| [f(x)| dx, 
G 


f s= fixy *)g(y)dy, 
G 


respectively, then L, has the structure of a 
commutative *Banach algebra. (We call f -g 
the convolution (or composition product) of f 
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and g.) If the topology of G is not ‘discrete, 
L(G) does not have a unity for multiplication. 
Hence, adjoining a formal unity 1 to L,(G), 
we set R={al+f|« is a complex number, 
feL,(G)}, and 


lat + fl =lol+ If il, 
(al +f)+(P1+g)=(a+A)1+(f+9), 
(al +f) (PI +g)=(aB)1+(Bf+ag+f-g). 


Then R is a commutative Banach algebra with 
unity. When G is discrete, R = L,(G). The 
Banach algebra R is called the group algebra of 
G. The group algebra R of G is tsemisimple. R 
is algebraically isomorphic to a subalgebra of 
C(Wè), which is the associative algebra of all 
continuous functions on the compact Haus- 
dorff space WM consisting of all maximal ideals 
in R (— 36 Banach Algebras). By this corre- 
spondence, if ge R corresponds to (M), 
which is a function on W, then supyemlo(M)l| 
<||e!|. L (G) belongs to W if and only if the 
group G is not discrete. 


I. Fourier Transforms 


According as the group G is discrete or not, we 
set N= Mt or R=IM—{L,(G)}. Then there 
exists a One-to-one correspondence between 
the elements Me and the elements y of the 
‘character group G of G such that the follow- 
ing formulas are valid: 


s=] x(x) f(x) dx, fel, ’ (4) 
x)= AM/FM), (5) 


where f,(x)= f(xy ') and f is a function such 
that f(M)#0. This correspondence M +>% 
gives a homeomorphism between the locally 
compact space N and Ĝ. Hence if we identify 
M with y and set f(M)= f(y), then f is a con- 
tinuous function on Ĝ, called the Fourier trans- 
form of f(x). Since f-+f(M) is an algebraic 
isomorphism, (fg) (7) = f(W9(0. If f= 
g(x), then f is equal to g in L,(G). This is the 
uniqueness theorem of Fourier transforms. 
From it we can deduce the fmaximal almost 
periodicity of locally compact Abelian groups. 
If G is not discrete, then L,(G)eM, and 
f(L,(G))=0 for fe L,(G). Hence {x| |fOI> e} 
is a compact subset of G. This means that f is 
a continuous function vanishing at infinity on 
G. (This is a generalization of the *Riemann- 
Lebesgue theorem concerning the cases G=R! 
or T'=R/Z.) Any continuous function u(x) on 
G vanishing at infinity is approximated uni- 
formly by f(y), which is the Fourier transform 
of fe L,(G). 
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J. Positive Definite Functions 


A function (x) defined on G is said to be 
positive definite (or of positive type) if the in- 
equality X7 ,-1 P(x;X, ')aja, > 0 holds for arbi- 
trary elements x,,...,x, in G and arbitrary 
complex numbers «,,...,a,. We denote by Pg 
the set of all positive definite functions on G. If 
peP,, then p(e)>0(e is the identity of G), 
|o(x)|<¢e(e), and w(x !)=¢(x). If G is locally 
compact, we further assume that ge P, is 
measurable with respect to the Haar measure 
of G. Then for any fe L,(G), 


[ [o ooa dy>0. 


Any e P; is equal almost everywhere to a 
continuous @, € P,. (Concerning positive de- 
finite functions on locally compact groups and 
their relation with unitary representations — 
437 Unitary Representations B.) 


K. Harmonic Analysis and the Duality 
Theorem 


If G is a locally compact Abelian group, then a 
function (x) on G belongs to P, if and only 

if there exists a nonnegative measure u(Ĝ)< 

œ on G such that p(x) = fe x(x) du(y). (When 
G=R', this theorem is tBochner’s theorem, 
whereas when G =Z, it is Herglotz’s theo- 
rem.) Hence we can prove a spectral resolu- 
tion of unitary representations of G: U(x)= 
fex(x) dE(y) (a generalization of *Stone’s 
theorem). If fe L,(G)N Pg, then f(y)>0, fe 
L,(G), and the inversion formula of Fourier 
transforms f(x)= f6 x(x) f(y) dx holds, provided 
that the Haar measure on G is suitably chosen. 
If feL,(G)NL,(G), then fe L,(G), and Par- 
seval’s identity falf? dx = fel fG)? dx holds. 
If we put Uf =f and VÎ= f, then U is extend- 
able uniquely to an isometry of L,(G) onto 
L(G) and V is extendable uniquely to its 
inverse transformation, respectively (Plan- 
cherel’s theorem on locally compact Abelian 
groups). By the inversion formula and Plan- 
cherel’s theorem, we can prove that the char- 
acter group G of Gis isomorphic to G as a 
topological group. This is called the Pon- 
tryagin duality theorem of locally compact 
Abelian groups (— 422 Topological Abelian 
Groups). In particular, if G is compact, Ĝ is a 
discrete Abelian group. Then we can normal- 
ize the Haar measure of G and G so that the 
measure of G and the measure of each element 
of G are 1. Plancherel’s theorem implies that 
the set of characters of G is a tcomplete tortho- 
normal set in L(G). 


723 


L. Poisson’s Summation Formula 


Suppose that G is a locally compact Abelian 
group, H is its discrete subgroup, and G/H is 
compact. Then the tannihilator T of H is a 
discrete subgroup of G. For any continuous 
function f(x) on G, if £e f(xy) is convergent 
absolutely and uniformly (hence fe L,(G)) 
and Fere is convergent absolutely, then 
Zale hia O: where c is a constant 
depending on the Haar measures of G and G. 
This is called Poisson’s summation formula on 
a locally compact Abelian group (a generali- 
zation of tPoisson’s summation formula on 
G=R). 


M. Closed Ideals in L, (G) 


In the following, the group operations in G 
and G are denoted by +, and the value of the 
character y(x) (xe G, ye Ĝ) is written as (x, y). 

For a function f defined on G and any yeG, 
the translation operator q, is defined by t, f(x) 
= f(x— y). A closed subspace of L,(G) is an 
ideal in L; (G) if and only if it is invariant 
under all translations (N. Wiener). A closed 
ideal I coincides with L,(G) if and only if the 
set of zeros of I, i.e., Z)=()\ re, f | (0), is 
empty (Wiener’s Tauberian theorem). A closed 
ideal J is maximal if and only if Z(J) consists of 
a single point. If the dual G of G is discrete, 
then the closed ideals in L,(G) are completely 
characterized by the zeros; that is, tspectral 
synthesis is possible, but this situation does 
not hold generally. P. Malliavin’s theorem 
states that if G is not discrete, then there exists 
a set E in G and two different closed ideals I 
and J such that Z(I)= Z(J)=E. Such a set E is 
called a non-S-set. For example, if Ĝ = R?, the 
unit sphere is a non-S-set (L. Schwartz). 


N. Operating Functions 


Denote by A(G) the set of all Fourier trans- 
forms of the functions in L,(G). Let fe A(G) 
and ® an analytic function in a neighborhood 
of the range of f Furthermore, assume that 
®(0)=0 if G is not discrete. Then there exists 
a ge A(G) such that 9(y)= (f(y) for yeG 
(Wiener-Lévy theorem). In general a function 
® defined on a set D in the complex plane is 
said to operate in a function algebra R or to be 
an operating function on R if B( f)eR for all 
feR whose range lies in D. The converse of 
the Wiener-Lévy theorem holds in the follow- 
ing form. Let G be an infinite Abelian group 
and ® a function on the interval [ —1, 1]. If ® 
operates in A(G), then ® is analytic in a neigh- 
borhood of the origin if G is compact, and 
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analytic in a neighborhood of [ —1, 1] if G is 
not compact [14, 15]. Let E be a subset of G, 
and I, the set of all fe A(G) such that f=0 on 
E. Let A(E)= A(G)/I, be the quotient algebra. 
The set E is called a set of analyticity if every 
operating function on A(E) is analytic. For a 
characterization of such a set — [11]. 


O. Measure Algebras 


For a locally compact Abelian group G, let 
M(G) be the set of all tregular bounded com- 
plex measures. For 4 and u in M(G), the 
convolution À u is defined by (A * p) (E)= 

fc A(E— y): du(y), where E is a Borel set in 

G. Then M (G) is a semisimple commutative 
Banach algebra whose product is defined to be 
the convolution. The Fourier-Stieltjes trans- 
form of pe M (G) is defined by 


a=] (x,y)du(x), ye. 
G 


A continuous function on G is positive definite 
if and only if it is the Fourier-Stieltjes trans- 
form of a positive measure in M(G) (Bochner’s 
theorem). Assume that G is not discrete. A 
function on the interval [ —1, 1] that oper- 
ates in the Fourier-Stieltjes transforms of 
measures in M(G) can be extended to an entire. 
function, and a function on the whole complex 
plane that operates in the tGel’fand repre- 
sentation of M(G) is an entire function [15]. 
From this fact, it follows that M(G) is asym- 
metric and nonregular. Furthermore, there 
exists a measure we M(G) such that ji(y)> 1 
but 1/4 is not a Fourier-Stieltjes transform of 
M(G). (See also Wiener and Pitt [16], Shreider 
[17], and Hewitt and Kakutani [18]; for the 
general description of measure algebra, see 
Rudin [11] and Hewitt and Ross [12].) 


P. Idempotent Measures 


A measure „eM (G) is called idempotent if p + u 
=p, that is, fi(y)=0 or 1 for all yeG. Then fis 
the characteristic function of the set {ye Ĝ| iy) 
= 1}. The smallest ring of subsets of G that 
contains all open cosets of subgroups of G is 
called the coset ring of G. The characteristic 
function of a set E in G is the Fourier-Stieltjes 
transform of an idempotent measure in M(G) 
if and only if E belongs to the coset ring of G 
[19]. A simple proof of this theorem is given 
by T. Ito and I. Amemiya (Bull. Amer. Math. 
Soc., 70 (1964)). When G is the unit circle, the 
coset ring consists of sequences periodic except 
at a finite number of points, and for this case 
the theorem was obtained by H. Helson. Let 
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Ny, na, ..., Ag be distinct integers and du(x)= 
Di, e'*-dx. Then u is an idempotent mea- 
sure on the unit circle. J. E. Littlewood conjec- 
tured that the norm of u exceeds clogk, where 
c is a positive constant not depending on the 
choice of {n,}. A partial answer was given 

by P. J. Cohen [19] for compact connected 
Abelian groups and was improved by H. 
Davenport (Mathematika, 7 (1960)) and E. 
Hewitt and H. S. Zuckerman (Proc. Amer. 
Math. Soc., 14 (1963)). 


Q. Mappings of Group Algebras 


Let G and H be two locally compact Abelian 
groups and ¢ a nontrivial homomorphism of 
L,(G) into M(H). Associated with ọ there is a 
mapping ọ* of a subset Y of A into G such 


A 


that p(f)(y)=f(e*(y)) for ye Y and =0 for 
yé Y, or symbolically o(f)=f(o"*). A con- 
tinuous mapping « of Y into G is said to be 
piecewise affine if there exist a finite number 
of mutually disjoint sets S, j= 1, ...,n, in the 
coset ring of H and mappings a, such that (i) 
Y =|)? S;; (ii) x; is defined on an open coset 
K, of H, where K;>S;; (iii) a, =a on S; and. 
(iv) ly +y —y")=a,(y) + %,(y’)—o,(y") for all y, 
yy" €K;,j=1,...,n. P. J. Cohen’s theorem is: 
If ọ is a homomorphism of L,(G) into M(H), 
then Y belongs to the coset ring of H and ¢* is 
a piecewise affine mapping of Y into G. Con- 
versely, for any piecewise affine mapping «, 
there is a homomorphism ¢ of L,(G) into 
M(H) such that @* =x. Related theorems have 
been studied by A. Beurling, H. Helson, J.-P. 
Kahane, Z. L. Leibenson, and W. Rudin [11]. 


R. Exceptional Sets 


Let G be a locally compact Abelian group. A 
subset E is said to be independent if n; x; + 

... +,x,=0, where the n; are integers and x;€ 
E implies n,x,;=0, j=1,...,k. A set E in G is 
called a Kronecker set if for every continuous 
function p on E of absolute value 1 and e>0 
there exists a ye Ĝ such that |~(x)—(x, y)|<e, 
xe E. Every Kronecker set is independent and 
of infinite order, but independent sets are not 
necessarily Kronecker sets. For a group G 
whose elements are of finite order p, a set E is 
called of type K, if for every continuous func- 
tion g on E with values exp(2zik/p), k=0,...,p 
— 1, there is a ye Ĝ such that y=) on E. If E is 
a compact Kronecker set in G and yp is a mea- 
sure with support in E, i.e., we M(E), then ||| 
= ||fi\|,,. A compact set E is called a Helson set 
if there is a constant C such that [iul <C || fi|| ,, 
for we M(E). Every K, set is also a Helson set. 
For a Helson set E, C(E)= A(E). A discrete 
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analog of a Helson set is a Sidon set. A subset 
F of a discrete group G is called a Sidon set if 
there is a constant C such that Y,.pla,|< 
Csup,|L,<rd,(x, y)| for every polynomial 

È a,(x, y). For example, a flacunary sequence 
{nk}, 41 /M, > q> 1, of integers is a Sidon set. 
These sets are deeply connected with har- 
monic analysis on groups, and measures con- 
centrated on these sets have some unexpected 
pathological properties (— e.g., [11, 13]). 


S. Tensor Algebras and Group Algebras 


Let X and Y be compact Hausdorff spaces, 
and denote by V(X, Y) the projective tensor 
product C(X) ® C(Y) of continuous func- 
tion spaces C(X) and C(Y). The norm of 

P(x, Y= LZ, f(x)g(y) is defined by ||¢|| 

=inf De, || fll lgl where the infimum is 
taken for all expressions of o. If G is an infi- 
nite compact group, then there exist two sub- 
sets K, and K, such that (i) K, and K, are 
homeomorphic to the *Cantor ternary set; (ii) 
the expression y, +y, of an element of E= K, 
+ K, is unique, where y; € K; and y, € K,; (iti) 
K, NK, # Ø; and (iv) K, UK, is a Kronecker 
set or a set of type K, for some p. Varopoulos’ 
theorem states that the algebra V(K,, K,) is 
isomorphic to A(E) which denotes the algebra 
of restriction of functions in A(G) on the set E. 
By this theorem, the problems of spectral 
synthesis and operating functions of group 
algebras are transformed into problems of 
tensor algebras. For a more precise discussion 
— [20]. 
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193 (X.29) 
Harmonic Functions and 
Subharmonic Functions 


A. General Remarks 


A real-valued function u of tclass C? defined in 
a domain D in the n-dimensional Euclidean 
space R” is called harmonic if it satisfies the 
Laplace equation 


07 u ĝu 
Au(P)=-— +... +7350 (P=(x1, 3X9) 
0Xi OX, 


in D. A harmonic function is, by definition, 
twice continuously differentiable, but turns out 
to be real analytic. It is not true, however, that 
the solutions of the Laplace equation are real 
analytic. For example, for the function u(x, y) 
=Reexp(—z *)(z=x+iy #0), u(0,0)=0, uxx 
and u,, exist everywhere, and u satisfies the 
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Laplace equation, but is not continuous at the 
origin. 

The fundamental properties of harmonic 
functions do not depend essentially on n. 

A real-valued function u of class C? satisfy- 
ing the inequality Au >0 is called subharmonic. 
For a more general definition of subharmonic 
functions and their properties — Sections P- 
U. 


B. Invariance of Harmonicity 


Harmonicity in R? is invariant under any 
tconformal transformation. Namely, when 
there exists a conformal bijection sending a 
domain D in the xy-plane onto a domain D’ in 
the €y-plane, every harmonic function u(x, y) 
on D is transformed into a harmonic function 
of (¢,7) on D’. In R” for n> 3, harmonicity is 
not generally preserved under conformal 
transformations. However, harmonicity is 
preserved in the following special case: Let D 
be a domain in R” (n> 3), and consider the 
inversion f:D—D’ defined by f(x,,...,x,)= 
(Xise, Xh) =(A7x, /r?,...,a7x,/r?), r=(x2 + 

... x2)", Let u(x,,...,x,) be a harmonic 
function on D, and let v(x‘, ...,x}) be the func- 
tion on D’ obtained by applying the Kelvin 
transformation to u. Namely, v(x}, ...,x))= 
(a/r')" 7u(a? x/r’, ...,a?x),/r'*), where r? = 
x? +... +x2. Then the function v is harmonic 
on D’. A function u that is harmonic outside 

a compact set is called regular at the point 

at infinity if any Kelvin transform of u is har- 
monic in a neighborhood of the origin, in 
which case u(P)—>0 as OP> œ. Now let T: 

Xk =X, (X},...,X,), 1<k <n, be a one-to-one 
analytic transformation of a domain D’ onto 
another domain D. If there exists a posi- 

tive function g(x’,,...,x/,) in D’ such that 
P(X, og Hy) UK (X s es Xh oo. Kyl Xi e X5) iS 
harmonic for any harmonic function u(x,, 
...,X,) in D, then T is conformal. A conformal 
transformation as it is known in differential 
geometry is either (i) a ‘similarity transforma- 
tion, (ii) an inversion with respect to a sphere 
or a plane, or (iii) a finite combination of trans- 
formations of types (i) and (ii). 


C. Examples of Harmonic Functions 


(1) If u is a polynomial in x,,...,x,, and har- 
monic in R”, then the terms of degree k in u 
form a harmonic function for each k>0. A 
harmonic homogeneous polynomial is said to 
be a spherical harmonic. (2) logr in R? and r?~™” 
in R” (n> 3) are harmonic except at r =0. (3) 
Every tlogarithmic potential in R? and every 
*Newtonian potential in R” (n> 3) is harmonic 
outside the fsupport of the measure. Con- 
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versely, any harmonic function defined on a 
domain D is represented in an arbitrary rela- 
tively compact domain D’ in D as the sum of 
a logarithmic (n= 2) or Newtonian (n> 3) 
potential of a measure on ôD’ and the poten- 
tial of a double layer. (4) Both the real part u 
and the imaginary part v of an analytic func- 
tion of a complex variable are harmonic. We 
call v a conjugate harmonic function of u. If u is 
harmonic on a fsimply connected domain D, 
then the conjugate v of u is given by 


- (x,y) 6 G 
v(x, v=| (—Stas + ay) + constant, 
(a,b) y x 


where (a, b) is a fixed point in D and the path 
of integration is contained in D. When D is a 
tmultiply connected domain, v may take many 
values in accordance with the thomology 
classes of the paths of integration. 


D. Green’s Formulas 


In the following one should substitute “curve” 
for the term “surface” when n=2. Let D be a 
bounded domain whose boundary S consists 
of a finite number of closed surfaces that are 
piecewise of class C!. Let u and v be harmonic 
in D, and suppose that all the first-order par- 
tial derivatives of u and v have finite limits 

at every boundary point. We call D the inside 
of S. Let n be a normal on S toward the out- 
side of D. Then the relation 


| (we) do=0 (1) 
s\ ôn ôn 


follows immediately from *Gauss’s formula, 
where do is the tsurface element on S. In par- 
ticular, when v is identically equal to 1, for- 
mula (1) gives 


| odo =0. (2) 
sS 


on 


Equations (1) and (2) are called Green’s and 
Gauss’s formulas, respectively. Conversely, u is 
harmonic in D if u is a function of class C? 

in D and at every point pe D, there is a se- 
quence {r,' decreasing to zero and such that 

I sce,ry(u/On) do =0 (k= 1,2, ...), where S(P, r) 
is the spherical surface with center at P and 
radius r,. Another sufficient condition for u to 
be harmonic is that u is of class C! and, at 
every point Pe D, there be an rp>0 such that 
{scp,y(Cu/On) do =0 for every r,0<r<rp 
(Koebe, Bocher). 


E. Mean Value Theorems 


We assume that u is a harmonic function, D 
the domain of definition of u, and S the bound- 
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ary of D. The mean value of u on the surface 
or the interior of any ball in D is equal to the 
value of u at the center of the ball. Namely, 


u(P)=—- udr= | udo, 
Ta J BPN) Ont S(P,r) 


where 1, and o, are the volume and surface 
area of a unit ball in R”, respectively, B(P, r) is 
the open ball with center at P and radius r, 
and dt is the volume element. These relations 
are called mean value theorems. Conversely, if 
v is continuous in D and at every point PeD, 
there is a sequence {r,} decreasing to zero and 
such that the mean value of v over B(P,r,) or 
S(P, 7,) is equal to v(P) for each k, then v is 
harmonic in D. This result is called Koebe’s 
theorem. From the mean value theorems the 
maximum principle for harmonic functions 
follows: Any nonconstant u assumes neither 
maximum nor minimum in D. If both u and v 
are harmonic in D and have the same finite 
boundary value at every point on S, then u=v 
in D by the maximum principle. This is called 
the uniqueness theorem. 


F. Boundary Value Problems 


The first boundary value problem (or Dirichlet 
problem) is the problem of finding a harmonic 
function defined on D that assumes boundary 
values prescribed on S (— 120 Dirichlet Prob- 
lem). The second boundary value problem (or 
Neumann problem) is the problem of finding a 
harmonic function u whose normal derivative 
6u/én is equal to a function f prescribed on the 
piecewise smooth boundary S. The solution, if 
it exists, is uniquely determined up to an addi- 
tive constant. In order for the solution to exist, 
f should satisfy the condition fs fdo =0. The 
third boundary value problem is the problem of 
finding a harmonic function u on D that satis- 
fies 0u/On=hu-+ f on S, where h and f are 
functions prescribed on S. All these problems 
can be reduced to certain Fredholm integral 
equations. There is also the boundary value 
problem of mixed type, in which the boundary 
values are prescribed in a part of S and the 
normal derivatives are prescribed on the rest. 


G. The Poisson Integral 


Let D be a bounded domain with smooth 
boundary S and u a function harmonic in 
D and continuous on DUS. Let G(P, Q) be 
Green’s function in D. Then (1) yields 


oG(P, 
| QP 2) ao), 


s ông 
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In particular, if D = B(0, r), then 
r?— 0P? | u(Q) 
s 


u(P)= LE do(Q). 


(0,r} PO” 
Conversely, given an integrable function f on 
S(0, r), we set 


r? —OP? | f(Q) 
S(O,r) PQ" 


Then u(p) is harmonic in B(0, r) and converges 
to f(Q) as P tends to any point Q on S(0,r) 
where f is continuous. We call u a Poisson 
integral. Sometimes it is possible to represent a 
harmonic function u in B(0,r) in the following 
form, which is more general than the Poisson 
integral: 


o,f 


u(P)= do(Q). 


Opt 


2_Nnp2 
u(p)= 2 | =, d(Q), 3) 
Onf son PQ 
where « is a signed tRadon measure on 
S(0,r). In order for u to admit such a repre- 
sentation, it is necessary and sufficient that 
Ís, lulde be a bounded function of r’ for 0< 
r <r, or equivalently, that the tsubharmonic 
function |u| have a tharmonic majorant. Fur- 
thermore, if « is absolutely continuous, then 
the Poisson integral representation of u is 
possible, and vice versa. A necessary and suffi- 
cient condition for the function u to admit the 
Poisson integral representation is that there 
exist a positive convex function g(t) on t>0 
such that @(t)/t> oo as t> oo and ¢(|u|) has a 
harmonic majorant. 

When D is a general domain in which 
Green’s function exists, every positive har- 
monic function u(P) is represented uniquely as 
the integral | K(P, Q)du(Q), where K(P, Q) is a 
*Martin kernel and u is a Radon measure on 
the Martin boundary B whose support is 
contained in a certain essential part of B, each 
point of which is called a minimal point. A 
similar integral representation appears in the 
theory of Markov processes (— 260 Markov 
Chains I). The representation f| K(P, Q)du(Q) is 
a generalization of (3). In terms of a function 
similar to g(t), we can give a necessary and 
sufficient condition for u(P) to be represented 
in the form f K(P, Q)f(Q)dv(Q), which corre- 
sponds to the Poisson integral representa- 
tion and in which v is determined by 1 = 
| K(P, Q)dv(Q). This condition is equivalent 
to the condition that u(P) be quasibounded, 

Łe., that there exist an increasing sequence of 
bounded harmonic functions that converges to 
u [7]. 

A positive harmonic function u is said to be 
singular if any nonnegative harmonic minorant 
of u vanishes identically. Every positive har- 
monic function can be expressed as the sum of 
a quasibounded harmonic function and a 
singular one. 
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H. Expansion 


Let Po =(x?,...,x®) be a point in D, and de- 
note the distance from P, to S by r. Then a 
harmonic function u is expanded uniquely into 
a power series 


O n 
Xa, m kn — xq) + (X,— xp) ; 
ki,- k 29, 


in B(Po, (/2 —1)r). Thus u is (real) analytic in 
D. 1f u vanishes on an open set in D, then u=0 
in D. If the power series is written as X, h, with 
spherical harmonics h, of degree k=1, 2,..., 
then this series converges over all of B(P,,7). 


I. Sequences of Harmonic Functions 


In this section, {um} is a sequence of harmonic 
functions in a bounded domain D. First, if 
each um is bounded and continuous on DUS 
and {u,,} converges uniformly on S, then 

{u,,| converges uniformly in D, and the limit- 
ing function u is harmonic in D. Moreover, 

Oki tty /Axk: ...Oxk converges to 081 *-**ay/ 
éxk: ...éx* uniformly on any compact subset 
of D (Harnack’s first theorem). Second, if u; < 
u <... in D and there is a point of D at which 
{um is bounded, then {u,,} converges uni- 
formly on any compact subset of D (Harnack’s 
second theorem). The following Harnack’s 
lemma is useful: If u is positive harmonic in D, 
P, is a point of D, and K is a compact subset 
of D, then there exist positive constants c and 
c’, depending only on P, and K, such that 
cu(Po) <u(P)<c'u(Po) on K. 

Any family of (locally) uniformly bounded 
harmonic functions is fnormal. A family of 
positive harmonic functions that is bounded 
at a point is also normal by Harnack’s lemma. 
If [pluk — unl” dt—>0 as k, m> œ for p>1, then 
tHölder’s inequality implies that {um} con- 
verges uniformly on any compact subset of D. 
It follows that if [p|grad(u,—u,,)|?dt0 as k, 
m—oo and {u,} converges at a point in D, then 
there exists a harmonic function u in D such 
that J,|grad(u,,—u)|’'dt0 as m> oo and um 
converges to u uniformly on any compact 
subset of D, where P, is any point in D. Finally, 
if | plu,’ dt (p> 1) are bounded, then {u,} 
forms a normal family. 


J. Level Surfaces and Orthogonal Trajectories 


The set {P|u(P)=constant} is called a level 
surface (niveau or equipotential surface). When 
ais given as the constant, the level surface is 
called the a-level surface. Assume that u is not 
a constant. A point where grad u vanishes is 
called critica]. The set of critical points consists 
of at most countably many freal analytic 
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manifolds of dimension <n—2 (n=dimD, and 
a manifold of dimension 0 is understood to be 
a point). Any compact subset of D intersects 
only a finite number of such manifolds; we 
express this fact by saying that the manifolds 
do not cluster in D. Each of these manifolds is 
contained in a certain level surface. The 
complement of the critical points with respect 
to any level surface consists of real analytic 
manifolds of dimension n—1 that do not 
cluster in D. 

For each noncritical point there exists an 
analytic curve passing through it such that 
grad u is parallel to the tangent to the curve at 
each point on the curve. A maximal curve with 
this property is called an orthogonal trajec- 
tory (or line of force). Along every orthogonal 
trajectory, u increases strictly in one direction 
and hence decreases in the other, so that none 
of the orthogonal trajectories is a closed curve. 
There is exactly one orthogonal trajectory 
passing through any noncritical point. There- 
fore no two orthogonal trajectories intersect, 
and no orthogonal trajectory terminates at a 
noncritical point. Moreover, the set of limit 
points of any orthogonal trajectory in each 
direction does not contain any noncritical 
point. When u is a Green’s function G(P, Q), 
every orthogonal trajectory is called a Green 
line, and a Green line that originates at the 
pole Q and along which u decreases to 0 is 
called regular. For any sufficiently large a, the 
a-level surface X, is an analytic closed surface 
homeomorphic to a spherical surface. Let E be 
a family of orthogonal trajectories originating 
at the pole. If the intersection A of E anda 
closed level surface X, is an (n — 1)-dimensional 
measurable set, then the tharmonic measure 
of A at Q with respect to the interior of 2, is 
called the Green measure of E. M. Brelot and 
G. Choquet proved that all orthogonal trajec- 
tories originating at the pole except those 
belonging to a family of Green measure zero 
are regular. Consider a domain D bounded by 
two compact sets, and denote by u the har- 
monic measure of one compact set with re- 
spect to D. Assume that u is not a constant. 
Then u changes from 0 to 1 along all ortho- 
gonal trajectories except those belonging to a 
family that is small with respect to a measure 
similar to the Green measure (see “flux” de- 
fined in Section K). 


K. Harmonic Flows 


Denote by Z, the a-level surface for a harmonic 
function u, and by £? the complement of the 
set of critical points with respect to 2. Let o 
be an (n— 1)-dimensional domain in Z? such 
that the (n —2)-dimensional boundary of ø is 
piecewise of class C*. Suppose that u assumes 
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the value b (>a) on each orthogonal trajec- 
tory passing through a. Consider the union of 
orthogonal trajectories that pass through ø. 
The subset of this union on which u assumes 
values between a and b forms a set called a 
regular tube. The parts of the boundary corre- 
sponding to a and b are called the lower and 
upper bases of the tube; accordingly, ø is the 
lower base. The integral |(Gu/dn)do on any 
section (1.e., the part of a level surface in the 
tube) is constant and is called the flux of the 
tube. The family of orthogonal trajectories 
passing through an (n— 1)-dimensional domain 
(not necessarily bounded by a smooth bound- 
ary) in X? is called a harmonic flow, and a 
subfamily is called a harmonic subflow if its 
intersection A with X? is measurable (in the 
(n — 1)-dimensional sense). The flux of a har- 
monic subflow is defined to be f, (ĉu/ôn)do. 
Then the Green measure of family E of Green 
lines originating at the pole is equal to the flux 
of E divided by o,. We can compute the exact 
value of the textremal length of any harmonic 
subflow. 


L. Isolated Singularities 


Let u be harmonic in an open ball except at 
the center O. It is expressed as the sum of a 
function h(P) harmonic in the entire ball and 
Dd 2-9 Hn (P)/OP?2"*!, where H,, is a tspherical 
harmonic of degree m. If OP*u(P)—0 as P>O 
for a>0, then u(P) is equal to h(P)+c/OP +... 
+ H,,(P)/OP2"*) with m<a- 1. In particular, 
if u is bounded in a neighborhood of O, then O 
is a removable singularity for u. If u is bounded 
above (below), then u(P)=h(P)+c/OP" ?, 
where c <0 (c >0). (When n= 2, we have u(p) 
=h(p)+clog1/OP. For the removability of a 
set of capacity zero — 169 Function-Theoretic 
Null Sets). 

If u is harmonic near the point at infinity, 
i.e., outside some closed ball, then 


H,,(P) 
Op2™ti +U 


u(P)= 2 
where the first sum is regular at the point at 
infinity and U,, is a spherical harmonic of 
degree m. If OP ~“*u(P)>0 as OP— with 

a >0, then U,,=0 for all m>«. If u is bounded 
above or below, then U,,=0 for all m>1. If u is 
harmonic in R” and OP “u(P)—>0 as OP >œ 
with «>0, then u is a polynomial of degree m 
(<a). If u is harmonic and bounded above or 
below in R”, then u is constant. Brelot called a 
function u harmonic at the point at infinity if 


H,,(P) 


u(P)=constant + 2 Opn 


(note that m> 1) near the point at infinity [1]. 
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M. Harmonic Continuation 


If u vanishes in a subdomain of D, then u=0 in 
D. If u, is harmonic in D, (k= 1,2), D; 1D, 4 
@, and u, =u, in D,ND,, then u, and u, de- 
fine a harmonic function in D, U D}. If the 
boundaries of mutually disjoint domains D, 
and D, have a surface Sọ of class C! in com- 
mon, u, is harmonic in D, (k= 1,2), u; =u, 
on So, and du, /én and —éu,/dn exist and 
coincide on Sp, then u, and u, define a har- 
monic function in the domain D, US) U D3. 
We express this fact by saying that one of u, 
and u, 1s a harmonic continuation of the other. 
It follows that u=0 in D if the boundary of 
. D contains a surface S, of class C! and u = 
du/én=0 on Sy. Consider the case n=2. If the 
boundary of a Jordan domain contains an 
analytic arc C and u (or du/én) vanishes on C, 
then a harmonic continuation of u into a cer- 
tain domain beyond C is possible. If n= 3, 
however, nothing is known except in the case 
where Sp is a part of a spherical surface or a 
plane and u=0 (or du/én=0) on Sp. 
Boundary values of u do not always exist, 
but in some special cases, u has limits. For 
instance, a positive harmonic function in a ball 
has a finite limit at almost every boundary 
point Q if the variable is restricted to any 
angular domain with vertex at Q. 


N. Green Spaces 


As a generalization of tRiemann surfaces, 
Brelot and Choquet introduced &-spaces [3]. 
It is required that & be a separable connected 
topological space and satisfy the following two 
conditions: (i) At each point P there exists a 
neighborhood Vp of P and a homeomorphism 
between Vp and an open set V; in the tAlexan- 
drov compactification R"U {00 }; (ii) if A= Vp, N 
Vp, # Ø and A, is the part of Vp, that corre- 
sponds to A (k= 1, 2), then the correspondence 
between A} and Aj via A is a conformal (pos- 
sibly with the sense of angles reversed) trans- 
formation when n=? and an tisometric trans- 
formation (which keeps oc invariant) when 
n> 3. Ifa tGreen’s function exists on &, then 

& is called a Green space. Harmonic functions 
and the fDirichlet problem on a Green space 
have been discussed from various points of 
view. 


O. Biharmonic Functions 


A function v is called polyharmonic if A‘ v =0 

(k > 2) and biharmonic if AAv =0; sometimes, 
polyharmonic functions are also called bi- 
harmonic. A biharmonic function in a plane 
domain D is written as Re(zf(z)+ g(z)), where f 
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and g are complex analytic in D (Goursat’s 
representation). Biharmonic functions are used 
in the theory of elasticity and hydrodynamics. 


P. Subharmonic Functions 


Let D be a tdomain in the n-dimensional Eu- 
clidean space R” (n> 2). A real-valued function 
u(P) in D is called subharmonic if (1) —co < 
u< +00, u# —oo; (2) u is tupper semicon- 
tinuous; and (3) at every point P, of D, the 
mean value of u over the surface of any closed 
tball in D with center at P, is not smaller than 
u(Pp), i.e., 


1 
u( Po) <——— fe do = L(P},r), 


or 


where o, is the area of the surface of a unit ball 
in R”. Condition (3) can be replaced by: (3°) 
The mean value A(P,,1r) of u over the closed 
ball is >u(P,). In order that an upper semi- 
continuous function u be subharmonic it is 
necessary and sufficient that, for any sub- 
domain D’ of D and for any harmonic function 
hin D’, the maximum principle hold for u— h. 

We call —u superharmonic when u is sub- 
harmonic. A harmonic function is subhar- 
monic and superharmonic. The converse is 
also true (— Section E). 

When u is of tclass C?, then u is subhar- 
monic if and only if 


ĝu Cru 

Au=—>+...42520 (P=(x,,...,X,)) 

ôx? x 
When u is an upper semicontinuous function 
that is not necessarily differentiable, u is sub- 
harmonic if and only if Au interpreted as a 
tdistribution is a positive tmeasure. 

Ifu,,...,u, are subharmonic and a,,..., a 
are positive constants, then a; u; +... + apuy 
and max(u,(P), u (P), ...,u,(P)) are subhar- 
monic. If a subharmonic function u is replaced 
by the tPoisson integral for the boundary 
function u inside a closed ball in D, then the 
resulting function in D is subharmonic. If f(t) 
is a monotone increasing convex function, 
then f(u) is subharmonic. If v> 0 and logv is 
subharmonic, then v is subharmonic. If f(z) is 
a holomorphic function of the complex vari- 
able z and A>0, then ålog|f(z)| and hence 
| {(z)|* are subharmonic. If h is harmonic, then 
|h| is subharmonic. Any tlogarithmic potential 
(n= 2) or Newtonian potential (n> 3) is super- 
harmonic in R”. 


Q. Properties of Mean Values 


Condition (3) (resp. (3’)) can be replaced by the 
condition that there exists an r(P,)>0 at any 
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Py such that u(P,) < L(Po, r)(u( Po) < A(Po,1)) 
for every r, 0<r<r(Po). The relation —œ < 
A(Po,r)<L(Po,7r) always holds, and both 
A(Po,r) and L(Po,r) decrease to u(P,) as r|0. 
On any ‘compact subset of D, u is tintegrable. 
Both A(P),r) and L(P,,r) increase with r and 
are convex functions of --logr (n=2) and r*" 
(n> 3); hence they are continuous functions of 
r. If D' is relatively compact in D and rẹ is the 
distance between ôD’ and ôD, then A(P,r) isa 
continuous subharmonic function of P in D’, 
where r is fixed in the interval (0, rọ). By taking 
the average of A(P,r) k times, a subharmonic 
function of class C* is obtained that decreases 
to u as r}0. If o,((x7 +... +x2)"?) is suitably 
chosen, then the fconvolution u * ọ, is a sub- 
harmonic function of class C” and decreases 
to u as rļ0. 


R. Sequences of Subharmonic Functions 


The limit of a decreasing sequence or a 
downward-directed net of subharmonic func- 
tions is subharmonic or equal constantly to 
—oo. The limit of a uniformly convergent 
sequence of subharmonic functions is sub- 
harmonic. If u,,u,,... are subharmonic, then 
max(u,,...,4,) is subharmonic for every k, but 
sup(u,,uU,,...) may not be subharmonic. Let U 
be a family of subharmonic functions in D that 
are uniformly bounded above on every com- 
pact subset of D. Then the upper envelope of 
U, i.e., the function defined by sup,,yu in D, 
coincides with a subharmonic function except 
on a set of tcapacity zero. 


S. Harmonic Majorants and Riesz 
Decompositions 


Suppose that we are given a subharmonic 
function u in D. If there is a harmonic function 
h satisfying h>u in D, then h is called a har- 
monic majorant of u. When there is a harmonic 
majorant of u, there exists a least one among 
them, denoted by hp. For any relatively com- 
pact subdomain D’ of D, hp, always exists and 
equals the *Perron-Brelot solution in D’ for the 
boundary function u. As D’ increases to D, hp 
increases to a function h that is either har- 
monic or equal constantly to oo. If hp exists, it 
coincides with h, and hence h is harmonic. 
Conversely, if h is harmonic, then hp exists and 
equals h. Generally, there is a unique tRadon 
measure pin D with the following property: 
Let 6 be any subdomain of D such that h; and 
the tGreen’s function G; exist in 6 (6 may 
coincide with D). Then h;—u is equal to the 
potential [;G,dyu, and u=h,—{,G,dy. In gen- 
eral, a representation of a superharmonic 
(subharmonic) function as the sum (difference) 
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of a harmonic function and a potential is 
called a Riesz decomposition. 


T. Boundary Values 


Let D be a domain in which a Green’s function 
exists, and consider the tfine topology on the 
*Martin compactification of D. Any negative 
subharmonic function u in D has a finite limit 
with respect to the fine topology at every point 
of the *Martin boundary A except at the points 
of a subset of A of harmonic measure zero 
(J. L. Doob). When D is a ball, u has a limit 
in the ordinary sense along almost every 
radius. However, it may happen that even if u 
is bounded there exists no angular limit at 
any point of the boundary. If the mean value 
of |u| on every concentric smaller ball in D is 
bounded, then u can be decomposed into the 
sum of a nonnegative harmonic function and a 
negative subharmonic function by the Riesz 
decomposition, and hence u has a limit both 
radially and with respect to the fine topology. 

Let D be a domain and K be a compact 
subset of D of tcapacity zero. If a function u is 
subharmonic and bounded above in D— K, 
then u can be extended to be a subharmonic 
function in D. A function that is equal to a 
subharmonic function almost everywhere is 
called almost subharmonic, and an almost 
subharmonic function satisfying condition (3’) 
is called submedian. 

Subharmonic functions can be discussed in 
a space more general than R”, e.g., a tRiemann 
surface (n= 2), or more generally, an t&-space 
of dimension n (> 2) in the sense of Brelot and 
Choquet. 


U. The Axiomatic Treatment 


tNewtonian potentials were the main object of 
interest in the early stages of tpotential theory. 
A major part of potential theory can be dis- 
cussed on the basis of the theory of superhar- 
monic functions [8]. For example, a tpolar set 
is defined as a set on which some superhar- 
monic function assumes the value oo, and a set 
X is ‘thin at a point P ,¢X if and only if P) has 
a positive distance from X or there exists a 
superharmonic function v(P) in a neighbor- 
hood of P, such that lim sup v(P)> v(P,) as 
Pex tends to P). Moreover, we can discuss 
tbalayage, define potentials, and obtain Riesz 
decompositions. Generalizing results of Doob 
(1954) and starting from a family of harmonic 
functions defined axiomatically in a locally 
compact Hausdorff space, M. Brelot defined 
superharmonic functions and potentials and 
discussed balayage, Riesz decompositions, and 
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the fDirichlet problem (1957). Further pro- 
gress in axiomatic potential theory has been 
made by Brelot, H. Bauer, C. Constantinescu, 
A. Cornea, and others [11, 14]. 
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194 (VII.11) 
Harmonic Integrals 


A. Introduction 


*De Rham’s theorem shows that the coho- 
mology group with real coefficients of a tdif- 
ferentiable manifold of class C® is isomorphic 
to the cohomology group of the cochain com- 
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plex of *differential forms with respect to the 
exterior derivative d. Thus every element of the 
cohomology group can be represented by a 
class of tclosed differential forms. Harmonic 
forms enable us to choose one definite dif- 
ferential form in each cohomology class. The 
theory of harmonic forms, called the theory of 
harmonic integrals, is modeled after the theory 
of holomorphic differentials and their inte- 
grals (Abelian integrals) in function theory 
[2,4, 5,8]. 


B. Definitions 


Let X be an oriented n-dimensional differ- 
entiable manifold of class C® with a tRie- 
mannian metric ds? of class C® (— 105 Differ- 
entiable Manifolds, 364 Riemannian Mani- 
folds A). For every (C%) p-form @ on X we 
define an (n — p)-form * ọ on X as follows: 
First denote the tvolume element of X by 

dv. If we choose a basis {w,,...,@,} of the 
space of 1-forms on an open set U of X such 
that ds? =? and dv=@,A...Aq,, then 

ọ can be expressed on U in the form ọ = 
(PDE Pi, OA ADi If we let xọ = 
(n-p) p); sjy ph A AOp where 
(*@);. iao (1p)È ON sinp Pipin then 
*@ is an (n—p)-form on U that does not 
depend on the choice of (@,,...,@,,) and is 
determined only by ¢. Since X is covered by 
open sets as above, + defines a linear map- 
ping that transforms p-forms to (n— p)-forms. 
If we let ds? =$ g,,dx/dx* in terms of the 
local coordinate system (x',...,x") and ọ = 
(Up), idx" A... A dxtr, then, in the nota- 
tion of tensor calculus, we have 


x =(1/(n—p)!)(* P)j ig pO" A... dxin-p, 
(* Pin p VI E ig ap 
(g = det(g;;)). 


For two p-forms ọ and y, we define the 
inner product by (ọ, Y)= fx p ^ *y if the right- 
hand side converges. In order for the inner 
product (¢, y/) to be defined, it suffices that 
either p or y has compact tsupport. Then 
(~, W) is a Symmetric, positive definite bilinear 
form. 

If we let 6=(—1)"?*"*! dx operate on p- 
forms, where d ts the texterior derivative, then 
d and ô are adjoint to each other with respect 
to the inner product. That is, if either ọ or 
y has compact support, we have (dg, /) = 
(o, dw) (*Stokes’s theorem). We call A = dô + 
od the Laplace-Beltrami operator, which is 
a self-adjoint telliptic differential operator. 
These operators satisfy relations such as 
#* =(—1)?""P), dd=0, 66=0, sA=Ax, 
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+õ=(—1)d*, and ô*=(—1)"7?+! «d (when 
they operate on p-forms). 

A differential form ¢ is said to be harmonic 
if dp =0 and 6g =0. Then Ag=0. Since A is 
an elliptic operator, a tweak solution ọ of the 
equation Ag = is an ordinary solution of 


class C® on the domain where u is of class C”. 


Therefore, if o is harmonic (as a weak solu- 
tion), ọ is of class C®. 


C. Harmonic Forms on Compact Manifolds 


On a compact manifold X, any ọ with Ag =0 
is harmonic, since (p, Ag) =(d, de) + (dg, d¢@). 
Let L,(X) be the linear space of p-forms of 
class C” on X, and denote by &,(X) the com- 
pletion of L,(X) with respect to the inner 
product (p, Y). Then 2,,(X) is the Hilbert space 
of square integrable measurable p-forms. Then 
§,(X)= {ye L,(X)|Ag=0 (in the weak sense)} 
is a finite-dimensional subspace of 2,(X) and 
is contained in L,(X), as we have seen before. 
Also, §,(X) is closed in £,(X), and the tpro- 
jection operator H: 2,(X)—§,(X) is an tinte- 
gral operator with kernel of class C”. The 
orthogonal complement of §,(X) in 2,(X) is 
mapped onto itself by A and has the inverse 
operator G of A, which ts a continuous oper- 
ator of the Hilbert space. By letting G=0 

on §,(X), we can extend G to an operator 
from £,(X) to 2,(X) that is called Green’s 
operator. It is also denoted by G, maps L, into 
itself, commutes with d and 6, and satisfies 

GH = HG =0, H+ AG =1 (=identity map- 
ping). Therefore, for pe L,(X) we have p=Ho 
+ Gôd + ddG¢, which shows that H is thomo- 
topic to the identity mapping of the tcochain 
complex (1, L,(X), d). From this we infer that 
every cohomology class of de Rham cohomol- 
ogy contains a unique harmonic form that 
represents the cohomology class. However, 
since products of harmonic forms are not 
always harmonic, it is not appropriate to use 
harmonic forms to study the ring structure of 
cohomology. G is also an integral operator 
with kernel of class C” outside the diagonal 
subset in X x X. 


D. Harmonic Forms on Noncompact 
Manifolds 


If X is a noncompact manifold, let L,(X) be 
the space of p-forms of class C” with compact 
support, and let 2,(X) be its completion. Let 
B(X) and BF(X) be the respective closures of 
dL,-,(X) and 6L,,,(X) in 2,(X), and let 
3p(X) and 35(X) be the respective orthogonal 
complements of B(X) and BF(X) in LX). 
Then 3,(X)N 33(X)=9,() ts a subspace of 
the square integrable harmonic forms, and we 
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have the direct sum decomposition £,(X)= 
B,(X) + B*(X)+ H,(X). In this decomposition 
any component of a form of class C® is also of 
class C”. 

If X is an open submanifold of another 
manifold Y, X is compact, and 6X=X —X isa 
closed submanifold of Y, then the theory in 
this section is just a generalized potential 
theory with boundary condition @=0 on éX. 
We sometimes treat decompositions of other 
Hilbert spaces that correspond to other bound- 
ary conditions. 


E. Generalization to Complex Manifolds 


If X is a complex manifold, we consider 
complex-valued differential forms (— 72 Com- 
plex Manifolds C). Then the space L,(X) of p- 
forms is the direct sum of the spaces L, ,(X) of 
forms of type (r,s), and the exterior derivative 
d has the expression d =d +d”, where d' is of 
type (1,0) (Le., L, s(X)> L,+1.s(X)) and a” is of 
type (0, 1). If we are given a holomorphic vec- 
tor bundle E on X, we can define an operator 
d” on differential forms with values in E, and 
we have the generalized tDolbeault theorem. If 
X is compact and has a tHermitian metric, we 
can define a Hermitian inner product on E as 
follows: There is an open covering {U,} such 
that over each U, the vector bundle E is iso- 
morphic to U, x C*. A point of E over U, is 
represented by (x, ¢,), where xe U; and če C4. 
For xe U;N U, we have (x, €)) = (x, čą) (the sides 
are the respective expressions over U, and U,) 
if and only if €;=g4(x)¢,, where g(x) is a 
holomorphic mapping from U;M U, to GL(q, C) 
satisfying 9.91 =g; On U;AU,OU,. A differen- 
tial form @ with values in E is expressed as a 
family {@;} of differential forms on U, with 
values in C? such that ~,(x) = gy(x)¢p,(x) on 

U; U,. If we take a positive definite Hermitian 
matrix h; whose components are C*-functions 
on U; such that ‘g,.h,9,,=h, on U;N Up, then 
{'¢;h,¢;} determines a Hermitian inner product 
on each fiber of E. We can also endow the 
space L, (E, X) (of forms of type (r,s) of class 
C” with values in E) with a Hermitian inner 
product by setting (9, Y)= fx Ea ghigo a * WF 
for o, WEL, (E, X) (where the pf (x=1,...,q) 
are the components of ,). If we denote by d 
the adjoint operator of d” with respect to this 
inner product and let A=d"d+0dd", then Aisa 
self-adjoint elliptic differential operator, and 
results similar to those for A mentioned above 
hold for A. For example, the space §, ,(E, X) 
of harmonic forms of type (r, s) is of finite 
dimension, and there is a continuous linear 
operator G on £, (E, X), the completion of 
L,.(E, X), that satisfies 1 = H + AG, HG = 

GH =0, d” G = Gd”, and dG = Gd. Here H 
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denotes the projection 2, which is an 
integral operator with kernel of class C®. 
Also, G maps L, ,(E, X) into itself. Therefore 
H is homotopic to the identity on the co- 
chain complex (XL, ,(E, X), d”), and any ele- 
ment of tDolbeault’s cohomology groups (d’- 
cohomology groups) is represented by a uni- 
que harmonic form (— 232 Kahler Manifolds 
B). 


F. Other Generalizations 


Even if a manifold X is not of class C”, if X is 
a manifold of class C1, we can develop the 
theory of harmonic forms [6]. We say that X 
is of class C! if it is of class C? and has a set of 
local coordinate systems whose transition 
functions have derivatives satisfying the *Lip- 
schitz condition. 

If X is a real analytic manifold with a real 
analytic Riemannian metric, then harmonic 
forms are also real analytic. Using this fact, we 
can embed real analytically a compact mani- 
fold with a real analytic Riemannian metric 
into a Euclidean space (P. Bidal and G. de 
Rham; this result is now included in the 
theorems of C. B. Morrey and H. Grauert). 

We can consider the theory of harmonic 
forms with singularities [4, 5], a generalization 
of the theory of differential forms of the second 
and third kinds. Here the notion of tcurrent is 
very useful. 


G. Cohomology Vanishing Theorems 


Since the operator A is closely related to the 
Riemannian metric, some metrics may admit 
no harmonic forms of certain degrees except 
zero. This is important since it means that the 
corresponding cohomology group of the mani- 
fold vanishes. The condition for this phenom- 
enon to occur can be described in terms of 

the curvature of the metric. This study has its 
origin in S. Bochner’s results [2]. 

Here is an example of a vanishing theorem: 
Let B be a holomorphic line bundle on a com- 
pact complex manifold X of dimension n. If 
the *Chern class of B is expressed by a real 
closed differential form of type (1,1) as œ = 
at È a,gdz* ^ dz’, where the Hermitian 
matrix (a,,) is positive definite at every point of 
X, then H4(X,Q?(B))=0 for p+q>n. In this 
case, ds? =2 È a, dz* ^ dZ’ is a tHodge metric 
on X {-+ 232 Kahler Manifolds D). 
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Harmonic Mappings 


A. General Remarks 


The theory of harmonic mappings between 
Riemannian manifolds has its origin in the 
study of tPlateau’s problem. The basic prob- 
lem in the theory is to deform a given map- 
ping into a harmonic one, which is a problem 
of the calculus of variations and tglobal ana- 
lysis (— 46 Calculus of Variations, 183 Global 
Analysis). Recently, the theory of harmonic 
mappings has been applied to problems in 
various branches of geometry [7—9, 11]. 


B. Definitions and Examples 


Let (M, g) and (N, h) be Riemannian mani- 
folds with metrics g = È g,,dx'dx/ and h= 
“h,gdy*dy", respectively. We define the energy 
of a C'-mapping f: MN by 


1 
n= Idf(x)|? dx, 


where |df(x)| is the tHilbert-Schmidt norm of 
the differential df.: T.(M)— T,,,.(N) of f at 
xeM and dx is the canonical Lebesgue mea- 
sure defined by g on M (assumed compact). 
Thus E(f) can be considered to be a generali- 
zation of the classical fDirichlet integral for 
functions. The integrand e( f)(x) =|df(x)|? is 
called the energy density of f; it measures 

the sum of the squares of elements of length 
stretched on a complete set of mutually per- 
pendicular directions. 
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The *Euler-Lagrange differential equations 
of the energy functional E(f) yield a vector 
field t(f) along f, i.e., a section of the bundle 
f* T(N) induced from the tangent bundle 
T(N) of N by f. In fact, given a family f, of 
mappings depending differentiably on t with 
Jo= f, we have 

pac 
t=0 


EED- o 
where < , > denotes the inner product of 
tangent vectors along f. The vector field t( f) is 
called the tension field of the mapping f; it 
indicates the direction in which the energy of f 
decreases most rapidly. 

The Euler-Lagrange differential equations 
t( f)=0 are a system of tquasilinear elliptic 
partial differential equations of the second 


order. In local coordinates, these can be writ- 
ten in the form 


ae 


Ox' @xfi 


Af +S gT) 


where A is the tLaplace-Beltrami operator on 
M and the I%,(f)(x) are the tChristoffel sym- 
bols on N at f(x). (The f* are local coordi- 
nates of the point f(x), and (gë) is the inverse 
matrix of (g;).) A C?-mapping f: MN is said 
to be harmonic if its tension field t(/) vanishes. 
Thus, if M is compact, a C?-mapping f:M>N 
is harmonic if and only if it is an extremal of 
the energy functional E(f). 

Examples of harmonic mappings appear in 
various contexts of differential geometry. For 
instance: 

(1) If N=R, then the harmonic mappings 
M=>R are the tharmonic functions on M. 

(2) If M is the circle S', then a harmonic 
mapping S'—N is a closed tgeodesic of N 
parametrized by arc length. 

(3) Let f: MN be an tisometric immersion 
of M into N. Then f is harmonic if and only if 
it is a tminimal immersion. 

(4) If M and N are tKahler manifolds, then 
every tholomorphic or antiholomorphic map- 
ping M >N is harmonic, where by an anti- 
holomorphic mapping is meant a mapping 
whose differential mapping carries a differen- 
tial form of type (1,0) into that of type (0, 1). 
We note that each (anti-)holomorphic map- 
ping is an absolute minimum for the energy in 
its homotopy class. There are also examples of 
nonholomorphic (and nonantiholomorphic) 
harmonic mappings between Kahler manifolds 
(— Section D). 


C. Fundamental Properties 


(1) Regularity. Since a harmonic mapping is a 
solution of a second-order quasilinear elliptic 
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system of partial differential equations t( f)=0 
(— Section B), it is a smooth (i.e., of class C”) 
mapping. More generally, it is known that a 
continuous mapping that satisfies t(f)=0 in 

a weak sense is smooth [4, 6]. 

(2) Unique continuation property. The follow- 
ing unique continuation theorem is valid for 
harmonic mappings: If two harmonic map- 
pings of M into N agree up to infinitely high 
order at some point of M, then they are iden- 
tical (M being assumed connected). In partic- 
ular, a harmonic mapping that is constant on 
an open set is a constant mapping. 

The global natures of harmonic mappings 
are closely related to the curvatures of the 
manifolds under consideration. For instance, 
suppose that M and N are compact and that 
the sectional curvatures of N are nonpositive 
everywhere. Then we have: 

(3) Uniqueness. Let f: M—N be a harmonic 
mapping, and assume that there is a point of 
J(M) where the sectional curvatures of N are 
negative. Then f is unique in its homotopy 
class unless f(M) is a closed geodesic y of N; 
and in this case we have uniqueness up to 
rotation of y, i.e., an isometry of y which moves 
each point of y a fixed oriented distance along 
y. 

(4) Degeneracy. Suppose further that the 
tRicci tensor (R,) of M is positive semidefinite 
everywhere. Then the energy density e(f)is a 
tsubharmonic function for every harmonic 
mapping. This implies that any harmonic 
mapping f: MN is ‘totally geodesic. More- 
over, if N is of negative sectional curvature, 
then f is either constant or maps M onto a 
closed geodesic of N; if (R;) is positive definite 
at some point, then f is constant. 

(5) Finiteness. Assume now that N is of 
negative sectional curvature. Then, for each 
K 21, there are only finitely many noncon- 
stant harmonic mappings f: M >N of dilata- 
tion bounded by K. Here, we say that the dila- 
tation of f is bounded by K if and only if at 
each point of M we have df =0 or (A,/4,)'7 < 
K, A, 24, >...>0 being the positive eigen- 
values of the pullback quadratic form f *h(x) 
on T,(M) induced from the metric h of N by f. 


D. Harmonic Mappings of a Surface 


Let M be a compact surface. Then the energy 
of a mapping M >N is the tDirichlet-Douglas 
functional, and harmonic mappings are closely 
connected with solutions of Plateau’s problem 
(— 334 Plateau’s Problem). In fact, if a tcon- 
formal mapping M >N minimizes the tarea 
functional, then it also minimizes the energy 
functional. 

Now let M and N be compact orientable 
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surfaces whose genera are denoted by p and q, 
respectively. Then the problem of existence (or 
nonexistence) of harmonic mappings is well 
understood. In fact: 

(1) When q £0, for any metrics g and h on 
M and N, every homotopy class of mappings 
M-N contains a harmonic mapping. 

(2) When g=0 (i.e., N is the 2-sphere S°), 
every harmonic mapping whose tdegree d 
satisfies |d| > p is holomorphic or antiholo- 
morphic with respect to the complex struc- 
tures associated with g and h. For example, 
consider the homotopy classes of mappings 
from the 2-torus T? to S? with any metrics. 
Then all classes with degree |d|>2 have har- 
monic representatives, and any such is holo- 
morphic or antiholomorphic; and the classes 
with d= +1 have no harmonic representatives. 

(3) When g=0 and |d|<p— 1, we have, for 
every such p and d, a surface M of genus p and 
a metric h on S* such that there exists a har- 
monic nonholomorphic (and nonantiholo- 
morphic) mapping of degree d from M to S?. 


E. Existence Theorems 


The basic problem in the study of harmonic 
mappings is to prove their existence in general 
geometric contexts. 

(1) In regard to this problem, translating the 
problem of the elliptic system t(f)=0 into the 
finitial value problem of the corresponding 
nonlinear tparabolic system df/6t =1(f), J. 
Eells and J. H. Sampson [1] proved that if 
M and N are compact and if N has nonposi- 
tive sectional curvature everywhere, then every 
homotopy class of mappings M >N contains 
a harmonic mapping that minimizes the energy 
in that class. Subsequently, the uniqueness of 
these harmonic mappings was established 
by P. Hartman [2] in the form stated in Sec- 
tion C. 

(2) For harmonic mappings of surfaces, 
more general existence results have been 
known. 

First, by the tdirect method of the calculus 
of variations, L. Lemaire (1977) and others 
proved that if M and N are compact and if M 
is 2-dimensional, then every conjugacy class of 
homomorphisms 2,(M)>7,(N) of the funda- 
mental groups is induced by a minimizing 
harmonic mapping. It follows that if, in partic- 
ular, the second homotopy group z,(N) of N 
is zero, then every homotopy class of map- 
pings of a compact surface M to N contains 
a harmonic representative realizing the mini- 
mum of the energy in that class. 

On the other hand, by making use of the 
generalized tMorse theory for a perturbed 
energy functional, J. Sacks and K. Uhlen- 


195 Ref. 
Harmonic Mappings 


beck [5] succeeded in giving a satisfactory 
answer to the structure of z (N), which is a 
m,(N)-module, in terms of harmonic mappings. 
They proved that there exists a generating set 
for z (N) consisting of harmonic mappings of 
spheres that minimize energy and area in their 
homotopy classes. We note that these har- 
monic mappings are minimal immersions with 
tbranch points. 

(3) Next, we mention the case of harmonic 
mappings of manifolds with boundary. In this 
case, we can naturally formulate the fDirichlet 
and the *Neumann boundary value problem 
for harmonic mappings. 

In his study of Plateau’s problem on Rie- 
mannian manifolds, C. Morrey (1948) dis- 
cussed the Dirichlet problem for harmonic 
surfaces with boundary. 

The problem in arbitrary dimensions has 
been studied by R. S. Hamilton [3], who 
extended the result of Eells and Sampson 
mentioned above to the case where M and N 
have boundaries. In fact, let M and N be com- 
pact Riemannian manifolds with boundary, 
and assume that N has nonpositive sectional 
curvature and that the boundary ôN of N is 
tconvex (or empty). Then there exists a unique 
minimizing harmonic mapping in each trela- 
tive homotopy class determined by the pre- 
scribed Dirichlet boundary value. We note 
that if ON is not convex, then it is easy to 
formulate Dirichlet problems with no solu- 
tions. Hamilton also treated the Neumann 
problem. 

Subsequently, S. Hildebrandt, H. Kaul, and 
K.-O. Widman [4] gave another existence 
proof of solutions of the Dirichlet problem 
that covers the case where N admits positive 
sectional curvature. 
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David Hilbert January 23, 1862—February 14, 
1943) was born in Königsberg, Germany. He 
attended the University of Königsberg from 
1882 to 1885, when he received his doctoral 
degree with a thesis on the theory of invar- 
iants. It was there that he established a life- 


long friendship with H. Minkowski. In 1892 he | 


became a professor at the University, and in 
1895 he was appointed to a professorship at 
the University of Göttingen, a position he held 
until his death. He obtained his basic theorem 
on invariants between 1890 and 1893, and next 
began research on the foundations of geometry 
(— 155 Foundations of Geometry) and the 
theory of talgebraic number fields. Concerning 
the former, he published Grundlagen der Geo- 
metrie (first edition, 1899), in which he gave the 
complete axioms of Euclidean geometry and 

a logical examination of them. Concerning 

the latter, he systematized all the important 
known results of algebraic number theory in 
his monumental Zahlbericht (1897). In number 
theory, he enunciated his significant conjecture 
on felass field theory. At the international 
congress of mathematicians held in Paris in 
1900, he put forth 23 problems as targets for 
mathematics of the 20th century (Table 1). 
Between 1904 and 1906 he conducted research 
on the fDirichlet principle of tpotential theory 
and on the direct method in the tcalculus of 
variations. Around 1909 he established the 
foundations of the theory of tHilbert spaces. 
After 1910 he was chiefly involved in research 
on the tfoundations of mathematics, and he 
advocated the standpoint of tformalism. He is 
one of the greatest mathematicians of the first 
half of the 20th century. 
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Table 1. The 23 Problems of Hilbert 


(1) To prove the continuum hypothesis (— 33 
Axiomatic Set Theory D). 


(2) To investigate the consistency of the 
axioms of arithmetic (— 156 Foundations of 
Mathematics E). 


(3) To show that it is impossible to prove 

the following fact utilizing only congruence 
axioms: Two tetrahedra having the same alti- 
tude and base area have the same volume. 
Solved by M. Dehn (1900). 


(4) To investigate geometries in which the line 
segment between any pair of points gives the 

shortest path between the pair (— 155 Foun- 
dations of Geometry). 


(5) To obtain the conditions under which a 
topological group has the structure of a Lie 
group (— 423 Topological Groups M). Solved 
by A. M. Gleason and D. Montgomery and L. 
Zippin (1952) and H. Yamabe (1953). 


(6) To axiomatize those physical sciences in 
which mathematics plays an important role. 


(7) To establish the transcendence of certain 
numbers (— 430 Transcendental Numbers B). 
The transcendence of 2V2, which was one of 
the numbers put forth by Hilbert, was shown 
by A. Gel’fond (1934) and T. Schneider (1935). 


(8) To investigate problems concerning the 
distribution of prime numbers; in particular, to 
show the correctness of the Riemann hypoth- 
esis (— 450 Zeta Functions). Unsolved. 


(9) To establish a general law of reciprocity (— 
59 Class Field Theory A). Solved by T. Takagi 
(1921) and E. Artin (1927). 


(10) To establish effective methods to deter- 
mine the solvability of Diophantine equations 
(— 97 Decision Problem; 182 Geometry of 
Numbers). Solved affirmatively for equations 
of two unknowns by A. Baker, Philos. Trans. 
Roy. Soc. London, (A) 263 (1968); solved nega- 
tively for the general case by Yu. V. 
Matiyasevich (1970). 


(11) To investigate the theory of quadratic 
forms over an arbitrary algebraic number field 
of finite degree (— 348 Quadratic Forms). 


(12) To construct class fields of algebraic num- 
ber fields (— 73 Complex Multiplication). 


(13) To show the impossibility of the solu- 
tion of the general algebraic equation of the 
seventh degree by compositions of continu- 
ous functions of two variables. Solved nega- 
tively. In general, V. I. Arnold proved that 
every real, continuous function f(x;,x2,X3) 
on [0, 1] can be represented in the form 

E? h:(g:(X1, X2), X3), where h; and g; are real, 
continuous functions, and A. N. Kolmogorov 
proved that f(x,,x ,x3) can be represented 
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in the form 27-1 h,(gi1(%1) + Gi2(X2) + gis(X3)), 
where h; and g;; are real, continuous func- 
tions and g,, can be chosen once for all inde- 
pendently of f (Dokl. Akad. Nauk SSSR, 114 
(1957), Amer. Math. Soc. Transl., 28 (1963)). 


(14) Let k be a field, x,,...,x, be variables, 
and f,{x,,...,x,) be given polynomials in 
k[x,,...,x,] (i= 1,...,m). Furthermore, let 

R be the ring formed by rational functions 
F(X,,...,X,,) in k(X,,..., Xm) such that F(f,, 
wes ImEK[X,,...,X,]. The problem is to deter- 
mine whether the ring R has a finite set of 
generators. Solved negatively by M. Nagata, 
Amer. J. Math., 81 (1959). 


(15) To establish the foundations of algebraic 
geometry (— 12 Algebraic Geometry). Solved 
by B. L. van der Waerden (1938-1940), A. 
Weil (1950), and others. 


(16) To conduct topological studies of alge- 
braic curves and surfaces. 


(17) Let f(x,,...,x,) be a rational function 
with real coefficients that takes a positive value 
for any real n-tuple (x,,...,x,). The problem is 
to determine whether the function f can be 
written as the sum of squares of rational func- 
tions (— 149 Fields O). Solved in the affirma- 
tive by E. Artin (1927). 


(18) To express Euclidean n-space as a disjoint 
union | }, P,, where each P, is congruent to 
one of a set of given polyhedra. 


(19) To determine whether the solutions of 
regular problems in the calculus of variations 
are necessarily analytic (— 323 Partial Differ- 
ential Equations of Elliptic Type). Solved by 
S. N. Bernshtein, I. G. Petrovskii, and others. 


(20) To investigate the general boundary value 
problem (— 120 Dirichlet Problem; 323 Par- 
tial Differential Equations of Elliptic Type). 


(21) To show that there always exists a linear 
differential equation of the Fuchsian class with 
given singular points and monodromic group 
(— 253 Linear Ordinary Differential Equa- 
tions (Global Theory)). Solved by H. Röhrl 
and others (1957). 


(22) To uniformize complex analytic functions 
by means of automorphic functions (— 367 
Riemann Surfaces). Solved for the case of one 
variable by P. Koebe (1907). 

(23) To develop the methodology of the 


calculus of variations (— 46 Calculus of 
Variations). 
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A. General Remarks 


The theory of Hilbert spaces arose from prob- 
lems in the theory of tintegral equations. D. 
Hilbert noticed that a linear integral equation 
can be transformed into an infinite system of 
linear equations for the *Fourier coefficients 
of the unknown function. He considered the 
linear space l, consisting of all sequences of 
numbers {x,} for which Z2; |x,|7 is finite, 
and defined for each pair of elements x= {x,}, 
y={y,} el, their inner product as (x, y)= 

Enzi Xn), The space l, can be regarded as 

an infinite-dimensional extension of the notion 
of a Euclidean space. F. Riesz considered the 
space of functions now termed L,-space and 
succeeded in giving a satisfactory answer to 
the Fourier expansion problem. In his book 
[3], J. von Neumann established a rigorous 
foundation of quantum mechanics employing 
Hilbert spaces and the spectral expansion of 
self-adjoint operators. The following axiomatic 
definition (— Section B) of Hilbert spaces is 
due to von Neumann. H. Wey! later justified 
the tDirichlet principle of Riemann by the 
method of orthogonal projection in a Hilbert 
space, and thus paved the way for the function- 
analytic study of differential equations. 


B. Definition 


Let K be the field of complex or real numbers, 
the elements of which we denote by a, f,.... 
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Let H be a tlinear space over K, and to any 
pair of elements x, ye H let there correspond a 
number (x, yje K satisfying the following five 
conditions: (1) (x, +X, Y)}=(x1; Y) + (x2, y); (ii) 
(ax, y) = a(x, y); (iii) (x, y) = (y, x); (iv) (x, x) 2 0; 
and (v) (x, x)=0<x=0. Then we call H a pre- 
Hilbert space and (x, y) the inner product of x 
and y. 

With the norm ||x||=./(x,x), H isa 
tnormed linear space. If H is tcomplete with 
respect to the distance ||x — y|| (i.e, |X, — Xm I| — 
0 (m, n> co) implies the existence of lim x, =x), 
then we call H a Hilbert space. According 
as K is complex or real, we call H a complex 
or real Hilbert space. A Hilbert space is a 
*Banach space. 

A normed linear space with norm ||x|| can 
be made a pre-Hilbert space, by defining an 
inner product (x, y) so that ||x|| =,/(x, x), if 
and only if the equality ||x + yl? + llx— yl? = 
2(\|x||?+ || y||7) holds for any x, y. 


C. Orthonormal Sets 


Two elements x, ye H are said to be mutu- 
ally orthogonal if (x, y)=0. A subset 2 of H is 
called an orthogonal set (or system) if 0¢ X and 
every distinct pair x, ye is mutually ortho- 
gonal. If every element of an orthogonal set X 
is of norm 1, then X is called an orthonormal 
set. Any orthogonal set 2 = {x,} can be normal- 
ized into an orthonormal set {x;/|x;||}. A 
maximal orthonormal set is called a complete 
orthonormal set or an orthonormal basis. All 
the complete orthonormal sets of a given H 
have the same cardinal number, which we call 
the dimension of H. Two Hilbert spaces are 
isomorphic if and only if they have the same 
dimension. 

Let 2 = {x;} be an orthonormal set. Then for 
every xe H, its Fourier coefficients (x, x;) van- 
ish for all but a countable number of i, and the 
Bessel inequality || x ||? >>,|(x, x,)|? holds. The 
following three statements are equivalent in a 
Hilbert space: (i) X is complete; (ii) Parseval’s 
equality ||x||* =, |(x, x)|? holds for every x; 
(iti) every x can be expanded in a Fourier series 
x= (x, x,)x; (— 317 Orthogonal Functions). 


D. Examples of Hilbert Spaces 


The space l, (— Section A) is a Hilbert space 
of dimension Ny. The function space L, ona 
measure space (X, u) is a Hilbert space if the 
inner product of f, ge L, is defined by (f,g)= 
|x fGdu. In the case of the *Lebesgue mea- 
sure in a Euclidean space, L, is of dimension 
No, so that it is a Hilbert space isomorphic 
to l. Further examples of Hilbert spaces are 
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A2(Q), W;(Q) (= H'(Q)), and Hy(Q) (— 168 
Function Spaces). 


E. Closed Linear Subspaces and Projections 


Let M be a closed linear subspace of a Hilbert 
space H, i.e., a linear subspace that is closed in 
the norm topology of H. It is a Hilbert space 
with respect to the restriction of the inner 
product in H. For a given M the set of all 

x€H such that (x, y)=0 for every ye M forms 
a closed linear subspace M+ called the orthog- 
onal complement of M. The orthogonal com- 
plement of M+ is M (i.e, M++ = M), and H is 
the direct sum of M and M+ (i.e., every xe H 
can be uniquely represented as x= y +z, yE 
M, ze Mt, and ||x||* =|] yll? + ||z||?). Thus the 
quotient space H/M is isomorphic to M+ and 
is also a Hilbert space. The operator Py that 
maps x to y is called the projection (or ortho- 
gonal projection or projection operator) to M. 
A bounded linear operator P is a projection if 
and only if it is idempotent (P? = P) and self- 
adjoint ((Px, y)=(x, Py) for any x, ye H) (— 
251 Linear Operators). 


F. Conjugate Spaces 


A linear operator from H to K is called a 
linear functional. The set H’ of all continuous 
linear functionals f on H forms a Hilbert 
space with norm || f || =sup{| f(x)|| |||] = 1}. 
For every fe H’ there exists a unique ye H 
such that f(x)=(x, y) for all xe H (Riesz’s 
theorem), and the correspondence f—y gives 
an tantilinear isometric operator from H’ onto 
H (for ‘linear operators on Hilbert spaces 

— 68 Compact and Nuclear Operators; 251 
Linear Operators; 390 Spectral Analysis of 
Operators). 
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Holomorphic Functions 


A. Differentiation of Complex Functions 


Let f(z) be a tcomplex-valued function defined 
in an open set D in the tcomplex plane C. We 
say that f(z) is differentiable at z if the limit 


lim (f(z + A) — f(z)/h= f(z) (1) 


exists and is finite as the complex number h 
tends to zero. We call f’(z) the derivative of 

J (2) at z. This definition is a formal extension 
of the definition of differentiability of a func- 
tion of a real vartable to that of a complex 
variable (— 106 Differential Calculus), but it is 
a much stronger condition than the differentia- 
bility of a real function, since z +h in (1) may 
be an arbitrary point in a 2-dimensional neigh- 
borhood of z. Hence many results essentially 
different from those for functions of a real 
variable follow from it. 

If a function f(z) is differentiable at each 
point of an open set D, it is said to be holo- 
morphic (or regular) in D, or f(z) is a holo- 
morphic function on D. (For the definition of 
holomorphy of a complex-valued function 
of several complex variables — 21 Analytic 
Functions of Several Complex Variables C.) 

Let E be an arbitrary nonempty subset of C. 
We say that f(z) is holomorphic on E if it is 
defined in an open set D containing E and is 
holomorphic on D. Some results valid for 
differentiable real functions also hold for 
holomorphic functions. For instance, the de- 
rivative of a sum, product, or quotient is given 
by the usual rules. The derivative of a com- 
posite function ts determined by the chain rule. 
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The set of all functions holomorphic in a 
tdomain D forms a tring. 

Suppose f(z) is holomorphic on D and 
f'(zo) #9, zoe D. Then two curves that form 
an angle at z) are mapped by f to two curves 
forming the same angle at f(z). Because of 
this property, the mapping f is said to be 
conformal at all points z with f’(z) 40. 

The following four conditions are equivalent 
for a function f =u + iv defined on an open 
set P. (1) f is holomorphic in D. (2) u=u(x, y) 
and v=v(x, y) are ttotally differentiable at 
each point z=x+ iy and satisfy the Cauchy- 
Riemann differential equations 


Ou/éx=dv/dy,  Ou/dy= —6v/Ox. 


(3) f is represented by a tpower series D9 c,(Z 
—a)" in a neighborhood of each point a of D; 
that is, f(z) is analytic in D. (4) f is continuous 
and |. f(z)dz=0 for every rectifiable Jordan 
closed curve C whose interior is contained, 
together with C, in D. The proposition that (1) 
implies (4) is called Cauchy’s integral theorem, 
and the proposition that (4) implies (1) is called 
Morera’s theorem. 

The hypothesis of Morera’s theorem can be 
weakened as follows: Let f(z) be continuous in 
a domain D. If fe f(z)dz=0 for every rectangle 
C in D with sides parallel to the axes and 
whose interior consists of only points of D, then 
f(z) is holomorphic in D. In the statement of 
this theorem, if we let C be an arbitrary circle, 
we get the same conclusion. 

The following complex differential operators 
are often useful: 


ô 1fé ð ô 1/3 ð 
zei) z- u) 
Generally, (6@/0z) = 0@/0z. The Cauchy- 
Riemann equations above can be expressed in 
a single equation : 0f/éz =0. If f is holomor- 
phic, 0f/0z = f’(z). 

In order to show that f=u-+iv is holomor- 
phic in D, assumption (2) can be weakened. 
Actually, we have the Looman-Men’shoy 
theorem: Suppose that u and v are continuous 
in D, 0u/éx, Ou/Cy, 6v/Ox, and Gv/éy exist at 
every point of D except for at most a count- 
able number of points, and the Cauchy- 
Riemann equations hold in D except for a set 
of 2-dimensional tmeasure zero; then f =u + iv 
is holomorphic in D. D. E. Men’shov extended 
this theorem and obtained various conditions 
for holomorphy. For example, he proved the 
following theorem: If f is a topological map- 
ping of D and f is conformal in D (i.e., 


limarg( f(z +h) — f(z))/h 


exists) except for at most a countable number 
of points, then f is holomorphic in D. 
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As another type of sufficient condition for 
holomorphy, we have the proposition : If f is 
locally tLebesgue integrable in D and satis- 
fies the Cauchy-Riemann equation (£=0 in 
the sense of ‘distribution, then there exists a 
holomorphic function g in D such that g=f 
talmost everywhere. 


B. Cauchy’s Integral Theorem 


Cauchy’s integral theorem can be stated as 
follows: If f(z) is a holomorphic function in a 
tsimply connected domain D in the complex 
plane, the equality fe f(z)dz=0 holds for every 
(rectifiable) closed curve C in D. In particular, 
the integral ff f(z)dz (a, p€ D) is uniquely deter- 
mined by « and $ provided that its path of 
integration lies in D. The function F(z)= 

ESC) dt (ze D) is called the indefinite integral 
of f. F(z) is holomorphic in D and F’(z)= f(z). 

In the proof of his integral theorem, Cauchy 
assumed the existence and continuity of the 
derivative f’(z) in D. However, E. Goursat 
proved the theorem utilizing only the existence 
of f’(z), Actually, by virtue of the integral 
formula (2) in this section, the existence of f’(z) 
in D implies the continuity of f(z). This is 
sometimes called Goursat’s theorem. 

Let C, C,, C, ...,C, be rectifiable Jordan 
curves. Suppose that C,,C3,...,C, are in the 
interior of C and that each one lies in the ex- 
terior of the others. If f(z) is holomorphic in 
the region D bounded by these n+ 1 curves 
and continuous on DUCUC,U...UC,=D, 
then we have 


[sod f(z)dz. 
c KEL Jc, 


Here the curvilinear integrals are taken in the 
positive direction (i.e., we take the direction 
such that {.(z—a)"'dz=|¢,(z—a) ' dz=2ni for 
a point a in the interior of C or C,, respec- 
tively). Henceforth, an integral along a closed 
curve is taken in the positive direction unless 
otherwise noted. Cauchy’s integral theorem 
under the assumption that f is holomorphic 
in D and is continuous on D is sometimes 
called the stronger form of Cauchy’s integral 
theorem. 

Under the same assumptions as in the 
stronger form, we have Cauchy’s integral for- 
mula for ze D: 


1 1 
Tl C, 


cé—z 2ni Ic. C-Z 
l LQ 
| Ea (2) 


. This integral formula expresses the value of 
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f(z) at a point z in the domain D in terms of 
the values of f on the boundary of D. In par- 
ticular, when n=O, the integral formula reads 
as 


fe=5 | JO dt. 3) 


Furthermore, if C is a circle |z|= R (i.e., D is 
the disk |z| < R), we obtain Poisson’s integral 
formula: 


f(z)= 

1 2n R R*—r? 

=| f(Re Ri pi = Rees Sey a 
S(2)= 


1 Re frei Ret? dp + itm f(0) 
2n SET OR age ee 


z=re®, O<r<R. (3) 


Formula (3)’ is valid for a harmonic function. 
Let C be a rectifiable curve and f(f) be a 
continuous function defined on C; then the 


integral of Cauchy type 
1 
rem | ie) dt 
2ni Jol —z 


is holomorphic outside C. The nth derivative 
F of F is given by (n!/27i) fe f(O —2)"** dé; 
moreover, F can be expanded in a Taylor 
series about every point a outside C: 

Eo] FY 
F=S aa", a= —, 


n=0 n! 


which converges in |z—a|<p, p being the 
distance from a to C. In particular, formula (3) 
implies that a holomorphic function f is in- 
finitely differentiable and is expanded in a 
Taylor series about every point of D as above. 
Let C be a closed curve not passing through 
a point a. Then the integral (1/27i) |-dz/(z—a) 
is an integer. It is called the winding number of 
C about a and is denoted by n(C; a). A cycle y 
(a finite sum of oriented closed curves) in an 
open set D is said to be homologous to zero 
in D if n(y;a)=0 for all points a in the com- 
plement of D. The general form of Cauchy’s 
integral theorem is stated as follows. If f is 
holomorphic in D, then f, f(z)dz=0 for every 
cycle y which is homologous to zero in D (E. 
Artin). From this we have the general form of 
Cauchy’s integral formula: if f is holomorphic 
in a domain D, then 


1 
nly; asos Las, 


for every cycle y which is homologous to zero 
in D. 
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C. Zero Points 


Let f(z) be a holomorphic function not identi- 
cally equal to zero. If f(a)=0, we call a a zero 
point of f. Every zero point of f is an isolated 
point, and there exists a unique positive inte- 
ger k and a function h holomorphic at a such 
that 


f(2)=(2—a)g(2), 


We call k the order of the zero point a and a 
a zero point of the kth order. The equality (4) 
implies that the tTaylor series of f(z) at a 
begins with the term c,(z—a)*. Suppose that a 
is a zero point of f(z)— y of the kth order; then 
we call a a y-point of the Ath order. 

For a function f(z) defined in a neighbor- 
hood of the tpoint at infinity, we set f(1/w) = 
g(w) (f(œ)=g(0)) and call f holomorphic at 
co if g is holomorphic at 0; f is said to have a 
zero of order k at œ if g has a zero of order k 
at 0. If two functions f and g are holomorphic 
in D and f{(z)=g(z) on a subset E that has an 
taccumulation point in D, then f is identically 
equal to g in D (theorem of identity or unique- 
ness theorem) since the zeros of holomorphic 
functions must be isolated. 


g(a) #9. (4) 


D. Isolated Singularities 


Let f(z) be holomorphic in an annulus D 
={z|R,<|z—a|<R,,0<R,<R,< +0}. 
Then f(z) is expanded in the fLaurent series 


+ 00 


f= È ¢(2—a)" (5) 
This is called the Laurent expansion of f 
about a. The coefficients c,, are given by c,= 
(1/2ni) fe f(QatKE—a)"** with C= {z||z—al= 
r}, Ri <r<R,. In particular, if f(z) is holo- 
morphic in D={z|0<|z—a|<R} (or, if a= 

oo, in D={z|R<|z|< +}) but not holomor- 
phic in DU {a}, we call a an isolated singular 
point (or isolated singularity) of f. By utilizing 
the local canonical parameter t= z—a (or 
t=1/z for a= œ), the Laurent expansion (5) 

of f is then written as f(z)= Dj -o Cat" + 

7 ¢,t". The second sum is an ordinary 
power series, called the holomorphic part of 
f(z). The first sum is a power series of 1/t with 
no constant term, called the singular part of 
f(z) at a or the principal part of the singularity 
(or of the Laurent expansion at a). 

If we have lim,_,y tf{(z)=0, the Laurent 
expansion (5) of f(z) lacks its singular part, 
and the limit of f(z) exists as t>0 (za) and is 
equal to Co. If we set f(a)= co, then the func- 
tion f(z) is holomorphic in DU {a}. In this 
case, the point a is called a removable singular- 
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ity. If f(z) is bounded in a neighborhood of a 
singularity a, then a is removable (Riemann’s 
theorem). Usually, we assume that the remov- 
able singularities of a function have already 
been removed in this way. 

When the singular part of f(z) at a exists 
and consists of a finite number of terms, the 
point a is called a pole; when it consists of an 
infinite number of terms, the point is called- 
an essential singularity. If a is a pole, f(z) is 
represented by the Laurent series 1. _,¢,t” 
(c_, #0) and ;(z)—> œ as z—a. In this case, the 
index k is called the order of the pole a. Then a 
relation such as (4) holds, where the index k is 
replaced by —k. Hence the point a is some- 
times called a zero point of the — kth order. If a 
is an essential singularity, then for an arbitrary 
number c there exists a sequence z, converging 
to a such that lim, ,,, f(z,)=c (the Casorati- 
Weierstrass theorem or simply Weierstrass’s 
theorem). Related to Weierstrass’s theorem, 
we have *Picard’s theorem, which gives a de- 
tailed description of the behavior of a function 
around its singularities. 


E. Residues 


Let a( # œ) be an isolated singularity of f(z). 
Then the coefficient c_, of (z—a)~' in the 
Laurent expansion (5) of f(z) is called the 
residue of f(z) at a and is denoted by Res[ f ]a, 
R(a; f), or R(a) if we need not indicate f. We 
have 


1 
R@)=c4=5— SO dK, 


\S-al=r 

where the integral is taken in the positive 
direction along a path for O0<r<R. If f(z) is 
holomorphic at z=a, then R(a)=0. If f(z) has 
a pole of the first order at a, 


R(a)= lim (z—a)f(z). 


The residue at the point at infinity is defined to 
be —a_,, where a is the coefficient of 1/z of 
the Laurent expansion of f(z) at œ: f(z)= 
Eo 4,2", and we have 


—1 
ae eer | f(Qdl, R <r< +o. 
T Jil=r 


Thus the notion of the residue of f(z) is actu- 
ally related to the differential form f(z)dz and 
not to f(z) itself. 

From the first formula in this section and 
the formula for —a_,, the residue theorem 
follows (Cauchy, 1825): Let C be a rectifi- 
able Jordan curve in the complex plane. Let 
d,,..-,4, be a finite number of points inside C, 
and let D be a domain containing C and its 
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interior. If f(z) is a function holomorphic in D 
—{d,,...,4,}, we have 


{ m 
— dz= 5 R(a,). 
sh | flende= $ Ray 
Furthermore, if f(z) is holomorphic in the 
extended complex plane (including the point at 
infinity) except for a finite number of poles, the 
sum of all residues is equal to zero. 


F. Calculus of Residues 


The calculus of residues is a field of calculus 
based on application of the notion of residues. 
For example, we have methods for the calcu- 
lation of definite integrals. Actually, one of the 
reasons why Cauchy studied the theory of 
complex functions was that he believed that 
the theory would provide a unified method of 
computing definite integrals. For example, if 
(z) is a rational function without poles on 
the real axis and with a zero point at infinity 
whose order is at least 2, then we have 


| p(x)dx=2ni }, R(%; p(2)), (6) 


=% Ima>0 


| e*p(x)dx=2ni Y Rae e(z)). (7) 
= 69 Ima>0 

Here the sums are taken over all the poles in 
the upper half-plane. Formula (7) is valid also 
for a rational function @(z) with a simple zero 
at infinity. If @(z) has simple poles at a, (k= 
1,...,n) on the real axis, then we take the 
principal values of the integrals at those poles 
and add ziR(a,) (k=1,...,n) to the terms on 
the right-hand side of (6) and (7). Sometimes 
we use the residue theorem to obtain the value 
of the sum of a series (e.g., the ‘Gaussian sum) 
by expressing it as an integral. 

Let f(z) be a single-valued function that ts 
tmeromorphic and not identically equal to 
zero in a domain D, and let (z) be a function 
holomorphic in D. Draw a rectifiable Jordan 
curve C in D such that the interior of C is 
contained in D and f(z) has neither zeros nor 
poles on C. Let a,,...,%y and f,,..., Bp be the 
zeros and poles inside C, respectively (where 
each of them is repeated as often as its order). 
Then we have 


è BS septa ian es 
x p(a,)— 2 olb) = aa I (z) A dz. 


If o(z)=1, we get 

1 

=| darg f(jJ=N-P. 
2n Je 


This is called the argument principle. Next, 
let f(z) be a function meromorphic for |z| < 
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R< +æ, f(0)40, #00, and set w(z)=logz. 
Take C as a closed curve consisting of the 
boundary of an annulus 0<p<|z|<r<R 
(where p is sufficiently small) and two sides of 
a suitable fcrosscut joining a point of |z| = p 
and |z|=r. Then we obtain Jensen’s formula: 


04,025 ++ 9 Oy 


log 
By, Ba, +--> Bp 


=log|f(0)|+(N — P)logr 


Poa ; 
al log|f(re™)| dy. 
n Jo 


The argument principle can be utilized to 
prove Rouché’s theorem: Let f(z) and g(z) be 
functions holomorphic in a domain D that 
contains a rectifiable Jordan curve C and its 
interior. Suppose that f(z)+ Ag(z) never van- 
ishes on C for any 4 with 0< å< 1. Then the 
number of zeros of f(z) in the interior of C is 
equal to that of f(z)+g(z). If | f(z)|>|g(z)| on C 
or arg f(z)—arg g(z) 4(2n + 1)z (n is an integer), 
the hypothesis of Rouché’s theorem is satisfied. 
This theorem is useful in proving the existence 
of a zero of a complex function (for example, a 
polynomial) and in finding its position. 


G. Analytic Continuations 


Let f(z) be a holomorphic function tn a do- 
main D of the complex plane C and D* be a 
domain containing D as a proper subset. If 
there exists a function F(z) holomorphic in D* 
that coincides with f(z) in D, then F(z) is called 
an analytic continuation (or analytic prolonga- 
tion) of f(z) from D to D*. By the theorem of 
identity an analytic continuation of F(z) is 
uniquely determined if it exists. 

The function f,(z) defined by the power 
series P(z;a)= Ło a,(z—a)" with the radius 
of convergence r, >0 is holomorphic in the 
domain D; :|z—a|<rj,, and at a point b of D, 
it can be expanded into a power series P(z; b) 
=~ 9b, (z — 5)" with the radius of conver- 
gence r, (2r,—|b—al). Ifr,>r,—|b—al, the 
domain D,:|z—b|<r, is not entirely contained 
in D,. Let f,(z) be the function defined in D, 
by P(z; b). Then the function F(z) that is equal 
to f;(z) in D, and to f(z) in D, is an analytic 
continuation of f;(z) from D, to D, UD, (a 
direct analytic continuation by power series). 

We have the following classical theorems 
about analytic continuations: 

Let D, and D, be two disjoint domains, and 
suppose that their respective boundaries C; 
and C, are trectifiable simple closed curves 
and that the intersection of C, and C, contains 
an open arc T. If two holomorphic functions 
f,(2), f>(z) defined in D, and D,, respectively, 
have finite common ‘boundary values at every 
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point of F, then there exists an analytic con- 
tinuation F(z) of f(z) and f(z) to D, UTUD, 
(Painlevé’s theorem). We sometimes call f,(z) 
a continuation of f, (2) beyond F. If T is not 
rectifiable, the continuation beyond T does not 
exist, in general. 

Let f(z) be holomorphic in a ‘Jordan do- 
main D lying in the half-plane on one side of 
the real axis and containing an open interval 7 
of the real axis in its boundary. If f(z) has 
finite real boundary values at every point of I, 
then it can be continued analytically beyond I 
to the other side of the real axis; there the 
continued function is given by {(Z) (Schwarz’s 
principle of reflection). This theorem can be 
generalized to the case where the real interval 
is replaced by an tanalytic curve. 

A harmonic continuation of harmonic func- 
tions is defined analogously to analytic con- 
tinuation. Let D be a Jordan domain lying in 
the half-plane on one side of the real axis and 
having an open interval 7 on the real axis as a 
part of its boundary. If u(z) is harmonic in D 
and has the boundary value 0 at every point 
of I, then u(z) has a harmonic continuation 
beyond 1. 

For other properties of holomorphic func- 
tions — 43 Bounded Functions; 429 Tran- 
scendental Entire Functions. 
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A real-valued function f(t) of a real variable t 
is said to be analytic at t= ty if it can be repre- 
sented by a fpower series in t—t, in a neigh- 
borhood of tẹ in R. If f(t) is defined on an 
open set of R at every point of which it is 
analytic, then f(t) is called an analytic function, 
or, more precisely, a real analytic function. 

Analogously, a complex-valued function f(z) 
of a complex variable z defined on a tdomain 
D of the complex plane C is said to be analytic 
at z=Zp, (€ D) if it can be represented by a 
power series in z—Z, in a neighborhood of zo 
in C, and f(z) is an analytic function in D if it is 
analytic at every point of D. In the remainder 
of this article, we are concerned with analytic 
functions in this sense. To distinguish them 
from the real case, they are also called complex 
analytic functions. A complex analytic function 
f(z) defined on D is tdifferentiable in D; there- 
fore it is Tholomorphic in D. The converse is 
also true. Thus the term “analytic function” is 
synonymous with “holomorphic function” 
insofar as it concerns a complex function (i.e., 
a complex-valued function of a complex vari- 
able) on a domain, but in the theory of func- 
tions it takes on an additional meaning that is 
explained in the following section. 
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I. Analytic Functions in the Sense of 
Weierstrass 


Let a be a point of the tz-sphere and t the 
‘local canonical parameter at a; i.e., t=z—a if 
a#o and t=z_' if a=oo. Ifa power series 
P(z;a)=D0¢,t" has a positive radius r, of 
convergence, we call P(z; a) a function element 
with center a on the z-sphere, after K. Weier- 
strass. P(z;a)=D 9c,(z—a)" if ax oo, and 
P(za)= Dpeoc,z "tfa=oo. These represent a 
holomorphic function in |z—a|<r, or in r! < 
|z| < 00, respectively. If b is a point inside the 
circle of convergence of the function element 
P(z; a), by the *Taylor expansion of P(z; a) at 
z=b, we obtain the power series P(z; b) in 
z—b, which is a direct analytic continuation 
of P(z;a). Let a and b be two points on the 
z-sphere, and let C:z=z(s) (0<s<1,z(0)= 

a, z(1)=b) be a curve joining a and b. We say 
that P(z; a) is analytically continuable along 

C and that we obtain P(z; b) at the end point b 
by the analytic continuation of P(z; a) along C 
if the following two conditions are satisfied: (i) 
To every se[0, 1] there corresponds a function 
element P(z;z(s)) with center z(s); (ii) for every 
So €[0, 1], we can take a suitable subarc z= 
z(s) (|S —So|<é,€>0) of C contained inside 

the circle of convergence of P(z;z(so)) such 
that every function element P(z; z(s)) with 

|S —So|<é is a direct analytic continuation of 
P(z;z(So)). When P(z; a) and the curve C are 
given, the analytic continuation along C is 
uniquely determined (uniqueness theorem of 
the analytic continuation). 

Given a function element P(z; a) with center 
a, the set of all function elements obtained by 
every possible analytic continuation along 
every curve starting from a is called an ana- 
lytic function in the sense of Weierstrass deter- 
mined by P(z; a). In this definition, we can 
restrict the curves to polygonal lines. An ana- 
lytic function in this sense is completely deter- 
mined by a single arbitrary function element 
belonging to it, so two analytic functions are 
identically equal if they have a common func- 
tion element. 

A tgerm of a holomorphic function is iden- 
tical to a function element, and the set of all 
germs has the natural structure of a tsheaf O. 
In the terminology of sheaves, an analytic 
function is a connected component of ©, and 
an analytic continuation along a curve C is a 
continuous curve F in © whose projection is C. 


J. Values and Branches of Analytic Functions 


The value of an analytic function at a point b 
is, by definition, the value at b of its function 
elements with center b (whose existence is 
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assumed; there may be several such elements). 
An analytic function is, in general, a multiple- 
valued function because analytic continua- 
tions along different curves with the same end 
points may lead to different function elements. 
For a given analytic function f(z), if the max- 
imal number of its function elements with the 
same center is n, we say it is n-valued, and if 
n>2 we Say it is multiple-valued (or many- 
valued). The number of function elements of 
f(z) with the same center is at most tcountably 
infinite, so the value of f(z) at a point is a 
countable set (Poincaré-Volterra theorem). By 
introducing a fRiemann surface instead of the 
complex plane as the domain of definition of 
an analytic function, we can regard multiple- 
valued analytic functions as single-valued 
functions defined on a suitable Riemann sur- 
face (— 367 Riemann Surfaces). 

Let f(z) be an analytic function and P(z;a) be 
a function element belonging to f(z), where a is 
a point of a domain D. The set of all function 
elements obtained from P(z; a) by every pos- 
sible analytic continuation along all curves in 
D is called a branch of f(z) in D determined by 
P(z; a). When D coincides with the whole com- 
plex plane, the branch of f(z) in D is the func- 
tion f(z) itself. A function holomorphic in a 
domain D can be expanded in a power series 
with any point of D as its center, and the set of 
these power series (function elements) consti- 
tutes a branch of an analytic function. 

If analytic continuations of a function ele- 
ment are possible along all curves in D, then 
the analytic continuations along two tho- 
motopic curves in D lead to the same result 
(monodromy theorem). In particular, if D is 
‘simply connected and if analytic continua- 
tions of P(z; a) are possible along all curves in 
D starting from a, then the branch of f(z) in D 
determined by P(z; a) is single-valued. 


K. Invariance Theorem of Analytic Relations 


Suppose the following four conditions hold: (1) 
F(z, w) is a holomorphic function of two vari- 
ables for zeA, and weA,, where A,, A, are 
domains in the complex plane. (2) A curve C: 
z=2z(s) (O<s<1,z(0)=a, z(1)=b) and two sets 
of function elements P(z; z(s)) and Q(z; z(s)) 
defined for every s (0<s<1) are given. (3) 

P(z; a) and Q(z; a) can be continued analyti- 
cally along C using P(z; z(s)) and Q(z; z(s)), 
respectively. (4) There exists a positive number 
R(s) for every s (0<s< 1) such that, if |z—z(s)| 
< R(s), the values of P(z; z(s)) and Q(z; z(s)) 
belong to A, and A,, respectively. Under these 
conditions, if F(P(z; a), Q(z; a))=0 holds for 
|z—a|< R(0), then F(P(z; b), Q(z; b)) =0 holds 
for |z—b|< R(1). In other words, an analytic 
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relation between function elements belonging 
to two analytic functions that holds in a neigh- 
borhood of the starting point of a curve C is 
conserved for function elements with center at 
the terminal point b of C. This is called the 
invariance theorem of analytic relations. The 
same statement is valid for relations among 
more than two analytic functions and their 
derivatives (differential equations). 


L. Inverse Functions 


Suppose that P(z; a) (a4 co) belongs to an 
analytic function f(z) and P’(a; a) #0. We 
consider the inverse function of P(z;a) ina 
neighborhood of a and let B(w; «) («= P(a; a)) 
be its expansion as the power series in w—«. 
We call 8(w; «) the inverse function element (or 
simply inverse element) of P(z; a) and the ana- 
lytic function determined by B(w; æ) the inverse 
analytic function (or simply inverse function) of 
f(z). The inverse function is completely deter- 
mined by f(z) and is independent of the choice 
of P(z; a). For example, analytic functions rep- 
resented by Jw or log w are defined as the 
inverse function of z? or e7, respectively. 


M. Singularities of Analytic Functions 


Hereafter, when we speak of a curve C:z=2(s) 
(O<s< 1), it is always supposed that C is a 
curve in the complex plane starting at a and 
ending at w. Let K, be the open disk |z— œ| < 
r; we denote by C, the connected component 
of CNK, that contains œ. If analytic continu- 
ations of P(z; a) are possible along any subarc 
of C with a terminal point arbitrarily near œ 
but impossible along the whole C, we say that 
the analytic continuation of P(z; a) along C 
defines a singularity Q of the coordinate œ, and 
that Q lies over w. For example, if P(z; a) has 
a finite radius of convergence, for a suitable 
point œw on the circle of convergence the ana- 
lytic continuation of P(z; a) along the radius aw 
defines a singularity over w. Now take a point 
z, on C, and denote by F.(z) the branch of an 
analytic function determined by P(z;z,) in K,. 
Let Q be a singularity determined by C and 
P(z;a), and suppose that we are given another 
singularity Q* over œ determined by C* and 
P*(z; a*). If they define the same branch F(z) 
for every K,, by definition, we put QN=O*. 
Thus F,(z) defines an tunramified covering 
surface W, of K,— {œ}, and it is single-valued 
on W.. 

Singularities are classified according to the 
geometric structure of W, and the value distri- 
bution of F,(z) on it. First, if W, has no trela- 
tive boundary over 0<|z—@|<r for a suitable 
r, then Q is called an isolated singularity of the 


745 


analytic function. In this case, the number k of 
points of W, lying over a point z in K,— {a} 

is constant. If k= 00, W, has a tlogarithmic 
branch point over œ, and Q is called a loga- 
rithmic singularity. If k < oo, F,(z) can be repre- 
sented as a single-valued holomorphic func- 
tion in 0<|t|<r’'* by putting z=% +t". In this 
case, if we introduce an additional point P, 
corresponding to z =w, then W.U {Po} has 
only an talgebraic branch point over w. Now, 
taking into account the value of w= F,(z), we 
call Q an algebraic singularity if lim w exists. In 
this case, we have F.(z)=2-,c,t”, and if we 
admit analytic continuations in the wider 
sense (—> Section O), P(z; a) is analytically con- 
tinuable along the whole C. 


N. The Natural Boundary 


Given a domain D and an analytic function 
J(z} holomorphic in D, if all boundary points 
of D are singularities of f(z) and f(z) is not 
continuable to the exterior of D, the boundary 
of D is called the natural boundary of f(z). This 
phenomenon was first discovered for telliptic 
modular functions. Many results are known 
about power series for which the circle of 
convergence is the natural boundary (— 339 
Power Series). For any given domain D in C, 
there exists an analytic function whose natural 
boundary is the boundary of D. The original 
proof of this fact, given by Weierstrass, con- 
tained a defect that was corrected by J. Besse. 


O. Analytic Continuation in the Wider Sense 


Let two tLaurent series (with parameter t) z= 
P(t)= 22, a,t” and w=Q(t)=Z2,5,t" (k 

and / are integers, and a,b, #0) converge in 
0<|t|<r, and let (P(t,), Q(¢,)) A(P(t2), Q(t2)) 
if t; At,; then we say that the pair (P, Q) de- 
fines a function element in the wider sense. If a 
change of parameter t=r,t+r,t?+... (7,4 

0 and the radius of convergence > 0) gives 
P(t)= (1), O(t)= K(t), we say that (M, K) and 
(P, Q) define the same function element. By a 
suitable choice of parameter, any function 
element can be given in the form z=t*+ a (or 
a=t "),w=>~,b,t", and the elimination of t 
gives the representation of w as a tPuiseux 
series of z. So if k= 1 and />0, it reduces to a 
holomorphic function element. When k= 1, 
with !<0 not excluded, the above element is 
called a rational element. If k > 1 it is called a 
ramified element, and if /<0 it is called a polar 
element. 

If P’, Q’ are the direct analytic continuations 
of P and Q at tọ (O<|fo|</r), i.e., their Taylor 
expansions at ty, the function element (P’, Q’) is 
called a direct analytic continuation of (P, Q), 


198 P 
Holomorphic Functions 


which is also considered its own direct analytic 
continuation. For a fixed r, the set of all direct 
continuations of (P, Q) thus obtained is called 
an analytic neighborhood of (P, Q), and these 
neighborhoods define a topology in the set of 
all function elements. A curve in this topolog- 
ical space is called an analytic continuation in 
the wider sense, and a tconnected component 
of this space is called an analytic function in 
the wider sense. An analytic function in the 
wider sense is a set of function elements in the 
wider sense, but it can also be regarded as a 
function w= f(z) (with an independent variable 
z and a dependent variable w) defined by each 
function element p(z, w): z = P(t), w= Q(t). 

An analytic continuation in the wider sense, 


p(s) = p(z, w; s); 


w= > c,(s)t", 


n=I(s) 


z=z(s)+ t9, 0<ss<ÎÍ, 
is sometimes called an analytic continuation 
along the curve C:z=2(s)(0<s<1) in the 
complex plane. If all p(s) are holomorphic 
function elements, this coincides with the 
analytic continuation along C in the original 
sense, but if this is not the case, p(0) and C do 
not necessarily determine p(1) uniquely. Actu- 
ally, an analytic function in the wider sense is 
just an analytic function in the original sense 
with at most a countable number of ramified 
or polar elements added. 


P. Singularities of Analytic Functions in the 
Wider Sense 


Suppose the following three conditions hold: 
(1) For every point on C except a, that is, for 
z(s) (0<s< 1), a function element in the wider 
sense p(z, w; s) is given. (2) For every 4 (<1), 
p(z, w; s) (0< s <A) constitutes an analytic con- 
tinuation in the wider sense. (3) It is impossible 
to find a function element p(z, w; 1) such that 
p(z, w; s)(0<s<1)is an analytic continuation 
in the wider sense. When these three condi- 
tions are satisfied, we say that p(z, w; s) (0 <s < 
1) defines a transcendental singularity Q with 
@ as its coordinate. The method of determin- 
ing a branch w= F,(z) in an open disk with 
center w is completely parallel to the case of 
holomorphic analytic functions. Because of the 
appearance of function elements in the wider 
sense in F,(z), the covering surface W, of K, 
defined by F,(z) may have algebraic branch 
points. If W, has a logarithmic branch point 
over œ, Q is called a logarithmic singularity. If 
W, has no point over œ for suitable r, Q is 
called a direct transcendental singularity; other- 
wise, it is called an indirect transcendental 
singularity. The logarithmic singularities are 
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direct singularities. The inverse function of z= 
wsinw has a direct singularity over z= œ 

that is not logarithmic, and the inverse func- 
tion of z=(sin w)/w has an indirect singularity 
over z=0. Taking into account the value of 
w= F(z), if the tcluster set of F, at Q: So = 
(\->0 {F,(z)} consists of only one point, it is an 
ordinary singularity; if not, it is an essential 
singularity. 


Q. History 


A function of a complex variable is monogenic 
in the sense of A. L. Cauchy if it is differenti- 
able at every point of its domain of definition. 
It was B. Riemann who succeeded in develop- 
ing Cauchy’s concept. Riemann considered an 
analytic function as a function defined on a 
*Riemann surface, that is, a 1-dimensional 
complex analytic manifold. On the other hand, 
Weierstrass constructed the theory of analytic 
functions starting from power series. When we 
speak of single-valued functions defined in a 
domain of the complex plane, the monogenic 
functions of Cauchy and the analytic functions 
of Weierstrass are identical. Although the 
analytic functions are very special functions, 
the study of complex analytic functions is 
usually called the theory of functions of a 
complex variable, or simply the theory of 
functions. 

By considering the following point set C, 
which is more general than a domain, E. Borel 
showed that a monogenic function on C is not 
necessarily holomorphic in the ordinary sense. 
Take a countable dense subset in a subdomain 
D’ of a domain D and a double sequence of 
positive numbers {r}. Put SM = {z||z—z,|< 
rP} and C”=D—\|),;2, SP. By a suitable 
choice of r, we can suppose that the C™ are 
connected and monotone increasing with 
respect to h. Put C=| 2, C™. A function 
defined in C is by definition monogenic if it is 
differentiable in C® for every h. For such a 
monogenic function, Cauchy’s tintegral for- 
mula in a generalized form holds, and the 
function is infinitely differentiable. If f(z) and 
g(z) are monogenic in C and coincide on a 
curve in C, then they are identical in C. Let D 
be the set {z|0<Rez<1,0<Imz<1} and {z,} 
be all rational points in D (z,=(p+ig)/m). For 
a natural number h, we define C™ to be the set 
D minus the union of open disks with radius 
exp(—e”™)/h and center (p + ig)/m. The 
function 


fo} S 


is monogenic in C in the above-mentioned 
sense, but not holomorphic in C. The study of 


exp(—e”") 
z—(p+igq)/m 
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these functions developed into the theory of 
tquasi-analytic functions. 

The concept of tanalytic functions of several 
complex variables can also be defined analo- 
gously to the case of one variable. Then non- 
uniformizable singularities appear that lead to 
a generalization of the concept of manifolds 
(— 23 Analytic Spaces). 
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A. General Remarks 


Let M be a ‘differentiable manifold. If a tLie 
group G acts ttransitively on M as a tLie 
transformation group, the manifold M is said 
to be a homogeneous space having G as its 
transformation group (— 431 Transformation 
Groups). The tstabilizer (isotropy subgroup) 
H, of G at a point x of M is a closed subgroup 
of G, and a one-to-one correspondence be- 
tween G/H, and M preserving the action of G 
is defined by associating the element sH, (se G) 
of G/H, with the point s(x) of M. This corre- 
spondence is a tdiffeomorphism between the 
manifold M and the quotient manifold G/H, if 
the number of connected components of G is 
at most countable. Under this condition we 
may therefore identify a homogeneous space 
M with the quotient manifold G/H of a Lie 
group G by a closed Lie subgroup H (— 249 
Lie Groups). However, H is not uniquely 
determined by M, and it may be replaced by 
Hœ =sH,s ‘(seG). Each element h of the 
stabilizer H, at a point x induces a linear 
transformation h on the ttangent space V, of 
M at the point x. The set A, of all his called 
the linear isotropy group at the point x. 


199 B 
Homogeneous Spaces 


If we represent the homogeneous space M 
as G/H, we obtain the canonical map z:s->sH 
of G onto M, which we call the projection of G 
onto M. Let g be the Lie algebra of G, and þh 
the Lie subalgebra corresponding to the closed 
subgroup H. When we identify g with the 
tangent space at the identity element e of G 
and b with its subspace, the projection z in- 
duces a linear isomorphism of g/b with the 
tangent space V, of M at the point x =z(e). 
The tadjoint representation of G gives rise to a 
linear representation h-Ad(h) modulo b of the 
group H on the linear space g/b. Through the 
linear isomorphism between g/b and the tan- 
gent space V, defined by the projection z, this 
representation of H is equivalent to the one 
which associates with h the linear transforma- 
tion h defined by h on the tangent space V,. 

The homogeneous space G/H is said to be 
reductive if there exists a linear subspace m of 
g such that g=h+m (direct sum as linear 
spaces) and (Ad H)mcm. H is said to be re- 
ductive in g if the representation h- Ad(h) of 
H in g is tcompletely reducible. 

If a ttensor field P on the homogeneous 
space M =G/H is G-invariant (namely, invar- 
iant under the transformations defined by the 
elements of G), then the value of P at the point 
x =7(e) is a ftensor over the tangent space V, 
at x which is invariant under the linear iso- 
tropy group A. Conversely, such a tensor 
over V, is uniquely extended to a G-invariant 
tensor field on M. If G/H is reductive, then G- 
invariant tensor fields over M are in one-to- 
one correspondence with H-invariant tensors 
over m. For instance, if H is compact, then H 
is reductive in g and an A-invariant positive 
definite quadratic form on m defines a G- 
invariant fRiemannian metric on G/H. 

We say that the homogeneous space M = 
G/H is a Riemannian (linearly connected, 
complex Hermitian, Kahler) homogeneous 
space if there exists on M a G-invariant Rie- 
mannian metric (*linear connection, tHer- 
mitian metric, Kahler metric). Concerning 
such homogeneous spaces, there are various 
results on their structures and geometric prop- 
erties [1-5] (— 412 Symmetric Riemannian 
Spaces and Real Forms; 427 Topology of Lie 
Groups and Homogeneous Spaces). 


B. Examples 


Stiefel Manifold. A k-frame (1 <k <n) in a real 
n-dimensional Euclidean vector space R” is an 
ordered system consisting of k linearly inde- 
pendent vectors. If we regard the real tgen- 
eral linear group of degree n, GL(n, R), as the 
regular linear transformation group of R”, 
GL(n, R) acts transitively on the set V, ,(R) of 
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all k-frames in R”. Therefore, if H denotes the 
subgroup of GL(n, R) consisting of the ele- 
ments which leave fixed a given k-frame v}, 
we may identify the set V,’, and the quotient 
set GL(n, R)/H. Transferring the differenti- 
able manifold structure of GL(n, R)/H to V’ x 
through this identification, we see that V; (R) 
=GL(n, R)/H becomes a homogeneous space 
(the differentiable manifold structure of V, , is 
defined independently of the choice of vå). 
The space V; (R) is called the (real) Stiefel 
manifold of k-frames in R”. 

A k-frame is called an orthogonal k-frame 
if the vectors belonging to the frame are of 
length 1 and are orthogonal to each other. The 
set V, (R) of all orthogonal k-frames is a sub- 
manifold of V; (R). The torthogonal group 
O(n) acts transitively on V; (R), which is a 
homogeneous space represented as V, ,(R)= 
O(n)/1, x O(n —k). The manifold V, (R) is 
actually the (n— 1)-dimensional sphere. We call 
V, (R) the (real) Stiefel manifold of orthogonal 
k-frames (or simply Stiefel manifold). The 
complex Stiefel manifold V, ,(C) = U(n)/I, x 
U(n—k) is defined analogously. 


Grassmann Manifold. Let M, ,(R) (1 <k <n) 
be the set of all k-dimensional linear sub- 
spaces of R”. The group O(n) acts transitively 
on M,,,(R), so that we may put M, ,(R)= 
O(n)/O(k) x O(n—k). Here O(k) and O(n—k) 
are identified with the subgroups of O(n) con- 
sisting of all elements leaving fixed every point 
of a fixed (n—k)-dimensional subspace and of 
its orthogonal complement, respectively. In 
this way, M, a(R} is a homogeneous space, 
which we call the (real) Grassmann manifold. 
The tproper orthogonal group SO(n) acts 
transitively on M,, (R), and M,, ,(R) may be 
represented as a homogeneous space having 
SO(n) as its transformation group. It follows 
that M,, ,(R) is connected. The homogeneous 
space M,, ,(R) = SO(n)/SO(k) x SO(n—k) is 
called the Grassmann manifold formed by 
oriented subspaces. M,, (R) and M, ,(R) may 
be identified with the (n— 1)-dimensional real 
projective space and the (n — 1)-dimenstonal 
sphere, respectively. l 

Applying the above process for real Grass- 
mann manifolds to the complex Euclidean 
vector space C” instead of R”, we see that the 
set M„ (C) of all k-dimensional linear sub- 
spaces in C” is a homogeneous space with 
the tunitary group U (n) of degree n as its 
transformation group, and we represent it as 
U(n)/U(k) x U (n —k). This space is called the 
complex Grassmann manifold. The mani- 
fold M, (C) is a simply connected complex 
manifold and has a cellular decomposition 
as a tCW complex whose cells are tSchubert 
varieties (— 56 Characteristic Classes E). On 
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the other hand, M,, (C) may be regarded as 
the set of all (k — 1)-dimensional linear sub- 
spaces in the (n— 1)-dimensional complex 
projective space. Then, by using the tPlicker 
coordinates of these subspaces, M,, ,(C) can be 
realized as an talgebraic variety without sin- 
gularity in the projective space of dimension 


a —1 (= 90 Coordinates B). Sometimes 


M,,,,(R) is denoted by G,,,(R) or G(n, k). In the 
same way, the homogeneous space represented 
as Sp(n)/Sp(k) x Sp(n—k) is called the tquater- 
nion Grassmann manifold and is denoted by 
M, (H). 


Flag Manifold. Let k,,...,k, be a sequence of 
integers such that n>k,>...>k,>0, and let 
F(k,,...,k,) be the set of all monotone se- 
quences V; >... > V, where V,(i=1,...,r)isa 
k,-dimensional linear subspace in R”. For the 
two sequences V, >... > V, and Vj>... 3 W 
belonging to F(k,,...,k,), there exists an ele- 
ment se GL(n, R) such that s(V,)= V; (i= 
1,...,r). Therefore F(k,,...,k,) is a homogene- 
ous space with GL(n, R) as its transforma- 
tion group, and is called the proper flag mani- 
fold. Since the unitary group U(n) of degree n 
acts transitively on it, F(k,,...,k,) 1s also re- 
garded as a homogeneous space admitting 
U(n) as its transformation group. In this case, 
putting F(k,,...,k,)=U(n)/H, H is isomorphic 
to the direct product U (k; —k,) x U(k, —k3) 

x ... x U(k,). In particular, when r=n—1, 
k,=n—i, the homogeneous space is the quo- 
tient space of the compact Lie group U (n) by a 
maximal ftorus T. In general, the quotient 
space G/T of a compact connected Lie group 
G by a maximal torus of G is called a flag 
manifold. If G acts effectively on G/T, Gis a 
tsemisimple compact Lie group. The complex 
Lie group G° is a Lie transformation group of 
tbiregular transformations which acts transi- 
tively on the flag manifold G/T, a simply con- 
nected Kahler homogeneous space. Here G° is 
a complex Lie group having G as a maximal 
compact subgroup. If B is a maximal tsolvable 
Lie subgroup (tBorel subgroup) of G°, G/T is 
represented as G°/B. 
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A. General Remarks 


Homological algebra is a new branch of math- 
ematics that developed rapidly after World 
War II. The introduction of the theory was 
motivated by the observation that some alge- 
braic ideas and mechanisms that arose in 

the development of talgebraic topology, in 
particular, thomology theory, can provide 
powerful tools for treating from a unified 
viewpoint various problems in algebra that 
previously were treated differently. One of its 
characteristic features lies in emphasizing, 
from the standpoint of categories and functors 
(— 52 Categories and Functors), the functional 
structure of the objects to be investigated 
rather than their inner structure. Thus the 
theory of derived functors constitutes the main 
theme of homological algebra. This new 
theory turned out to have wide applications in 
other areas of mathematics, and the philos- 
ophy embodied in the theory has been influen- 
tial in the general progress of mathematics. 
For general references — [2,5, 6]; for tsheaf 
cohomology — [3,4,8]. 


B. Graded Modules and Graded Objects 


Let A be a tring with unity element and X bea 
tunitary A-module. If we are given a sequence 
of A-submodules X, (ne Z) such that X = 
Ynez X, (*direct sum), we call X a graded 
A-module and X, the component of degree n 

of X. Each element x of a graded A-module 

X has a unique representation x = ¥,,-7 Xn 
(x, € X,); we call x, the component of degree n 
of x. An A-submodule Y of a graded A-module 
X is called homogeneous if xe Y implies x,¢ Y 
(ne Z). In this case, Y = $, Y, and the quotient 
module X/Y=),, X,,/Y, are graded A-modules, 
where Y,= YN X,. Let X =X, X, and Y=), Y, 
be graded A-modules and f: X > Y be an A- 
homomorphism. If there is a fixed integer p 
such that f(X c Y,,, for any neZ, f is called 
an A-homomorphism of degree p. In this case, 
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Ker f=, Ker f, and Im f=, Im f,_, are 
homogeneous A-submodules of X and Y, 
respectively, where fa: X,,— Yp+p is the restric- 
tion of f on X,. 

Sometimes, by a graded module we mean 
only a sequence {X,,} of A-modules X,,, with- 
out considering the direct sum ¥,, X,. Simi- 
larly, we have the notion of a graded object 
{C,} in any category @. 


C. Chain Complexes and Homology Modules 


By a chain complex (X, 0) over A we mean a 
graded A-module X = ¥,, X, together with an 
A-homomorphism 6: X +X of degree —1 such 
that 6od=0. Hence a chain complex over A is 
completely determined by a sequence 


Cnet ĉ 
e> Xn S Xn OX ee 


of A-modules and A-homomorphisms such 
that 6,06,,, =0 for all n. We call ð the bound- 
ary operator. For a chain complex (X, ô), we 
write Kerdé=Z(X), Ker ô, = Z,(X), Im ô= B(X), 
Imô, +; =B,(X). Then Z(X)=5£,Z,(X), B(X) 
=), B,(X) are homogeneous submodules of 
X, called the module of cycles and the module 
of boundaries, respectively. B(X) is a homog- 
eneous submodule of Z(X), and the quotient 
modules Z(X)/B(X), Z,(X)/B„(X) are denoted 
by H(X), H,(X), respectively. We call H(X)= 
>, H,(X) the homology module of the chain 
complex (X, ô). 

If (X, ô), (Y, 6’) are chain complexes over 
A, an A-homomorphism f:X > Y of degree 
0 satisfying d'o f =f oô (i.e., Of, =f,_16,) 
is called a chain mapping of X to Y. For a 
chain mapping f, we have f{(Z,(X))< Z,(Y), 
J(B,(X))< B,(Y), and hence f induces an A- 
homomorphism f: H(X)—>H(Y) of degree 0, 
which is called the homological mapping in- 
duced by f. We have (1x) = lig, and (go f), 
=g,.0 f, for chain mappings f:X—>Y and g: 
YZ. 

Let f, g:X >Y be two chain mappings. If 
there is an A-homomorphism D: X > Y of 
degree +1 such that f—g=Dodé+0'0D, we 
say that f is chain homotopic to g and write 
f ~g; D is called a chain homotopy of f to g. 
If f is chain homotopic to g, we have f, = 
g,:H(X)~>H(Y). For chain complexes X 
and Y, if there are chain mappings f:X — Y 
and g: YX such that fog~1l,y and gof~1y,, 
we say that X is chain equivalent to Y. In this 
case f: H(X)—>H(Y) is an isomorphism and 
gą: H(Y)> H(X) is its inverse. 

Let (X, 0) be a chain complex over A and 
Y=, Y, be a homogeneous A-submodule 
of X such that ôY c Y. Then Y and X/Y are 
chain complexes over A with the boundary 
operators induced by ô. Y is called a chain 
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subcomplex of X, and X/Y is called the quo- 
tient chain complex of X by Y or the relative 
chain complex of X mod Y. For a chain com- 
plex X and its subcomplex Y we have an 
texact sequence 0 YX 4 X/Y-0, where i is 
the tcanonical injection and j the tcanonical 
surjection. 

Let (W, 0’), (X, ô), (Y, 6”) be chain com- 
plexes over A, and f: WX, g:X—Y be chain 
mappings such that 0> WX 4 Y-0 is 
exact. Then an A-homomorphism 6, :H(Y)> 
H(W) of degree —1, called the connecting 
homomorphism, is defined by 6,(y + B(Y)) 
=f~'odog '(y)+ B(W) (yeZ(Y)), and we 
have the exact sequence of homology: 


(W)5H,(X)5H,(¥) 
SHAW) SH, AIX) ORY) oe 
For a ‘commutative diagram 
0-W >X >Y 0 

tb y fy (1) 
0>W'>X'>Y'>0 


consisting of chain complexes and chain 
mappings in which each row is exact, we have 
0,0W,=9,00,:H(Y)>H(W’). 

For the tinductive limit lim X, of chain com- 
plexes X, over A, we have 


H(lim X,)=lim H(X,). 


A chain complex X is said to be positive if X, 
=0 for all n<0. If X is a positive chain com- 
plex over A and M is an A-module, then we 
mean by an augmentation of X over M an A- 
homomorphism £: Xo—>M such that the com- 
position X, SX >M is trivial:eo 0, =0. A 
positive chain complex X together with an 
augmentation ¢ of X over M is called an aug- 
mented chain complex over M. It is said to be 
acyclic if the sequence 


sek OX os x SSO 


is exact, namely, if H,(X)=0 (n 40) and £ 
induces an A-isomorphism H,(X)= M. In this 
case X is also called a left resolution of M. 
Moreover, if each X, is a fprojective A- 
module, X is called a left projective resolution. 
For any A-module M, there exists a left pro- 
jective resolution of M. 

Let x: M >M’ be an A-homomorphism of A- 
modules, and X, X’ be augmented chain com- 
plexes over M, M’ having augmentations e, £’, 
respectively. Then a chain mapping f: X > X’ 
satisfying £'o fg =a0¢ is called a chain map- 
ping over «. If X, X’ are left projective reso- 
lutions of M, M’, respectively, then there exist 
chain mappings of X to X’ over «, and any 
two such mappings are chain homotopic. In 
particular, a left projective resolution of an A- 
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module M is uniquely determined up to chain 
homotopy. 


D. Tor 


Given a right A-module M and a left A- 
module N, Z-modules Tor4 (M, N) (n= 
0,1,2,...), called the torsion products (or Tor 
groups), are defined as follows: Let 


¥:..0Y,3Y,.,3...5%5N20 


be a projective resolution of N, and consider 
the chain complex 


(M@ 4Y,1@0):... 
1@é, 


>M Q 4¥, 7M Q 4h%y-17>--- 
Luly ®) ato >Q 

obtained by forming the ttensor product of M 
and Y. Then we see that the homology module 
H,(M ® 4Y) is uniquely determined for any 
choice of left projective resolution of N. We 
define Tor4(M, N)=H,(M @ 4Y). In partic- 
ular, we have Tord(M,N)=M® 4N. 


Properties of Tor. (1) If M is a tflat A-module, 
we have Tor4(M, N)=0 (n=1,2.,...). 

(2) An A-homomorphism f: M, >M, in- 
duces a homomorphism f,: Torf (M, N)> 
Torý(M,3, N). We have (14) = 1, and (go f), = 
Gx Of, for f:M,>M,, 9:M,>Ms3. 

(3) For an exact sequence 0>M, 4M,% 
M,-0, we have the following exact sequence 
of Tor: 


... 3Tor4(M,, N)STor4(M,, N) 
% Tor4(M,, N)ŠTor4 ,(M,, N)>... 
+Tor4(M;, N)ŠM,®@ ,N>M,@ ,N 
>M,® ,N-0, 


where 0, are the connecting homomorphisms. 
(4) For a commutative diagram 


0-M,-M,-M,-0 


le } |e 
0-M',>-M5-M;3-0 


of A-modules and A-homomorphisms with 
exact rows, we have 0,0W,=@, 06,. 

(5) Tory (È, Ma, N)= 2X, Torn(M,, N) 

(6) Tor; (lim M,, N)=lim Tor4(M,, N). 

On the other hand, take a left projective 
resolution X of M and consider the chain 
complex X © ,N. Then we have H,(X Q ,N) 
= Tor4(M, N) for n=0, 1,.... Therefore 
properties similar to (1)—(6) hold with respect 
to the second variable N of Tor4(M, N). 

(7) If A° is a ring tanti-isomorphic to A, 
then Tor4(M, N)= Toré (N, M). In particular, 
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if A is commutative, then Tor4(M, N) is an A- 
module and we have Tor4(M, N)=Tor4(N, M). 
(8) Let A be a tprincipal ideal ring. Then 
Tor4(M, N)=0 (n=2,3,...) and Tor?(M, N) is 
also denoted by M x ,N. For an exact se- 
quence 0—> M, >M, —>M,—>0, we have the 
exact sequence 0> M, * ,N>M,*,N- 
M3* 4N>M, @ s4N>M, © 4N>M;, 84N > 
0. In particular, Z*,N =0 and (Z/nZ)*,N=,N 
(={xeN|nx=0}). 


Universal Coefficient Theorem for Homology. 
If (X, 6) is a chain complex over A and N isa 
left A-module, then (X © ,N,é@ 1) is a chain 
complex. If A is a principal ideal ring and each 
X,, is a ttorsion-free A-module, then we have a 
formula 


H,(X @ 4N) 
= H,(X) Q aN + A,-1(X)* 4N, 


called the universal coefficient theorem. 


E. Double Chain Complexes 


By a double chain complex (X, 4, 0’, 0”) over 
A we mean a family of left A-modules X, , 
(p,q¢Z) together with A-homomorphisms 
Ogi X pig? Xp-1.4 aNd OF: Xp, >X p.a- Such 
that ĉp-1,4 0 0p,q=Op,q~1 O Ôp.a = Îp.q-1 O Op. 
+ 671,406, ,=0. We define the associ- 

ated chain complex (X,,, 8) by setting X,= 

E ptacnX p,a On =Z p+q-n0p a Popa: We call ô 
the total boundary operator, and 0’, 6” the 
partial boundary operators. 

Given a chain complex X consisting of right 
A-modules and a chain complex Y consisting 
of left A-modules, a double chain complex 
(Zp, 0’, 0") is defined by setting Z, = 
Xp 8 4%, p= 0 Q l, ôg a= (—1)1 @ Ôg» 
where ĉp, 6, are the boundary operators of X, 
Y, respectively. It is called the product double 
chain complex of X and Y, and the homology 
module of its associated chain complex is 
denoted by H(X © 4Y). With respect to this 
homology module, the following facts hold. If 
X is a left projective resolution of a right A- 
module M and Y is that of a left A-module N, 
then H,(X @ ,Y)=Tor4(M, N). If A is a prin- 
cipal ideal ring and each X, is a torsion-free A- 
module, then we have the formula 
H,(X @4Y)= È} H,(X)® 4H,(Y) 


ptq=n 


+ $ H,(X)* 4H,(Y), 


ptq=n-1 
the Kiinneth theorem. 
F. Cochain Complexes 


By a cochain complex (X,d) over A we mean a 
graded A-module X together with an A- 
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homomorphism d: X >X of degree +1 such 
that dod=0; d is called the coboundary oper- 
ator or differential. For a cochain complex 
(X,d), we denote by X” the component of 
degree n of X, and by d”, X"> X"*! the restric- 
tion of d on X”. Then a chain complex (Y, ĉ) is 
defined by Y, = X “" and ô,: Y, —> Y,_, is equal 
tod™:X tS: 

For a cochain complex (X, d), we write 
Kerd"=Z"(X), Kerd= Z(X) (Z(X)=XZ"(X)), 
Imd”! = B"(X), Imd = B(X) (B(X) = B"(X)), 
and Z2"(X)/B"(X)=H"(X)), Z(X)/B(X) = 
H(X) (H(X)= È H"(X)). These modules 
Z(X)(Z"(X)), B(X) (B"(X)), and H(X) (H"(X)) 
are called the module of cocycles, the module of 
coboundaries, and the cohomology module of 
X, respectively. If we consider the associated 
chain complex (Y, ô) of (X,d), then H_,(Y) 
corresponds to H"(X). In this way, results on 
chain complexes give results on cochain com- 
plexes. Thus the concepts of cochain mapping, 
cochain homotopy, cochain equivalence, cochain 
subcomplex, and relative cochain complex can 
be defined as in the case of chain complexes in 
B, and we have corresponding results. In par- 
ticular, given an exact sequence 0— WSX45Y 
—0 of cochain complexes and cochain map- 
pings, the connecting homomorphism d,,:H"(Y) 
— H"*!(W) is defined, and the exact sequence 
of cohomology 


SAW) SAX) SHY) 
SH (WSH (X) a (Y)3... 
exists. For a commutative diagram 
0>W >X >Y >0 
le Low 
0-W'> x’ Y'>0 


of cochain complexes and cochain mappings 
with exact rows, we have d, of, =9, od,. 

A cochain complex X is said to be positive if 
X"=0 for n<0. If X is a positive cochain 
complex over A and M is an A-module, we 
mean by an augmentation of X over M an A- 
homomorphism ¢:M— X° such that the com- 
position MX o4, y1 is trivial. If the sequence 


0 n 
oM SK Seas SK 


is exact, X is called a right resolution of M. 
Moreover, if each X” is an tinjective A- 
module, X is called a right injective resolution 
of M. For any A-module M, there exists a 
right injective resolution of M, and any two 
such resolutions are cochain homotopic. 


G. Ext 


Given left A-modules M and N, Z-modules 
Ext",(M, N) (n=0,1,2,...), called the Ext 
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groups, are defined as follows: Let X:...>X, 
—~X,_17... 7X )7~M-0 be a projective reso- 
lution of M, and consider the cochain complex 


Hom ,(X, N): 
0>Hom,(Xo9, N)>...>Hom,(X,_,,N) 
+Hom,(X,, N)>... 


obtained by forming the tmodule of A- 
homomorphisms. Then we can show that the 
cohomology module H"(Hom,(X, N)) is 
uniquely determined for any choice of projec- 
tive resolution of M. We define Ext",(M, N) 
= H"(Hom ,{X, N)). This can also be defined 
as the cohomology module H"(Hom,(M, Y)) 
of the cochain complex Hom, (M, Y):0> 
Hom,(M, Y°)>...+Hom,(M, ¥" 1)> 
Hom,(M, Y")>..., where Y:03N—-Y°3...3 
Y" Y”... is a right injective resolu- 

tion of N. Furthermore, for a left projective 
resolution X of M and a right injective re- 
solution Yof N, we see that Ext (M, N) is 
isomorphic to the cohomology module 
H"(Hom,(X, Y)) of the associated cochain 
complex of the double cochain complex 
Hom,(X, Y)=(Hom,(X,, Y°), d', d"), where 
d, a: Hom, (Xp, ¥)>Hom,(X,.,, Y°) and 

d; a: Hom, (Xp, Y)>Hom,(X p, Y9) 

are given by d, (u) =u0 0p+1, dp 4(u)= 

(—1) +1 d1ou (ueHom,(X,, Y4)) by using 
the boundary operator ô of X and the co- 
boundary operator d of Y. 


Properties of Ext. (1) We have Ext? (M, N) = 
Hom ,(M, N). 

(2) If M is a projective A-module or N is 
an injective A-module, then Ext",(M,N)=0 
(n=1,2,...). 

(3) An A-homomorphism f:M, >M, 

(resp. f: N; >N) induces a homomor- 

phism f *: Ext} (M2, N) > Ext (M, N) (resp. 
Ja: Exti(M, Ni)>Ext5 (M, N2)). We have 

1% =1 and (gof)*=f* og* for f:M,>M,, 
g:M,—>M, (resp. 1y, =1 and (gos), =9,0f, 
for f:N,>N,, g: Na >N3). 

(4) For an exact sequence 0>M,>M,->M, 
—0 (resp. O3N,>N,—>N,-0), we have the 
exact sequence of Ext: 


0—-Hom,(M;, N)>Hom,(M,, N) 
—+Hom,(M,,N)>Ext)(M,, N) 
—Ext)(M,,N)>... 
(resp. O+Hom,(M, N,)~Hom(M, N>) 
—+Hom(M, N;)>Ext}(M, N,) 
+ Ext)(M,N,)>...) 


I] Na) = I] Ext",(M,, Ng). 


a B 
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(6) If A is a principal ideal ring, then 
Ext} (M, N)=0 (n= 2, 3,...), and Ext4(M, N) 
is also denoted by Ext,(M, N). In particu- 
lar, Ext,(Z, N)=0, Ext,(Z/nZ, N)= N/nN, 
Ext,(M, Q/Z) =0, Ext,(M, Z/nZ) = M/nM, 
where M = Hom,(M, Q/Z). 


Universal Coefficient Theorem for Coho- 
mology. If X is a chain complex over a prin- 
cipal ideal ring A such that each X, is a free A- 
module, then for any A-module N we have the 
formula 


H"(Hom,(X, N)) 
= Hom ,(H,(X), N) a Ext,(H,-; (X), N), 


the universal coefficient theorem. This is gen- 
eralized as follows: Let X be a chain complex 
and Y a cochain complex, both over a prin- 
cipal ideal ring A. Assume that each X, isa 
free A-module or that each Y” is an injective 
A-module. Then we have the formula 


H"(Hom4(X, Y)) 
= > Hom,(H,(X), H(Y)) 


ptq=n 


+ ) Ext,(H,(X), H4(Y)) 


p+q=n-1 


(— 201 Homology Theory). 


H. Complexes in Abelian Categories 


We mainly consider general Abelian cate- 
gories @. Consideration may, however, be 
restricted to the tcategory (Ab) of Abelian 
groups (whose fobjects are Abelian groups and 
whose tmorphisms are homomorphisms) or 
the tcategory g% of R-modules. 

A (cochain) complex C in an Abelian cate- 
gory @ is a graded object {C”} in @ with 
differentials d”: C”—C”*! subject to the con- 
dition that d"*! od"=0 (ne Z). The nth coho- 
mology H"(C) of C is defined by the texact 
sequence 0> B"(C)>Z"(C)> H"(C)-30, where 
B"(C) and Z"(C) are objects representing 
Imd"! and Ker d”, respectively. The complex 
C is called positive (negative) if C”=0 for n<0 
(n>0). We sometimes interchange positive 
superscripts and negative subscripts and write 
C_,, instead of C”. Then the differentials 
become d,:C,—-C,_,, and C is then called a 
chain complex. The quotient of Kerd, =Z,, by 
Imd,,,, =B, is called the nth homology H,(C). 
Negative complexes are usually described in 
this manner. When C”, Z", B", and H” are 
sets, as in the category pM of R-modules, 
their elements are called cochains, cocycles, 
coboundaries, and cohomology classes, respec- 
tively. Similarly, in the group C, of chains, 
residue classes of cycles (€Z,,) modulo bound- 
aries (€B,,) are called homology classes (€H,). 
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A morphism (or chain transformation) f:C—> 
C’ is a ‘natural transformation of the com- 
plexes considered as tfunctors Z-@; i.e., 

f isa family of morphisms f":C">+C” (neZ) 
satisfying f""' od"=d"o f”. It induces a 
morphism of cohomology H"(C)> H"(C’). 

A subcomplex of C is an equivalence class 

of tmonomorphisms D>C, usually denoted 
by any representative D of the class. A (chain) 
homotopy between two chain transforma- 
tions f, g:C—>C' is a family of morphisms 
h":C">+C""! (neZ) satisfying f"—g"= 
h"*!od"+d" 'oh". If there exists a homo- 
topy between f and g, then f and g induce the 
same morphism of cohomology. A morphism 
f:C>C is called a (chain) equivalence if there 
exists a morphism f{’:C’->C such that f’o f 
and fof’ are homotopic to the identities of C 
and C’, respectively. In this case we have 
H"(C)=H"(C’). An exact sequence of com- 
plexes 0O>C’ »C>C’"0 gives rise to the 
connecting morphisms H"(C")> H"*'(C’) 
(neZ), and the resulting sequence... 

H" '(C")3H"(C)>H"(C)3 H"(C)> 
H"*!(C’)-... is exact (the exact cohomology 
sequence), and similarly for homology instead 
of cohomology. An object Ae@ defines a 
complex (also denoted by A) such that A° = A, 
d? =0. A positive complex C together with a 
morphism e: A—>C is called a complex over A, 
and £ is the augmentation. A complex C over A 
is acyclic if0+A>C°>C!-... is exact. An 
acyclic positive complex over A is called a 
right resolution of A. Let {C, e}, {C’,é} be 
complexes over A and A’, respectively, and a a 
morphism A— A’. Then a morphism f:C>C’ 
satisfying foe=e' oats called a morphism 
over «. For a negative complex C, we define 
similarly augmentations €: C> A, acyclicity, 
left resolutions, etc. 

A bicomplex (or double complex) C in € 
consists of objects C”! (p, qe Z) and two dif- 
ferentials d):C?4>C?*!4, dy: CP 43CP471 
subject to d? = d? =0 and didy = dyd; (some- 
times replaced by anticommutativity, dydy + 
dydi =0). Morphisms of bicomplexes are de- 
fined as for single complexes. A bicomplex 
C becomes a (single) complex if we put C" = 
È p+4-n C”? (when the sum exists) and define 
the differential d to be d, +(—1)?d, on C?4. 
Then d is called the total differential and d}, d, 
the partial differentials. On the other hand, Cf 
={C™1(pe Z), dı} constitutes a complex for 
each q, whose cohomology H?(Cf) is denoted 
by H?(C*). Then dy induces morphisms H?(C‘) 
— HP(C4*'), so that we obtain a complex 
HP(C). The cohomology of H?(C) is denoted 
by Hf(HP(C)). We define H?(H#(C)) similarly. 
The cohomology of C with respect to the total 
differential is denoted simply by H"(C). Similar 
constructions are applied to double chain 


200 I 
Homological Algebra 


complexes {C, ,} and further to multiple 
complexes, as we shall show in the case of 
bicomplexes. 

Let T be a ‘bifunctor @, x @, >@ and C; be 
complexes in @; (i= 1,2). Then T(C,,C,) is a 
bicomplex in @’. For instance, Hom(C’, C) is a 
positive (bipositive) complex if C(C’) is a posi- 
tive (negative) complex in @. If C, C’ are com- 
plexes in Wp, pM, respectively, the ttensor 
product C ® gC’ is a complex in (Ab) (the 
product complex). There is a canonical mor- 
phism H,(C) @ H,(C’))>H,.,4(C @ C’). If C, 
and B, are tflat for all ne Z, we have the fol- 
lowing exact sequence (Kiinneth’s formula): 
0> ¥ H(C)@H(C)>H,(C@C) 


pt+q=n 


> }, Tor,(H,(C), H,(C’))+0 
ptq=n-1 

(for the definition of Tor — Section D). For C’ 
=A€p.W, Kiinneth’s formula reduces to the 
exact sequence 0-H,(C) ® A> H,(C @ A)> 
Tor, (H,_,(C), A)—>0 (universal coefficient 
theorem). The corresponding exact sequence 
for cohomology is 


0 Ext!(H,_,(C), A)> H"(C, A) 
—Hom(H,(C), A) 0 


(— Section G; 201 Homology Theory). 


I. Satellites and Derived Functors 


Let € and @’ be Abelian categories. All func- 
tors in this section are tadditive. A tcovariant 
functor T:> is called exact if T maps 
every exact sequence in @ to an exact sequence 
in @’. T is called half-exact, left exact, or right 
exact if for every short exact sequence 0— A 
—+B-—C-0, the sequence T(A)— T(B)— T(C), 
0+ T(A)> T(B)> T(O), or T(A) > T(B)> T(C) 
—0, respectively, is exact. Similar definitions 
apply for tcontravariant functors. The functor 
Hom: @ x @-(Ab) (which defines the category 
€) is left exact in both factors. An object P is 
projective if h,(-)= Hom(P,:) is exact, while Q 
is injective if h2(-)= Hom(-, Q) is exact. If 
every object A admits an tepimorphism from 
a projective object PA (resp. fmonomor- 
phism into an injective object AQ), € 

is said to have enough projectives (injec- 
tives). An object G is called a generator (cogen- 
erator) if the natural mapping Hom(A, B)—> 
Hom(h,(A), hg(B)) (Hom(h9(B), h°(A))) is 
one-to-one. 

An Abelian category @ is called a Gro- 
thendieck category if (1) @ has a generator, (2) 
*direct sums always exist, and (3) the identity 
(() A) B=|)(4;) B) holds for any object A, 
its tsubobject B, and a ttotally ordered family 
{A,} of subobjects. A Grothendieck cate- 
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gory has enough injectives (R. Baer, 1940, 
for (Ab); A. Grothendieck, 1957, for general 
g). A monomorphism into an injective ob- 
ject f: A> @ is called an injective envelope if 
Im fNO Img #0 for any nonzero monomor- 
phism g: BQ. Every object A in a Grothen- 
dieck category admits an injective envelope, 
which is unique up to isomorphism (B. Eck- 
mann and A. Schopf, 1953, for p; B. Mit- 
cheli, 1960, for general @). 

We say that a covariant d-functor €>” is 
given if we have a sequence of covariant func- 
tors T={T':@-+@’} and the connecting mor- 
phisms ô: T'(A")> T'*!(A’) for an arbitrary 
short exact sequence 0 A’ -> A> A” 50 satis- 
fying the following conditions: (i) do T'(f”) 
=T'*'(f')oé for a morphism f of short 
exact sequences; and (ii) the sequence ...— 
THUANS TA) STA) THA") THA) 
... constitutes a complex. T= {T°} is called a 
covariant 0*-functor if instead of ô there are 
given 0*: Tİ(A”)—> TŻ! (A’) satisfying similar 
conditions (i*) and (ii*). By taking duals, we 
define the notion of contravariant ð- and d*- 
functors. They are also called connected se- 
quences of functors. A 6-(6*-)functor defined 
for — œ <i< +00 is called a cohomological 
functor (homological functor) if the sequence in 
condition (ii) (resp. (1i*)) is always exact. A 
morphism of -functors f:S—T consists of 
natural transformations f':S'—T' that com- 
mute with the connecting morphisms. A ð- 
functor S defined for a<i<b is called universal 
if for any 6-functor T defined in the same in- 
terval and any natural transformation g:S*> 
T°, there exists one and only one morphism 
f:S>T such that f°= 9. Let F:@>@' bea 
covariant functor and b any positive integer. A 
universal covariant 6-functor S defined for 
0<i<bis called a right satellite of F if S° = F 
(S is then denoted by {S'F}). If such an S 
exists, then it is unique and satisfies S'*!(F) = 
S'(S'F). If @ has enough injective objects, 
the right satellites always exist, and if F is left 
exact, then {S'F} is a cohomological functor. 
The universality of 6*-functors is defined by 
reversing the arrows; the satellites {S;F} are 
then written as {S~‘F} and called the left 
satellites. 

Let @ be an Abelian category with enough 
injectives. An injective resolution of an object A 
is a right resolution Q = {Q'} such that all Q' 
are injective. Every A admits an injective reso- 
lution, which is unique up to chain equivalence 
(H. Cartan, 1950). For a covariant functor F: 
€ >’, the functor A—> H (F(Q)), called the ith 
right derived functor R'F of F, is independent 
of Q. {RİF} is a cohomological functor. By the 
universality of satellites, there exists a mor- 
phism of é-functors {S'F}—{R'F} which is an 
isomorphism if and only if F is left exact. The 
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left derived functors L;F of a contravariant 
functor F are defined similarly and are isomor- 
phic to the left satellites when F is right exact. 
If @ has enough projectives (instead of injec- 
tives), we define left (right) derived functors of 
covariant (contravariant) functors via projec- 
tive resolutions. For a multifunctor, we define 
partial derived functors as well as (total) de- 
rived functors of the functor viewed as a func- 
tor defined in the tproduct category. For 
instance, let T(A, B):@, x $, >@' be contra- 
variant in A and covariant in B. When @, has 
enough injectives, we obtain R} T(A, B)= 
H'(T(A, Q)) using an injective resolution Q 

of B. Suppose that T satisfies condition (i) 
A—>T(A, B) is exact for any injective B. Then 
for a fixed injective B, R} T(A, B) is a cohomo- 
logical functor in A. When A has a projec- 
tive resolution P in @,, we obtain Rj T(A, B) 

= H'(T(P, B)) as well as the equation for the 
total derived functor R‘T(A, B)=H'(T(P, Q)). 
We say that a functor T is right balanced if it 
satisfies (i) and also (ii) B— T(A, B) is exact for 
any projective A. In this case, the three derived 
functors are isomorphic. The left balanced 
functors are defined similarly. When the right 
derived functors of the functor Hom (which. 
defines the category) exist, they are denoted by 
Ext'(A, B). 


J. Spectral Sequences 


In this section, we deal with cohomology in 
the category pH of R-modules. A similar 
theory for homology is obtained by modifying 
the theory in a natural way. Similar construc- 
tions are also possible for general Abelian 
categories [3, 8]. 

A filtration F of a module A ts a family of 
submodules {F?(A)|peZ} such that F?(A)> 
F?*'(A). We say that the filtration F is con- 
vergent from above (or exhaustive) if |), F?(A) 
= A, and F is bounded from below (or dis- 
crete) if F?(A)=0 for some p. The tgraded 
module G(A)= {G?(A)=F?(A)/F?*'(A)|peZ} 
is said to be associated with A. A morphism of 
filtered modules f: A> A’ is a module homo- 
morphism such that f(F?(A))<¢ F(A’). It in- 
duces a homomorphism of the graded modules 
G(A)>G(A’). A filtration of a complex C= 
{C",d} consists of subcomplexes F?(C)= 
{F(C}; such that F?(C)> F?*'(C). We 
assume that the complex C satisfies |_),, F?(C) 
=C, and is bounded from below; i.e., for every 
n there exists some p such that F?(C")=0. In 
particular, if F°(C)=C, F?*!}(C”)=0, the com- 
plex C is called canonically bounded. Writing 
C4 = G?(C?*4), we obtain a tbigraded module 
{C4}, in which p, q, and p+q are called the 
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filtration degree, the complementary degree, 
and the total degree, respectively. 

A spectral sequence {F,} with a graded 
module D={D"} as its limit (denoted by 
E} 1 = ,D") consists of a family of doubly 
graded modules E,={E?**|p,qeZ} (r>2 or 
sometimes r > 1) and differentiations d,: EP" — 
EP*"4a-'*! (p, geZ) of degree (r, 1 —r) satisfy- 
ing d? =0 and satisfying the following two 
conditions: (i) H(E,) (with respect to d,) is 
isomorphic to E,,, (hence there exists a se- 
quence of graded submodules of E,:0= B, c 
B,c...cZ,<Z,=E, such that Z,/B,= E,); 
and (ii) there are submodules Z,, and B,, such 
that |), Bc Bo € Zo € (\r Zr and E,, = 
Z.,/B,. is isomorphic to the doubly graded 
module associated with a certain filtration F of 
D (i.e., E81 = G?(D?*4)). We assume that Z,, = 
(\,Z, and B,, = |); By (weak convergence). 
Suppose that F is convergent from above and 
bounded from below and that Z,(E$9) is 
stationary for every p,q. Then {E,} is called 
regular. {E,} is bounded from below if for every 
n there exists a po such that E$”? =0 for 
P<Po. In particular, if E3-4=0 (p<0,q <0), 
then {E,} is called the first quadrant (or coho- 
mology spectral sequence). In the latter case, 
the edge homomorphisms E2? —> E?:°, E°:4— 
E9" are defined through base terms EP? and 
fiber terms E?’?, respectively. A morphism 
of spectral sequences f:{E,, D}>{E;, D'} con- 
sists of f.: E,—> E; of degree (0,0) and f:D>D' 
of degree 0 which preserve the mechanism of 
spectral sequences. When the spectral se- 
quences are regular, a morphism f is an iso- 
morphism if one of the f, is an isomorphism. 
Addition is naturally introduced in the set of 
morphisms so that spectral sequences form an 
additive category. An additive functor from 
an Abelian category @ to this category is 
called a spectral functor. A filtered complex 
{C, F} gives rise to a spectral sequence E2" => 
G(H(C)) if we put Z? = {ae F?”(C)|dae F”*+(C)} 
B? =dZ?, EP =ZPZ}Ż] + BP), E, =}, EP. 
A double complex C = {C™1, dy, dy} admits two 
natural filtrations Fj: FP(C)=Z,5,2,C°" 
and Fy: FACC) = È, >g Xp C7". By the pro- 
cedure above, these filtrations give rise to 
spectral sequences HP(H4(C)) > ,H"(C) and 
Ay(HP(C)) > ,H"(C), respectively. Compari- 
son of these sequences yields many useful 
results. Let T be an additive covariant func- 
tor from an Abelian category @ to pM, C be 
a complex in @, and Q={Q?'4} be an injec- 
tive resolution of C. The double complex Q 
gives rise to spectral sequences H?(R47T(C)) => 
A(T(Q)) and RP T(H%™(C)) > H(T(Q)). The 
limit H(T(Q)) is independent of Q and is 
called the hypercohomology of T with respect 
to C [2,8]. We can similarly define hyper- 
cohomology of multifunctors. The theory of 
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spectral sequences was initiated by J. Leray 
(1946), and suitable algebraic formulations 
were given by J. L. Koszul (1950). 


K. Categories of Modules 


The category gM (resp. Mpg) of left (right) R- 
modules over a tunitary ring R is not only an 
Abelian category but also a Grothendieck 
category (— 277 Modules). The tfull embed- 
ding theorem permits us to deduce many pro- 
positions about general Abelian categories 
from the consideration of p.#@. An object P of 
rM is projective if and only if it ts isomorphic 
to a direct summand of a ffree module. Any 
projective module is the direct sum of count- 
ably generated projective modules (I. Kap- 
lansky, 1958). It follows that any projective 
module over a flocal ring is free. Finitely 
generated projective modules P, and P, 

are said to be equivalent if there exist finite- 
ly generated free modules F,, F, such that 

P, OF, =P, ® F,. The equivalence classes 
then form an Abelian group with respect to 
the direct sum construction called the projec- 
tive class group of a ring R. The category of 
complex vector bundles over a compact space 
X is equivalent to the category of projective 
modules over C(X), the ring of complex- 
valued continuous functions on X, and simi- 
larly for other types of spaces and bundles. 
Many investigations have been made involv- 
ing the problem of whether every projective 
module over a polynomial ring is free (J.-P. 
Serre, 1955). This problem was settled affirma- 
tively by D. Quillen [13] and independently 
by A. Suslin. It has been observed that “big” 
projective modules are often free: for example, 
nonfinitely generated projective modules over 
an tindecomposable weakly Noetherian ring 
are free (Y. Hinohara, 1963). 

The nth right derived functor of Hom,(A, B) 
is denoted by ExtR(A, B) (— Section G). This is 
a bifunctor pæ x pM (Ab), contravariant in 
A and covariant in B. Ext? is isomorphic to 
and identified with Hom ,. An exact sequence 
Q-— A'+A—A"-—0 gives rise to the con- 
necting homomorphisms A": Ext}(4’, B)> 
Ext''(A”, B), and the following sequence 
is exact: ... > Ext! (A, B) Extk(A”, B)> 
Ext} (A, B) > Ext} (A', B) S Ext?! (A", B)>... 
(the exact sequence of Ext). Similarly, an exact 
sequence 0—> B'—B-— B” 50 gives rise to 
A": Ext} (A, B’) + Ext"! (A, B’) and to an exact 
sequence of Ext. An extension of A by B (or of 
B by A) is an exact sequence (E):0->B> X > 
A-—(. The set of equivalence classes of exten- 
sions of A by B is in one-to-one correspon- 
dence with Ext,(A, B) by assigning to (E) its 
characteristic class y,,=A°(1)¢ Ext}(A, B), 
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where 1 denotes the identity of Hom,(B, B). In 
this correspondence, the sum of two extensions 
is obtained by a construction called Baer’s 
sum of extensions. Similarly, Ext,(A, B) is 
interpreted as the set of the equivalence classes 
of n-fold extensions 0>B>X,_,>... 7X97 
A-0 (exact). This point of view permits us to 
establish a theory of Ext, etc., in more general 
(additive) categories lacking enough projec- 
tives or injectives (N. Yoneda, 1954, 1960). 

The tensor product A © gB is a right exact 
covariant bifunctor Wp ® pH —(Ab). If the 
functor tp(:)=- ®© P is exact, P is called a tflat 
module. A projective module is flat. In general, 
a flat module is the tinductive limit of finitely 
generated free modules (M. Lazard, 1964). A 
flat module P is called tfaithfully flat if P mP 
for every maximal ideal m of R. The functors 
® and Hom are related by tadjointness (— 52 
Categories and Functors). From this view- 
point © can be introduced in more general 
categories. Left-derived functors of A ® gB are 
denoted by Tor®(A, B) and are called nth tor- 
sion products of A and B. Tor’ (A, B) is often 
denoted by A * gB. The functor Qx is left 
balanced, hence Tor is calculated by using 
projective resolutions of A, B, or both A and 
B. We have Tor = @x. An exact sequence 0— 
A'+A-+A"-0 gives rise to A,,:Tor®, ,(A”, B)—> 
Tor®(A’, B) and the infinite exact sequence of 
Tor, and similarly for the second variables. 

From the various relations between Hom 
and ® follow the corresponding relations 
between their derived functors. When A and T 
are algebras over K and Q=A @T, we can 
define the external product (T -product), which 
is a mapping 


T : Tor}(A, B) @ Tor! (4’, B’) 


>Tor?, (A @ A’, B@ B’). 


In particular, if A and T are K-projective and 
Tor*(A, A’)=0 (n> 0), then we can define the 
wedge product (\/-product) V: Ext? (A, B) ® 
Ext2(A’, B') > Ext2*4(A Q A’, B® B’). The latter 
is described in terms of the composition of 
module extensions. When K=A =T =Q, 

the T-product reduces to the internal product, 
called the m-product. If A is a tHopf alge- 

bra over K, the tcomultiplication A> A @A 
induces Ext, g,—Ext,. This, combined with 
the \/-product, yields the cup product (~- 
product)~ : Ext} (A, B) © Ext4(A’, B')> 
Ext*4(A ® A’, B® B’). We define similarly 

L -product, /\-product, w-product, and ~- 
product (cap product) [2]. Let A, T, and 2 be 
algebras over K, with A K-projective; let 

AE Mypgr, BEAM z, CE Mrez and assume 
Tor}(A, B)=0 (n> 0). The natural isomor- 
phism Hom,gr(A, Hom;(B, C) = Homyg 5 
(A ® ,B, C) then yields a spectral sequence 
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Ext? gr (A, Ext}(B, C)) >, Exttgs (A ® 4B, C) 
by the double complex argument in Section D. 
The homological dimension h dim, A, 
dh, A, or projective dimension proj dim, A of 
AEpgM is the supremum (<œ) of n such 
that Ext,(A, B) 40 for some B. The relation 
hdimg A <0 means that A is projective. The 
injective dimension injdim, B of Be pM is 
defined similarly by means of the functor 
Extk(-, B), and the weak dimension w dim, C 
of Ce gM by the functor Tor®(-, C). The 
common value sup{projdimg A| AE RM} 
=sup{injdimp B| Bez} is called the left 
global dimension 1 gl dim R of R. It is identical 
with the supremum of homological dimensions 
of tcyclic modules (M. Auslander, 1955). The 
right global dimension r gl dim R is defined 
similarly. The common value sup{wdim, A| 
Ac Mz} =sup{wdimg C|CE€pgM} is called the 
weak global dimension w gl dim R of R. We 
have w gl dim R <1 gl dim R, r gl dim R. The 
equality may fail to hold (Kaplansky, 1958). If 
R is tNoetherian, the three global dimensions 
coincide (Auslander, 1955) and are called 
simply the global dimension of R: gl dim R. The 
condition 1 gl dim R =0 (or r gl dimR=0) 
holds if and only if R is an tArtinian semi- 
simple ring, while w gl dim R =0 if and only 
if R is a tregular ring in the sense of J. von 
Neumann (M. Harada, 1956). A ring R is 
called left (right) hereditary if 1 gl dim R <1 
(r gl dim R < 1), and left (right) semihereditary 
if every finitely generated left (right) ideal is 
projective. A left and right (semi)hereditary 
ring is called a (semi)hereditary ring. Since 
projectivity and tinvertibility of an ideal of a 
(commutative) integral domain R are equiva- 
lent, R is hereditary if and only if R is a tDede- 
kind ring. In this case, the projective class 
group of R reduces to the tideal class group. 
An integral domain R is semihereditary if and 
only if w gl dim R < 1 (A. Hattori, 1957), and in 
that case R is called a Prüfer ring. A tmaximal 
order over a Dedekind ring is hereditary. A 
commutative semihereditary ring R is charac- 
terized by the property that flatness of R- 
modules is equivalent to torsion-freeness (S. 
Endo, 1961). A Noetherian ring R is left self- 
injective if and only if R is tquasi-Frobenius 
(M. Ikeda, 1952), and the global dimension of 
a quasi-Frobenius ring is 0 or œ (S. Eilenberg 
and T. Nakayama, 1955). A polynomial ring R 
=K[X,,...,X,] over a commutative ring K 
satisfies gl dim R =gl dim K +n. When K is a 
field, this is a reformulation of Hilbert’s theory 
of syzygy sequences (— 369 Rings of Poly- 
nomials). In this sense, the study of the global 
dimension of rings and categories is sometimes 
called syzygy theory (Eilenberg, 1956). The 
homological algebra of commutative Noe- 
therian rings has been studied extensively and 
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is useful in algebraic geometry. Since gl dimR 
=sup,, gi dim Rn (Rn is the tring of quotients 
relative to m), with m running over the max- 
imal ideals of R, the problem of determining 
gl dim R reduces to the case of flocal rings. A 
finitely generated flat module over a local ring 
R 1s free. If K denotes the residue field R/m, 
where m is the maximal ideal of the local 

ring R, Tor®(K, K) has the structure of a 
Hopf algebra (E. F. Assmus, Jr., 1959). De- 
tailed results concerning the Betti numbers 
dim Tor?(K, K) of R have been obtained (J. 
Tate, 1957, and others). In particular, R is 
tregular if and only if gl dim R < œ (Serre, 
1955). A local ring R is called a Gorenstein 
ring if the injective dimension of R-module R 
is finite. This is a notion intermediate between 
regular rings and *Macaulay rings (— 284 
Noetherian Rings). 

Consideration of a ring R in relation to a 
subring S leads to relative homological algebra. 
Foundations for this theory were established 
by Hochschild (1956). An exact sequence of 
R-modules that ‘splits as a sequence of S- 
modules ts called an (R, S)-exact sequence. 

An R-module P is called an (R, S)-projective 
module if Hom,(P,:) maps any (R, S)-exact 
sequence to an exact sequence. (R, S)-injective 
modules are defined similarly. Based on these 
notions, Ext;g,s, and Tor®® are defined as the 
relative derived functors of Hom, and ® pz, 
respectively. We also have a relative theory 
from a different viewpoint (S. Takasu, 1957). 
Relative theory is extended to general cate- 
gories from various viewpoints [6, 14] (— 
Section Q). 


L. Cohomology Theory for Associative 
Algebras 


Let A be an talgebra over a commutative 
ring K and A a ‘two-sided A-module. Let 
C” be the module of all n-linear mappings 
of A into A called n-cochains (C° = A). De- 
fine the amare: operator 6":C"+C""! 


o ("FA À n41) =A (Aass Anr) + 
-ı(— 1f oo pa 
“YY Yds od, An+1- 


me thus obtain a ee whose coho- 
mology is denoted by H"(A, A) and is called 
the nth Hochschild’s cohomology group of A 
relative to the coefficient module 4 (Hochs- 
child, 1945). A cochain f is called normalized if 
J(A,,.--,4,) =0 whenever one of the å; is 1. We 
obtain the same cohomology group H"(A, A) 
from es caus oa of normalized cochains. 
{H"(A,-)} is a cohomological functor from the 
category „Ma of two-sided A-modules to the 
category gM. Using the enveloping algebra A‘ 
=A ® x A°, where A° is an anti-isomorphic 
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copy of A, a, can be identified with ,eW% 
(and M,e). If A is K-projective, {H"(A,-)} is 
isomorphic to {Exte(A, -)}. (In [2], H"(A, A) is 
defined as Extie(A, A) in general.) We have 
H°(A, A)= {ae A| 2a =a), Vie A}. We call 1- 
cocycles derivations (or crossed homomor- 
phisms) of A in A and 1-coboundaries inner 
derivations. Thus H'(A, A) is the derivation 
class group and is related to the ramification. 
When K is a field, H!(A,-)=0 if and only if A 
is a tseparable algebra. In general, an alge- 
bra A over a commutative ring K is called a 
separable algebra if A is A°-projective, 1.e., if 
Exthe(A,:)=0 (Auslander and O. Goldman, 
1960). This is a generalization of the notion of 
*maximally central algebras (Nakayama and 
G. Azumaya, 1948). We have a one-to-one 
correspondence of H(A, A) to the family of 
algebra extensions of A with kernel A (i.e., K- 
algebras I’ containing A as a two-sided ideal 
such that T/A =A) satisfying A?=0. Any 
extension of an algebra A over a field K such 
that H?(A,:)=0 splits over a nilpotent kernel 
(J. H. C. Whitehead and G. Hochschild). This 
holds in particular for a separable algebra, and 
we obtain the fWedderburn-Mal’tsev theorem. 
There are some interpretations of H?(A, A) in 
terms of extensions. 

The supremum (<œ) of n such that 
H"(A, A)40 for some A is called the cohomo- 
logical dimension of A and written dim A. For 
a finite-dimensional algebra A over a field K, 
dim A < œ if and only if A/N is separable and 
gl dim A < œ, where N is the tradical of A (N. 
Ikeda, H. Nagao, and Nakayama, 1954). 

The homology groups H,(A, A) of A relative 
to a coefficient module A are defined similarly. 
If A is K-projective, {H,(A,-)} is isomorphic to 
{Torf (8): 


M. Cohomology of Groups 


The pair consisting of an algebra A over K 
and an algebra homomorphism ¢:A-—K is 
called a supplemented algebra [2] (or aug- 
mented algebra [6]), of which ¢ is the augmen- 
tation. The tgroup algebra Z[G] of a group G 
over the ring of rational integers is a supple- 
mented algebra, in which the augmentation is 
defined by e(x)= 1 (xe G). The category of left 
G-modules is identified with the category of 
left Z[G]-modules. For a finite group G, a 
finitely generated projective G-module is not 
necessarily free (D. S. Rim, 1959) and is iso- 
morphic to the direct sum of a free module 
and a left ideal of Z[G]. It follows that the 
projective class group of Z[G] ts a finite group 
(R. G. Swan, 1960). The cohomology groups 
and homology groups of G relative to A€ gM 
(Eilenberg and S. MacLane, 1943) are defined 
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by H"(G, A)=Ext},¢)(Z, A) and H,(G, A)= 
Tor“l°1(Z, A), respectively. Their concrete 

description is given usually via the Z[G]- 

standard resolution of Z. 

(1) Homogeneous formulation. The group of 
homogeneous n-chains is the free Abelian group 
with basis G x ... x G (n+ 1 times), on which 
G operates by x(Xo, ...,X,)=(XXq, -+ XXa) 
and the boundary operator is defined by 
Dice MNS Be CHL (X05 ca es Xn). 

(2) Nonhomogencous formulation. The group 
of nonhomogeneous n-chains is the Z[G]-free 
module with basis G x ... x G (n times), and 
the boundary operator is defined by d(x,, 

a MEK Son he = Pas 
XXi ey Xa t(— D" (x1, ---, Xp-1). A non- 
homogeneous 2-cocycle is sometimes called 
a factor set. H°(G, A) is the submodule A°% 
of A consisting of the G-invariant elements, 
while H,(G, A) is the largest residue class 
module A, of A on which G acts trivially. 
Given two groups G and K, an exact sequence 
of group homomorphisms 1—>K >E>G—>1 is 
called a group extension of G over the kernel 
K. When K is Abelian, the extension canoni- 
cally induces a G-module structure on K, and 
the deviation of K from being a semidirect 
factor of E is measured by a factor set. The 
group H?(G, A) is thus in one-to-one corre- 
spondence with the set of equivalence classes 
of the tgroup extensions of G over A which 
induce the originally given G-module structure 
on A (— 190 Groups N). This point of view is 
essential in the proof of the fSchur-Zassenhaus 
theorem (— 151 Finite Groups). H3(G, A) is 
interpreted as the set of obstructions for exten- 
sions (Etlenberg and MacLane, 1947). For a 
free group F, H"(F, A)=0 (n> 1). If a group 
G is represented as a factor group F/R ofa 
free group F, we have a group extension 1—> 
K>E->G-1, where K=R/[R,R] and E= 
F/[R, R]. Let ëe H? (G, K) correspond to this 
extension. Then for any G-module A, the cup 
product y>7~é€ followed by the pairing 
Hom(K, A) ® K—A provides isomorphisms 
H"(G, Hom(K, A)) = H"*'?(G, A) (n>0) (the cup 
product reduction theorem of Eilenberg and 
MacLane, 1947); similarly, we have the reduc- 
tion theorem for the homology. The Z-algebra 
H(G, Z)= Ło H"(G, Z) under the multipli- 
cation defined by the cup product is finitely 
generated if G is a finite group (B. B. Venkov, 
1959; L. Evens, 1961). 

The following are mappings relative to a 
subgroup H. 

(1) The inner automorphism by xeG in- 
duces an isomorphism of H"(H, A) and 
H"(xHx +, A) which reduces to the identity 
of H"(G, A) if H =G. Hence if H is a normal , 
subgroup, H”(H, A) has the structure of a G/H- 
module, and similarly for H,(H, A). 
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(2) When H is a normal subgroup of G, the 
mapping (x,,...,X,)(x,H, ...,x,H) of non- 
homogeneous chains induces the inflation 
(or lift) Inf: H"(G/H, A#)>H"(G, A) and the 
deflation Def: H,(G, A)> H,(G/H, Ay). 

(3) The embedding of nonhomogeneous 

chains induces the restriction Res: H"(G, A) 
— H"(H, A) and the injection Inj (or corestric- 
tion Cor): H,(H, A) H,(G, A). The theory of 
tinduced representation gives another con- 
struction of these mappings; that is, if we put 
1°(A) = Homz,;,(Z[G], A), Res is obtained by 
the isomorphism H"(G,1°(A))= H"(H, A) com- 
bined with the homomorphism induced by A 
—1°(A); while if we put 1¢(A)= Z[G] ® grin, 
Inj is obtained by the isomorphism H,(H, A)= 
H,(G, 1¢(A)) followed by the homomorphism 
induced by :,(A)> A. 

(4) If (G: H) < œ, we have :%(A)=1,(A). 

The composition of H"(H, A) > H"(G, ig{A)) 

~ H"(G, A) defines on the cohomology groups 
the Inj: H"(H, A) > H"(G, A), while the compo- 
sition H,(G, A) H,(G, 1°(A))— H,,(H, A) gives 
the Res: H,,(G, A)-> H,(H, A). In particular, 
Res: H,(G, Z) > H, (H, Z) coincides with the 
ttransfer G/L[G, G] > H/[H, H]. 

(5) Let H be a normal subgroup of G. 
Consider the additive relation p (the corre- 
spondence) between he Z"(H, A)® and fe 
Z"*1(G/H, A®) determined by p(h, f) if and 
only if there exists a ge C”(G, A) such that h = 
Resg and Inf f = ôg. If the relation induces a 
homomorphism H"(H, A)° >H"! (G/H, A”), 
then it is called the transgression. If H‘(H, A) 
=0 (0<i<n), the sequence 0+H"(G/H, A”) 
> H”(G, A)>H"(H, A) > H"*! (G/H, A) 
H"*'(G, A) composed of inflation, restric- 
tion, and transgression mappings is exact 
(Hochschild and Serre, 1953) and is called 
the fundamental exact sequence. This exact 
sequence can be derived from a certain spec- 
tral sequence H"(G/H, H4(H, A)) =, H”(G, A) 
(R. C. Lyndon, 1948; Hochschild and Serre, 
1953). 

The relative (co)homology theory relative to 
a subgroup (I. T. Adamson, 1954) can be dealt 
with in terms of the relative Ext and the rela- 
tive Tor (Hochschild, 1956). Many results in 
the absolute case are generalized to the rela- 
tive case: for example, the fundamental exact 
sequence (Nakayama and Hattori, 1958). The 
relative theory is further generalized to the 
cohomology theory of tpermutation represen- 
tations of G (E. Snapper, 1964). 


Non-Abelian Cohomology. For a non-Abelian 
G-group A, the cohomology “set” H'(G, A) 
(and H°(G, A)) is defined as in the Abelian case 
by means of the nonhomogeneous cochains 
(— e.g., [9]). Some efforts are being made 
toward the construction of a more general non- 
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Abelian theory (J. Giraud, Cohomologie non 
abeliénne, Springer, 1971). 


N. Finite Groups 


Let G be a finite group and A a G-module. 
Define the norm N: AA by N(a)=È ga, 
and denote Ker N by pA. The kernel of the 
augmentation ¢:Z[G]|—Z is denoted by I. Put 
H"(G, A)=H"G, A) (n>0), 4°(G, A)= A®/NA, 
A(G, A)=yA/IA, and H~"(G, A)=H,,_,(G, A) 
(n> 1). Then {f"(G,-)} forms a cohomological 
functor, called the Tate cohomology (E. Artin 
and J. T. Tate), that can be described as the set 
of cohomology groups concerning a certain 
complex called a complete free resolution of Z. 
(Similar arguments are valid more generally 
for quasi-Frobenius rings (Nakayama, 1957), 
and a theory of this kind is called complete 
cohomology theory.) We have H"(G, A)=0 
(ne Z) if and only if hdimz;g) A < 1 (Naka- 
yama, 1957). If A satisfies the conditions (i) 
A'(G,, A)=0 for any Sylow p-subgroup G, 

of G, and (ii) there exists a če Ê? (G, A) such 
that Res če Ĥ?(G,, A) has the same order as G, 
and generates all of H?(G,, A), then the homo- 
morphisms A"(H, B) > H"*?(H, A @ B) (ne Z) 
defined by the cup product with Res are 
isomorphisms for every subgroup H and every 
G-module B such that Tor(A, B)=0 (Naka- 
yama, 1957; for B=Z, Tate, 1952). If G is 
cyclic, the mappings H"(A)> H"*?(A) (neZ) 
defined by the cup product with a generator 
of H?(Z) are isomorphisms. (The notation is 
abbreviated by omitting G.) If the orders of 
°(A) and Â! (A) are finite, their ratio is called 
the Herbrand quotient h(A) of A. If 0> A >A 
>A" 0 is exact, then h(A)=h(A)h(A”"). If A is 
finite, then h(A)= 1. By combining these two 
facts we obtain Herbrand’s lemma: If A’ is a G- 
submodule of A of finite index and h(A’) exists, 
then h(A) also exists and h(A)=h(A’). The 
periodicity H"(A) = "*"(A) (neZ, AegM) 
holds if and only if every Sylow subgroup is 
cyclic or a tgeneralized quaternion group 
(Artin and Tate; [2]). 

Let L/K be a finite tGalois extension with 
the tGalois group G. The cohomology groups 
of various types of G-modules related to L/K 
are called the Galois cohomology groups (— 
172 Galois Theory). Using continuous cocycles, 
a cohomology theory (Tate cohomology) is 
developed for infinite Galois extensions as well 
[9, 10]. By means of Galois cohomology (— 59 
Class Field Theory), the cohomology theory of 
finite groups and of ftotally disconnected 
compact groups (which are ‘profinite groups) 
plays an important role in class field theory 
and its related areas. 
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O. Cohomology Theory of Lie Algebras 


Let g be a tLie algebra over a commutative 
ring K, and assume that g is K-free. The ten- 
veloping algebra U = U (g) is a tsupplemented 
algebra over K. For a q-module (= U-module) 
A, Ext} (K, A) and Tor¥(K, A) are called the 
cohomology groups H"(g, A) and homology 
groups H,(q, A), respectively, of g relative to 
the coefficient module A. They are usually 
described by means of the U-free resolution 
U ® Ax(g) of K (called the standard com- 
plex of g) constructed by C. Chevalley and 
Eilenberg (1948), where /\,(g) is the exterior 
algebra of the K-module g and (denoting 

1 (xA -AX by (x1, ...,X,)) the differenti- 
ation is given by 


d(x e Xd = $ (SDE a eee Gas) 
i=l 


+ Yo (CDHE x], x.. 


l<i<j<n 


For n>[g:K], H"(g, A\=H,(g, A)=0. If gis a 
tgemisimple Lie algebra over a field K of char- 
acteristic 0, we have H'(g, A)=0, H? (g, A)=0, 
while H3(g, K) 40. H' (g, A)=0 is equivalent to 
Weyl’s theorem, which asserts the complete 
reducibility of finite-dimensional representa- 
tions (— 248 Lie Algebras E). H? (g, A) and 
H3(g, A) are interpreted by means of Lie 
algebra extensions as in the cohomology of 
groups. The theorem on Levi decomposition 
is derived from H?(g, A)=0. Chevalley and 
Eilenberg constructed this cohomology by 
algebraization of the cohomology of compact 
*Lie groups. They also introduced the notion 
of relative cohomology groups H”(g, b, A) 
relative to a Lie subalgebra b of g, which cor- 
respond to the cohomology of homogeneous 
spaces. H”(g, b, A) does not always coincide 
with Exttq) væ (K, A) (Hochschild, 1956), but 
does so in an important case where K is a field 
of characteristic 0 and b is treductive in g (— 
248 Lie Algebras). 

For ‘transformation spaces of thnear alge- 
braic groups G over a field K, the rational 
cohomology groups are introduced using the 
notion of rational injectivity (Hochschild, 
1961). In particular, if G is a tunipotent alge- 
braic group over a field K of characteristic 0, 
then H(G, A) is isomorphic to H(g, A), where g 
is the Lie algebra of G. There 1s also a relative 
theory. 


P. Amitsur Cohomology 


Let R be a commutative ring, and F a co- 
variant functor from the category @, of com- 
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mutative R-algebras to the category of Abelian 
groups. For S€@, and n=0, 1,2, ..., we write 
SM=S$@®...@S (n-fold tensor product over 
R). Let g: St -> S”? (i=0, 1,...,n +1) be 
€-morphisms defined by £;(xo @ ... Q X,)= 
Xp @... OX;-, ©1 Ox; @... @x,. Defining 
d": F(S™*)) > F(S“*?) by d"= Li29(—1)'F(e), 
we obtain a cochain complex {F(S"*”), d"}. 
This complex and its cohomology groups are 
called the Amitsur complex and the Amitsur 
cohomology groups, and are usually denoted 
by C(S/R, F) and H"(S/R, F), respectively. 

If S/R is a finite Galois extension with 
Galois group G, then the group H"(S/R, U) 
of the unit group functor U is naturally iso- 
morphic to H"(G, U(S)). If S/R is a finite pure- 
ly inseparable extension, then H"(S/R, U)=0 
for n>3. The group H? (S/R, U) is related to 
the *Brauer group B(S/R) (— 29 Associative 
Algebras K). 


Q. Relative Theory 


In the course of the development of homolog- 
ical algebra, it has been recognized that the 
notion of projective (resp. injective) resolutions 
should be generalized ([14, 4]; Hochschild, 
1956). In the meantime, a method has been 
introduced that utilizes simplicial objects in 
order to define the derived functor of an arbi- 
trary functor with Abelian category as its range 
([15, 17]; J. Beck, 1967). As a consequence of 
these developments, there emerged a view- 
point, which we describe below, making it 
possible to unify various known definitions of 
(co-)homology theories that has been designed 
for particular cases. 

Let æ be a category and ¥ a class of ob- 
jects in æ. In this section, we denote the 
set Hom,,(A, B) of morphisms by .o7(A, B). 
A morphism f:A—B in æ is called a P- 
epimorphism if the induced mapping (P, f) 
(=Hom(P, f)):.(P, A) > A(P, B) is surjective 
for any Pe. The class F is called a projective 
class in æ if there exist an object Pe? and a 
P-epimorphism f:P—A for each object Ae.. 
To any category .o/, we associate a preaddi- 
tive category Zæ, adding a zero object to æ if 
necessary: Put ObZ = Ob, Z.A (A, B)=free 
Abelian group generated by the set (A, B). £ 
is regarded as a subcategory of Z by the 
natural inclusion J: >Z.. Any functor T 
from . into an Abelian category 2 has a 
unique additive extension T: Z +B such 
that T= TJ. If æ is an additive category, 
there is a canonical projection 0:27 >. 
such that 8J = Id. If furthermore T is additive, 
then T = T0. 

Now suppose that a projective class 7 in £ 
is given. For Ae.v, an augmented chain com- 
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is called a P-projective resolution of A if (i) 
X,€P (n20), and (ii) it is P-acyclic (i.e., the 
sequence 


+ Zod (P, X,) ZLAP, X44). 
S ZLAP, Xo) ZAP, A)>0 


is exact for any PeY [16]). Note that the 
#-acyclicity in this case implies the existence 
of a contracting homotopy 


h: ZAP, Xn > ZAP, Xn+1b 
n>—1,X_,=A, 


satisfying d,,,h, +h,_,d,= L. 

Comparison theorem: Given two 2- 
projective resolutions X.>A and Y.—A, we 
have that the chain complexes X, and Y. are 
chain equivalent in Zeg. 

Let T: >Z be a (covariant) functor with Z 
an Abelian category. The nth left derived func- 
tor LT :.d7 > (n>0) of T, with respect to 
P, is defined by L, T(A)=H,(TX.) for a 2- 
projective resolution X.— A. The derived func- 
tors L,,T will remain unaffected if we replace 
the projective class 2 by its enlargement P= 
{objects in P together with their retracts}. 

We can easily verify that (i) Lo T(P)= T(P) 
and L,, T(P)=0 (n>0) for Pe, and (ii) a 
short exact sequence 0> T’> T->T" 50 of 
functors: > induces a long exact sequence 
of derived functors 
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If . is preadditive and has a zero object 
and kernels, then it is routine to give a F- 
projective resolution of any object. If has 
finite limits (i.e., finite products and tequal- 
izers), it can be proved that there exists a 
P-projective resolution for any object [17]. 
There is a standard functorial construction 
which provides canonically a projective class 
Pinacategory . anda ¥-projective reso- 
lution of any object in <7. Let (G, £, ô) be a 
cotriple (or comonad [18] or functor coalge- 
bra) in £. Here G: oa is an endofunctor, 
e:G—ld and ô: G—G?° =GG are natural trans- 
formations such that Geod=eGod=1, and 
Gd06=6Go6. A cotriple comes usually from 
a pair (F, U) of tadjoint functors U:.7 >G, 
F:€— with natural bijection 4:.A~(F(C), A) 
X@(C, U(A)). Putting G= FU: A 9, &(A)= 
A7 (lua): FU(A) >A, nC) = AC ro): C> 
U F(C), we have a cotriple (G = FU, £, ô = FyU) 
in &. Conversely, it is known that any cotri- 
ple in Z is induced from a suitable pair of 
adjoint functors. 
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Given a cotriple (G, e, 6) in X, we define a 
projective class A, = {G(A)| AeA} (or its 
enlargement Z) in .o, and an augmented 
simplicial object G,(A)— A for any object A as 
follows. Put G,(A)=G"*!(A) (n>0), 6,=0"= 
GieG""'(A):G,(A)>G,_ , (A) (face operator) 
and 6;=6?=G'dG""*(A):G,(A)>G,,, (A) (de- 
generacy operator) for O<i<n, G_,(A)=A. 
Then G,(A)~—A gives rise to a Z; (or equiva- 
lently Y,)-projective resolution of A in ZW 
with differentials d, = 0?_,(—1)'6;. This is 
the bar resolution (or standard resolution) 
in a generalized sense, and there are many 
(co-)homology theories defined by means of 
such constructions. 

Most of the above definitions and construc- 
tions can be dualized so as to give injective 
classes .%, f -injective resolutions, and right 
derived functors with respect to ¥, triples (or 
manads), etc. (See also [19] for generalized 
(co-)homology). 
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A. History 


Homology theory is the oldest and most ex- 
tensively developed portion of talgebraic to- 
pology. Historically, it started with measuring 
the higher-dimensional connectivity of a space 
in the sense that the 0-dimensional connectiv- 
ity is the number of tconnected components of 
the space. For example, take a t2-sphere S? 
and a t2-torus T*. Then T? is distinguished 
from S? by the fact that on T? a closed curve 
can be drawn without forming a boundary, 
while this is not true for S°. In fact, a curve (c; 
or c} in Fig. 1) can be drawn on T? so that it 
does not form a boundary of an embedded 2- 
disk. On more complicated tsurfaces there are 
many kinds of such closed curves. The maxi- 
mum number of such closed curves is the 1- 
dimensional connectivity of the surface; this is 
a topological property of the surface. For 
example, the 1-dimensional connectivity of S? 
is 0, that of T? is 2, and that of the surface in 
Fig. 2 is 6. A more general consideration of the 
bounding properties of g-dimensional ‘closed 
submanifolds of a manifold led E. Betti (Ann. 
Mat. Pure Appl., 4 (1871)) to introduce the 
notion of the g-dimensional connectivity of the 
manifold, which was a precursor of homology 
theory. 


The foundation of homology theory was 
laid by fH. Poincaré [1]. He started his study 
of homology with analytic treatment of mani- 
folds, which led to a series of complications. 
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Poincaré then introduced a new method for 
the study, now called tcombinatorial topology: 
He decomposed the manifold into elementary 
pieces or tcells, which adjoin one another in 

a regular fashion; he then substituted the 
algebraic notions of tcycles and tboundary 
operators for the geometrical notions of closed 
submanifolds and boundaries. Thereby the 
notion of homology groups acquired an exact 
logical meaning, and the fundamental for- 
mulas, which are now called Poincaré for- 
mulas, were proved. 

After Poincaré, much of the development of 
homology theory centered around the ques- 
tion of the topological invariance of homology 
groups, that is, the independence of the ho- 
mology groups on the choice of ‘cellular de- 
composition. Through the development of 
tsimplicial complexes and their techniques, 

J. W. Alexander (Trans. Amer. Math. Soc., 28 
(1926)) gave the first fully satisfactory proof for 
topological invariance of the homology groups 
of polyhedra. In those days, the homology 
groups themselves were barely recognized; 
instead, one dealt with numerical invariants 
such as the *Betti numbers and the ttorsion 
coefficients [2, 3]. 

During the period 1925-1935 there was a 
gradual shift of interest from the numerical 
invariants to the homology groups themselves, 
and homology theory developed intensively 
[4,5]. S. Lefschetz (Trans. Amer. Math. Soc., 28 
(1926)) added the theory of the tintersection 
products to the homology of manifolds. He 
also invented trelative homology theory (Proc. 
Nat. Acad. Sci. US, 13 (1927)) and generalized 
the duality theorems of Poincaré and Alex- 
ander (ibid., 15 (1929)). G. de Rham (J. Math. 
Pures Appl., 10 (1931)) obtained a duality 
theorem that relates the texterior differential 
forms in a manifold to the homology groups of 
the manifold. L. S. Pontryagin (Ann. Math., 

(2) 35 (1934)) proved the complete group- 
invariant form of the Alexander duality theo- 
rem. These duality theorems seemed to reflect 
the existence of a theory dual to homology 
theory, and the genesis of this dual theory, 
now called ftcohomology, occurred in 1935, in 
the work of Alexander and A. N. Kolmo- 
gorov. It was discovered subsequently by 
Alexander (Ann. Math., (2) 37 (1936)), E. Cech 
(ibid.), and H. Whitney (ibid., 39 (1938)) that 
the cohomology of a polyhedron can be made 
into a ring. 

On the other hand, after L. Vietoris (Math. 
Ann., 97 (1927)) and P. S. Aleksandrov (Ann. 
Math., (2) 30 (1928)), many devices were inven- 
ted to extend the homology theory of poly- 
hedra to general ttopological spaces, and 
numerous variants of homology theory ap- 
peared at the hands of Cech (1932), Lefschetz 
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(1933), Alexander (1935), and Kolmogorov 
(1936). This development served to clarify the 
relations between combinatorial and set- 
theoretic methods in topology (— 426 Topol- 
ogy), whereas it produced complexity and 
confusion in homology theory [6]. S. Eilen- 
berg and N. E. Steenrod (Proc. Nat. Acad. Sci. 
US, 31 (1945) and [7]) cleared the air by treat- 
ing homology axiomatically. 

Roughly speaking, a homology theory as- 
signs tAbelian groups to ttopological spaces 
and thomomorphisms to fcontinuous map- 
pings of one space to another. In this way, a 
homology theory is an algebraic image of 
topology; it converts topological problems to 
algebraic problems. Starting from this view- 
point, Eilenberg and Steenrod selected some 
fundamental properties as axioms to charac- 
terize homology theory. This unified homol- 
ogy theories and allowed systematic treatment 
of homological problems which had previ- 
ously been done separately “by hand” in each 
case. Moreover, it motivated the birth of a 
new branch of mathematics, called thomo- 
logical algebra. 


B. Homology of Chain Complexes 


A chain complex C = {C,,6,} is a collection of 
(additive) Abelian groups C,, one for each 
integer q, and of homomorphisms 6,:C,>C,_, 
such that 0,00,,,=0 for each q. Elements of 
C, are called q-chains of C, and ô, is called the 
boundary operator. A subcomplex C’ = {C}, 0;} 
of C is a chain complex such that C; c C, and 
6, = 04| C; for each q. 

The tkernel of ô, is denoted by Z,(C), and its 
element is called a q-cycle of C. The timage of 
04+, Is denoted by B,(C), and its element is 
called a q-boundary of C. The relation 6,00,,, 
=0 implies B,(C)<Z,(C). The *quotient group 
Z,(C)/B,(C) is denoted by H,(C), the qth ho- 
mology group of C. Elements of H,(C) are 
called g-dimensional homology classes of C. 
Two cycles representing the same homology 
class are said to be homologous. The ‘direct 
sum $, H,(C) is denoted by H,(C) and is 
called the homology group of C. 

If C={C,,0,} and C’={Cj, 0;} are chain 
complexes, a chain mapping (chain map) ọ:C 
—C’ is a sequence of homomorphisms ¢,: C, 
>C; such that 0,0 9, = @,-; O 6, for each q. If 
~:C—C' is a chain mapping, then g, sends 
Z,(C) to Z,(C’) and B,(C) to B,(C’), and hence 
œ induces a homomorphism of H,(C) to 
H,(C’). 

If H,(C) is ffinitely generated, it can be de- 
composed into the direct sum of a free Abelian 
group B,(C) and a finite Abelian group 7,(C). 
B,(C) and T,(C) are called the qth Betti group 
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of C and the qth torsion group of C, respec- 
tively. The trank p, of B,(C) is called the qth 
Betti number of C. T,(C) is isomorphic to the 
direct sum of t(q) finite cyclic groups of orders 
Of, 04, ..., 04,), where 07> 1 and 67 divides 04, 
for i=1,...,t(q)--1(— 2 Abelian Groups). The 
numbers 07,09, ...,02,, are called the qth tor- 
sion coefficients of C. 

If H,,(C) is finitely generated, then the num- 
ber ¥(C)=,(—1)"p, is called the Euler num- 
ber, the Euler characteristic, or the Euler- 
Poincaré characteristic of C. In this case a 
*polynomial 2, p,t4 with variable t is called 
the Poincaré polynomial of C. 

Let C be a chain complex such that, for each 
q, C, is a tfree Abelian group of finite trank. 
Then the gth Betti number and the qth torsion 
coefficients are well defined for each g. Denote 
the ranks of C, and B,(C) by «, and f, respec- 
tively. Then it holds that p,=«,—f,—B,-1, 
and hence x(C)= X,(—1)*a,. (Euler-Poincaré 
formula). Moreover, there exists a set of bases, 
one for each C,, with the following properties: 
For each q, the base for C, is composed of 
five types of elements, af (1<i<f,—t,), b? 
(1<i<t,), c? (1 <i<p,), d? (1 <i<t,_,), and e? 
(1<i<f,-,—t,-1); 6, satisfies daf =0, ðbf =0, 
6c#=0, 0d? =02"' b8~', and def =a!"'. Such a 
set of bases is called the canonical basis of C. 

Let C be a chain complex such that each C, 
is a free Abelian group with a given base {0f}. 
Then the incidence number [o/:0/7"']€Z is 
defined by 0,(07)=2 [07:67 "Jo '. This 
notion was commonly used in the early days 
of topology. 


C. Homology of Simplicial Complexes 


Let K be a tsimplicial complex. An oriented q- 
simplex c of K is a q-simplex se K together 
with an equivalence class of ttotal orderings of 
the vertices of s, two orderings being equiva- 
lent if they differ by an even permutation of 
the vertices. If ag,...,a, are the vertices of s, 
then [ay,4,,...,@,] denotes the oriented q- 
simplex of K consisting of the simplex s to- 
gether with the equivalence class of the order- 
ing dg<a,<...<a, of its vertices. For every 
vertex a of K there is a unique oriented 0- 
simplex [a], and to every q-simplex with q > 

1 there correspond exactly two oriented q- 
simplexes, which are said to be opposites of 
one another. 

Let C,(K) denote the Abelian group 
generated by the oriented g-simplexes of K 
with the relations o +0’ =0 if o’ is the oppo- 
site of o. If we choose an oriented q-simplex 
of for each q-simplex sf of K, then each 
element of C,(K) is written uniquely as a 
finite sum È; g;cf with integers g; #0, and 


201 D 
Homology Theory 


C,(K) is a free Abelian group generated by 
the set {0f}. We define a homomorphism 

ĝa: C,(K)>C,_1(K) by 6,[4@0,44,-.-,4g1= 
Lho(—1)'Lao, ---. 4-1, G41, ---,4,]- Then 
6,064, =0 holds, and we have a chain com- 
plex C(K)={C,(K), 6,}(C,(K) = {0} if q <0), 
called the (oriented) simplicial chain complex. 
The homology group H,(C(K)) is denoted by 
H,,(K) and is called the (integral) homology 
group of the simplicial complex K. 

Let K, and K, be simplicial complexes, and 
let f:K, >K, be a tsimplicial mapping. Then 
for each q, a homomorphism fy,:C,(K,)> 
C,(K2) can be defined by fyq([@o,41,---,4,])= 
[ f (ao), f(a), ---, f(a,)], where the right-hand 
side is understood to be 0 if f(a), f(a) ..., 
f(a,) are not distinct. The sequence fy = { fy,4} 
is a chain mapping of C(K,) to C(K,), and it 
induces a homomorphism /,.:H,(K,)—> H,(K)). 

If K is a ‘finite simplicial complex, then the 
Betti numbers, the torsion coefficients, and the 
Euler characteristic of K are defined to be 
those for the chain complex C(K). 

Let K, and K, be a subcomplexes of a sim- 
plicial complex K. Then we have the follow- 
ing fexact sequence which relates the homol- 
ogy group of K, UK, to the homology groups 
of K,, K}, and K,K):...3H,(K,NK2)> 
H,(K,) + H,(K)>H,(K, UK2) 3H, (K, N 
K,)—..., where a, B, and é, are defined as 
follows. Let i: K; NK >K; and j,:K,-K,U 
K, (/=1, 2) be inclusion mappings; then «(a)= 
(i,«(4), —iz+(a)) and B(a,, d2)=j1+(@1) +J2*(a). 
If z=c, +c, (c;E C(K))) ts a cycle of C(K, UK,) 
then 6(c,)= — G(c2) is a cycle of C(K, N K3); 6, 
sends the homology class of z to the homol- 
ogy class of 0(c,). The sequence is called the 
Mayer-Vietoris exact sequence of the couple 
{K,,K,}, and ô, is referred to as the connect- 
ing homomorphism. The prototype of the 
Mayer-Vietoris exact sequence was obtained 
by W. Mayer (Monatsh. Math. Phys., 36 (1929)) 
and L. Vietoris (ibid., 37 (1930)). The present 
form is due to Eilenberg and Steenrod [7]. 


D. Homology of Polyhedra 


If K is a tsimplicial complex and K’ is a tsub- 
division of K, then there exists a canonical 
isomorphism H,(K)= H,(K’). This proves that 
if K, and K, are tsimplicial decompositions of 
a polyhedron then H,(K,) and H,(K,) are 
isomorphic, because there exists a common 
subdivision of K, and K,. Thus we may de- 
fine the (integral) homology groups H,,(X) of 
a polyhedron X to be the homology group 
H,,(K) of a simplicial decomposition K of X. 
Let X and Y be polyhedra, and let f:X >Y 
be a continuous mapping. Take a *stmplicial 
approximation g:K->L of f. Then a homo- 
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morphism of H,(X) to H,(Y) given by the 
induced homomorphism 9, : H,(K)—> H,,(L) is 
independent of the choice of ¢, and is denoted 
by f- The following properties hold: (i) 1,, 
=1:H,(X)-H,,(X), where 1 is the identity; (11) 
If f:X>Y and g: YZ are continuous map- 
pings, then (go f), =9, 0 f,:H,(X)— H,,(Z); 
(iii) If ff’: X > Y are homotopic, then f, = 
f, H,(X)> H,(Y). These imply the homo- 
topy invariance of the homology group stated as 
follows: If X and Y are polyhedra which are 
thomotopy equivalent, then H,(X) and H,(Y) 
are isomorphic. Specifically, the homology 
group is a topological invariant. Thus if X is 
a ttriangulable space (for example, if X is a 
tdifferentiable manifold), then its homology 
group H,(X) can be defined to be the homol- 
ogy group H,(K), where (K, t) is a ftriangu- 
lation of X. This homology group is referred 
to as the simplicial homology group of X. 
Similarly, if X is a compact triangulable space, 
the Betti numbers p,(X) of X, etc. can be de- 
fined to be those for the chain complex C(K). 

If we denote by pt a single point, then H,(pt) 
=Z (the group of integers) and H,(pt)=0 if 
q#O0. For an n-sphere S”, a 2-torus T?, and a 
treal projective plane P?, the homology groups 
can be computed as follows by means of their 
triangulations: (1) H)(S")= H,,(S") = Z, and 
H,(S")=0 if q 40, n; (2) (T?) = HT?) =Z, 
H,(T’?)=Z+Z, and H,(T’)=0 if q 40, 1, 2; 
(3) Ho(P?)& Z, H,(P?)=Z,, and H,(P?)=0 if 
q#0, 1, where Z, = Z/2Z. 

Two surfaces are homeomorphic if and only 
if their integral homology groups are isomor- 
phic (— 410 Surfaces). 


E. Singular Homology 


There are various devices for defining ho- 
mology groups of general topological spaces. 
A familiar one is the singular homology theory 
initiated by S. Lefschetz (Bull. Amer. Math. 
Soc., 39 (1933)) and improved by S. Eilenberg 
(Ann. Math., (2) 45 (1944)). 

The standard q-simplex is the convex set 
Atc R*"! consisting of all (q + 1)-tuples (to, t1, 
...,¢,) of real numbers with ¢;>0, to +t, + 
... +f, = 1. Any continuous mapping of A‘ to 
a topological space X is called a singular q- 
simplex in X. The ith face of a singular q- 
simplex o: A1->X is the singular (q — 1)-simplex 
oog: AT '!— X, where the linear embedding 
&1 A1! A4 is defined by g; (to, ... 

s.s ta) = (fos -+s ti-15 0, titis -5 ta) 

For each integer q, let S,(X) denote the free 
Abelian group generated by the singular q- 
simplexes in X (S,(X)=0 if q <0), and define a 
homomorphism @,:5S,(X)—>S,_(X) by 0,(0) = 


sfi-15 lita 


764 


4_,(—1)'oo¢;. Then we have a chain com- 
plex S(X)={S,(X), 6,}, called the singular chain 
complex of X. The homology group H,(S(X)) 
is denoted by H,(X) and is called the integral 
singular homology group of the topological 
space X. 

Given a continuous mapping f:X > Y,a 
chain mapping f,:S(X)—S(Y) is defined by 
sending each singular simplex o:A’—X to the 
singular simplex foa:A‘?— Y, and it induces 
the homomorphism f4: H,(X)—> H,(Y). The 
properties (i), (ii), (iii) of f, in Section D hold 
for continuous mappings of topological spaces, 
and hence the singular homology group is a 
homotopy invariant. 

The homology group H,.(K) of a simplicial 
complex K is isomorphic to the singular ho- 
mology group H,(|K|) of the polyhedron |K]. 
Therefore the simplicial homology group of a 
triangulable space is isomorphic to the sin- 
gular homology group of the space. 

If {X;} is the set of tarcwise connected com- 
ponents of a topological space X, then H,(X) 
=È; H,,(X;). If X is arcwise connected, then 
H(X) =Z. If {A,} is the collection of all the 
compact subsets of X directed by inclusion, 
then H(X) is isomorphic to the tinductive 
limit lim H,,(A,). It is not true that there is a 
Mayer-Vietoris sequence in singular homology 
for any couple {X,, X,} of subsets of X. How- 
ever, for certain couples {X,, X,}, there is a 
Mayer-Vietoris exact sequence of {X,, X3}: 
SHAK, 1. X,)5H,(X,)+ H(X) H(X, U 
X2) H-1 (X, N X2)5.... For example, this 
holds if X = Int X, U Int X,, where Int denotes 
the tinterior. 

Let c: X —>pt be the mapping of a topolog- 
ical space to a single point. Then the kernel 
of cą: H (X)>H,(pt) is denoted by A(X) 
and is called the reduced homology group of 
X. It holds that H (X) = A,(X)+ H,(pt). 
Regard the tsuspension SX as the union of 
two copies of the tcone over X. Then the 
connecting homomorphism in the Meyer- 
Vietoris sequence gives an isomorphism 
ASX) = H,_,(X) for any q. The inverse of 
this isomorphism is called the suspension iso- 
morphism for homology. 

Let M be a tC”-manifold. A C”-—singular q- 
simplex in M is a singular q-simplex o: A1—>M 
such that o extends to a C”-mapping from an 
open neighborhood of A% in {(to,f,,.--,t€ 
R**"|to+t,+...+t,=1} to M. The totality 
of C” -singular simplexes in M generates a 
subcomplex of S(M), denoted by $”(M). The 
inclusion S*(M)< S(M) induces an isomor- 
phism H,(S°(M))= H,(M). 

Let M be an n-dimensional ftopological 
manifold. Then H,(M)=0 unless 0<q <n, and 
H,,(M) is finitely generated if M is compact. 
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F. Homology of CW Complexes 


Homology theory is tractable in the category 
of *CW complexes by virtue of the facts stated 
below. 

Let X be a topological space and A its sub- 
set. Then we denote by X/A the quotient space 
obtained from X by shrinking A to a point, 
understanding X/@ to be the disjoint union 
X Upt, If X,, X, are tsubcomplexes of a CW 
complex, then the Mayer-Vietoris exact se- 
quence of {X,, X,} and the excision isomor- 
phism i% : H; (X, AX, N X2))= H,((X4 U X)/X2) 
(i: inclusion) are valid. If A is a subcomplex of 
a CW complex X, then we have the following 
reduced homology exact sequence of (X, A):... 
3 A (A)-5 A(X) 8,(X/A)3 A,_1(A)3..., 
where i, and j, are induced by the inclusion 
i: A-+X and the collapsing j: X > X/A, and 
é, is given by a commutative diagram 


A(X /A)+4,_,(A) 
~ th ~|s 
A (X UCA)-3A,(SA). 


Here CA is the cone over A, S is the sus- 
pension isomorphism, and h:X UCA>(X U 
CA)CA=X/A and k: X UCA>(X UCA)/X = 
SA are collapsings. More generally, if A, B 
are subcomplexes of a CW complex X and 

A >B, then we have the following reduced 
homology exact sequence of (X, A, B): ... 3 

Ñ (4/B)5 A (X/B)5 Ã (X/A, ,(A/B)> 
.... Furthermore, the homology group H,(X) 
of a CW complex X can be computed in the 
following manner. 

Let X4 denote the q-skeleton of X, i.e., 
the union of all cells of dimensions <q. Put 
C,(X) = A,(X4/X9), and let 0,:C,(X)> 
C,-,(X) be the connecting homomorphism 
ôy: A (XXT) f,- (X1 !/X1?) in the re- 
duced homology exact sequence of (X1, XT !, 
X12), Then C(X)={C,(X), 6,} is a chain com- 
plex such that C,(X) is a free Abelian group 
with one generator for each q-cell of X. If X is 
a polyhedron |K|, then C(X) coincides with 
the simplicial chain complex C(K). The ho- 
mology group H,(C(K)) is called the cellular 
homology group of the CW complex X. This is 
isomorphic to the singular homology group 
H,,(X). 

Since a CW decomposition frequently re- 
quires fewer cells than a simplicial decompo- 
sition, the cellular homology groups are use- 
ful in calculating the homology groups. For 
example, the tcomplex n-dimensional projec- 
tive space CP" has a CW decomposition with 
a single 2i-cell for each i=0, 1,...,n, and hence 
we see immediately that H,(CP") = Z if q=2i 
(O<i<n) and =0 otherwise. 

If X is a finite CW complex and a, denotes 
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the number of q-cells of X, then we have the 
Euler-Poincaré formula y(X) = %,(—1)"a,. In 
particular, if X is homeomorphic to S? then we 
have the Euler theorem on polyhedra: xo —«, + 
æ, = 2. This was the first important result in 
topology (L. Euler, 1752). 


G. Homology with Coefficients in Abelian 
Groups 


Given a chain complex C and an Abelian 
group G, we have a new chain complex C ® G 
given by (C &® G),=C, @ G and ô (c 8 g)= 

ĉc Q g (ceC,,geG), where @ is the ttensor 
product of Abelian groups. For a topological 
space X, the homology group of the chain 
complex S(X) ® G is denoted by H,(X; G) and 
is called the singular homology group of X 
with coefficients in G. The homology group 
H,,(K; G) of a simplicial complex K with coeffi- 
cients in G and the cellular homology group 
H,.(C(X); G) of a CW complex X with coeffi- 
cients in G are similarly defined. The ho- 
mology group with coefficients in Z is the 
integral homology group. 

The previous results for the integral ho- 
mology groups generalize in a straightforward 
fashion to homology groups with coefficients 
in G. 

We have the homomorphism x: H,(X) @ 
G—H,(X; G) sending a ® ge H(X) @ G to the 
homology class of z ® ge Z,(S(X) ® G), where 
z is a representative cycle of a. The follow- 
ing theorem is known as the universal coeffi- 
cient theorem for homology, since it expresses 
H(X; G) in terms of H,(X), H,_,(X), and G: 
There is an exact sequence 0>H,(X) @ GS 
H(X; G)>Tor(H,_,(X), G)>0, and this se- 
quence is split (— 200 Homological Algebra). 
Universal coefficient theorems of this type 
were first shown by S. Eilenberg and S. Mac- 
Lane (Ann. Math., (2) 43 (1942)). 

Let A be a fring with a unit 1. Then a chain 
complex over A is a chain complex C such that 
each C, is a A-module and each ô, is a tA- 
homomorphism. The homology groups H,(C) 
of a chain complex C over A are A-modules. If 
C is a chain complex, C @ A forms naturally a 
chain complex over A. In particular, if X is a 
topological space, then S(X) @ A is a chain 
complex over A, and H,(X; A) are A-modules. 
In this case, the induced homomorphisms 
Ja: HX; A) H,(Y; A) are A-homomorphisms. 
The homology groups with coefficients in a 
field k are tvector spaces over k and are useful 
in applications. If H(X) is finitely generated, 
then y(X)=%,(—1)*dim, H,(X; k) holds for 
any field k. 
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H. Cohomology 


A cochain complex C = {C‘, 5%} is a collection 
of Abelian groups C”, one for each integer q, 
and of homomorphisms 64:C?—C**! such 
that 67*' 0 64=0. Elements of C? are called q- 
cochains, and ôf is called the coboundary 
operator. The notions of subcomplex of a co- 
chain complex, cocycle, coboundary, coho- 
mology group, and cochain mapping are de- 
fined as in chain complex. Let Hom(A, B) 
denote the tgroup of homomorphisms from an 
Abelian group A to an Abelian group B. Given 
a chain complex C and an Abelian group G, a 
cochain complex C* = Hom(C, G) is defined 
by C4=Hom(C,, G) and (64u)(c)=u(0,,, 0) 
(ueCt, cE Cyri). 

For a topological space X, the cochain 
complex Hom(S(X), G) is called the singular 
cochain complex of X with coefficients in G, 
and elements of Hom(S,(X), G) are called 
singular g-cochains of X. The cohomology 
group of Hom(S(X), G) is denoted by H*(X;G) 
and is called the singular cohomology group of 
X with coefficients in G. We write H*(X) for 
H*(X; Z); this is called the integral cohomol- 
ogy group of X. Similarly, the cohomology 
group H*(K; G) of a simplicial complex K 
with coefficients in G and the cellular coho- 
mology group H*(C(X); G) of a CW complex 
X with coefficients in G are defined. There are 
isomorphisms H*(K; G)= H*(| K|; G) and 
H*(C(X); G)& H*(X;G). 

If f:X—>Y is a continuous mapping, then a 
cochain map f* :Hom(S(Y), G) > Hom(S(X), 
G) is defined by (f*u)(c)=u(f,c) with ue 
Hom(S,(Y), G) and ceS,(X). Therefore f in- 
duces the homomorphism f *:H*(Y; G)> 
H*(X;G). The following properties hold: (i) 
1*=1; (tt) (go f)* =f* og*; (iii) If f and f’ are 
homotopic, then f* = f'*. In particular, the 
singular cohomology groups are homotopy 
invariants. The tcokernel of c*: H*(pt;G)> 
H*(X; G) induced by the mapping c: X >pt 
is denoted by H*(X;G) and is called the re- 
duced cohomology group of X. 

For če H1(X; G) and ae H,(X), the Kro- 
necker index <¢,a>€G is defined naturally in 
terms of representatives of č and a. We have 
the following universal coefficient theorem for 
cohomology: There is an exact sequence 0> 
Ext(H,_;(X), G)N > H"(X; G)> Hom(H,(X), G)> 
0, and this sequence is split, where x is given 
by the Kronecker index (— 200 Homological 
Algebra). 

If A is a ring with 1, then a cochain complex 
over A is defined analogously to a chain com- 
plex over A. The singular cochain complex 
Hom(S(X), A) forms naturally a cochain com- 
plex over A, and H(X; A) are A-modules. For 
Ce H4(X; A) and ae H,(X; A), the Kronecker 
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index <ë, aX €A is defined naturally. If k is 
a field, then the vector spaces H4(X;k) are 
identified with the dual space of H,(X,k) by 
means of the Kronecker index. 

If M is a C”~-manifold, then there is a co- 
chain complex D(M)={D%(M), d} over the 
field R of real numbers, where D*(M) is the 
real vector space consisting of the tdifferen- 
tial forms of degree q on M, and d:D4(M)—> 
D11 (M) is the texterior differentiation. The 
cochain complex D(M) is called the de Rham 
complex of M, and its cohomology group 
H*(D(M)) is called the de Rham cohomology 
group of M. A cochain mapping .4:D(M)—> 
Hom(S“(M), R) is defined by 


(F(o)) (a) -| o*a, 
Af 


where we D4(M), o: A’ M is a C® singular q- 
simplex in M, and o*q@ denotes the tpullback 
of w by o. We have isomorphisms 


H*(D(M))2 H*(Hom(S*(M), R)}<— H*(M; R), 


where i* is induced by the inclusion S°(M)c 
S(M). This result is called the de Rham theo- 
rem on the cohomology of manifolds (— 105 
Differentiable Manifolds). 


I. Cohomology Rings 


Given a topological space X and a ring A, the 
cup product u ~ ve Hom(S,,,,(X), A) of co- 
chains ue Hom(S,(X), A) and ve Hom(S,(X), A) 
is defined by (u ~ v)(o)=u(oo8)v(a oe’), where 
a: A’*4-+X is a singular (p+q)-simplex, ¢:A’> 
A’*4 and ¢’':A2A?*4 are given by efto, t,, 

sey lp) =(lo,t1,---ytp,0,...,0), and e(t,, tpi, 
sorb pr g=(0, ...,0, tps tp+1> -+> tp+q) The prod- 
uct operation is bilinear, and the formula 
(uw v)=du— v+(—1)’u — dv holds. There- 
fore it gives rise to the cup product €~ ye 
H’*4(X; A) of cohomology classes če H?(X; A) 
and ne H?(X; A). This cup product operation 
makes H*(X; A) into a ring, which is called 
the singular cohomology ring of X with coeffi- 
cients in A. If A has 1, the cohomology class 
represented by the 0-cocycle taking the value 
1 on each singular 0-simplex serves as 1 of 
H*(X; A). If A is commutative, then éy = 
(—1)’4né holds. The induced homomorphism 
JS *:H*(Y; A)>H*(X; A) preserves the prod- 
uct, and hence the cohomology ring is a ho- 
motopy invariant. 

If K is a simplicial complex, the cup prod- 
uct operation — :H?(K; A) @ HYK; A)> 
H?*4(K; A) is induced from the opera- 
tion ~ :Hom(C,(K), A) @ Hom(C,(K), A)> 
Hom(C,,,(K), A) defined as follows. Adopt- 
ing a ‘linear ordering of vertices of K, we write 
all oriented simplexes in this ordering. Then, 
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for ue Hom(C,(K), A) and ve Hom(C,(K), A), 
we define u ~ ve Hom(C,,,,(K), A) by (ur 

U) Ld. a1, +- ,ap+4])=u( Cao, 44, ---4p]) (Cay, 
An+t>+++>Ap4q]). The canonical isomorphism 
from the cohomology of K to the singular 
cohomology of | K| preserves the cup product. 

On the de Rham complex D(M) of a C”- 
manifold M, we have the texterior product 
w@AbeD?*{M) of we D?(M) and 66 DYM). 
This makes H*(D(M)) into a ring, which is 
called the de Rham cohomology ring. The 
canonical isomorphism of H*(D(M)) to 
H*(M; R) preserves the product (— 105 Dif- 
ferentiable Manifolds). 

Examples. (1) Let T"=S! x... x S! denote 
the n-dimensional torus, and let z;:T">S! 
denote the projection to the ith factor (1 < 
i<n). Take a generator € of the A-module 
H?! (St; A), and put č; = nž(é)e H'(T"; A). Then 
H*(T"; A) is the fexterior algebra over A gen- 
erated by ¢,,...,¢,. (2) If we denote by CP” 
the complex n-dimensional projective space, 
then H4(CP"; A) is A if g=2i (O<i<n) and 0 
otherwise. If € is a generator of the A-module 
H?(CP"; A), then č’ generates the A-module 
H?(CP"; A) (O<i<n). Thus H*(CP"; A) is the 
quotient ring ALE]/(é"*') of the tpolynomial 
ring A[E] by the tideal (€"*'). (3) If P” denote 
the real n-dimensional projective space, then 
H*(P";Z,)=Z,[E]/(E"*!), where € is the gen- 
erator of H!(P";Z,). 


J. Homology of Product Spaces 


If C and D are chain complexes, their tensor 
product C ® D is a chain complex given by 
(C @ D),=Lpig-nC, Q D,, and 6,(c @ d)= 
6,(c)@d+(—1)’c Q 2 (d) (ceC,, de D,). The 
following Eilenberg-Zilber theorem (Amer. J. 
Math., 75 (1953)) is the link between the alge- 
bra of tensor products and the geometry of 
product spaces: For the product space X x Y 
of topological spaces X and Y, there is an 
isomorphism p,:H,(X x Y;G)=H,(S(X)@ 
S(Y) ® G) induced from a chain mapping p: 
S(X x Y)>S(X) @ S(Y) defined as follows: 
Given a singular n-stmplex ¢:A">X x Y, 
we define for each p (0< p <n) a singular p- 
simplex g, in X to be the composite A? >A" 
X x YX, where efto tis -s fp) = (tostis 5 
t,,9,...,0) and z, is the projection to the first 
factor. Similarly, we define for each q (0< 
q <n) a singular g-simplex of in Y to be the 
composite A?5A"X x YŠ Y, where Elti 
-<-s 6,)=(0,...,0,t,-4,.-.,¢,) and 7, is the pro- 
jection to the second factor. Then p is defined 
by p(o)=È p+4=10p Q oy and is called the 
Alexander-Whitney mapping (Alexander- 
Whitney map). 

Given a ring A, a chain mapping p:(S(X) ® 
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A) @ (S(Y) @ A)>S(X) @ S(Y) @ A is de- 
fined by n((c @ A) Q Udi) =c Dd AA"; it 
induces homomorphisms yp, : H,(X; A) @ 
H,(Y; A)>H,,,(S(X) @ S(Y) @ A). The cross 
product a x be H,,,(X x Y; A) of ae H (X; A) 
and beH,(Y; A) is defined to be p,.'(u,.(a x b)). 
If A is a commutative ring with 1, then the 
cross product defines a A-homomorphism x : 
H,(X; A) @, HY; A)> 4 4(X x Y; A), and 
satisfies (a x b) x c=a x (b xc), T (ax b)= > 
(—1)™b xa, (f x g),(a x b)= f(a) x g,.(b), 
where T:X x YY x X is the mapping inter- 
changing factors, and f: X-X’, g: YY’ are 
continuous mappings. If A is a tprincipal ideal 
domain, there is an exact sequence 

0> È A,(X;A)@,H(¥:A)-3H,(X x YA) 


ptq=n 


> }_ Tor,(H,(X;A), H,(Y;A))9, 
ptq=n-1 
and this sequence is split (— 200 Homolog- 
ical Algebra). In particular, if k is a field we 
have the following isomorphism of vector 
spaces: 
x: $} A,(X3k)@® H,(Y;k)=H,(X x Y;k). 
ptq=n k 
This is called the Kiinneth theorem, since the 
prototype was proved by D. Kiinneth (Math. 
Ann., 90 (1923); ibid., 91 (1924)). The present 
form was given by H. Cartan and S. Eilenberg 
[8]. 

The tensor product C © D of cochain com- 
plexes C and D is defined analogously to that 
of chain complexes. Given topological spaces 
X, Y and a ring A, a cochain mapping p: 
Hom(S(X), A) ® Hom(S(Y), A)> Hom(S(X) 

@ S(Y), A) is defined by (uu © v))(c @ d) 
=u(c)v(d), where ue Hom(S,(X), A), ve 
Hom(S,(Y), A), ce S,(X), deS,(Y), and u(c)o(d) 
is understood to be 0 if (p, g)¥(s, t). We then 
have the composite H?(X; A) ® H4(Y; A)S 
H?’*4(Hom(S(X) Q S(Y), A) H?*4(X x Y:A), 
where p ts the Alexander-Whitney mapping. 
For €€ H?(X; A) and ne H4(Y; A), the cross 
product € x n€ H’'4(X x Y; A) is defined to be 
p* u,(€ @ n). The cohomology cross product 
satisfies the properties analogous to the homol- 
ogy cross product. 

The cup product and the cohomology cross 
product are given in terms of each other: ë~ 4 
=d*(¢ x n), č x n =n] (č) ~ 23 (n), where d: X > 
X x X is given by d(x)=(x,x), and z; : X x Yo 
X, n3: X x YY are projections. 

We have the following Künneth theorem for 
cohomology: If A is a principal ideal domain 
and each H (X; A) is finitely generated over A, 
then there is an exact sequence 
0> $ H(X;A)@, HY; A)-5H"(X x Y;A) 


ptq=n 


> SY Tor,(H?(X; A), H4(Y;A))>09, 


ptqą=n+i 
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and this sequence is split (— 200 Homological 
Algebra). For če H?(X; A), ne H4(Y; A), c'e 
H*(X; A), n'€ H'(Y; A), the formula (é x 4) ~ 
(Ox m')=(-1)®(C~ č) x (n~ 17’) holds. There- 
fore, if k is a field and dim, H,(X;k)< œ for 
each q, then the cohomology ring H*(X x Y;k) 
is determined by the cohomology rings 
H*(X;k) and H*(Y;k). 

tFiber bundles can be considered as gen- 
eralized product spaces. Let E be the total 
space of a fiber bundle with base B and fiber F. 
The following Leray-Hirsch theorem (J. Math. 
Pures Appl., 29 (1950)) asserts that, under 
certain conditions, the cohomology of E is 
additively isomorphic to that of B x F: Let A 
be a principal ideal domain, and assume that 
H,,(F; A) is free and finitely generated over A. 
Furthermore, assume that there is a homo- 
morphism 0: H*(F; A)> H*(E; A) such that the 
composite H*(F; A) H*(E; A) H* (p~! (b); A) 
is an isomorphism for each be B, where p: E> 
B is the projection and i,:p '(b)c E. Then 
an isomorphism ®: H*(B; A) ®, H*(F;A)= 
H*(E; A) is given by ®(¢ @ n)=p*E ~ O(n), 
where če H*(B; A), ne H*(F; A). 

A general connection between (co)homol- 
ogy of E and B x F is given by means of spec- 
tral sequences (— 148 Fiber Spaces). 


K. Cap and Slant Products 


There are other products closely related to the 
cup product or the cross product that involve 
cohomology and homology together. 

Given a topological space X and a ring 
A, the cap product v ~ceS,(X)@ A of a co- 
chain ve Hom(S,(X), A) and a chain c= £; c; © 
A;€S,4q(X)@ A is defined by v~ c=}; o;o 
£Q v(g;06')4;, where o; are singular (p + q)- 
simplexes in X, A;e A, and ¢:A?>A?*4, g'i A> 
A’*4 are the mappings used in the definition 
of cup product. For any ue Hom(S,(X), A), 
the formula <u ~v, cò = <u, v~ c> holds. The 
cap product satisfies 6(v ~ c)=(—1) ôv ~ c + 
va ĉc, and hence it induces the cap product 
¢-— ae H, (X; A) of a cohomology class če 
H4(X; A) and a homology class ac H,+,(X; A). 
If A is a commutative ring with 1, then the 
cap product operation is bilinear and satisfies 
the following properties: (č — &’) ~a=é ~ 
(Cn a), ff *0 aan fala) ~ a=a, 
(¢ xn) ~ lax b)=(—1)P" (FE ~ a) x (nab), 
where če H?(X; A), € € H” (X; A), ne H4(Y; A), 
aéH,(X; A), be H,(Y; A), and f:X > Y isa 
continuous mapping. 

Given topological spaces X, Y and a ring 
A, the slant product w/d € Hom(S,(X), A) of 
a cochain we Hom(S(X) @ S(Y)),,,, A) and 
a chain d= );,7; ® 4E SY) © A is defined 
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by (w/d)(a)=X(w(o ® 7,))A;, where o is a 
singular p-simplex in X, q; are singular q- 
simplexes in Y, and 4; A. The slant operation 
satisfies 6(w/d)=(dw)/d —(—1)?w/éd. There- 
fore, under the identification H*(X x Y; A)= 
H*(Hom(S(X) © S(Y), A)), it induces the 

slant product ¢/be H?(X; A) of a cohomology 
class e H’**(X x Y; A) and a homology class 
beH,(Y; A). For any ae H,(X; A), it holds that 
<¢/b, a> = <0, ax b). 

Let G, G’ and G” be Abelian groups. Given a 
homomorphism G’ ® G” >G, we write g'g” for 
the image of g' © g” eG’ © G” in G. Then the 
cap product ~ : H(X; G) @ H,,,(X;G")> 
H(X; G) can be defined in the same way as 
before. Similar definitions are valid for the cup 
product, the cross products for homology and 
cohomology, and the slant product. 


L. Relative Homology 


If C’ is a subcomplex of a chain complex C, 
then we have a chain complex C/C’ = {C,/C;, 
6,}, where C,/C, denotes the quotient group 
and ô, ts induced from ô, by passing to the 
quotient. C/C’ is called the quotient complex of 
C by C'. 

A topological pair (X, A) is composed of a 
topological space X and its subset A. Given a 
topological pair (X, A) and an Abelian group 
G, we have the chain complex (S(X)/S(A)) @ G 
and the cochain complex Hom(S(X)/S(A), G). 
The homology group of (S(X)/S(A)) @ G is 
denoted by H,,(X, A; G) and is called the rela- 
tive singular homology group of X modulo A 
with coefficients in G or the singular homology 
group of (X, A) with coefficients in G. The 
homology group H,(X; G)=H,,(X, Ø; G) is 
sometimes called the absolute homology group. 
Similar definitions are made for the coho- 
mology group H*(X, A; G) of the cochain 
complex Hom(S(X)/S(A), G). 

A simplicial pair (K, L) is composed of a 
simplicial complex K and its subcomplex 
L, and a CW pair (X, A) is composed of a 
CW complex X and its subcomplex A. The 
relative homology group H,(K, L; G)= 
H,,((C(K)/C(L)) @ G) of a simplicial pair 
(K, L) is isomorphic to H,(|K|,|L{; G). For a 
CW pair (X, A), there is an isomorphism 
H,(X, A; G)= (X/A; G). Similar statements 
hold for the cohomology groups. 

A continuous mapping f:(X, A)-(Y, B) of 
topological pairs is a continuous mapping f: 
X >Y such that f(A)cB. If f:(X, A)(Y, B) is 
a continuous mapping, then f,:S(X)—>S(Y) 
sends S(A) to S(B), and hence f induces homo- 
morphisms f,: H(X, A; G)>H,,(Y, B; G) and 
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{*:H*(Y, B; G)> H*(X, A; G). Since the prop- 
erties are analogous, we state them below 
only for the case of relative homology. 

The following six properties are fundamen- 
tal. (i) 1, =1. (i) (gos), =g, © fẹ- (tii) Homo- 
topy property: If f, f’:(X, A)—>(Y, B) are tho- 
motopic, then f, = f,. (iv) Exactness property: 
There exists a homology exact sequence of 
A A):...3H,(A;G)3H,(X; G) H(X, A:G)3 

-ı (å; G)>.. , where i: Ac X,j1(X, OB) 
a A), and ô, sends the homology class of a 
cycle of (S(X ys (A)) @ G represented by a 
chain ce S(X)® G to the homology class of ĉc 
which is a cycle of S(A) @ G. ô, is called the 
boundary homomorphism or the connecting 
homomorphism. (v) Naturality of 0,: For any 
continuous mapping f:(X, A)—>(Y, B), it holds 
that 6,0 f, =(f| A), 06,. (vi) Excision pro- 
perty: If U is a subset of X such that the clo- 
sure U is in Int A, then the excision isomor- 
phism H,(X — U, A—U;G)=H,(X, A; G) is 
induced by inclusion. 

The exactness property extends to the 
homology exact sequence ofa triple (X, A, B): 

.3H,(A, B; G)4H, (X, B; G) H(X, A; gs 
i ,(A, B;G)5.... A couple {X,,X,} of sub- 
sets of X is said to be excisive if H,(X,,X, 
X) = H,(X,U X7, X2) is induced by inclusion. 
If {X,, X2} is excisive, so is {X,, X,}. For ex- 
ample, if X =Int X, UInt X, or if X, and X, 
are subcomplexes of a CW complex, then 
{X,, X2} is excisive. If {X,,X,} and {A,, A2} 
are excisive couples such that A, c X, and 
A, € X,, then we have the relative Mayer- 
Vietoris exact sequence: ... >H (X, N X2, A, N 
Ay; G)> HX, A1: G)+ H(X, A2; G)> H(X, U 
X>, A4, U Ag; G)> H, (X1 N Xp, A NA; G)... 
For the case of relative cohomology, we use 
terms such as cohomology exact sequence and 
coboundary homomorphism, correspondingly. 

The universal coefficient theorems are 
valid for the relative (co)homology groups. 
Given a homomorphism C © G">G, if 
{A, B} is excisive in X, then the cup product 
<: H?(X, A: G’) Q HX, BG") H?*4(X, AU 
B; G) and the cap products ~ : H4(X, B; G) © 

H,44(X, AU B; G")> H,(X, A; G) can be de- 
fined. The product (X, A) x (Y, B) is defined 
to be the pair (X x Y, A x YU X x B). Given a 
homomorphism G’ ® G” >G, the homology 
cross products x : H (X, A; G’)@ H,(Y, B; G”) 
>H,+4((X, A) x (Y, B); G) and the slant prod- 
ucts /: H’*4((X, A) x (Y, B); G) © H,(Y, B; G”) 
>H”(X, A; G) can be defined. If {A x Y, X x 
B} is excisive, then the cohomology cross 
products x :H?(X, A; G) ® H4(Y, B; G”)—> 
H?*4((X, A) x (Y, B), G) can also be defined. 

If {A x Y, X x B} is excisive, the Künneth 
theorems are valid for the relative 
(co)homology groups [9, 10,11]. 
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M. Čech Homology Theory 


Another homology theory is commonly used 
along with the singular theory. The theory was 
originated by E. Čech (Fund. Math., 19 (1932)) 
and was modified by C. H. Dowker (Ann. 
Math., (2) 51 (1950)). 

Given a topological space X and an Abelian 
group G, the Čech homology group H,(X; G) 
and the Čech cohomology group H*(X;G) are 
defined as follows. We take the family of all 
topen coverings of X directed by frefinement, 
and we consider the tnerve K(U) of each open 
covering H, that is, the simplicial complex 
whose simplexes are finite nonempty subsets 
of U with nonempty intersection. If W is a 
refinement of U, then a simplicial mapping 
n(U, W): K(W)— KU) is obtained by assign- 
ing to each U’eU’ an element UeU such 
that U’'c U. The induced homomorphisms 
mW, UW’), H (KW); G)> A,,(K(U); G) and 
nU, W')*: H*(K (UW); G) > H*(K (WW); G) are inde- 
pendent of the choice of z(U, U’), and we have 
the tinverse system {H,.(K(U); G), n(U, W),.} 
and the direct system {H*(K(U); G), 2(U, W’)*}. 
We now define H, (X; G)=lim H, (K (4); G) 
and H*(X; G)=lim H*(K(U y; G). 

A continuous mapping f:X—>Y induces 
homomorphisms f, : H,(X; G)>H,(Y;G) and 
f*: Ä*(Y;G)> Ā*(X; G) as follows. If 8 is an 
open covering of Y then a simplicial mapping 
Sy: K(f *(&))> K (B) is defined by fyf (V)) 
=V (Ve). The induced homomorphisms 
Sar: A, (K(f ~'(8)); G)> A, (K(X); G) for 
all open coverings B of Y gives rise to f,: 
H,(X; G)>H,(Y; G). Similarly f induces f*. 

Another approach to Cech cohomology 
theory is called the Alexander-Kolmogorov 
construction (Proc. Nat. Acad. Sci., 21 (1935) 
and C. R. Acad. Sci. Paris, 202 (1936)). The 
approach was improved by E. H. Spanier 
(Ann. Math., (2) 49 (1948)), and the theory is 
now called the Alexander (or Alexander- 
Kolmogorov-Spanier) cohomology theory. 

The Alexander cohomology group H*(X; G) 
is defined as follows. Let @4(X; G) be the 
Abelian group of all functions from the (q+ 1)- 
fold product space X**' to G with addition 
defined pointwise. An element ye ®*(X; G) 
is said to be locally zero if there is an open 
covering U of X such that (xo, ...,x,) van- 
ishes if Xo, ...,X, are contained simultaneously 
in some U eU. The subgroup of P(X; G) con- 
sisting of locally zero functions is denoted by 
D(X; G). We define a homomorphism 67: 
D(X; a G) by (640) (Xp, X1; -3 
Xg+1)= Fol D @( Xx Xi- 1s Xiti +++ Xqt1)- 
Then ae G) = {®1(X; G), 6%} is a cochain 
complex, and P(X; G)= {O3(X; G), 64} is its 
subcomplex. We now define H*(X; G) to be 
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the cohomology group of the quotient com- 
plex P(X; G)= ©(X; G)/®,(X: G). 

If f:X > Y ts a continuous mapping, then a 
cochain map f* :®(Y;G)>®(X; G) is defined 
by (f* Po X15 X) = Plo), f), 
{(x,)), and it induces the homomorphism f*: 
H*(Y; G)-> H*(X;G). There is a natural iso- 
morphism H*(X;G)=H*(X;G). 

The (co)homology group of a simplicial 
complex K is isomorphic to the Cech (co)- 
homology group of |K]. If X is a manifold or 
a CW complex, then its singular (co)homology 
group and its Cech (co)homology group are 
isomorphic. However, even for compact met- 
ric spaces X, the singular (co)homology group 
of X is not necessarily isomorphic with the 
Cech (co)homology group of X. 

If {X,} is an tinverse system of compact 
Hausdorff spaces and x = lim X,, then there 
are isomorphisms lim H4(X;; G)= H,(X;G) 
and lim H*(X,;G)= H*(X; G). This is called 
the continuity property for Cech theory. If A is 
any closed subset of a manifold M, then there 
is an isomorphism lim H*(W; G) = H*(A;G), 
where W varies over neighborhoods of A in M 
directed downward by inclusion. If the tcover- 
ing dimension of X is n, H4(X;G)=0 for g>n. 

The cup product in the Cech cohomology is 
introduced simply by passing to the limit with 
cup products in simplicial complexes, and the 
cup product in the Alexander cohomology is 
induced from the operation ~ :®?(X; C) Q 
P(X; G”) +? "4(X; G) defined by (p ~ W) (Xo, 
Xis t Xpt) = P(X, X15 +++ XY (Xp Xpt 
sei Noga): 

The relative Čech homology group H,(X, 

A; G) and the relative Čech cohomology group 
A*(X, A; G) are defined as follows: An open 
covering of (X, A) is a pair (U, R) of an open 
covering U of X and an open covering N of 
A such that Rc U. To such a pair (U, N) we 
assign a simplicial pair (K (4), K'(®)), where 
K'(NR) is the nerve of RNA={NNA|NEV. 
Considering the family of all open cover- 
ings of (X, A), we define now H,(X, A; G)= 
lim H,,(K (UW), K'(N); G) and H*(X,A:G)= 
lim H*(K (U), K’(I0; G). 

The relative Alexander cohomology group 
H*(X, A; G) is defined to be the cohomology 
group of the cochain complex which is the 
kernel of the cochain mapping i*: ®(X; G)> 
(A; G) induced by inclusion. There is a nat- 
ural isomorphism H*(X, A; G)& H*(X, A; G). 

The relative Cech (co)homology groups 
satisfy the properties analogous to the relative 
singular (co)homology groups except the 
exactness property for homology (— Section 
Q). In certain cases, the excision property is 
strengthened for Cech (co)homology. For 
example, we have the following theorem: As- 
sume that X and Y are compact Hausdorff 
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spaces, A and B are closed subsets of X and Y, 
respectively, and f:(X, A)—(Y, B) is a continu- 
ous mapping which maps X — A onto Y— B 
homeomorphically. Then f, : H,(X, A; G)= 
H,(Y, B; G) and f*:H*(Y, B; G)= A*(X, A; G) 
hold. 


N. Fundamental Classes of Manifolds 


For a topological space X and a point x of X, 
the local homology groups H,.(X, X — x) repre- 
sent a topological property of X around x. 

The notion of torientation for differentiable 
manifolds and triangulable manifolds general- 
izes to ttopological manifolds by using local 
homology groups as follows. Let M be an 
n-dimensional (topological) manifold with 
*boundary ðM. If x is a point of the interior 
M,)=M-— OM, then H,(M, M —x)=H,(R", R” 
—0) is Z for g=n and is 0 for g#n. We define 
a local orientation o, for M at xe Mo to bea 
choice of one of the two possible generators 
for H,(M, M — x), and we then define an orien- 
tation for M to be a function which assigns to 
each xe M, a local orientation o, which varies 
continuously with x in the following sense: For 
each x there should exist a compact neighbor- 
hood N c Mo and an element op€ H,(M, M — 
N) such that i,.(0y)=0, for each ye N, where 
i,:(M,M—N)<(M, M —y). If there is an ori- 
entation for M, then M is said to be orient- 
able, and the pair of M and an orientation is 
called an oriented manifold. If M is a nonorient- 
able manifold without boundary, the set of 
local orientations for M forms an orientable 
manifold doubly covering M, called the orien- 
tation manifold of M. 

If M is an oriented n-dimensional manifold, 
then for any compact subset K of M there is a 
unique element og €H,(M,(M — K)U0M) such 
that i,,.(0x)=0, for each xe KN Mo, where 
i,:(M,(M —K)U0M)c(M, M — x). The ele- 
ment ox is called the fundamental homol- 
ogy class around K. In particular, if M is it- 
self compact, o u € H, (M, 6M) is usually de- 
noted by [M] and is called the fundamental 
homology class of M. A connected compact 
n-dimensional manifold M is orientable if 
and only if H (M, 0M)+#0, and in this case 
H,(M,0M) is a free cyclic group generated by 
a fundamental class [M]. If M is an orientable 
compact n-dimensional manifold, then ôM is 
an orientable compact (n — 1)-dimensional 
manifold without boundary, and the boundary 
homomorphism ¢,,: H,(M,0M)—H,_, (0M) 
sends a fundamental class [M] to a funda- 
mental class [0M]. 

An n-dimensional manifold M is orientable 
if and only if there exists an element Ue H"(M 

x M, M x M—dM) such that, for each xe Mo, 
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j*(U) is a generator of H"(M, M — x), where 
dM is the diagonal in M x M, and j,:(M,M 
—x)>(M x M, M x M —dM) is given by j,(y) 
=(x, y) (ye M). In fact, U corresponds to an 
orientation which assigns to each xe My a 
local orientation o, such that < j*(U),0,5=1. 
The element U is called the orientation coho- 
mology class of M. If M is a compact manifold 
without boundary, it holds that <d*(U),[M]> 
= 7(M), where d*: H"(M x M, M x M —dM)—> 
H"(M) ts induced by the diagonal mapping. 
The element d*(U)e H"(M) is called the Euler 
class of M. 

If we work with the (co)homology groups 
with coefficients in Z,, the fundamental classes 
are defined for an arbitrary manifold without 
making any assumption of orientability. If 
M is connected and compact, then H,(M, 

OM; Z,)=Z, is generated by [M]. 


O. Duality in Manifolds 


Let M be a compact n-dimensional manifold, 
and let M,, M, be compact (n— 1)-dimensional 
manifolds such that M, UM,=¢M and M, N 
M,=6M,=6M.,. Assume either that M is 
oriented or that G = Z,. Then for each q, an 
isomorphism D: H4(M, M,;G)= H,_,(M, 
M,;G) is defined by D(€)=€ ~(M]. In par- 
ticular, there are the isomorphisms D: HM, 
6M; G)= H„,-,(M; G) and H4(M; G)= H,_,(M, 
OM; G), where the cap product is taken with 
respect to the homomorphism G ® ZG 
defined by multiplication. This theorem is 
called the Poincaré-Lefschetz duality theorem, 
and the special case for 0M = Ø is often 
referred to as Poincaré duality. 

Poincaré duality implies the following con- 
sequences for a compact n-dimensional mani- 
fold M without boundary. If M is orientable, 
then the gth Betti number is equal to the (n— 
q)th Betti number, and the gth torsion coef- 
ficients are equal to the (n—q — 1)th torsion 
coefficients. If n is odd, then y(M)=0, and if M 
is orientable and n=2 mod 4, then y(M) is 
even. 

Poincaré duality generalizes to the following 
duality theorem. Let M be an n-dimensional 
manifold without boundary, and let K be a 
compact subset of M. Assume either that M is 
oriented or that G=Z,. Then there ts an iso- 
morphism D: H4(K; G)H,_,(M, M — K; G) for 
any q, which is given as follows: For each open 
neighborhood W of K, define Dy: HW; G)> 
H,,-4(M, M — K; G) by Dy) =k, ky’ (0x), 
where k, is the excision isomorphism induced 
by k:(W, W— K)<(M, M—K). Now D is de- 
fined to be the limit of Dy, where W varies 
over open neighborhoods of K. The inverse of 
D up to sign is given in terms of the slant 
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product with the fundamental cohomology 
class U of M as follows: For each open neigh- 
borhood W of K, we define a homomorphism 
Yw: H,_4(M, M—W;G)>H4(W; G) by yy(a)= 
j*(U)/a, where j*: H"(M x M, M x M—dM)> 
H"(W x(M, M — W) is induced by inclusion. 
Then, passing to the limit, these py define the 
desired one. 

If we take an n-sphere S” as M in the above 
duality theorem and use the homology exact 
sequence of (S", S”— K), then we have the fol- 
lowing Alexander duality theorem (Trans. 
Amer. Math. Soc., 23 (1922)): If K is a closed 
subset of $", then the qth reduced Cech coho- 
mology group of K is isomorphic to the (n— 
q—1)th reduced singular homology group of 
S"— K for any coefficient group G and any q. 
In particular, if K is a tneighborhood retract, 
AK; G)= H,_,-,(S"— K; G) holds. This shows 
that H,(S"—K) depends only on K and not on 
the way K is embedded in S". The Alexander 
duality theorem for n=2 and K =S' gives the 
classical tJordan curve theorem. 

In view of the duality theorems, certain 
classical definitions in the homology of mani- 
folds can be given in terms of cohomology. 
For example, if f: M—M’ is a continuous 
mapping of oriented closed manifolds, then the 
Umkehr homomorphism or Gysin homomor- 
phism f': H,(M’; G)>H,,4(M; G) (d=dim M — 
dim M’) (W. Gysin, Comment. Math. Helv., 14 
(1941)) can be defined by Do f'=f* oD. In co- 
homology we have fi: HM; G)> H7 ?(M';G). 
Similarly, if M is an oriented n-dimensional 
closed manifold and ae H,(M), be H,(M), then 
the intersection product a: beH,,,-,(M) of 
Lefschetz can be defined by a-b=D°-'an~b= 
D(D“'a Db). If p+q=n, the number a- 
be H)(M) = Z is called the intersection number 
of a and b. The classical definitions are still 
meaningful today, since they are closer to 
geometric intuition and therefore possess con- 
siderable heuristic value. For example, the 
following fact serves to compute cup products 
in manifolds. If M is an oriented closed differ- 
entiable manifold and a, be H,,(M) are repre- 
sented by closed submanifolds N,, Na which 
intersect ttransversally, then +a-b is repre- 
sented by Ni N N, [11]. See [12] for a rigor- 
ous discussion of classical intersection theory. 


P. Cohomology with Compact Supports 


Let X be a topological space. A subset V of X 
is said to be cobounded if X —V is compact. A 
singular qg-cochain ue Hom(S,(X), G) is said 
to have compact support if there exists a co- 
bounded set V such that u(o)=0 for every 
singular q-simplex c in V. The singular co- 
chains with compact support form a subcom- 
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plex of the cochain complex Hom(S(X), G). 
The cohomology group of this subcomplex is 
denoted by H(X; G) and is called the singu- 
lar cohomology group of X with compact sup- 
ports. There is an isomorphism H*(X;G)= 
lim H*(X, V; G), where V varies over co- 
bounded subsets of X. 

Let K be a simplicial complex. A q-cochain 
ue Hom(C,(K), G) ts called a finite cochain 
of K if u(o)=0 except for a finite number of 
oriented q-simplexes o of K. If K is a tlocally 
finite simplictal complex, then finite cochains 
of K form a subcomplex of the cochain com- 
plex Hom(C(K), G) whose cohomology group 
is isomorphic to H*(|K|;G). 

Let X be a tlocally compact Hausdorff 
space, and let X U {20} denote the tone-point 
compactification of X. Then the Cech coho- 
mology group of X with compact supports, 
denoted by H*(X;G) is defined to be the re- 
duced Cech cohomology group of X U {œ} 
with coefficients in G. There is an isomorphism 
H*(X; G)=lim H*(X, V; G), where V varies 
over cobounded subsets of X. If X is a mani- 
fold or a CW complex, then H*(X;G)= 
H*(X;G). If X is a compact Hausdorff space 
and A is closed in X, then H*(X — A:G)& 
H*(X, A; G). The Alexander-Kolmogorov con- 
struction gives a direct approach to H*(X;G) 
[10,13]. 

A tproper continuous mapping f: X—>Y of 
locally compact Hausdorff spaces induces 
homomorphisms f*: H*(Y;G)— H*¥(X; G) and 
f*: H*(Y; G)-> H*(X; G), and if f, f’:X > Y are 
properly homotopic, then they induce the 
same homomorphisms. 

The cohomology with compact supports is 
useful in order to extend results in the coho- 
mology of compact spaces to noncompact 
spaces. For example, the conclusion of the 
duality theorem on a compact set K c M in 
Section O generalizes to the case of a closed 
set K CM as follows: There is an isomorphism 
H3(K; G)& H,_,(M, M — K; G) for any q. This 
implies the following generalization of Poin- 
caré duality: H7(M; G)= H,,_,(M; G) holds for 
an orientable n-dimensional manifold M with- 
out boundary. 

There are homology theories associated 
with the cohomology theories with compact 
supports [13]. 


Q. Eilenberg-Steenrod Axioms 


Let H* be a collection of the following three 
functions: (1) A function assigning to each 
topological pair (X, A) and each integer q an 
Abelian group H(X, A). (2) A function assign- 
ing to each continuous mapping f:(X, A)> 
(Y, B) and each integer g a homomorphism 
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f*:H4(Y, B)> H4(X, A). (3) A function as- 
signing to each topological pair (X, A) and 
each integer q a homomorphism 6*: H4(A)> 
H4*1(X, A). Then H* is called a cohomology 
theory on the category of topological pairs if 
the following seven axioms are satisfied [7]. 
(i) 1 =1, where 1 ts identity. (ii) (go f)* = 
f*og*: HZ, C)>H4(X, A) for continuous 
mappings f:(X, A)-(Y, B) and g:(Y, B)> 

(Z, C). (tit) Homotopy axiom: If f, f’:(X, A)> 
(Y, B) are homotopic, then f, = f,:H(Y, B)> 
HX, A). (iv) Exactness axiom: The Req uenice 

S H(X, A) 5 HX) HUA) H(X, AS... 
is aS where i: Ac X and j:(X, @)c(X, A). 
(v) f*06* =6* o( f| A)*: H4(B) +H! (X, A) for 
a continuous mapping f:(X, A)->(Y, B). (vi) 
Excision axiom: If U is an open set of X such 
that U cInt A, then i*: H(X, A)= H(X —U,A 
— U), where i is the inclusion. (vii) Dimension 
axiom: H4(pt)=0 if q #0. Axioms (i)-(vii) are 
called the Eilenberg-Steenrod axioms, and the 
group H°(pt) is called the coefficient group of 
the cohomology theory H*. 

A cohomology theory on the category of 
pairs of compact Hausdorff spaces is defined 
similarly. A cohomology theory on the cate- 
gory of CW pairs (or finite CW pairs) is de- 
fined similarly except that axiom (vi) is re- 
placed by the following excision axiom: If 
{X,, X2} is a couple of subcomplexes of a CW 
complex, then i*: H4(X,U X,, X,)=H%(X,, 
X,X,), where i is the inclusion. Two coho- 
mology theories H* and H’* on the same 
category are isomorphic if there is an isomor- 
phism h,: H(X, A)= H(X, A) for each (X, A) 
and each q, and they commute with f* and 
ô*. A homology theory on various categories 
is defined similarly by dualization. 

A singular (co)homology theory with coeffi- 
cients in G is an example of a (co)homology 
theory on the category of topological pairs. 
The Cech cohomology groups with coefficients 
in G can be made into a cohomology theory 
on the category of topological pairs. However, 
the Cech homology groups do not constitute a 
homology theory on the category of topolog- 
ical pairs; the homology sequence of any pair 
(X, A) is defined, but it can be proved only 
that the composite of any two successive 
homomorphisms is zero. The Cech homology 
groups with coefficients in a field constitute a 
homology theory on the category of compact 
Hausdorff pairs. The Alexander cohomology 
groups constitute a cohomology theory on the 
same category of topological pairs, and it is 
isomorphic to the Cech cohomology theory if 
their coefficient groups are isomorphic [10]. 

The Cech (co)homology constitutes a (co)- 
homology theory on the category of CW 
pairs, and it is isomorphic to the singular 
(co)homology theory on the same category if 
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the coefficient groups are isomorphic. (Co)- 
homology theories on the category of finite 
CW pairs are determined, up to isomorphisms, 
by their coefficient groups. This fact is called 
the uniqueness theorem of homology theory 

on the category of finite CW pairs. Coho- 
mology theories on the category of pairs of 
compact Hausdorff spaces which satisfy the 
following continuity axiom are determined, up 
to isomorphisms, by their coefficient groups: If 
{(X,, A,)} is an inverse system of pairs of com- 
pact Hausdorff spaces, then H “(lim X,,; lim A,) 
~lim H"(X,, A;). The Cech cohomology the- 
ory satisfies this axiom. 

During recent years, many (co)homology 
theories have been developed which satisfy the 
first six Eilenberg-Steenrod axioms but fail to 
satisfy the dimension axiom. These are called 
generalized (co)homology theories, and include 
various *K-theories, tbordism theories, and 
‘stable homotopy theories (— 202 Homotopy 
Theory). 


R. Homology with Coefficients in Local 
Systems 


N. E. Steenrod (Ann. Math., (2) 44 (1943)) in- 
troduced the (co)homology group with coef- 
fictents in a local system of Abelian groups, 
which is useful in tobstruction theory and in 
the homology theory of tfiber spaces. 

A local system © of Abelian groups on a 
topological space X is a set of Abelian groups 
G,, one for each xe X, together with an iso- 
morphism /*: Groy >G) for each fpath /:(0, 1] 
>X subject to the following conditions: (1) 

If two paths l and I are homotopic with 
endpoints fixed, then /* = /’*. (2) If l and m 
are paths such that /(1)=m/(0), then (l - m)}* = 
m* ol*, where l: m denotes the tproduct of | 
and m. An example is provided by the thomo- 
topy groups 2,,(X,x) forn>2. Let M be an 
n-dimensional topological manifold. Then 
x—>H,(M, M —x) is a local system of infinite 
cyclic groups. It is called the orientation sheaf 
of M.‘A local system G is said to be trivial if 
I* =l'* for any paths l, I’ with the same initial 
and final points. 

Given a local system © of Abelian groups 
on a topological space X, a chain complex 
S(X; G)={S,(X; G), é,} is defined as follows: 
If q <0, then S,(X; ©)=0, and if q 20, then 
S,(X; G) is the Abelian group of formal finite 
sums È g,0, where o:A?->X are singular q- 
simplexes in X and g,€G,.1.9,....o); the bound- 
ary operator 0,:S,(X; ©)—>S,_,(X; G) is given 
by alge 0) = IF (gq)o 08 + Li-y(—1)'g,0 08, 
where c og; is the ith face of g, and 1: [0,1] 
>X is given by /,(t)=o(1 —t,t,0,...,0). The 
homology group of the chain complex S(X; ©) 
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is denoted by H(X; ©) and is called the sin- 
gular homology group of X with coefficients in 
©. 

Similarly, a cochain complex S*(X; 6)= 
{S7(X; ©), 6%} is defined as follows: If q<0, 
then S7(X; ©)=0, and if g>0, then S$4(X; ©) is 
the Abelian group of functions u assigning to 
every singular qg-simplex o in X an element 
u(a)€ Go 0,...,0)) the coboundary operator 
64:$4(X; 6)>S1+' (X; ©) for g>0 is given by 
(54u) (a) =1* l ulo oeo) + D4} (—LDiu(o oe), 
where ø is a singular (q+ 1)-simplex in X. The 
cohomology group of the cochain complex 
S*(X; ©) is denoted by H*(X; ©) and is called 
the singular cohomology group of X with coef- 
ficients in ©. 

If © is trivial, then the (co)homology group 
with coefficients in © coincides with the (co)- 
homology group with coefficients in G2 G,- 
The various notions and theorems on the 
ordinary (co)homology can be extended to 
(co)homology with coefficients in ©. The Cech 
(co)homology group with coefficients in © is 
also defined [10]. The cohomology groups 
with coefficients in © are generalized to the 
cohomology groups with coefficients in a sheaf 
[10, 14] (— 383 Sheaves). 
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A. General Remarks 


Given a topological space X, we utilize the 
concept of homotopy to define the ‘funda- 
mental group, homotopy groups, and co- 
homotopy groups of X. These groups, to- 
gether with (co)homology groups, are useful 
tools in topology. 

Since the research of H. Hopf, W. Hurewicz, 
and H. Freudenthal in the 1930s, homotopy 
theory has made rapid progress and now plays 
an important role in topology. 


B. Homotopy 


Ifa family f,:X > Y (tel={t|O<t<1}) of 
tcontinuous mappings from a ttopological 
space X into a topological space Y is also 
continuous with respect to t, that is, if the 
mapping F from the product space X x I into 
Y defined by F(x, t)= f(x) (xe X, tel) is con- 
tinuous, then { f,} or F is called a homotopy. In 
this case, fọ and f are said to be homotopic. 
This relation between fọ and f; is indicated by 
fouf,i: XY, or simply fa ~ fı, and is called 
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the relation of homotopy. Denote by Y* the 
set of all continuous mappings from X into 

Y. The homotopy relation is an fequivalence 
relation on Y*, and the equivalence class [ f ] 
of a mapping f: X > Y is called the homotopy 
class (or mapping class) of f. The set of all 
homotopy classes of mappings of X into Y is 
called the homotopy set and is denoted by 
n(X; Y) or [X, Y]. A function y of continuous 
mappings fe Y* is called a homotopy invariant 
if f~g implies y(f)=y(g). When X consists of 
a point x we write z(*; Y)=7,(Y). If all con- 
tinuous mappings in Y* are homotopic to each 
other, we write 7(X; Y)=0; z)(Y)=0 means 
that Y is tarcwise connected. A mapping f 
from a compact space into an n-dimensional 
sphere S” is called essential if any mapping g 
homotopic to f satisfies g(X)= S”. A mapping 
is inessential if and only if it is homotopic to 
the constant mapping. 

These concepts are generalized as follows: 
Let A; and B; (i=1,2,...) be subspaces of X 
and Y, respectively, and denote by ¥Y*(A,, A), 
...;B,, B,,...) the set of continuous mappings 
feY* satisfying f(A) < B;. Ifa homotopy { f} 
is such that f,¢ Y*(A,; B}, then { f} is called a 
restricted homotopy with respect to A;, B; ora 
homotopy from a system of spaces (X, A4, 
A,,...) into a system of spaces (Y, B,, B,,...). 
The notation fo ~fi :(X, A,, Az,...)°(Y, B,, 
B,,...) and the homotopy set z(X, A,, A),...; 
Y, B,, B,,...) are defined accordingly. 

For the composite go fe Z*(A;; Cì) of 
feY*(A; B) and geZ"(B,;C), f= f' and 
g=g' imply go f ~ g'o f’. Thus the composite 
Boa=[gof ]en(X, A; Z, C) of [f]=ae 
r(X, Ai; Y, B;) and [g]=ferx(Y, B; Z, C) is 
defined. By putting g,[ f]=[go f]=f/*[g] 
we induce two mappings, 


Jx: T(X, An Y, B)>2(X, Aj; Z, Ci), 
Jr : n(Y, B;; Z, C)>n(X, A; Z, C;). 


Then f~ f’ implies f* = f’* and g~g’ implies 
Ix = Gx: Also (GO f),=940f,; (Gof)*=f*o 
g*, and h*og,=g, oh*, where he X” (D;; A)). 
The category of pointed topological spaces 
is defined to be the tcategory in which each 
object X, which is a topological space, has a 
point fixed as a base point and each mapping 
X >Y carries the base point of X to. the base 
point of Y. In this category, we define a homo- 
topy set, denoted by 2(X; Y)o or [X; Y]o, as 
follows: Denoting the base points by *, we 
have n(X, A; *; Y, B;,*)=2(X, A; Y, Bo. A 
continuous mapping f homotopic to the 
constant mapping X > +e Y is said to be 
homotopic to zero (or null-homotopic). This 
is indicated by f~0, and 2(X; Y),=0 means 
that all continuous mappings are homotopic 
to zero. Let S° be a set of two points; then 
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n(X; S°),=0 means that X is connected. In 
contrast to these specific homotopies, the 
usual homotopy is sometimes called a free 
homotopy. 

Suppose that a homotopy { f;} ({,:X > Y) is 
such that the restriction of f, to a subspace A 
of X is stationary, that is, f,(a)= f,(a) (ae A, 
tel). Then fo, fı are said to be homotopic 
relative to A, indicated by fọ ~ fı (rel. A). Ifa 
homotopy { f;} (f: X > Y) is such that each 
f, is a homeomorphism into Y, then { f;} is 
called an isotopy and fo is called isotopic to f, 
(— 235 Knot Theory). 

Research done by L. E. Brouwer, H. Hopf, 
W. Hurewicz, K. Borsuk, L. S. Pontryagin, 
and S. Eilenberg has contributed to the theory 
of homotopy, an important field of topology 
still in the process of development. 


C. Mapping Spaces 


We endow the set Y* of all continuous map- 
pings f: XY, with the tcompact-open topol- 
ogy. The topological space Y* is called a 
mapping space. In particular we denote Y’ (0,1; 
*,*)(*e Y) by Q(Y)=Q(Y, x) and call it the 
space of closed paths (or loop space) of Y. Two 
points f, g of Y* are connected by a tpath 

in Y* if and only if f~g:X—Y. Thus z,(¥*) 
=n(X; Y) and m9(Y*(A;; B))=2(X, A;; Y, B). 


D. Retracts 


Let A be a subspace of a topological space X. 
If there exists an fe A* such that the restric- 
tion f |A is the identity mapping of A, then A 
is called a retract of X, and f a retraction. If A 
is a retract of X, any continuous mapping of A 
into any topological space can be extended to 
a continuous mapping of X. If A is a retract 
of some neighborhood U(A), A is called a 
neighborhood retract or NR of X. If for any 
‘homeomorphism of a metric space A onto a 
closed subspace A, of any metric space X, Ao 
is a retract (neighborhood retract) of X, then A 
is called an absolute retract or AR (absolute 
neighborhood retract or ANR). For example, 
an n-dimensional simplex or an n-dimensional 
Euclidean space is an AR. If a retraction f is 
homotopic to the identity mapping of X (resp. 
U(A)), we call A a deformation retract (neigh- 
borhood deformation retract) of X. Moreover, 
if f~ 1, (rel. A), then A is called a strong de- 
formation retract. In particular, if a point xq is 
a (strong) deformation retract of X, we say 
that X is contractible to the point x,. For 
example, any tpolyhedron P and any com- 
pact n-dimensional ttopological manifold are 
ANRs; any polyhedron P, contained in Pina 
strong deformation retract of some neighbor- 
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hood in P. X is called locally contractible if 
each point x of X has a contractible neighbor- 
hood U of x. 


E. The Extension Property 


Let X, Y be topological spaces, Ac X, fo, 

fice Y*, and {g,:A-Y} a homotopy such that 
gi=f,| A (i=0, 1). We can extend {g,} to a 
homotopy { f;} of X if and only if the mapping 
F:(X x Q)U(A x DU(X x 1) Y defined by 
F(x, i) = f(x), F(a, t)=g,(a) can be extended to 
a continuous mapping sending X x / into Y. 
Therefore the problem of whether fọ ~ fı can 
be reduced to the problem of whether a con- 
tinuous mapping defined on a subspace can be 
extended to the whole space. If for any homo- 
topy {g,: AY} and any continuous mapping 
fo: X —>Y into any topological space Y satisfy- 
ing fol A =go there exists a homotopy { f;:X > 
Y} satisfying f,| A=g,, then we say that (X, A) 
has the homotopy extension property. This 
occurs if and only if (X x 0)U(A x J) is a re- 
tract of X x I. A pair (X, A) of ANRs, where 

A is closed in X, and a pair (P, P,) with Pa 
*CW complex and P, a subcomplex of P have 
this property. Given a continuous mapping 

h: B>A of a subspace B of a topological space 
Y into a topological space A, we identify be B 
with h(b)e A in the tdirect sum AU Y and 
obtain the tidentification space denoted by 
AU, Y, which is called an attaching space 
under h. If (Y, B) has the homotopy extension 
property, then (Y x X, B x X) and (AU, Y, A) 
also have the same property. When A consists 
of a point *, we write x U, Y= Y/B and call 

the space Y/B a space smashing (shrinking or 
pinching) B to a point. If Y= Bx I, B=B x0, 
then we call AU,(B x I) a mapping cylinder of 
h, (A U,(B x I))(B x 1) a mapping cone of h, 
and the mapping cylinder and mapping cone 
of h: B— x the cone over B and suspension of B, 
respectively. 


F. Homotopy Type 


For systems (X, A;), (Y, B;) of topological 
spaces, if there exist fe Y*(A;; B), ge X*(B;; A)) 
such that go f and fog are homotopic to the 
identity mappings of (X, A;) and (Y, B;), respec- 
tively, then we say that (X, A,) and (Y, B) have 
the same homotopy type or are homotopy 
equivalent. Such mappings f and g are called 
homotopy equivalences. For a homotopy 
equivalence f, the induced mappings f, and 
J* are bijective. Therefore, in homotopy 
theory, systems of spaces having the same 
homotopy type are considered equivalent. 

If A is a deformation retract of X, then A 

and X have the same homotopy type, and the 
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injection of A into X and the retraction of X 
onto A are homotopy equivalences. A con- 
tractible space has the same homotopy type 
as a point. Spaces having the same homo- 
topy type have isomorphic homotopy groups 
and *(co)homology groups. Since the mapping 
cylinder Z;= YU,(X x I) of fe Y* contains Y 
ag its deformation retract, it has the same 
homotopy type as Y. By this homotopy equiva- 
lence, f can be replaced by the injection of X x 
1 into Z,. If to each topological space there 
corresponds a value (which may be some ele- 
ment of R or some algebraic structure) and 
the values are the same for homotopy equiva- 
lent spaces, then the value is called a homotopy 
type invariant. A homotopy type invariant is a 
topological invariant; for example, 2(X; Y) is 
a homotopy type invariant of X. If a continu- 
ous mapping f: X — Y induces isomorphisms 
of the homotopy groups of each tarcwise con- 
nected component, then f is called a weak 
homotopy equivalence. Conversely, if X and Y 
are CW complexes, then a weak homotopy 
equivalence is a homotopy equivalence (J. H. 
C. Whitehead). 

Now we consider the category of pointed 
topological spaces. Let A and B be pointed 
topological spaces. Then the ‘direct sum in this 
category is the one-point union (or bouquet) 

Av B obtained from the disjoint union AUB 
by identifying two base points x4 and *p. 
Av B is identified with the subspace (A x 
*p)U (*4 x B) in A x B. The reduced join (or 
smash product) of A, B is the space obtained 
from A x B by smashing its subspace A v B 
to a point and is denoted by AA B. We call 
A ^ S’ the (reduced) suspension of A and de- 
note it by SA. Repeating the suspension n 
times, we have the n-fold reduced suspension of 
A. We call CA=AnI (I=[0, 1]) the reduced 
cone of A (I has the base point 1). For a con- 
tinuous mapping f: X — Y, the space obtained 
by identifying each point (x, 0) of the base of 
CX with f(x)e Y is called the reduced mapping 
cone and is denoted by C; = YU,CX. The 
reduced join (or smash product) of mappings 
f:Y>X and f’: Y’>X’ is the mapping f^ f’: 
Ya Y'>X ^ X’ induced from the product 
mapping f x f’: Yx Y’+X x X’. The reduced 
join of f: YX and 1:S'-=S! (identity map- 
ping) is written as Sf= fa 1 and is called the 
suspension of f. 


G. Puppe Exact Sequences 


For f:X—Y and g: YZ, we have gof ~0 if 
and only if g can be extended to a continuous 
mapping from C; into Z. In other words, the 
sequence 


(Cri Zo mY: Z)p>-m(X3Z)q 


776 


is exact (i.e., Imi* = Ker f* = f *71{0), where 

i: YC, is the canonical inclusion and 0 is the 
class of the constant mapping). The inclusion 
i: Y—>C, gives rise to the reduced mapping 
cone C;. We also have the canonical inclusion 
i':C,->C,. Adding the term n(C;:Z o> to the 
left-hand side of the sequence above, we have a 
new exact sequence. Continuing this process, 
we obtain an exact sequence of infinite length. 
If X, Y satisfy a suitable condition (e.g., X, Y 
are CW complexes), then C; has the same 
homotopy type as the reduced suspension 

SX of X; i’* is equivalent to p*:2(SX;Z))—> 
m(Cy; Z)o induced by a mapping p:C;>SX 
smashing Y to a point; and furthermore C, has 
the same homotopy type as SY, and the inclu- 
sion ig: SX >C, is equivalent to the suspension 
Sf:SX —SY of f. Thus the following Puppe 
exact sequence is obtained: 


So m(SCy Z) Š a(SY; Z) > m(SX3 Z)o 
B (Cy; Zon ¥; Z) (Xs Z)o. 


In this exact sequence, if Y is a CW complex, 
X is a subcomplex of Y, and f is the inclusion 
i: X +Y, then C,=C,= YU Cy is homotopy 
equivalent to the space C,/CX = Y/X obtained 
by smashing CX to a point, and an exact 
sequence of the following type is obtained: 


3 m(SX; Zj 30 ¥/X; Z >n(Y; Z)o 
Sax: Z)o- 


A sequence equivalent to the sequence 
xSyvSC 7 is called a cofibering, for which a 
similar exact sequence is obtained. For a con- 
tinuous mapping f: X -> Y, consider the sub- 
space E,={(x, g)| f(x) = (0)} of the product 
space X x Y'. By identifying X with {(x, o)l 
o,(1) = f(x)}, we can regard X as a deforma- 
tion retract of E,. By putting p,(x, )= (1), 
we obtain a ‘fiber space (E,, p1, Y). The fiber 
T; =p; '(*) is called a mapping track of f. 
Using the ‘covering homotopy property, we 
see that the sequence 


n(W; T) > a(W; X) nW; Y)o 


is exact, where p(x, ¢)= x. This sequence is 
also extended infinitely to the left as 

-> 1(W3 OX) 4 n( Ws OYN SW; Tr), 
where i is the inclusion of the loop space QY 
into T, and Qf: QX >QY is the correspon- 
dence of the loops induced from f. 


H. Homotopy Sets that Form Groups 


If X =SX’' or Y=QY' (or, generally, if Y isa 
‘homotopy associative tH-space having a 
thomotopy inverse), then z(X; Y) forms a 
group. In the general case the product of the 
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loops induces the product of 1(X;QY’),. We 
represent a point of SX by (x,t) (xe X, tel) 
and define the mapping 9g: X ~QY for each 
g:SX >Y by Qog(x)(t)= g(x, t). Hence an 
isomorphism Q,):2(SX; Y) =2(X;QY), is 
obtained. Each of the following pairs of homo- 
morphisms is equivalent: f, :2(SX; Y)ọo > 
n(SX; Y )o and Of, :1(X; QY) 9 > a(X;QY"); 
and h*:2(X';QY),>2(X;QY), and 

Sh* :2(SX'; Y)o—>2(SX; Y)o. 

If S” is an n-dimensional sphere, then ,(X) 
=17(S"; X)p is the n-dimensional homotopy 
group (— Section J). Let 2"(X)=2(X; S”). If 
X is a CW complex of dimension less than 2n 
— 1, 2"(X) is the cohomotopy group isomor- 
phic to 2(X;QS"*"), (— Section I). Let K, be 
an tEilenberg-MacLane space of type (H, n). 
Then we have K =QK,,,,, and if (X, A) isa 
pair of CW complexes, then z(X/A; K„)o coin- 
cides with the cohomology group H"(X, A; II). 
For the ‘classifying space Bo(By) of the infinite 
orthogonal group O (infinite unitary group U) 
(— Section V), 2(X/A; Bo) (1(X/A; By)) may be 
considered the KO-group KO(X, A) (K-group 
K(X, A)) (— 237 K-Theory). 


I. Cohomotopy Groups 


K. Borsuk defined a sum of mapping classes of 
X into S” (1936), which was named Borsuk’s 
cohomotopy group by E. Spanier. Spanier also 
studied the duality of the cohomotopy group 
with the homotopy group and its relations to 
the usual cohomology groups. A cohomotopy 
group of (X, A) is defined to be 2"(X, A)= 
n(X, A; S", *), which forms a group if dim X/A 
<2n—1. A mapping F: X/A-S" x S" given 
by F(x) =(/(x), g(x)) with f, g: X/A>S" is 
homotopic to a mapping into S” v S". If we 
compose F with a folding mapping of S” v 
S” onto S”, we obtain a mapping that repre- 
sents the sum [ f]+[g]. With each homo- 
topy class of a continuous mapping f of an 
n-dimensional tpolyhedron K” into an n- 
dimensional sphere S”, we associate the image 
f *(u) of the fundamental class ue H"(S"; Z) 
under the induced homomorphism f*: 
H"(S"; Z)> H"(K"; Z). We then obtain a bijec- 
tive relation 2"(K")>H"(K"; Z), called Hopf’s 
classification theorem. 


J. Homotopy Groups 


Let X be a topological space with a base point 
$2 ={t= (tist, 58,051. 1350x250, 1) be 
the unit n-cube, and /” its boundary. Write 
Q"(X, *) =X? (/", x) (in particular, Q1(X, +) is 
the loop space), and denote by z,(X, *) or 
simply z,,(X) the set of arcwise connected 
components of 0”"(X, «), i.e., the homotopy 


202 K 
Homotopy Theory 


classes [f]. Using the notation of homotopy 
sets, we have 2,(X, *)=2(I", I"; X,+). If we 
choose the constant mapping as the base 
point » of O"(X, +), then Q"(O"(X, *), *) = 
Q(X, x). Thus 2,,(Q"(X, *), #)=Rmin(X, *). 
Since z, is the tfundamental group, 2,,(X, *)= 
m,(Q""'(X), *) is also a group, called the n- 
dimensional homotopy group of X with base 
point «. “Multiplication” in homotopy groups 
is defined as follows: Given f,, f2EQ"(X, *) 
we define fi + f,€2"(X, *) by 

filtistzs -3 tn) 0 
TESA) heen \ 
(Fig. 1). Then the product or sum of [f, ] and 
[f2] is given by [f, +f,]. The identity is the 
class of the constant mapping (denoted by 9), 
and the inverse of [ f] is [f], represented by 
f(D=fUl—t,, ty, ...,t,). The space Q"(X, «) is 
an tH-space, where multiplication is given by 
the correspondence (f1, f2) > f, +f. Since the 
fundamental group of an H-space is commuta- 
tive, z,(X,*) is an Abelian group for n>2. 


Q 1/2 1 


Fig. 1 


Fig. 2 


Let S*={t=(¢,,.:-5t,4;)| È t7 =1} be the n- 
sphere, and take » =(1,0,...,0) as its base 
point. Suppose that we are given a continuous 
mapping y,:(1", /")+(S", *) such that yn: I" — 
["~+§" — x is homeomorphic. Then the corre- 
spondence w*:7(S"; X)) >7,(X, *) determined 
by W*[g]=[g oy, ] is bijective. Thus we can 
identify the homotopy group z,(X, *) with 
a(S"; X)o. 


K. Relative Homotopy Groups 


Suppose that we are given a topological space 
X and a subspace A of X sharing the same 
base point +. Identify I"~' with the face t, = 
0 of I", and let J"! be the closure of J”— 
I~! (Fig. 2). Denote by z,,(X, A, *) the set of 
homotopy classes of continuous mappings 
fi", I", J") (X, A, *). Let Q"(X, A, *) be 
the mapping space consisting of such map- 
pings f, and let 2,(X, A, #)=79(Q"(X, A, *)). 
Since Q"(Q"(X, A, *), *) is homeomorphic to 
Q(X, A, *), we have 7,,(Q"(X, A, *), #) = 
Rmin(X,A,*). Thus 2,(X, A, *) is a group for 
n>2 and an Abelian group for n> 3. This 
group is called the n-dimensional relative 


202 L 
Homotopy Theory 


homotopy group of (X, A) with respect to the 
base point *, or simply the n-dimensional 
homotopy group of (X, A). In the same man- 
ner as in Section J multiplication in this group 
can be defined using f, + f2. Since Q"(X, *, *) 
and Q"(X, *) are identical, we have 2,,(X, *, *) 
=7,(X,*). Hence homotopy groups are spe- 
cial cases of relative homotopy groups. 

Let g:(X, A, *)>(Y, B, *) be a continu- 
ous mapping. Then a correspondence g4: 
1,(X,A,*)—>7,(Y, B, +) is obtained by g,[ f]= 
[gof], with gẹ a homomorphism of homo- 
topy groups for n>2 and forn=1, A=*. We 
call g, the homomorphism induced by g. Let 
E"={t=(t,,...,t,)| Dt? =1} be the unit n-cell 
with boundary S"~'. Utilizing a suitable rela- 
tive homeomorphism y,,:(I", J""!)(E", *), 
wii") =S8""!, we obtain a one-to-one corre- 
spondence Wi*:1(E", S""!; X, Ajo >n, (X, A, *), 
and 0"(X, A, *) is homeomorphic (via y4) to 
the mapping space X*"(S""!, *; A, *). 


L. Homotopy Exact Sequences 


Given an element «=[ f]ez,(X,A,*), and 
letting da=[f|I" 'Jez,_,(A, *), we obtain 

a homomorphism (n> 2) é:12,(X, A, *)> 
T,-,(A, *), which is called the boundary 
homomorphism. Furthermore, we have the 
following exact sequence involving homomor- 
phisms i, ją induced by two inclusions i:(A, *) 
>(X,*), J(X, *, *) (KX, A, *): 


5, (A, #) 1, X, +) Sn (X, A, *) 
2 n, (X,+) 51, (X, A, +) n(A) S m(X). 


This sequence is called the homotopy exact 
sequence of the pair (X, A). A system of topo- 
logical spaces X > A > Bə» is called a triple. 
In this homotopy exact sequence, if we replace 
(A, *), (X, *) by (A, B, *), (X, B, *), respectively, 
we obtain an exact sequence, called the homo- 
topy exact sequence of the triple (X, A, B}. 

The homotopy group z,(A x B) of the prod- 
uct space is isomorphic to the direct sum 
7,(A)+7,,(B), and the projections p(p’):A x B 
— A(B) of the product space induce the pro- 
jections from z,{A x B) onto the direct sum- 
mands 7, (A), z,,(B). This is a special case of the 
tHurewicz-Steenrod isomorphism theorem in 
fiber spaces (— 148 Fiber Spaces). Setting 
Av B=(A x *)U(* x B), we obtain a direct 
sum decomposition z,(A v B)=z,(A)+7,(B)+ 
Tn+1(A x B, A v B). Next we consider a fixed 
pair (X, A) and move the base point « to inves- 
tigate its effect on the elements of the homo- 
topy group. Suppose that we are given a path 
h:I1—A with terminal point x =h(1) and an 
element wex,(X, A,*) (@=(f 1, f:0", 2,0" 4) 
-(X,A,*)). By the homotopy extension prop- 
erty, we can construct a homotopy fg:(1”, I") 
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—(X, A) satisfying fa(J"71)=h(0) and f, =f. 
Then the homotopy class [ fo] of fo with re- 
spect to the base point *’=h(0) is determined 
only by « and the homotopy class w of the 
path h. We denote the homotopy class [ fọ] by 
a°Een,(X, A, *’). The correspondence «—>g® is 
a group isomorphism, and (a@°)° =«°®. Thus 
if A is arcwise connected, z,,(X, A, *) is isomor- 
phic to z,(X, A, *’). Hence, in this case, we may 
simply write z,(X, A) instead of z,(X, A, *). 
When * =x’, the correspondence «->«® deter- 
mines the action of the group z,(A,*) on 
m,(X, A, *). Given an element «e7,(X,*) anda 
class œw of paths in X, we define «°e7,(X, x’) 
as for relative homotopy. Specifically, if we 
m,(X, *), then «° —a coincides with the White- 
head product [@, «] (when n=1, we have «®: 

a '=[w,«4]=waw 'a~')(— Section P). 

A pair (X, A) consisting of a topological 
space X and an arcwise connected subspace A 
of X is said to be n-simple if the operation of 
T (A) on 2,(X, A) is trivial. Similarly, an arc- 
wise connected space X is called n-simple if the 
operation of n; (X) on z,(X) is trivial. For 
example, a pair (X, A) consisting of an H-space 
X and an H-subspace A is simple, i.e., n-simple 
for each n. If a topological space X satisfies 
m(X)=0 (0<i<n), then X is said to be n- 
connected. 0-connectedness coincides with 
arcwise connectedness and 1-connectedness 
means tsimple connectedness. S$” is (n— 1)- 
connected. A pair (X, A) is said to be n- 
connected if 7)(A)=19(X)=7,(X, A)=0 (1 < 
i<n), and (E", S"7!) is (n— 1)-connected. 


M. Homotopy Groups of Triads 


Let (X; A, B, *) be a system, called a triad, 

of a topological space X and its subspaces 

A, B satisfying AM Bə% (base point). Let 
T,(X; A, B,*)=7,_,(Q'(X, B), Q'(A, ANB), *) 
(n> 2); 2,(X; A, B, *) is a group for n>3 and an 
Abelian group for n>4. We call 2,,(X; A, B, *) 
the homotopy group of the triad. From the 
homotopy exact sequence of the pair, we ob- 
tain the following homotopy exact sequence of 
the triad: 


nA, ANB, *)-57,(X, B, *) 
B n (X; A, B,*)Šn; (4, ANB, *)-5.. 


Assume for simplicity that AN B is simply 

connected, X = Int A U Int B (Int A is the tin- 

terior of A), (A, AN B) is m-connected, and 

(B, AN B) is n-connected. Then (X; A, B) is 

(m+ n)-connected, i.e., 7;(X; A, B, *)=0 (2 < 

j <m +n) (Blakers-Massey theorem). 
Furthermore, in this case we have a replica 

of the texcision isomorphism in homology 

theory for j< m+n; that is, we have the iso- 

morphism i, :72,(A, AN B, *)=7,(X, B, *) in- 
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duced by the inclusion i:(A, AN B)>(X, B). On 
the other hand, 7,,4,,+,(X; A, B, *) is isomor- 
phic to z,,4,(A, ANB, *) ® 2,,4,(B, ANB, *). 
This shows that the excision isomorphism 
does not always hold for homotopy groups, an 
important difference from homology theory. 
However, if we replace the excision axiom by 
the Hurewicz-Steenrod isomorphism theorem, 
which is valid for fiber spaces (— 148 Fiber 
Spaces), then we can construct homotopy 
theory axiomatically in the same manner as 
homology theory (— 201 Homology Theory). 


N. The Hurewicz Isomorphism Theorem 


The Hurewicz homomorphism t of z,,(X, A) 
into the n-dimensional integral homology 
group H,(X, A) is defined by t([f])=/,.(é,) 
(where ¢, is a generator of H,(J", I”)). Then we 
have the Hurewicz isomorphism theorem: Sup- 
pose that the pair (X, A) is n-simple (e.g., A 

= * ) and (n — 1)-connected. Then we have 
H,(X, A)=0 (i<n) and the isomorphism t: 
1,(X, A)=H,(X, A) (for n=1 — 170 Funda- 
mental Groups). Let X, Y be simply connected 
topological spaces, and let f: X > Y be a con- 
tinuous mapping. Then the following two con- 
ditions are equivalent: (1) f,:2;(X)>72,(Y) is 
injective for i<n and surjective for i<n. (2) 
fa: H({X)— H,(Y) is injective for i<n and sur- 
jective for i<n(J. H. C. Whitehead’s theorem). 

J.-P. Serre generalized these theorems as 

follows: A family @ of Abelian groups satisfy- 
ing condition (i) is called a class of Abelian 
groups: (i) If a sequence F>G—H of Abelian 
groups is exact and F, He@, then Ge@. Fur- 
thermore, we consider the following condi- 
tions: (ii) The tensor product G © F of an arbi- 
trary Abelian group F with an element Geg 
also belongs to @. (u') If both F, Ge@, then 

F QG, Tor(F, G)e@. (iii) If Geg, then its 
thomology group H,(G)e@ (i> 0). Condition 
(ii’) is implied by (ii). A homomorphism f: 

F Gis called -injective if Ker feg, @- 
surjective if Coker f= G/Im fe, and a g- 
isomorphism if f is @-injective and g- 
surjective. Two Abelian groups G and G’ 

are called -isomorphic if there exist @- 
isomorphisms f:F—>G and f’: F->G’. In 
particular, if the class G consists of only the 
trivial group 0, then concepts such as @- 
isomorphism coincide with the usual concepts 
of isomorphism, and so on. Let @, be the class 
of finite Abelian groups whose orders are 
relatively prime to a fixed prime number p. 
Here, instead of the terms @,-isomorphism and 
so on, we use the terms mod p isomorphism 
and so on. Let 2 be a class of finitely gen- 
erated Abelian groups. Then @, satisfies con- 
ditions (ii) and (iii), and 2 satisfies (it) and (ili). 
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We have the following generalized Hurewicz 
theorem: (A) Suppose that a class @ satisfies 
(ii) and (iii) and we are given a 2-connected 
pair (X, A) of simply connected spaces X, A. 
If n;(X, A)E€ (i<n), then H(X, A)e@, and 
TiN, X, A)>H,(X, A) is a €-isomorphism. (B) 
Suppose that A = «, @ satisfies conditions (ii’) 
and (iii), and X is simply connected. Then an 
assertion similar to (A) holds. In particular, a 
simply connected space X having finitely 
generated homology groups (e.g., a simply 
connected finite polyhedron) has finitely gen- 
erated homotopy groups. As a corollary to 
theorem (A), we obtain a generalized White- 
head theorem. In particular, applying the 
theorem to the class 9N@,, we obtain the 
following frequently used theorem: Suppose 
that we are given simply connected spaces X, 
Y whose homology groups are finitely gen- 
erated and f:X—Y satisfies f,x,(X)=7,(Y). 
Then the following two conditions are equiva- 
lent: (1) f,:2%(X)>7,(Y) is a mod p isomor- 
phism for i<n and a mod p surjection for i=n. 
(2) fx: H(X, Z,)>H,(Y, Z,) is an isomorphism 
for i<n and a surjection for i=n (where Z, = 
Z/pZ). The theory above, which makes use 
of the notion of class @, is an example of 
Serre’s @-theory. Concepts such as tspectral 
sequences for fiber spaces and tn-connective 
fiber spaces are important tools in Serre’s @- 
theory (— 148 Fiber Spaces). 

To calculate homotopy groups, we use 
notions such as exact sequences, fiber spaces, 
(co)homology groups of n-connective fiber 
spaces, and tPostnikov systems. Given an 
arbitrary group (more generally, a Postnikov 
system), there exists a CW complex having the 
given group (system) as its homotopy group 
(Postnikov system) (realization theorem of 
homotopy groups). For an arbitrary arcwise 
connected topological space X there exist 
topological spaces (X,n) and continuous map- 
pings p,:(X,n+ 1)>(X,n) (n= 1, 2,...) satisfy- 
ing the following two conditions: (i) ((X,n-+ 1), 
Pa (X,n)) is a fiber space whose fiber is an 
tEilenberg-MacLane space. (ii) (X, 1)= X, and 
((X,n+1), p,o...op,, X) is an n-connective 
fiber space. The method of obtaining the 
homotopy group z,(X)= H,((X,n)) by com- 
puting (co)homology groups of (X,n) is called 
a killing method. 


O. Homotopy Operations 


Let X, Y, X’, Y’ be topological spaces. If to 
each continuous mapping fe Y~* there corre- 
sponds a homotopy class ®( f)e2(X’; Y’) that 
is a homotopy invariant of f (satisfying a 
certain naturality condition), then ® is called 
a homotopy operation. More generally, we 
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may consider the case where ® is a mapping 
from 2(X,; ¥,)x ... x 2(X,; Y,) into 2(X’; Y’). 
The naturality of ® is defined as follows: Con- 
sider the tcategory @ of topological spaces (or 
its subcategory). Let Y= Y’ be an arbitrary 
tobject of €, and fix X and X’. In this case, 
the naturality of ®,:2(X; Y)>2(X’; Y) is de- 
fined to be the commutativity of the diagram: 


n(X; Y)—>0(X": Y) 
14 a 1 
n(X;Z)—>n(X’; Z) 


i.e., gą ° Dy =, 0g, for an arbitrary tmor- 
phism (i.e., continuous mapping) g: YZ of 
the category @. Similarly, when objects Y, Y’ 
of the category @ are fixed and X = X’ is an 
arbitrary object of @, to say that a homotopy 
operation ®,:2(X; Y)>7(X; Y’) is natural 
means that h*o®,=@, oh* for an arbitrary 
morphism h: W> X. 

We have the following theorem: In the 
category of topological spaces and continu- 
ous mappings, the homotopy operations ®y: 
n(X; Y)—>2(X’; Y) and the elements of x(X’; X) 
are in one-to-one correspondence. The corre- 
spondence is obtained by associating a homo- 
topy operation ®(f)= Bou (Bex(X; Y)) with 
each aen(X’; X). Similarly, the homotopy 
operations ®,:2(X; Y)>n(X; Y’) and the ele- 
ments of z(Y; Y’) are in one-to-one corre- 
spondence. This theorem holds also for the 
case involving several variables if we consider 
m(X'; Xiv X,v...) or m(Y, x Yx ...; Y’) in- 
stead of x(X’; Y) or x(X; Y’). The theorem 
remains valid if we replace the spaces X, Y by 
systems of spaces. 


P. Homotopy Operations in Homotopy Groups 


(1) If X, X’ are spheres S”, SP with base points 
and Y, Y’ are topological spaces with base 
points, a homotopy operation ®,:2,(Y)> 

1p( Y) is said to be of type (n, p). By the theo- 
rem in Section O, the homotopy operations 
of type (n, p) are in one-to-one correspondence 
with the elements of the homotopy group of 
the sphere 7,,(S"). 

(2) As an example of the 2-variable homo- 
topy operations ®y:7,,(Y) x 2,(¥)—72,(Y) of 
type (m, n; p) we have the Whitehead product 
defined as follows: Suppose that «e7z,,(Y), 
fen, (Y) are elements represented by f:(I”, I”) 
(Y,*) and g:(I", 1")>(Y, x), respectively. 
Define a continuous mapping F from the 
boundary /"*"=(1" x I")U(I" x I”) of "+" = 
I™x I” into Y by F(x, y)= f(x) for (x, y)e I" x 
I" and F(x, y)=g(y) for (x, ye l™ x I". Since 
I*" is homeomorphic to S"*"!, we can iden- 
tify them. The homotopy class represented by 
F is an element of z,,,,-,(Y) determined by « 
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and f, denoted by [«, 8] and called the White- 
head product of « and £ (J. H. C. Whitehead, 
Ann. Math., (2) 42 (1941)). The Whitehead 
product is a homotopy operation of type 
(m,n;m+n—1). Let w,,:(1", 1")(S", *) be a 
mapping that smashes /” to a point. The pro- 
duct of Yn and Y, defines a mapping y,, ,: 
smn t_, S™ v Sm =(S™ x *)U(* x S”). Let re 
Tml S” v S”), r E7,(S” v S") be the homotopy 
classes of the natural inclusions of S$”, S” into 
S” v S”; then the homotopy class of Ym, n is 
[r]. G. W. Whitehead showed that a direct 
sum decomposition z,(S" v S")=1,7,(S") + 
¥,.7,(S") + [e JanS) (tys Les [60], are 
injective) holds for 1 < p <m +n + min(m, n)— 
3. Furthermore, P. J. Hilton showed that for 
general p> 1, z (8™” [v S") is the direct sum of 
the images of injections t4, t4, [1,7'],. [E 

r], ily, [Er], rly, etc. The homotopy 
operations of type (m, n; p) are in one-to-one 
correspondence with the elements of z,(S” v S”); 
hence such operations can be constructed by 
means of composition and the Whitehead 
product. The last proposition is also valid for 
homotopy operations of type (m,,...,m,;p). 
The Whitehead product [a, P] (xe7z,,,(X), Be 
7, (X)) is distributive with respect to « (resp. £) 
for m> 1 (n> 1), and we have [ B, «] =(—1)”" 
[x BJ and fo, B= fa fab] for f:X > Y. 
Moreover, for yez,(X) the Jacobi identity 
holds: (—1)™ [[«, 8], y] +(—1)™ CLA, y], 2] + 
(—1)"LLy, x], B] =0 (M. Nakaoka and H. Toda; 
H. Uehara and W. S. Massey; Hilton). 


Q. Suspensions and Generalized Hopf 
Invariants 


We denote by an Ben(X a X’; YA Y’), the class 
of the reduced join of f, g, where f represents 
xen(X; Y) and g represents Be n(X’; Y')o. 
We call œ a f the reduced join of « and £. In 
particular, if Y= Y’=S', ß is the identity 
mapping of S', and « is represented by f, then 
aA f is called the suspension of « and is de- 
noted by Sa. Sx is the class of the suspension 
Sf of f and belongs to 2(SX;SY)), where SX 
indicates the reduced suspension of X. The 
suspension Sa is often denoted by Ea in ref- 
erence to the German term Einhdngung. The 
identity mapping 1 of SY gives rise to an in- 
jection i=Q,1 sending Y into the loop space 
Q(S Y) determined by the formula i(y)(t)=(y, t). 
Then we have 


ig =QQOS:2(X; Y) >n(SX; SY) 
Sx(x; QSY)o, 


and S and i, are equivalent. Let Y, be the 
identifying space Y*/~, where Y* is the prod- 
uct space Y x... x Y of k copies of Y and ~ 
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is the equivalence relation determined by 


(# Yis Y2s ees Yk) ~ (Y1 *; V253 Vana) ~- 


~ (Yis -3 Yk-13*). 


Denote by Y,, = |); Y the limit space with 
respect to the injection Y¥,_,— Y, given by 

(Yis -3 Yk- (Y1; -> Vea» *) and call it the 
reduced product space of Y. Let Y be a CW 
complex of 0-tsection «. The mapping i: Y= 
Y, >QSYcan then be extended to i: Y, > 
QSY, where i is a weak homotopy equiva- 
lence. If X is also.a CW complex, then Q5 +o 
ipi n(X; Yo)o>n(SX; SY) is bijective. By 
smashing the subset Y of Y,, we have YA Y= 
Y,/Y. This smashing mapping can be extended 
toh: Y,,>(Ya Y),, (I. M. James). Utilizing 

hy: n(X; Y,.)o > T(X; (Y^ Y),.)o and the bijec- 
tion Qo! 0 i,, we obtain a correspondence 
H:n(SX;SY),>2(SX;S(YA Y))o. We call H(a) 
the generalized Hopf invariant of «x. When X = 
S22, Y=S""', H is equivalent to the Hopf 
invariant y:z>,-,($")->Z (— Section U). In 
general, we have Ho S=0O, and the exactness 
of 5 4 holds under various conditions. 
Denote also by o the composition of homo- 
topy classes; then we have S(x o f)=Sao Sfp 
and H(aoSf)= HaoSf. Also, H(Sa«o p)= 
S(aAa)o HB. Under the condition i<3n— 3, 
we have (a, +a,)0oB=a,08+4,08+[a,, 
a,JoH(f) for «,, y En, (X) and Bex,(S") (G. 
W. Whitehead). Thus the composition «o f is 
not always left distributive but is always right 
distributive, and ao f is left distributive if B= 
Sf’. The composition is defined over the stable 
homotopy groups G, of spheres (— Section U): 
ao BeG,,, (XE Gp, PEG). It is distributive and 
satisfies Boa=(—1)"4% aofp. 

When Y and Y’ are Eilenberg-MacLane 
spaces, Bo, and By, we have tcohomology 
operations on cohomology groups H"( ; I), 
KO groups, and K groups, respectively. As 
typical examples there are tSteenrod square 
operations Sq':H"(X; Z,)H"*'(X; Z,), *Steen- 
rod pth power operations ': H"(X;Z,)—> 
H"* "D(X: Z), Chern characters ch": K(X) 
— H?"(X; Q) (Q: rational field), ‘Adams opera- 
tions ®;: KO(X)— KO(X) (K(X) K(X)). They 
are all homomorphisms (— 64 Cohomology 
Operations; 237 K-Theory). 


R. Secondary Compositions 


Suppose that xo B=0, Boy=0 for yex(W; 
X)o, BER(X; Y)o, E2(Y; Z)o. In the com- 
mutative diagram of Puppe exact sequences 


ŽS MSW; Yh EnC; Yo 2X5 Yo 
dae es, te E 


s n(SW; Z)o>2(C,; Z) > 0(X; Z) 
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the set of elements $ in n(SW; Z) such that 
p*(B)ea,i* ‘(B) is denoted by {a, 8, y} and 

is called a secondary composition or Toda 
bracket. If 0, y are elements of (SW; Y)o, 
m(SX;Z)o, respectively, then we have {a, B, y} 
+a,0={a, By}, {a, B, y} + Sy*n = {a, By}. 
Hence we may consider the set {a, $, y} to be a 
residue class modulo a submodule generated 
by a, 2(SW; Y)o and Sy*x(SX; Z)o- 

The secondary composition {a, $, y} has the 
following properties: (i) {«, £, y} is linear with 
respect to a, ß, y (if the sum is defined); (ii) 
xo {B,y, 0} = (a, B, y} o(— Sô); (iii) S{«, 8, y} = 
— {Sa, SB, Sy}; (iv) ao {8,7,8} = {ao B, y, ô}, 
{ao B,y,6}={a, Boy, d},...5(v) {{a, By}, 

Sò, Se} + {a, {B, y, ô}, Se} + {a B, {y, 6, eh} = 

0. Suppose that the spaces X, Y, Z, W are 
spheres. Then by (iit) the secondary composi- 
tion fa, B, y} € Gyegrrsi (a O Ggtr+1 HYO Gps gt) 
is defined in the stable homotopy groups G, = 
lim,,.0 Zn+r(S") of spheres. From this we obtain 
(vi) {75 bsa} =(—1)P rt fa, B, y} and (vii) 
(=D {a B, 7} + (1) B, 7,03 +( D9, B} 
=(). 


S. Functional Operations 


Let ® be an operation corresponding to a and 
y be the class of f. We put ®,(2)={a, $, y} and 
call ®, a functional ®-operation. When ® is a 
cohomology operation, ®, is called a func- 
tional cohomology operation. Then ®,() is 
defined for B satisfying f*(B8)= (8) =0, and 
,(f) is determined modulo Im Sf* + Im®. 
For f:S"** 8", k=2i(p—1)—1, we denote by 
H (f) Entra e Het (sr +1, Z) the image 
of a generator £, of H"(S"; Z,) under the 
functional A; operation. Then the Hopf in- 
variant modulo p (or mod p Hopf invariant) 

Hy :Tn44(S")> Z, is obtained (we use Sq”! for p 
= 2). The following statements are equivalent: 
(i) The mod 2 Hopf invariant is not trivial 

(H, #0); (ii) there exists a mapping: S?**! 
S**1 of Hopf invariant 1; (iti) S* is an H-space; 
(iv) the Whitehead product [1,7] of a genera- 
tor z of z,(S*) vanishes. Also, H, #0 if and 
only if k=2, 4, 8 (J. Adams), and for an odd 
prime p, H, £0 if and only if k=2p—3 (A. L. 
Liulevicius; N. Shimada and T. Yamanoshita). 


T. Stable Homotopy Groups and Spectra 


The homotopy set 2(S"X,S"Y), for n-fold 
iterated suspensions S”X = X AS"=SS" 1X 
and S"Y, forms a group (an Abelian group) if 
n> 1 (n>2). The limit 2*(X; Y)=lim 2(S"X; 
S"Y) with respect to the suspension homo- 
morphisms S:2(S"X; S"Y))—2(S"*1X; S"*1Y), 
is called a stable homotopy group of X and 
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Y. For an r-connected space Y and a CW- 
complex X, S:2(X; Y)o>n(SX; SY) is bijec- 
tive if dim X <2r and surjective if dim X < 

2r + 1 (generalized suspension theorem). Thus, 
if X is a finite-dimensional CW-complex, 
m°(X, Y) is isomorphic to 2($"X, S”Y), for suf- 
ficiently large n. To discuss stable homotopy 
groups more generally, the following concept 
of spectra is used. A system E = {E}, ¢,} which 
consists of CW-complexes E, and continuous 
mappings ¢,:SE,—£,,, is called a spectrum. 
When E, = S* and e= 1,4,:SS*>S**!, S= 
{S*, 1,41} is called a sphere spectrum. When 
E, = K(G, k) (tEilenberg-MacLane complex) 
and g, induces a homotopy equivalence K(G, 
k)~QK(G,k +1), HG={K(G,k), €,} is called 
an Eilenberg-MacLane spectrum. As in the 
latter, a spectrum E in which g, induces a 
homotopy equivalence E, ~QF, ,, is called an 
Q-spectrum. By Bott’s periodicity, Q-spectra 
KU={Z~xB,,U,Z x B,,U,...} and KO = 

{Z x B,, U/O, Sp/U, Sp, Z x Bsp U/Sp, SO/U, 
O, Z x Bẹ»... } are obtained (— Section V). 
Also, using ‘Thom complexes, Thom spectra 
MU, MO, etc. are obtained. Given a spectrum 
E, by putting E"(X, A)=lim 2(S*(X/A); Ex41)o 
for each pair (X, A) of CW-complexes, we 
obtain a generalized cohomology theory with 
E-coefficient; and by putting E,(X, A)= 

lim 2,(E,-,\(X/A)), we obtain a generalized 
homology theory with E-coefficient. tGener- 
alized (co)homology theory on (finite) CW- 
complexes can be represented by a suitable 
spectrum (G. W. Whitehead, E. H. Brown, 
Adams). Corresponding to E=S, HG, KU, 
MU, etc., we have stable (co)homotopy 
groups, G-coefficient (co}homology groups, 
K-groups, '(co)bordism groups, etc., respec- 
tively (— 201 Homology Theory). 


U. Homotopy Groups of Spheres 


The spheres S” and their homotopy groups are 
basic objects in homotopy theory. Although 
much research has been done concerning these 
objects, there are still open problems. 

S" is (n— 1)-connected: z,(S")=0 (i<n). The 
fact that 2,(S”")= Z (infinite cyclic group) was 
obtained from the Brouwer mapping theorem. 
Also, 2;(S')=0 (i> 1) follows from the fact that 
the tuniversal covering space of S! is contrac- 
tible. Suppose that we are given a continuous 
mapping f:S?""'—+S". We approximate it by a 
tsimplicial mapping ọ. Then the inverse image 
o`! (x) of a point « in the interior of an n- 
simplex of $” is an (n— 1)-dimensional tpseudo- 
manifold which is orientable by means of 
a suitable generator ee H,_,(@ '(*)). The 
boundary isomorphism 6: H,(S?""1, 7 '(*«))= 
H,_,(g~'(*)) and the homomorphism ¢, : 
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H,(S?""!,.@7'(*)) > H,(S", *) give rise to an in- 
teger y(~) determined by the relation pô +(e) 
=(g)eé, (£, is an orientation of S”). This inte- 
ger ts independent of the choice of p, so we 
can set y(f)=y)(@). Then f ~g implies that 
y(f)=y(g). We call y( f) the Hopf invariant of 
fH. Hopf defined y and showed y:23(S?)=Z 
(1931); y(z2,-,(S")) =0 for odd n; y(2,,-,(S"))> 
2Z for even n; and y(7>,-,(S"))=Z for n=4, 8 
(1935). H. Freudenthal defined a homomor- 
phism E:7,(S")>7;,,(S""'), EL f]=[Sf], and 
proved the Freudenthal theorem: (1) E is an 
isomorphism for i<2n—1; (2) E is a surjection 
for i=2n—1; and (3) the image of E coincides 
with the kernel of y for i= 2n. Furthermore he 
obtained z,,,,(S")& Z, (n> 3) (1937). For n= 
2, 4, 8, a mapping f:S?""'—S" (‘Hopf map- 
ping) such that y(f)= 1 (given by Hopf) is the 
projection of a tfiber bundle S?"~! over the 
base space $", and the correspondence (a, B)> 
Ea+f,B gives an isomorphism z,_,(S""')+ 
n,(S?"~1) (direct sum) = x;(S"). Hence we ob- 
tain z,(S*)=Z,. It was shown by G. W. 
Whitehead and L. S. Pontryagin that z,,, (S") 
(n> 3) is isomorphic to Z, (1949). Whitehead 
also defined a generalized Hopf homomorphism 
H:2,S")>7,(S*" ') for a range of i< 3n— 3, 
and this restriction on the dimension was 
removed by P. J. Hilton and I. M. James. 
Using H, many nontrivial results concerning 
m,(S") have been obtained. Serre obtained the 
following (1951-1953): n;(S”) is finite except 
when i=n or i=4m—1 and n=2m. Further- 
more, 14m—1(S7”) is the direct sum of Z and 

a finite group. Let p be an odd prime and 

n be even. Then 7,(S") is G,-isomorphic to 
T;,(S" ')+2,(S?" 1). Let n be odd. Then 
Tn+4(S")E GE, (k<2p— 3), and 7,42p-3(S") ts 6,- 
isomorphic to Z,. Serre and H. Toda deter- 
mined z,,,,(S") for k= 3, 4, 5, and Serre further 
determined it for k =6, 7, 8. Utilizing the re- 
duced product space of S", James gave the 
sequence 


o MS) 1544 (S) Sna (S2"*7) 
>n- (S>... 


and showed that it is an exact sequence 1f n is 
odd and an exact sequence mod 2 if n is even 
(1953). Using this exact sequence and the 
secondary composition, Toda determined 
T,+4(9") for k< 19 (— Appendix A, Table 6.VI). 
By the Freudenthal theorem (1), the z,,,,(S”) 
(n>k+1) for a fixed k are isomorphic to each 
other. We call z,,,,(S”") (n> k + 1) the stable 
homotopy group of the k-stem of the sphere 
and denote it by G,. For k=0,1,2,..., 15,..., 
G, 2 Z, Zz, Z2, £44,9,0, Z2, Z240; Z2 + Z2, 
Z+ Za + Z3, Le, Z504, 0, Z3, Z2 + Z2, Lago + 
Z3, .... For the computation of G,, the notion 
of n-connective fiber spaces is important. By 
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utilizing the Adams spectral sequence, we can 
show that G, is closely related to the cohomol- 
ogy of the *Steenrod algebra. Let p be an odd 
prime. There exist the following sequences 

of elements of order p: {a,€G, (k =2i(p—1)— 
1)}, {Bre G, (k=2(ip +i— 1)(p—1)—2)} and for 
p> 3{y,€ G, (k= 2(ip? +(i— Ip +i—2)(p—1)- 
3)}. The p-component of G, is determined for 
k <2p*(p—1)—3 by using Steenrod algebra. 
To compute G, for higher k, relations such as 
a, BP =0, 6B? =0 (i> 1) are necessary. In gen- 
eral, each element of G; (k #0) is nilpotent 

(G. Nishida). Let 2,(S": p) be the p-component 
of 2,(S"). To survey this group for the nonstable 
case (i= 2n— 1), we utilize Serre’s mod p direct 
sum decomposition (for n even), and we have 
the following two exact sequences for the case 
of odd n: 


OTS") 4 (S882) 3 ,(Q2(S"*), SS... 
PT (SPP p) STi (SPP :p) 
> (Q?(S"*?), S": p)> na (S tt: pS... 


where E? = Eo E and AE? (a)= pa (— Ap- 
pendix A, Table 6. VI). 


V. Homotopy Groups of Classical Groups 


Consider the classical group U (n, A), which is 
either the orthogonal group O(n) (A =R); the 
unitary group U (n) (A =C); or the symplectic 
group Sp(n) (A =H). The infinite classical 
group U (%, A) is defined to be the inductive 
limit group of {U (n, A)|n=1,2,...} with re- 
spect to the natural injection U (n, A)c U (n+ 
1, A). We call U (2%, A) the infinite orthogonal 
group, infinite unitary group, and infinite sym- 
plectic group for A =R, C, and H, respectively. 
The dimensions of the cells of U (%0, A)— 
U(n, A) are >A(n+ 1)—1, where A=dimgA 
(=1(A=R), =2(A=C), =4(A=H)). It 
follows that z,(U(n, A)) ts isomorphic to 
m,(U (œ, A)) for k< A(n+ 1)—2, which is called 
the kth stable homotopy group of the classical 
group. Let O = U(co, R), U = U (œ, C), Sp= 
U(œ,H). The homotopy groups of the clas- 
sical groups are periodic (k > 0): 
mU n, (U)2Z Z, k odd, 

~0, k even, 
7,(O) = 2, 44(Sp) = 2.48 (O), 

=Z, k=3,7(mod8), 

~Z,, k=0,1(mod8), 
=0, k #0, 1,3, 7 (mod 8). 


This is called the Bott periodicity theorem. The 
relations are deduced from weak homotopy 
equivalences U>Q(B,), By x Z7Q(U), Bo x Z 
2 Q(U/O), U/O >Q(Sp/U), Sp/U > Q(Sp), Sp 


203 A 
Hopf Algebras 


Q(Bsp), Bsp x Z>Q(U/Sp), U/Sp>Q(O/U), 
O/U>Q(O). This result is applied to non- 
stable cases; for example, 25,,(U(n)) is a cyclic 
group of order n! (— Appendix A, Table 6.VJ). 
The 2-dimensional homotopy group z,(G) of 
any Lie group is trivial. 

Let xen,(O(n)), where a=[ f], f:S' O(n). 
We define f: S* x St >S"! by f(x, y)= f(x) y 
and identify S**" with the boundary (E**' x 
S"™ jU(S* x E") of EX*! x E". We extend f to 
f:S**" S" so that it maps E**! x S"! Sk x 
E" into the upper and lower hemisphere of 
S"(S" | =the equator), respectively. Let J(a)e 
Tn+x(5") be the class of the mapping thus ob- 
tained. This homomorphism J:2,{O(n))—> 
Ten +4(5") is called a J-homomorphism of Hopf 
and Whitehead. For the stable case, J:2,(QO)— 
G, is injective for k=0, 1 (mod 8), and the 
order of the image of J is the denominator 
of B,,/4t (Ba, is a ‘Bernoulli number) or its 
double for k= 4t — 1 (Adams). 
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A. General Remarks 


The concept of Hopf algebras arose from two 
directions. First in the field of algebraic topol- 
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ogy the notion of Hopf algebras arose from 
the study of homology and cohomology of Lie 
groups or, more generally, H-spaces. It was 
introduced by H. Hopf [1], whose basic struc- 
ture theorem was generalized and applied to 
several problems by A. Borel [2]. These Hopf 
algebras are graded as algebras and coalge- 
bras, and they are now used as standard tools 
in algebraic topology. On the other hand, 
Hopf algebras without grading were studied in 
connection with affine algebraic groups and 
formal groups. The study of nongraded Hopf 
algebras as an algebraic system was initiated 
by M. E. Sweedler [3], and many results on 
Hopf algebras of this type have been applied, 
not only to the theory of algebraic groups but 
also to the Galois theory of field extension and 
to combinatorial theory. 

Although these two types of Hopf algebras 
have similar structures, and the same termi- 
nology is used to describe their properties, 
they are somewhat different from each other. 
So to avoid confusion in this article, we dis- 
tinguish between graded Hopf algebras and 
Hopf algebras. 


B. Graded Algebras 


A igraded module A= 2,9, over a field k is 
said to be of finite type when each A, is finite- 
dimensional, A is connected when an isomor- 
phism n:k= A, is given. The ftensor product 
of two graded modules A and B is a graded 
module with A @ B=, (A ® B),, (A @ B), = 
2X pA, © Br-p We call A* = È Až (where Až 

is the tdual module of A,) the dual graded 
module of A. When A and B are of finite type, 
AQ Band A* are also of finite type, and we 
have (A ® B)* = A* © B* and A** = A. When 
A and B are connected, A* and A © B are also 
connected. 

Let A be a graded module. If there exists a 
degree-preserving linear mapping g: A @ A> 
A, we call (A, y) a graded algebra, whereas if 
there exists a degree-preserving linear mapping 
y:A>A Q A, we call (A, y) a graded coalgebra. 
We call ọ a multiplication, and y a comultipli- 
cation (or diagonal mapping). Usually we write 
ola © b)=ab (the product of a, be A), and call 
y(a) the coproduct of a. Multiplication and 
comultiplication are dual operations. If A is of 
finite type and (A, ọ} is a graded algebra, then 
(A*, o*) (where y* is the dual mapping of ø) is 
a graded coalgebra, and vice versa. A multi- 
plication ọ is called associative (commuta- 
tive) if p(1 © y)=9(e @ 1) (po T= p), where 
T:AQ A>AQ Ais the mapping defined by 
T(a @ b)=(—1)""b @ a for ae A, and be Ag. 
Associativity and commutativity of a comulti- 
plication are defined dually. Let (A, y) be a 
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graded connected coalgebra of finite type, 

and identify k with Ap via n. Then the graded 
algebra (A*, w*) has the unity of k as unity if 
and only if y satisfies y(1)=1@ 1 and (x)= 
1@x+x@1+X;x; @ x (0<degx;<degx) 
for degx >0. In this case we say that w has the 
unity of k as counity. For graded algebras 

(A, ọ) and (B, ¢’), if p” =(9 @ po @T@ 1): 
A®B@AS BAB, then (A @ B, ¢”) is 
also a graded algebra, which we denote by 

(A @ B, ¢")=(A, ¢) @(B, g’). The tensor prod- 
uct of graded coalgebras is defined as the 
dual notion of (A, @) © (B, 9’). 


C. Graded Hopf Algebras 


For simplicity we assume that graded mod- 
ules are defined over a field, connected, and 
of finite type. Let a graded module A be 

equipped with a multiplication ọ and a co- 


“multiplication y. If @ and y have the unity 


of k as unity and w:(A, )(A, ») @ (A, p) is an 
algebra homomorphism, then we call (A, p, W) 
a graded Hopf algebra. The last condition for a 
graded Hopf algebra is satisfied if and only if 
o:(A, 4) ® (A, W){A, Y) is a homomorphism 
of graded coalgebras. The dual (A*, w*, *) is 
also a graded Hopf algebra, called the dual 
Hopf algebra of (A, p, y). 


D. H-Spaces 


Let X be a topological space. The tcoho- 
mology group H*(X) (thomology group 
H,,(X)) considered over a field k has a multi- 
plication d* (comultiplication dą), which is 
induced by the diagonal mapping d: X >X x X 
and becomes a commutative and associative 
graded algebra (coalgebra). The groups H*(X) 
and H,(X) are dual to each other (— 201 
Homology Theory I, J). When X is equip- 
ped with a base point x, and a base point- 
preserving continuous mapping h: X x XX 
such that ho~ 1x (thomotopic) for i=1 and 
2 (where 4 (x)=(x, Xo) and 1, (x) =(Xo, x)), we 
call (X, h) an H-space, h a multiplication, and 
Xo a homotopy identity of X. Then h induces, 
through a tKiinneth isomorphism, a co- 
multiplication h*: H*(X)>H*(X) ® H*(X) 
(Hopf comultiplication) and a multiplication 
h: H,(X) @ H,(X)—>H,,(X) (Pontryagin 
multiplication). Then h*(a) (xe H*(X)) is 
called the Hopf coproduct of x, and h,.(f © y) 
(B, y€ H,{X)) is called the Pontryagin product 
of $ and y. When X is tarcwise connected and 
H,,(X) is of finite type, (H*(X), d*, h*) and 
(H,,(X),h,,d,) are graded Hopf algebras dual 
to each other. In particular, when h is homot- 
opy associative, i.e., ho(h x 1ly)~ho(ly xh) 
(homotopy commutative, i.e., h~ ho T, where 
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T(X1,X2)=(X2,X,) for x;¢X), then h* and hy 
are associative (commutative). ‘Topological 
groups and ‘loop spaces are homotopy as- 
sociative H-spaces. If a continuous mapping 
g:X—X satisfies ho(1y x g)Zhol(g x ly)=c 
(constant mapping X >{xo}), then g is called a 
homotopy inverse for X, h. 

Suppose that a graded Hopf algebra A is 
defined over a field k of characteristic p and 
equipped with associative and commutative 
multiplication, and A is generated by a single 
element ae A,. Then A is a tpolynomial ring 
k[a] (n is even when p #2) or a tquotient ring 
ka] (a2) (n is odd when p¥2) or k[a]/(a’”) 
(only when p40; n is even when p #2). These 
are called elementary Hopf algebras. Every 
graded Hopf algebra over a tperfect field k 
with associative and commutative multiplica- 
tion is isomorphic (as a graded algebra) to a 
tensor product of elementary Hopf algebras 
(Borel’s theorem) [2]. In particular, the coho- 
mology algebra over a field of characteristic 0 
of a tcompact connected Lie group is isomor- 
phic to a tGrassmann algebra generated by 
elements of odd degrees [1]. 


E. Steenrod Algebras 


The *Steenrod algebra s, over Z, is generated 
by ‘Steenrod operations Sqi (p=2), P’ (p>2), 
and the *Bockstein operation A, (p> 2), with 
composition of operations defined as multi- 
plication. Then .o/, is a connected associative 
graded algebra of finite type (not commuta- 
tive). Defining a comultiplication w of ./, by 
YS) =E Sq‘ Q Sq", PNE P OP", 
and W(A,)=1@A,+A, @ L, ., becomes a 
graded Hopf algebra with an associative and 
commutative comultiplication. Thus its dual 
£3 is a graded Hopf algebra with an associa- 
tive and commutative multiplication, and we 
can apply Borel’s theorem to ./¥ in order to 
investigate the structure of A, [4]. 

Let (A, p, Y) be a graded Hopf algebra 
with associative multiplication and comulti- 
plication. Putting c(1)=1 and c(a)= —a — 
DX a; c(a;) for dega>0 (where w(a)=1@®at 
a®1+X0 a; ® a), we obtain a linear map- 
ping c:A—A satisfying cp=y(c ®c)T. We 
call c the conjugation mapping of A. When the 
multiplication or comultiplication is commuta- 
tive, we obtain the relation c? = 1, and ¢ is a 
bijection. The conjugation mapping is utilized 
in studying Steenrod algebras [4, 5]. 


F. Coalgebras 


Now we turn to nongraded cases. Let A be a 
vector space over a field k, and let p:A@®,A—-> 
A and »:k—A be linear mappings. Then the 
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triple (A, u,n) is said to be an algebra over k if 
HOo(u®@ 1la=no(ly@ yp) and po(l,@n)= 
Lo(y @ 1,)=1,, where 1, ts the identity 
mapping of A, and A®,k and k @,A are 
identified naturally with A. We call u the multi- 
plication and 7 the unit mapping of the alge- 
bra. Dually a triple (C, A, e) with C a vector 
space over k, linear mappings A:C>C @,C, 
and ¢:C-+k is said to be a coalgebra over k if 
(1. @ A)oA=(A@ 1,)oA and (lc ®e)oA= 
(e@ 1-)oA=1¢. We call A the comultiplica- 
tion or the diagonal mapping and e the aug- 
mentation or the counit of the coalgebra. 

An algebra (A, yz, n) is called commutative 
if po T= pu, where T is the twist mapping 
a®bt+b @a. A cocommutative coalgebra is 
defined dually. Definitions of the tensor prod- 
uct of two algebras or coalgebras are similar 
to the definitions in the graded case. If D is a 
subspace of a coalgebra (C, A, £) over k satisfy- 
ing A(D)< D @ ,D, then (D, A] p, Elp) is a co- 
algebra and is said to be a subcoalgebra of C. 
A subspace I of a coalgebra (C, A, £) over k is 
called a coideal of C if A)<C@,1+1@,C 
and e(1)=0. Then the quotient space C/I has a 
coalgebra structure induced naturally from 
(C, A, €) and is said to be a quotient coalgebra 
of C. The intersection and the sum of sub- 
coalgebras of C are again subcoalgebras of 
C. If S is a subset of C, then the intersection 
of all subcoalgebras containing S is said to 
be the subcoalgebra generated by S. The 
subcoalgebra generated by any finite set or 
finite-dimensional subspace of C is finite- 
dimensional. 

If (C, A, £) is a coalgebra over k, then the 
dual space C* of C has an algebra structure 
over k with multiplication u and unit mapping 
n defined naturally from the tdual mappings of 
A and e respectively. (C*, u, n) is called the dual 
algebra of (C, A, e). Suppose that (A, p, n) is an 
algebra over k, and let A° be the subset of the 
dual space A* of A consisting of elements f 
whose kernel contains an ideal 7 such that A/I 
is finite-dimensional. Then (4°, A, £) is a co- 
algebra over k, where A and e are the linear 
mappings induced from the dual ones of u and 
n, respectively. (A°, A, £) is called the dual co- 
algebra of (A, u,n). The functors ( )* and ( )° 
are adjoint to one another tn the sense that 
there is a natural bijective correspondence 
between the set of algebra homomorphisms 
of A to C* and that of coalgebra homomor- 
phisms of C to A° for any coalgebra C and 
algebra A, where coalgebra homomorphisms 
are defined as the dual notion of falgebra 
homomorphisms. 

A nonzero subcoalgebra D of a coalgebra C 
is called simple if D has no nonzero proper 
subcoalgebras, and the sum of all stmple sub- 
coalgebras of C is called the coradical of C. If 
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C coincides with its coradical, then C is said to 
be cosemisimple. If C has only one simple 
subcoalgebra, then C is called irreducible. C is 
called pointed if all simple subcoalgebras of C 
are 1-dimensional. An element g of a coalgebra 
(C, A, £) is called grouplike if A(g)=g ® g and 
é(g)=1. The set G(C) of grouplike elements in 
C is linearly independent over k. 


G. Bialgebras 


A system (H, u,n, A, £) with an algebra structure 
(H, u,n) and a coalgebra structure (H, A, €) is 
said to be a bialgebra over k if A and e are 
algebra homomorphisms. This last condition 
is equivalent to saying that y and y are coalge- 
bra homomorphisms. If K is a subspace of a 
bialgebra (H, u,n, A, €) which is simultaneously 
a subalgebra and a subcoalgebra of H, then 
we call K a subbialgebra of H. An ideal J of 
(H, u,n) which is also a coideal of (H, A, £) is 
called a biideal of H and the quotient space 
H/I has a bialgebra structure which is said to 
be a quotient bialgebra of H. A linear mapping 
between bialgebras is a bialgebra homomor- 
phism if it is simultaneously an algebra homo- 
morphism and a coalgebra homomorphism. A 
bialgebra (H, u,n, A, £) is called commutative 
(cocommutative) if (H, u, n) ((H, A, £)) is com- 
mutative (cocommutative). 


Examples. Let kG be the vector space with a 
set G as free basis over a field k. If we define 
linear mappings A:kKG>G Q ,kG by A(x)= 
x@®x and e:kG—k by e(x)=1 for x in G, 

then (kG, A, £) is a cocommutative coalge- 

bra over k such that the set G(kG) of group- 
like elements is equal to G. Moreover if G 

is a tsemigroup with unit element, then 

(kG, u,n, A, £) is a cocommutative bialgebra 
over k, where u(n) is the multiplication (unit 
mapping) of the tsemigroup algebra kG. This 
bialgebra is called a semigroup bialgebra over 
k. If Lis a tLie algebra over k and U(L) the 
‘universal enveloping algebra of L with multi- 
plication u and unit mapping y, then Lie alge- 
bra homomorphisms x>x ® x (L>L@ L) 
and x0 (L>{0}) induce algebra homomor- 
phisms A: U(L)> U(L@ L)= U(L) @ U(L) and 
£:U(L)-k, respectively. Then (U (L), 4,1, A, £) 
is a cocommutative bialgebra over k, called the 
universal enveloping bialgebra of L. 


H. Hopf Algebras 


Let (A, p,n) be an algebra over k and (C, A, £) 
a coalgebra over k. If f and g are in R= 
Hom,(C, A), then f*g=po(f @g)oA is 
called the convolution of f and g. Defining 
K:RO,R>R and n’:kR by u'(f @g)=fx*g 
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and n‘(a)=an oe, then (R, x,y’) is an algebra 
over k. If H is a bialgebra over k with underly- 
ing coalgebra H“ and algebra H^, then Ry = 
Hom,(H*, Hô) is an algebra over k in the 
same manner as above. If the identity mapping 
ly of H has the inverse S in Ry under the 
multiplication yp’, then H is said to be a Hopf 
algebra over k with antipode S. Then S satisfies 
the following: S(gh) = S(h)S(qg) for g, h in H, 
Son=n, eoS=eand To(S @S)oA=AoS, 
where T is the twist mapping a Q b>b Qa. If 
H is commutative or cocommutative, then 
SoS=1,. 

If H’ is another Hopf algebra over k with 
antipode S’, then a bialgebra homomorphism 
f of H to H’ such that S'f= fS is called a Hopf 
algebra homomorphism. If H and H’ are both 
commutative (cocommutative), then any bi- 
algebra homomorphism of H to H’ is a Hopf 
algebra homomorphism. The ‘category whose 
objects are commutative and cocommutative 
Hopf algebras over a field k and whose mor- 
phisms are Hopf algebra homomorphisms is 
an tAbelian category (A. Grothendieck). If H is 
a commutative Hopf algebra over a field of 
characteristic zero, then the underlying algebra 
H^ has no ‘nilpotent elements (P. Cartier). 

If G is a fgroup, then the group bialgebra 
kG given above has an antipode S defined 
by S(x)=x7! for x in G, and hence kG is a 
cocommutative Hopf algebra over k. Another 
example of Hopf algebras is the tcoordinate 
ring of an talgebraic group defined over k. 
More generally, let X =Spec(A) be an taffine 
group scheme over k. Then algebra homomor- 
phisms A: A>A @,A, €: A—>k, and S:A>A 
are naturally induced from the group structure 
of X, and (A, 7, A, £, S) is a commutative 
Hopf algebra over k, where u and q are the 
multiplication and unit mapping of A, respec- 
tively. Conversely, if (A, u, 1, A, £, S) is a com- 
mutative Hopf algebra over k, then X = 
Spec(A) is an affine group scheme over k 
with the group structure induced from A, €, 
and S. Hence a commutative Hopf algebra 
over k is nothing but a cogroup object of the 
category of commutative algebras over k (i.e., 
a tgroup object of the tdual category). Dually 
a cocommutative Hopf algebra over k is 
nothing but a group object of the category of 
cocommutative coalgebras over k. 


I. Hyperalgebras 


If (C, A, £) is a pointed irreducible coalgebra 
over k, then C contains a unique grouplike 
element g, and kg is the unique simple sub- 
coalgebra of C. An element a of C satisfying 
A(a)=a ®g+g® ais called primitive. The set 
P(C) of primitive elements in C is a vector 
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subspace of C. A cocommutative coalgebra C 
is called colocal if C is irreducible, i.e., if the 
dual algebra C* of C is a tquasilocal ring. The 
tdimension of P(C) of a colocal pointed coalge- 
bra C is finite if and only if the dual algebra 
C* of C is a tNoetherian complete local ring. 
A bialgebra (H, p,n, A, £) over k is said to be a 
hyperalgebra over k if the underlying coalgebra 
is colocal. Then the unique simple subcoalge- 
bra (grouplike element) of H is 4(k) (y(1)), and 
P(H) has a Lie algebra structure defined by 
[x, y}=xy—yx for x, y in P(A). 

The universal enveloping bialgebra U(L) of 
a Lie algebra L over k is a hyperalgebra over k 
such that the set P(U(L)) of primitive elements 
in U(L) is equal to L. Conversely, if the char- 
acteristic of k is zero, any hyperalgebra H over 
k is isomorphic to the universal enveloping 
algebra U(P(A)) of the Lie algebra P(H). But 
in positive-characteristic cases U(P(H)) is 
generally a proper subbialgebra of H. Another 
important example of hyperalgebras is the 
dual coalgebra hy(X)= A?° of the tstalk A at 
the neutral point of the tstructure sheaf of an 
talgebraic group scheme X over k. In addition, 
hy(X) has an algebra structure defined from 
the group structure of X and is a hyperalgebra 
over k such that P(hy(X)) is equal to the Lie 
algebra L(X) of X. Although L(X) plays an 
important role in the infinitesimal theory of 
algebraic groups over a field of characteristic 
zero, it does not give any information on in- 
finitesimals of orders higher than p in the case 
of positive characteristic p. In the case of char- 
acteristic zero we see hy(X)= U(L(X)) and 
L(X)= P(hy(X)), and so hy(X) is a natural 
substitute for L(X) in positive-characteristic 
cases. From this viewpoint many interesting 
results on hy(X) of an algebraic group scheme 
X which are parallel to those on L(X) in the 
case of characteristic zero have been obtained 


[6-8]. 
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A. General Remarks 


Mathematical analysis of the motion of fluids 
(— 205 Hydrodynamics) gives rise to various 
kinds of mathematical problems; the equations 
that govern flows are amongst the most im- 
portant and most extensively studied of the 
nonlinear partial differential equations. Here 
we review only basic and noteworthy results. 
Sections B—E are concerned with incompress- 
ible fluids, while Sections F and G deal with 
compressible fluids. 


B. Nonstationary Solutions of the Navier- 
Stokes Equation 


Let Q be a bounded domain in R” (m=2 or 3) 
occupied by a fluid, with smooth boundary 
6Q. If the fluid is viscous and incompressible, 
its motion can be described by means of the 
velocity u=u(t, x) and the pressure p= p(t, x) 
with t>0 and xeQ the time variable and the 
space variable, respectively. For simplicity, we 
assume that external forces are absent. Then u 
and p satisfy the Navier-Stokes equation 


Ou 
apt Nea VP (1) 


and the equation of continuity 
divu=0, (2) 


where the positive constant v stands for the 
(kinetic) viscosity. On 0Q, u is subject to the 
boundary condition 


ulea = Blt, x). (3) 
The initial value of u is also prescribed: 


u|:=0 = uo(x). (4) 
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If Q is unbounded, e.g., if Q is an exterior 
domain outside compact surfaces, then 


u(t,x)>Up(t) (Ix|>%) (3) 


is imposed in addition as the boundary con- 
dition at infinity. 

The problem of finding u and p that satisfy 
(1)—(4) for given $ and up is called the Navier- 
Stokes initial value problem (abbreviation, NS 
initial value problem). Pioneering mathemat- 
ical studies of this problem were initiated by 
J. Leray, and since the 1950s various contri- 
butions have been made by many authors, 
including E. Hopf, O. A. Ladyzhenskaya, H. 
Fujita, T. Kato, and S. Ito (— [13,17,21]). We 
state here the result in terms of regular (class- 
ical) solutions under the simplifying assump- 
tion that Q is bounded, 8 =0, and ug is sole- 
noidal (divu,=0) and smooth. The situation 
depends nontrivially upon the dimension m. 
Namely, if m= 2, then a regular solution of the 
NS initial value problem exists uniquely and 
globally, i.e., for all time. If m = 3, we can prove 
only a local existence theorem (i.e., one hold- 
ing in a finite interval) of a regular solution. 
This solution is unique in the internal of its 
existence; however, it can be extended over 
the whole interval when the Reynolds number 
is sufficiently small. In other words, the ques- 
tion of well-posedness of the 3-dimensional 
NS initial value problem is open at present. 


C. Weak and Strong Solutions of the Navier- 
Stokes Equation 


In 1951 Hopf introduced the notion of weak 
solutions of the NS initial value problem and 
succeeded in proving their global existence 
(without uniqueness). We here give the defini- 
tion of Hopf’s weak solution: Let Co, be the 
set of vector functions ue C? (Q) with divu=0. 
By H we denote the closure of Co’, under the 
L?-norm, and by V the closure of C, under 
the W,!(Q)-norm (or equivalently, the Dirichlet 
norm for bounded Q). Then an H-valued func- 
tion u=u(t) is a weak solution of the NS initial 
value problem with f=0 in [0, T) (0< T< 
+00) if 

(i) ue L” (0, T; H)N L7(0, œ; V), 

(ii) u is weakly continuous from [0, T) to H, 
and 

(iii) u satisfies the weak equation 


T 
| f Pìr —v(Vu, Volo 


(6) 


+((u: V)ọ, Diot di= — (uo, P0) (5) 


for all ge CZ ([0, T) x Q) with div ọ =0. 
Note that the pressure p has been eliminated 
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in the weak equation. In general, the unique- 
ness of Hopf’s weak solution is not known 
except for m=2. However, when a regular 
solution does exist, then the weak solution 
coincides with it (a.e.), and is unique. Solutions 
of the NS initial value problem which are 
stronger than the weak solution to the extent 
that the uniqueness can be proved have been 
introduced, for instance, by A. A. Kiselev and 
Ladyzhenskaya [12] and Fujita and Kato 
[5]. Actually the existence and uniqueness 
theorems of the aforementioned regular solu- 
tions are proved using existence theorems 

of such strong solutions. Recently, Y. Giga and 
T. Miyakawa succeeded in generalizing the 
Fujita-Kato theory from L? to L”, and thus 
they have shown that if uo € L” (Q) and is sole- 
noidal, a unique strong solution exists at least 
locally for any m. 

Those strong solutions that satisfy unique- 
ness, and hence are regular solutions of the NS 
initial value problem, turn out to be smooth 
for t>0; they are analytic in t>0 and xeQ. 


D. Stationary Solutions of the Navier-Stokes 
Equation 


If the flow is steady, u and p are solutions of 
the boundary value problem consisting of (1) 
with ĝu/ôt omitted, (2) and (3) with f indepen- 
dent of t (and, in addition, (3’) with a constant 
U, if Q is an exterior domain). The existence 
of solutions of this boundary value problem 
for the case of bounded Q was established by 
Leray as one of the earliest applications of 
his fixed-point theorem. For the case of un- 
bounded Q, Leray’s study was completed and 
extended by R. Finn, H. Fujita, and others to 
yield theorems on existence, regularity, and 
asymptotic behavior in the wakes of solutions 
(— [4, 13,21]). These stationary solutions are 
unique if the Reynolds number is sufficiently 
small. On the other hand, under certain cir- 
cumstances that involve Quette flow, non- 
uniqueness or bifurcation of stationary solu- 
tions for large Reynolds numbers has been 
positively proved. 


E. Euler’s Equation 


If the fluid is inviscid and incompressible, the 
Navier-Stokes equation is reduced to Euler’s 
equation 


Cu 
z +(u: Vju= — Vp. (6) 
ct 


4 


Then the boundary condition ts replaced by 
the frictionless boundary condition, which, tn 
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the homogeneous case, takes the form 
Un | aT 0, (7) 


where u, is the normal component of u. Regu- 
lar solutions of the initial value problem con- 
sisting of (6) with (2), (4), and (7) have been 
proved to exist for all t if m=2 and in a finite 
time if m=3 [1,11]. 


F. The General Navier-Stokes Equations 


If the fluid is compressible, viscous, and heat- 
conductive, its motion is described in terms of 
the density p, the velocity u, and some thermo- 
dynamic quantity, say the absolute tempera- 
ture 8, and is governed by the following system 
of equations, sometimes called the general 
Navier-Stokes equations: 


A 

P 4 div(pu) =0, (8) 
ot 

ou H lei l 
—+(u-V)u=—| A+—Vdiv }u——Vp, (9) 
ôt p 3 p 


A 


0 
— +0 V0=— 
ot 


Cyp 


Us x6. P 
A0 +——(divu)p, +—. 
cyp cyp 
(10) 


Here the pressure p is regarded as a function 
of p and @ through the equation of state. The 
viscosity coefficient u, the coefficient of heat 
conduction k, and cp, the specific heat at 
constant volume, are positive constants. We 
have for simplicity assumed that the external 
force is absent, and that the Stokes condition 
for viscosity is satisfied. Finally, ¥ is the dis- 
sipation function: 


Die am, wpm fa 8 Y 
P= pidiv É F (4+, ) 
gneve 2. (2 “ a") 


If Q is the whole space, then the initial values 
Po» Up, Oo of p, u, 0 are given at t=0, and we 
obtain the Cauchy problem for the general 
Navier-Stokes equation. Mathematical study 
of this Cauchy problem has become active 
since J. Nash [19] and N. Itaya [8] proved the 
existence of unique regular solutions local in 
time. Following Itaya’s argument, A. Tani 
constructed a unique regular solution local in 
time for the initial boundary value problem 
consisting of (8)—(10) and boundary conditions 
imposed on u and 0. When it comes to the 
global existence of solutions, only restricted 
results are known. The global existence of a 
regular solution in the 1-dimensional version 
of the Cauchy problem has been established 
by Ya. Kanel’ [10] and Itaya under certain 
simplifying assumptions, such that the fluid 

is a barotropic gas, i.e., one obeying p= Cp”, 
where C and y >1 are positive constants. The 
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1-dimensional initial boundary value problem 
can also be solved globally if the gas is ideal 
and polytropic, i.e., one for which p= Rp0, and 
with the internal energy proportional to 0 and 
[9]. For the 3-dimensional case, we can only 
refer to [18], where existence of global solu- 
tions of the Cauchy problem has been proved 
for initial data close to constants under the as- 
sumption that the gas is ideal and polytropic. 


G. Equations for Inviscid Ideal Gases 


When the gas under consideration is inviscid, 
ideal, and barotropic, we put u=0 and p= Cp” 
in (9). The equation thus obtained is combined 
with (8) to yield the following quasilinear 
hyperbolic system, which admits conservation 
laws: 


L rs 
ae u}=Q, 
Ot i 


(11) 
a C 
y(u- Vu +—Vp’=0. 
ôt p 


It the initial data pọ and ug are smooth to 
some extent, then the Cauchy problem for (11) 
has a regular solution local in time. Generally, 
we cannot expect existence of regular solu- 
tions global in time, namely, discontinuity is 
likely to take place in a finite time, which cor- 
responds to the occurrence of shock waves. 
Therefore we have to introduce weak solutions 
that admit discontinuity. By definition a piece- 
wise continuous function {p, u} is a weak 
solution of (11) if it satisfies (11) in the distri- 
bution sense and if its discontinuity is sub- 
ject to a certain jump condition, called the 
Rankine-Hugoniot relation as well as another 
condition, called the entropy condition, which 
two conditions allow us to distinguish a physi- 
cally realizable solution among many possible 
discontinuous solutions [3, 14]. Global exis- 
tence of the weak solution has been proved 

so far only for the 1-dimensional problem with 
initial data close to constants in the sense that 
their oscillations and total variations are suffi- 
ciently small. Actually in this case we can 
apply J. Glimm’s method [6] to construct 
weak solutions by means of a difference ap- 
proximation which involves random numbers. 
Little is known regarding the uniqueness of 
weak solutions. Finally, if we are concerned 
with steady-state solutions of an inviscid com- 
pressible fluid and assume that the flow is 
irrotational, then we are led to a quasilinear 
partial differential equation of mixed type for 
the velocity potential ®, which is elliptic in the 
subsonic region and hyperbolic in the super- 
sonic region. Classical results concerning these 
equations may be found in [2, 3]. 
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A. General Remarks 


Gases and liquids are easily deformed, and 
they share many kinetic properties. They are 
examples of fluids. By definition, a fluid is a 
continuous substance having the property that 
when it ts not moving, any part of the sub- 
stance separated from the rest by a surface 
exerts an outward force that is perpendicular 
to the given surface. 

Hydrodynamics (or fluid dynamics) is con- 
cerned with the equilibrium and the motion of 
gases and liquids without considering their 
molecular structure. In particular, the branch 
of the theory concerning fluids in equilibrium 
is called hydrostatics, and hydrodynamics 
sometimes refers to the branch concerning 
fluids in motion. 

There are two methods of describing the 
motion of a fluid. One regards a fluid as a sys- 
tem consisting of an infinite number of par- 
ticles and discusses the motion of each parti- 
cle as a function of time. This is Lagrange’s 
method. For example, suppose that a fluid 
particle with the coordinates (x, y, z)=(a, b, c) 
at the moment t=0 has coordinates x= 
fila, b,c, t), y= f(a, b,c, t), z= f3(a, b,c, t) at 
an arbitrary time t. Then the motion of the 
fluid is perfectly determined by the functions 
Si. fz, and fy. 

The other is Euler’s method, which discusses 
the values of the velocity v(u, v, w), the density 
p, the pressure p, etc., of the fluid at arbitrary 
times and positions. From this standpoint 
each quantity of the fluid is regarded as a 
function of a space-time point (x, y, z, t). 

The rate at which any physical quantity F 
varies while moving with the fluid particle is 
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the Lagrangian derivative DF /Dt, which is 
related to the ordinary partial derivatives by 


DF GF OF OF CF 
= +u- 


fej 
Ox Oy Oz 


The three components (u, v, w) and the two 
state quantities (p, p) (in general, other state 
quantities, for example, the temperature T and 
the tentropy S, are assumed to be determined 
by equations of state such as T= T(p, p), S= 
S(p, p)) are determined by five (=1+3+1) 
relations derived from the conservation laws of 
mass, momentum, and energy, namely, the 
equation of continuity, which corresponds to 
the conservation of mass, 


ĉp/ôt + div( py) =0; (1) 


the equation of motion, which corresponds to 
the conservation of momentum, 


O(pv)/ct + div(py ® v—p)= pK, (2) 


where K is the external force per unit mass, p 
is the stress tensor, and ® denotes the ftensor 
product, while tdivergence is applied to each 
row vector, and by virtue of (1), the equation 
(2) can be expressed component-wise as 


Du ôP Pyy OP xz 
p iea OPa y Mia 


Dt ôx êy = a Fa 


REVE oo 
Dt ox êy @z 


+ pK,, (2) 


Dw ĉ Zx G z ÔPzz 
Pa ban, 


+pK,; 
Dt ôx dy oz” 


p 
and the energy equation, which corresponds to 
the conservation of energy, 


G(pv?/2+ pE)/dt 
+div(pv(v?/2+E)—v-p+h)=pv-K, (3) 


or the equation of entropy production, which is 
another expression of (3), 


pTDS/Dt= —divh+Q, (3’) 


where E is the internal energy per unit mass, Q 
the heat generated per unit time and volume, 
and h the heat flux. Here K, p,,, h, and Q or 
their relations with other quantities (e.g., h= 
—x« grad T, where x is the thermal conductiv- 
ity) are assumed to be known. 


B. Perfect Fluids 


When there is a velocity gradient in the flow, 
a tangential stress appears which tends to 
make the velocity uniform, so that p is not a 
diagonal tensor (— p6,,, i-e., pressure). This 
property is called fluid viscosity. Generally, Q 
and h do not vanish in this case. However, in 
order to simplify the problem we consider a 
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nonviscous (sometimes also adiabatic) fluid, 
which is called a perfect fluid and is a good 
approximation in the large of actual fluids. 
The motion of a perfect fluid is determined by 
Euler’s equation of motion 


p Dv/Dt = — grad p+ pK, (4) 


which is obtained from (1) and (2) by replacing 
Pix by the pressure only, and also by the ther- 
modynamic relation DS/Dt =0 obtained from 
(3’) by putting Q =0 and h=0 or its integral 
S=constant in homentropic flow, which is 
governed by the adiabatic law poc p”, where y 
denotes the ratio of specific heat at constant 
pressure to that at constant volume. In partic- 
ular, for a liquid, the density variation can be 
neglected. Putting =constant in (1), we have 


divv=0, (5) 


which, in conjunction with (4), determines four 
unknowns (u, v, w, p) as functions of (x, y, z, t). 

A fluid of constant density is called an in- 
compressible fluid, and one of variable density 
a compressible fluid. Even though it might 
seem natural to consider gases as examples of 
compressible fluids, they can be treated as 
incompressible fluids if the speed of the flow 
of the gas g=|v| is small compared with the 
velocity c=./dp/dp of sound propagating in 
the gas. We call g/c = M the Mach number. 

The vector @æ(č, n, 0), which is derived from 
the velocity vector v as œ =rotv, is called the 
vorticity. A small part of the fluid rotates with 
angular velocity w/2. If œ =0, the flow is called 
irrotational, otherwise rotational. The curves 
dx:dy:dz=u:v:w and dx:dy:dz=€é:n:C are 
called, respectively, stream lines and vortex 
lines. The line integral $,v,ds along a closed 
circuit C is called the circulation around C. 

In irrotational flow, the velocity is expressed 
as v= grad ®, where ® is called a velocity poten- 
tial. When the external force K has a potential 
Q (K = — grad Q) and p is a definite function 
of p, we have the pressure equation 
o® 


1 d 
- 13a [2+ a= constan, 
ot 2 P 


which is valid everywhere in the flow. In a 
steady flow, 


paa [Zo constan (6) 
2 p 

is valid along each stream line or each vortex 
line; this is called the Bernoulli theorem. These 
two equations correspond to tenergy integrals 
of the equation of motion. Furthermore, cor- 
responding to the conservation of tangular 
momentum, we have Helmholtz’s vorticity 
theorem: When K = — grad Q and p= f(p), 
vorticity is neither created nor annihilated in 
the fluid. 
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For the irrotational motion of an incom- 
pressible fluid, tLaplace’s equation AD =0 is 
derived from (5). Hence the problem reduces to 
the determination of a tharmonic function ® 
under appropriate boundary conditions (e.g., 
for a fixed wall, normal velocity v, =0®/dn= 
0). For the 2-dimensional problem a stream 
function ¥ is introduced to satisfy (5) by the 
relation u = 0¥/2y, v= — 0'Y/0x. Since the 
tCauchy-Riemann equations 6®/dx = 0 /éy, 
0@/dy = —0'Y/6x are valid in this case, f= 
P +iY is an fanalytic function of z = x + iy. 
Therefore the theory of 2-dimensional irro- 
tational motion is essentially equivalent to the 
theory of complex tanalytic functions, and 
consequently the theory of tcomformal map- 
ping is a powerful method in the theory of 
such fluid motion. 

For irrotational steady flow of a compress- 
ible fluid in which Q=0, c is determined from 
(6) as a function of g. Then (1) and (4) yield a 
‘nonlinear partial differential equation for ®: 


i u? oro i v? To i w?\02® 
c? j 6x? c?) dy? c? } dz? 


vw 07D wu’ _ uv 67 % 


eee ee Ag 
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This equation is ‘elliptic or thyperbolic (— 326 
Partial Differential Equations of Mixed Type) 
according as M is less than 1 (subsonic) or 
greater than 1 (supersonic). 

For 2-dimensional flow, we can introduce a 
stream function ¥ from (1) by u = ô®/ôx = 
(1/p)(0V/0y), v= 0@/dy = —(1/p)(EP/0x). By 
utilizing the idea of tLegendre transformation, 
this system of nonlinear equations for ® and Y 
can be reduced to a system of linear equations 
in the hodograph plane (q, 0): 


ab Topa 1—M? ôP 


ôq dq\pq/ 00 pq 30” 
om qF 
00 p oq 


(d(pq)/dq = p (1 — M”)), where the independent 
variables q and 0 are the magnitude and the 
inclination of the velocity, respectively. The 
treatment of 2-dimensional compressible flow 
on the basis of this system is called the hodo- 
graph method. For a flow of small M, there is a 
method of successive approximation (M?- 
expansion method) which starts from Laplace’s 
equation, neglecting the terms of O(M7) in 
(7). For uniform flow (velocity U in the x- 
direction) past a thin wing or slender body 
where v and w are small, we have thin wing 
theory or slender body theory, whose first 
approximation is 

Ceo p oO 


Ste a) 8 
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1—M? 
( x3 y? 


792 


IfM <1 or M>1 (although not too large) a 
linearization (Prandtl-Glauert approximation) 
is possible by replacing M by the Mach num- 
ber at infinity M,,=U/c,,. For M>1, (8) has a 
tcharacteristic surface, which is the Mach cone 
whose central axis makes an angle arcsinc/q = 
arcsin 1/M with the flow. This can be inter- 
preted also as an envelope produced by spher- 
ical sound waves with velocity c from a source 
drifting with velocity q. For M~ 1, we put D= 
c,x+@ (c, is the fluid velocity when g=c). 
Then for an adiabatic gas, (8) is approximated 
by a partial differential equation of tmixed type: 


8’ o a 7+1 ĉo @o@ 
dy? dx Ox?” 


ôz? Cx 0) 
Such a flow in which both domains M 2 1 co- 
exist is called the transonic flow, and exact 
solutions by the hodograph method are 
known. However, continuous deceleration 
from M>1 to M <1 generally tends to be 
unstable or impossible, and the appearance of 
a Shock wave, i.e., a discontinuous surface of 
state quantities, is not unusual. This can be 
considered as the tweak solution of (1), (2) (3) 
for a perfect fluid. In particular, in the coordi- 
nate system fixed to the surface, its integrated 
form can be obtained as follows: [pv,] =0, 
[Poin + pvjv,] =0, [4q7/2+ E+ p/p]=0(L Jis 
the jump of the quantity at the surface, and n 
is the normal component). Supplemented by 
the entropy increase, these formulas give rela- 
tions between the fluid velocity and the state 
variables at the front and back of the shock. 
In an ideal gas they are called the Rankine- 
Hugoniot relation. Entropy is not uniform 
behind a curved shock, and the flow is not 
irrotational. For a weak shock starting from 
the tip of a pointed slender body, however, the 
discontinuity is small and approaches the 
*characteristic surface of (8), i.e., the Mach 
wave (compressive wave, in this case). Rare- 
factive Mach waves are found in the super- 
sonic flow of acceleration around a convex 
surface. Such waves contribute to the drag on 
an obstacle placed in supersonic flow. 


C. Viscous Fluids 


A body moving uniformly in a fluid at rest 
(with velocity less than that of sound) suffers 
no drag as long as the viscosity of the fluid is 
negligible and the flow is continuous (d’Alem- 
bert’s paradox). Hence we must take the vis- 
cosity into account in order to discuss the 
creation and annihilation of vortices, the gen- 
eration and structure of shock waves, and 

the drag acting on obstacles. For this purpose, 
we extend Newton’s law stating that frictional 
stress is proportional to the velocity gradient 
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and assume that the stress tensor p is a linear 
function of the rate-of-strain tensor e: 


2 
i= =p +p divy + Hex, —sudivy 


Ou , 2 : 
= —p+2u—-+| w—<=p |divy,..., 


ôx 3 
= = (ew dv 
Pyz = Hey, =H TAT serre 


The proportionality constants p and x are 
called, respectively, the coefficients of (shear) 
viscosity and bulk viscosity. The bulk viscosity 
is sometimes neglected (Stokes’s assumption) in 
the usual hydrodynamics, and then the mean 
value of the normal stress components equals 
the pressure. When a fluid satisfies this linear 
relation between p and e, it is called a New- 
tonian fluid. Otherwise, it is called a non- 
Newtonian fluid. Except for a few cases, such 
as colloid solutions, fluids can be regarded as 
Newtonian. 

If we take the viscosity into account, the 
equation of motion of an incompressible fluid 
becomes 


p Dv/Dt = pK — grad p + pAv. (10) 


This is called the Navier-Stokes equation. A 
nondimensional quantity R= pUL/p formed 
by representative length L, velocity U, density 
p, and viscosity u of a flow is called the Rey- 
nolds number. In order for two flows with 
geometrically similar boundaries to share 
similar kinetic properties, their Reynolds num- 
bers must be equal. This is called the Reynolds 
law of similarity. 

For small R, we can approximate the equa- 
tion of motion (10) by replacing the accelera- 
tion Dv/Dt by év/ét (Stokes approximation) or 
by dv/et + Udv/dx (Oseen approximation) for a 
body placed in the uniform flow of velocity U 
in the x-direction. 

For large R, the flow can be regarded as 
that of a perfect fluid, since we can neglect pAv 
as long as the velocity gradient is not too 
large. In the vicinity of a fixed wall, however, 
the velocity gradient becomes large, because in 
a very thin layer the velocity decreases rapidly 
from the value U of a perfect fluid to zero at 
the wall. This layer is called the boundary 
layer. For the boundary layer, Prandtl’s bound- 
ary layer equation 


ou ðu éu_aU | dU uu 

at a D at ax p dy?’ 

ôu dv 

OE = 11 
ôx dy my 


is valid, where x and y are the coordinates 
parallel and perpendicular to the wall, respec- 
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tively, and U is the velocity outside the bound- 
ary layer. 

If GU/dx <0, it sometimes happens that the 
boundary layer separates from the surface of 
the body. In this case a vortex is generated in 
the flow, as large vorticities in the boundary 
layer are carried into the flow. For a body 
without separation of the boundary layer, the 
d’Alembert paradox holds and is no longer a 
“paradox,” and the drag is small. Such bodies 
are called streamlined. 

In compressible flow the new problem arises 
of the interaction of shock waves with the 
boundary layer. A rapid increase in pressure 
due to the shock wave formed on the surface 
of a body invalidates the assumption of a 
boundary layer and causes its separation. If 
the Mach number becomes sufficiently large 
(M = 5, hypersonic flow), the bow shock ap- 
proaches the body and interferes with the 
boundary layer. The generation of heat at the 
boundary layer (e.g., viscous dissipation in Q) 
requires the consideration of heat transfer as 
well as viscosity. In this manner, it becomes 
necessary to treat a complete system of equa- 
tions which take into account the energy 
equation (3) as well as the temperature de- 
pendence of x, u, and pw. 


D. Laws of Similarity 


For such complicated systems, fdimensional 
analysis is often useful (— 116 Dimensional 
Analysis). As laws of similarity, we can con- 
sider not only those like the Reynolds law but 
also others for bodies which transform similar- 
ly by taffine transformations. Corresponding 
to equation (8), the Prandtl-Glauert law of 
similarity for subsonic flow is famous: The 
pressure coefficient (nondimensional pres- 
sure change) for a thin wing of chord (i.e., the 
length in the direction of flow) 1, span L, and 
thickness t is 


CAL, 1) =AC,, (4/1 — MZ, L,1/,/1— M2 4), 


when 4 is an arbitrary constant and C,, is Cp 
for a body of scaled length and thickness 
placed in an incompressible flow. Correspond- 
ing to (9), an extension of the famous von 
Karman transonic similarity is possible: 


C,(L, =178 (y+ 1) 


x f(y |1- Mal L, (7+ 1)r/11 — Mz”). 


E. Turbulence 


For low Reynolds numbers, the flow generally 
has smooth streamlines. For high Reynolds 

numbers, however, extremely irregular motion 
in space and time appears. The former is called 
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laminar flow, and the latter turbulent flow (— 
433 Turbulence and Chaos). The transition 
from laminar to turbulent flow is considered to 
be due to the instability of the laminar flow, 
and this transition has been studied by the 
method of small oscillations. Recently, non- 
linear effects have also been examined. Regard- 
ing the internal structure of turbulence, statis- 
tical theories originated by T. von Karman 
and G. I. Taylor (Proc. Roy. Soc. London, 

151 (1935)) and A. N. Kolmogorov (Dokl. 
Akad. Nauk. SSSR, 30 (1941)) are of central 


importance. 


F. Water Waves 


*Surface waves that occur on the free surface of 
water (or other liquids) are called water waves; 
their restoring forces are gravity and surface 
tension. If we consider waves generated on still 
water (assumed to be inviscid and incompress- 
ible) in equilibrium, we can regard the flow 
field associated with the wave motion as tirro- 
tational by virtue of *Helmholtz’s vorticity 
theorem. Hence the flow velocity can be de- 
rived from the tvelocity potential ® which 
satisfies fLaplace’s equation AP =0, together 
with the boundary conditions 


DUETA Od Oe as. 
Dt 0x dx dy dy éz 
at z=—H(x,y), (12) 
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at z=h(x,y,t), (14) 
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where the Cartesian coordinates x and y are 
taken in the undisturbed horizontal free sur- 
face, while the positive z-axis is vertically 
upward. The equations z= — H(x, y) and z= 
h(x, y, t) denote, respectively, the bottom sur- 
face (assumed to be known) and the elevation 
of the free surface measured from the un- 
disturbed level z=0, while g stands for the 
gravitational acceleration, o the surface ten- 
sion, po the atmospheric pressure, and 1/R,, 
the tmean curvature of the disturbed free 
surface expressed as 


MANCAT GL 
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The conditions (12) and (13) imply, respec- 
tively, that the fluid does not cross the bottom 
and the free surface, while (14) expresses the 
fact that the difference between atmospheric 
and fluid pressures at the free surface is equal 
to the normal force (per unit area) due to the 
surface tension. Thus the problem is formu- 
lated as a nonlinear boundary value problem 
for Laplace’s equation including the unknown 
boundary z=A(x, y, t). 

Let us first consider linear waves for which 
the wave amplitude of the surface elevation is 
much smaller than any other characteristic 
linear dimension such as the wavelength or 
the water depth H (for simplicity, we assume 
hereafter H =constant, i.e., a flat horizontal 
bottom). Linearizing the boundary conditions 
(12)—(14) with respect to h and grad ®, and 
assuming a sinusoidal wave proportional to 
exp[i(k-r—ot)], k(k,,k,) and w being respec- 
tively the (2-dimensional) twave number vector 
and the fangular frequency and r=(x, y), we 
obtain the dispersion relation 


ok? 


In a layer of still water the waves are isotropic 
in the horizontal plane and the dispersion 
relation involves only the magnitude k of 

the wave number vector. It is readily seen 
from (15) that the water waves are typical 
tdispersive waves in which the tphase velocity 
C,(=«o/k) depends on the wave number k or 
the wavelength A( = 27/k). It is also evident 
from (15) that the quantity ok?/(pg) measures 
the relative importance of surface tension and 
gravity. Hence for waves with wavelengths 
much larger than 4,,=22,/a/(pg) (~1.7 cm 
for water), the effect of surface tension ts negli- 
gible, and we have gravity waves. Conversely, 
when 4 << Åm the effect of surface tension 
becomes dominant, and we have capillary 
waves or ripples. When the water depth H ts 
much larger than the wavelength 4, we can 
approximate (15) by wm? =gk +ok?/p, since 
tanh(kH)~ 1. We call such waves deep water 
waves. On the other hand, if H is much smaller 
than the wavelength (kH << 27), we have shal- 
low (or long) water waves for which (15) can be 
approximated by m*=gHk?[1 + {a/(pgH?)— 
1/3} (KH)? +...] if o/(pgH?)=O(1). In partic- 
ular, if we neglect O(kH)?, we recover the 
well-known dispersionless long gravity waves 
whose phase velocity is simply ./gH. In all the 
cases mentioned above, the amplitude function 
of the velocity potential ® is proportional to 
—iwcosh {k(z+ H)}/{ksinh(kH)}, so that the 
flow velocity due to deep water waves de- 
creases exponentially as one proceeds verti- 
cally downward from the free surface. In the 
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limiting case of long gravity waves, however, 
the fluid motion is nearly horizontal through- 
out the fluid layer. 

When the wave amplitude becomes larger, 
nonlinear effects are no longer negligible. For 
such waves, various ‘singular perturbation 
methods provide powerful tools. For example, 
the basic system of equations for weakly non- 
linear (1-dimensional) shallow water waves can 
be reduced to a simple solvable nonlinear 
equation called the ‘Korteweg-de Vries equa- 
tion, whose solitary wave solution is known 
as a prototype of solitons (— 387 Solitons). 
Another classical example is a (1-dimensional) 
deep gravity wave called the Stokes wave, 
which can be obtained as a power series in the 
wave steepness (amplitude x wave number). 
The first term of the series is of the form of a 
linear sinusoidal wave, and the higher-order 
terms correspond to the higher harmonics, 
while the angular frequency is shifted from the 
linear case and depends not only on the wave 
number but also on the amplitude. Similar 
singular perturbation methods have also been 
applied to various kinds of resonant interac- 
tions such as nonlinear self-modulation, higher 
harmonic resonances, and multiwave interac- 
tions. Finally it should be mentioned that an 
exact solution representing a (1-dimensional) 
deep capillary wave was obtained by G. D. 
Crapper (J. Fluid Mech., 2 (1957) 532-540; 
extended later to the case of finite depth by 
W. Kinnersley, J. Fluid Mech., 77 (1976), 229— 
241). This is one of the few realistic exact solu- 
tions obtained so far; a famous exact solution 
of Gerstner’s trochoidal wave does not satisfy 
the irrotationality condition. 
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A. Hypergeometric Functions 


The *power series 


ro) 2 T(a+nr(p+n) 

F(a, B,y52)= 2" 
POM Aa a!n) 

in the complex variable z is called the hyper- 

geometric series or Gauss’s Series. 

It is convergent for any a, f, and y if |z|<1, 
and is convergent for Re(a+ B—y) <0 if |z|=1. 
If z=1, its sum is equal to T (p) (y —&— Pyr (y 
—a)I'(y — f) (except when y is a nonpositive 
integer). The hypergeometric functions are 
obtained as analytic continuations of the 
functions determined by hypergeometric series 
that are single-valued analytic functions 
defined on the domain obtained from the 
complex plane by deleting a line connecting 
branch points z=1 and z= œ (— Appendix A, 
Table 18.1). 

A hypergeometric function is a solution of 
the differential equation 


2 


dw dw 
z(1 Se a SR I)2)7 — aBw=0, 


(1) 


which is called the hypergeometric differential 
equation or Gaussian differential equation. This 
equation is a differential equation of tFuchsian 
type with fregular singular points at 0, 1, and 
œ, whose solutions are expressed, in terms of 
the +P-function of Riemann, by 


0 oe) 1 
w=P 0 a 0 Zz. 
I-y B y-a—-8 


If any one of the values of y, y—a— fp, or a— 
ß is integral, there exists a series containing 
logz, representing a solution of the differentia! 
equation (1) in a neighborhood of the corre- 
sponding singular points. When none of the y, 
y—a—f, or «—f values is integral, since the 
linear transformations z’ =z, 2’ = 1/z, z’=1—z, 
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z’=2z/(z —1), 2’ =(z—1)/z, z’=1/(1 —z) permute 
singular points, there exist 24 particular solu- 
tions around the singular points. The latter 
fact was first proved by E. E. Kummer (1836). 

There exist various curves C for which the 
integral 


v=| ut *(1—u)’ * 1(1—zu) F du 
C 


is a solution of (1). Among them we can take 
the segment [0, 1] when Rea>0, Re(y—«)>0. 
Then the corresponding solution is holomor- 
phic in the interior of the unit circle, and 


ry) 


PP B= Oro 


x | ut H1 — up 211 — zu)? du. 
C 


Since the integrand has branch points at 0, 1, 
and 1/z, we have the following expression 
when y is not an integer: 


F(x, B, y; 2) 


7 l ro) 
(1-2?! )(1 — e?) rT — a) 


(1+,0+,1-,0-) 
z u*(1—u)’ ®t (1 — zu)’ du, 


where Rea> 0, Re(y—«)>0; whereas if y is an 
integer, then 


1 My) 


F(a, B, y; 2) ~ (1—e 2") P(x) Py —a) 


(1+,0+) 
<$ u*(t — up {1 — zu) f du, 


where the contour in the first expression en- 
circles successively each of 1, 0, 1, and 0 once 
with indicated directions. These expressions 
can be adopted as a definition of the hypergeo- 
metric functions for the general value of z. 
Other integral expressions are also known 

(— Appendix A, Table 18.1). 


B. The Ladder Method 


A linear ordinary differential equation of the 
second order having three regular singular 
points on the complex sphere is easily trans- 
formed into an equation of the form (1). To 
solve such an equation with a parameter, it is 
often useful to decompose, in two different 
ways, the main part of the equation into two 
factors of the first order, and find a recurrence 
formula involving the parameter, as we shall 
see in the following example. This method 
is called the ladder method or factorization 
method. 

For example, tLegendre’s differential 
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equation 
L,[w]=(1—27)(((1 -zwy +n(n+1)w)= 
is decomposed as follows: 


L,=S,° T +n? = That Siar t(at i), 


Peis) ws S,=(1 —2z?)——nz 

" dz ( i dz l 
If w, is a solution of L, [w] =0, then multiply- 
ing both sides of S,- T [wp] +n” w,=0 by Th, 
we find that T,S,(7,[w,]) +n (T Ew,])=0, 
that is, 7,[w,] is a solution of L,- [w] =0. 
Similarly, we see that S,,, [wp] is a solution of 
L+1[w]=9. In this sense, S, and T, are called, 
respectively, the step-up operator, or up-ladder, 
and the step-down operator, or down-ladder, 
with respect to the parameter n. 

The above relation constitutes a recurrence 

formula for Legendre functions (— Appendix 
A, Table 18.1]). 


C. Extensions of Hypergeometric Functions 
J. Thomas (1870) proposed the series 

fe ee + (Op) z", 

n=l ta rie (Bin 
(A), =AA+ 1)... A4+n—V) 


as an extension of the hypergeometric series. 
The sum of this series satisfies the hth-order 
differential equation 


d'w bw 
(I-z) t4 ==B 2) dt} 
h-2 
+{(A, Tea Rete +(A,—B,z)w=0, 


t=logz. 


When h=2 and f, =1, it reduces to the ordi- 
nary hypergeometric series. The notation 


pig, %25 ++ E Op; Bis B2 -0 Pa; 2) 
2 (A1). <- (Q p)n 
z”, 2 
EO a 


which is due to L. Pochhammer and modi- 
fied by E. W. Barnes, is used to denote the ex- 
tended hypergeometric series, and the function 
defined by (2) is often called Barnes’s extended 
hypergeometric function. For example, Gauss’s 
series in this notation is F; (a, $, y; z}- 

Corresponding to Barnes’s integral expres- 
sion for hypergeometric functions, it is known 
that the integral 


1 c+iœ 
W(z)=— 
@) =e 


where 


K(Q)=K(C+)) 


K(Q)H(Q)z = d, 
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T(E +a (E+)... T(E a) 
T(E+1+BDr(E+1+82)... 


is a solution of the hth-order differential equa- 
tion at the beginning of this section. The 
hypergeometric function expressed by the 
definite integral 


HE= 


x —1)’({— z) d¢ 
has an obvious formal extension 
le —a, (E — a3)? ... (C — am)” (E — zY dk. 


On the other hand, the equation 


(—1)"71—” 


(h+m—2)...(h+ 1h 


oP) pen 
m—v—1 


Q(z) = 


+(e?) Peal) 
m—y 


Po(z)=(z—a1)(z — 4p)... (Z — am), 


P (z)= rof E- Bs +4 + Ae), 


—ayz z—a 


called the Tissot-Pochhammer differential 
equation, has a solution 


vte)= | (mar Saat he, 


X (E — am)" (E ~ 2)" de. 


After Pochhammer (1870), this is sometimes 
called Pochhammer’s generalized hypergeo- 
metric function. 


As another extension of Gauss’s series, H. E. 


Heine (1846) introduced Heine’s series: 
C= 
(1—4) (1 —4°) 
(1-490 -4-4 =A 
(1—g)(1—4*)(1—4‘)(1—q°"") 
Setting g=1+e, z=(1/e)log x, and letting e0, 


we obtain Gauss’s series as the limit of Heine’s 
series. 


o(a, b,c; q;z)=1+ 


D. Hypergeometric Functions of Several 
Variables 


P. Appell (1880) formally extended Gauss’s 
series to the case of two variables and defined 


r++ B) 


ia ae 
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four kinds of functions [3]: 
F (06 p, BS yx, y) 


=> 52 (@)m+n(P)m rl Bn em yn 


n=0 m'n\(Y)mn+n 


F(a; B, B; Y» y5 X, y) 


kd 


— (a) m+n(P)m (Ba m.n 
=o a T 


F(a, a’; P, BS ys x, y) 
(2) n(n Bn Bn myn 


m!n!(Y)m>n 


> 


alo; By, 7's x, y) 
= = = (&)mtn(B)mtn min 
m=o n=0 MIA! (ValY’)n 


They are called Appell’s hypergeometric func- 
tions of two variables. Each satisfies a cor- 
responding system of partial differential 
equations: 


x(1—x)r+ yl —x)s + (7 —cx)p— Byq 


—afz=0, 
Fi sy(l—y)t+x(l—y)s+(y—c'x)q—p'xp 
—ap’z=0, 


(x—y)s— B'p + Bq=0, 
x(1—x)r—xys+(y—cx)p—Byq—aBz=0, 
F, t —y)t—xys + (y —c'y)q— p'xp 
—ap’z=0, 


x)r + ys+(y—cx)p—afz= 
y(l— yit +xs+(y—c"yjqg— a p’'z=0, 


x(1 —x)r—y*t—2xys+(y—cx)p—cyq 


—afz=0, 
Fy 
y(i —y)t—x?’r—2xys+ (y —cy)q— exp 
—afpz=0, 
where 
c=a+B+1, cC=atf't+l, c"=0'+P' +1, 
p=0z2/6x, q=0z/dy, r=072/dx’, 
s=0°z/ðxðy, t=6*z/dy?. 


Appell’s hypergeometric functions can also be 
represented by integrals: for example, 


rarer =i" 
 T(BMBITO—B-B) 


x (1—u ~v) EH] —ux— vy) *dudv, 


fe rro’) |, p-1 ppt 
27 u 
(AMBIT —- BV’ -p Jo 


x (1—u)? 811 —v)" TET —ux—vy) dudo, 
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r (y) oye opie 
pran n tt. B-18 -1 
; norane smli : 
x(l—u—v)" E E1 — ux) (1 — vy) ” dudv, 


where, for F, and F,, the domain of integra- 
tion is u> 0, v20, 1—u—v>0. E. Picard 
(1881) showed that F, can also be expressed 
by a single integral: 


) 1 
= roy) | day 
F&T x-y) Jo 


x(1— ux) f(1— uy) ® du. 


G. Lauricella (1893) extended the foregoing 
functions to the case of more than two vari- 
ables. More general hypergeometric series of 
several variables were defined by R. Mellin, 
J. Horn, and J. Kampé de Fériet [3,4]. Every 
algebraic equation can be solved analytically 
in terms of the foregoing functions (Mellin, 
R. Birkeland) [4]. Also there are studies con- 
cerning ‘Riemann’s problem and *automor- 
phic functions derived from F; (Picard; 

T. Terada [5]). 


E. Hypergeometric Functions with Matrix 
Argument 


For symmetric matrices Z of degree m, C. S. 
Herz defined hypergeometric functions with 
matrix argument as follows [5]: Denoting by 
etr Z the exponential exp(tr Z) of the ttrace of 
Z, let 


oFo(Z)=etr Z, 


pr Faltis Sp isos hei) 
1 

= etr(— A) Fă, -3 €p; By, -»- > Bas 

Ial) Ja>o i p Pi j 


AZ)(det A} PdA 1, dfa2...dmm (3) 
pFari (1s -0 Xp Bis s Bp nA) 


_ Ent”) oe | 
niye ReZ=X jae pP alas +++ %p; 
eh o” 


Bisons Bs AZ~')(det Z) dz, ,dZ32-.-dZmm> (4) 
where 


ASU, 


Wiisiacwe 

Z=((1 + 6;)24j/2);, jot... 

Tay =n” VATE (y— 1/2)... 
x P(y—(m—1)/2), 


and A >0 means that A is tpositive definite. 
The integral (3) converges for — Z >0 if Rey 
>(m—1)/2. If Rey is sufficiently large, then for 
suitably chosen Xo, (4) converges in a domain 
of the space of A and represents an analytic 
function of its argument. In particular, we 
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have 
1Fo(a; Z)=(det(E—Z)) *. 


Based on this definition, many special func- 
tions and formulas are extended to the case of 
a matrix argument. For example, 


A5(Z)= oF (6 +(m + 1)/2; —Z)/Tn(O + (m + 1)/2) 
(5) 


is an extension of the *Bessel function, and this 
reduces to 


(t/2) °Js(t) = Ao((t/2)”) 


when m= 1. Formula (5) is applied to the 
tnoncentral Wishart distribution in mathe- 
matical statistics. 
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A. Ideal Boundaries 


For a given Hausdorff space R, a tcompact 
Hausdorff space R* that contains R as its 
dense subspace is called a compactification of 
R, and A= R* — R ts called an ideal boundary 
of R. In the present article, we deal mainly 
with properties (in particular, function- 
theoretic properties) of ideal boundaries of 
*Riemann surfaces R. 


B. Harmonic Boundaries 


By R we mean an open Riemann surface. 
The set I’ of points p* in A such that 

lim infp,,+p* P(p)=0 for every tpotential P (i.e., 
a positive fsuperharmonic function P for 
which the class of nonnegative tharmonic 
functions smaller than P consists only of the 


constant function 0) is a compact subset of R*. 


The set T is called the harmonic boundary of R 
with respect to R*. For an arbitrary compact 
subset K in A—T, there exists a finite-valued 
potential P, with limg,,.,* Px(p)= œ (p* €K). 
From this, various kinds of tmaximum prin- 
ciples are derived. For instance, if u is a har- 
monic function bounded above for which 
lim sup,.ru(p)<M holds, then u< M on R. 
There are infinitely many compactifications 
of R. For two compactifications R¥ (i= 1, 2) 
of R, we say that R* is greater than R¥ or, 
equivalently, lies over R3, if the identity map- 
ping of R can be extended to a continuous 
mapping of R* onto R¥. In order that deep 
function-theoretic studies of R* can be carried 
out, various conditions must be imposed on 
R*. A compactification R* is said to be of 
Stoilow type (or of type S) if for every tcon- 
nected open subset G* in R* whose boundary 
in R* is contained in R, G* —A is also con- 
nected. Next suppose that R¢O, (— 367 Rie- 
mann Surfaces E). For a given real-valued 
function f on A, let UR?" (U#*’) be the class 
of tsuperharmonic functions s bounded from 
below (‘subharmonic functions s bounded 
from above) such that lim infp,,.,*5(p) > f(p*) 
(lim SUPrap>p* S(p) <f(p*)) for every p* eA. If 
these classes are nonempty, then Hf-*"(p)= 
inf{s(p)|seU?-®"} and HE-*"(p)=sup{s(p)|se 
We" are harmonic on R, and HÈ ®* < HRP 
In particular, if HF?" = H?-*", then the com- 
mon function is denoted by Hf-*", and the 
function f is said to be resolutive with respect 
to R*. A compactification such that every 
bounded continuous function on A is resolu- 
tive is called a resolutive compactification. In 
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such a case, a point p* in A is said to be regu- 
lar with respect to the fDirichlet problem if 
liM spp» HF?" (p)= f(p*) for every bounded 
continuous function f on A (— 120 Dirichlet 
Problem). The set A, of regular points in A 

is contained in T. If R* is a resolutive com- 
pactification, then there exists a unique posi- 
tive Borel measure 1, such that Hf-®"(p)= 
fa f(p*) du,(p*) for every bounded continu- 
ous function f on A. This measure is called 
the harmonic measure with respect to pe R. 
There exists a function P(p, p*) on R x A with 
dut,(p*) = P(p, p*) du,(p*) for an arbitrary fixed 
point o in R satisfying the following three 
conditions: (1) P(p, p*) is harmonic on Rasa 
function of p; (ii) P(p, p*) is Borel measurable 
as a function of p*; (iii) k(o, p) | < P(p, p*)< 
k(o, p), with the Harnack constant k(o, p) of 
{o, p} relative to R [9]. 


C. Compactifications Determined by Function 
Families 


A family F of real-valued continuous functions 
on R admitting infinite values is called a sep- 
arating family on R if there exists an f in F 
such that f(p)# f(q) for any pair of given 
distinct points p and q in R. A compactifica- 
tion R* is called an F-compactification, de- 
noted by R#, if every function in F can be 
continuously extended to R* and the family 
of extended functions again constitutes a sep- 
arating family on R*. The correspondence 

gy: FR} defines a single-valued mapping 

of all separating families F on R onto all F- 
compactifications of R. If F, > F,, then ọ(F,) 
lies over ọ(F,). For any R*, p~'(R*) contains 
infinitely many separating families, among 
which the separating families constituting 
tassociative algebras are important. The fol- 
lowing are typical examples of compactifica- 
tions determined by function families: 

(1) The Aleksandrov compactification is the 
U-compactification Rý with the family U of 
bounded continuous functions on R with 
compact support. It is the smallest compacti- 
fication of R and is often used in function 
theory in discussing Dirichlet problems for 
relatively noncompact subregions in reference 
to relative boundaries. 

(2) The Stone-Cech compactification is the 
€-compactification Rě with the family € of 
bounded continuous functions on R. It is the 
largest compactification of R. It is rarely used 
in function theory, but an application is found 
in the work of M. Nakai [8]. 

(3) The Kerékjarté-Stoilow compactification 
is the S-compactification RE with the family 
S of bounded continuous functions f on R 
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such that there exist compact sets K, with the 
property that the f are constants on each 
connected component of R—K,. This is the 
smallest compactifitcation of Stoilow type. 
Many applications of this compactification 
can be found in function theory, among which 
the investigation done by M. Ohtsuka on the 
Dirichlet problem and the theory of conformal 
mappings is typical. 

(4) The Royden compactification is the R- 
compactification R# with the family R of 
bounded C®% functions f on R with finite 
Dirichlet integrals ffp df ^ * df. It was intro- 
duced by H. L. Royden and developed further 
by S. Mori, M. Ota, Y. Kusunoki, Nakai, and 
others. This compactification has been used 
effectively in the study of HD-functions and 
the classification problem of Riemann surfaces 
(— 367 Riemann Surfaces). 

(5) The Wiener compactification is the W- 
compactification R}, with the family YB of 
bounded continuous functions f on R such 
that {Hf} converges to a unique harmonic 
function independent of the choice of exhaus- 
tions {G,} of an arbitrary fixed subregion G¢ 
Og, where the G, are relatively compact sub- 
regions of G. It is the largest resolutive com- 
pactification, and compactifications smaller 
than Rẹ are always resolutive. This compac- 
tification was introduced independently by 
Mori, K. Hayashi, Kusunoki, and C. Constan- 
tinescu and A. Cornea and is useful for the 
study of HB-functions and the classification of 
Riemann surfaces. 

(6) The Martin compactification is the W- 
compactification R}, with the family W of 
bounded continuous functions f on R such 
that there exist relatively compact regions 
R; with the property that f= HA PRORA Rs 
Hf Rr Ra ®s on R—R,. Here f* coincides 
with f on R; and equals 0 on R3—R, and 1* 
is similarly defined. The set R&,— R is called 
the Martin boundary of R. If fGreen’s func- 
tion g exists on R, then the function m(p, g)= 
g(p, q)/g(o, q) for an arbitrary fixed oe R can 
be extended continuously to R x R&, which ~ 
is called the Martin kernel. By the metric 
dy (4,7) =SUPper, MCP, q)/(1 + m(p, g)) —m(p,r)/ 
(1+m(p,r))| with a parametric disk Rp in R, 
Rj, is tmetrizable. This compactification was 
introduced by R. S. Martin, and many appli- 
cations of it to the study of H P-functions, 
potential theory, Markov chains, and cluster 
sets were obtained by M. H. Heins, Z. Kura- 
mochi, J. L. Doob, Constantinescu and Cor- 
nea, and others. 

(7) For a function f on R, (R)df/én=0 
means that there exists a relatively compact 
subregion R, such that f is of class C” on R 
outside R, and the Dirichlet integral of f over 
R— R; is not greater than those of functions 
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on R—R, that coincide with f on the bound- 
ary of R,. The Kuramochi compactification is 
the R-compactification Rë with the family & of 
bounded continuous functions f on R satisfy- 
ing (R)ôf/ôn =0. The continuous function 
k(p,q) on R such that (R)¢k/én =0 vanishes in 
a fixed parametric disk Ry in R and is har- 
monic in R — R, except for a positive tlogarith- 
mic singularity at a point g can be extended 
continuously to R x R&, which is called the 
Kuramochi kernel. By the use of this kernel, 

R§ is metrizable, as in the case of Martin 
compactification. This compactification was 
introduced by Kuramochi, and its important 
applications to the study of HD-functions, 
potential theory, and cluster sets were made 
by Kuramochi, Constantinescu and Cornea, 
and others. 

Among compactifications (1)-(7), no bound- 
ary point in (2), (4), or (5) satisfies the tfirst 
countability axiom (hence they are not metriz- 
able), while the others are all metrizable. In (4) 
and (5), A, =T. Fig. 1 shows the relationship 
among the seven examples. Here 4— B means 
that A lies over B, and A#B means that in 
general neither A> B nor B>A. 


por 
RRS AX SRR 
RRA 


Fig. 1 


D. Remarks 


In contrast to topological compactifications 
(1)-(3), (4)-(7) can be regarded as potential- 
theoretic and have enough ideal boundary 
points so that one can solve the Dirichlet 
problem and introduce various measures 
there. Utilizing Green’s functions, R. S. Martin 
[6] deduced the first important compactifi- 
cation and gave the integral representation of 
positive harmonic functions (the extension of 
the tPoisson integral). Z. Kuramochi [3] ob- 
tained his compactification similarly by using 
N-Green functions introduced by himself 
instead of the usual Green’s functions. In this 
compactification, the ideal boundary points 
can be considered to be interior points of the 
surfaces in a potential-theoretic sense. In case 
of finitely connected domains with smooth 
boundaries, both the Martin and Kura- 
mochi boundaries coincide with the usual 
boundaries. 

The Royden and Wiener compactifications 
were introduced as the maximal ideal spaces 
of respective function algebras. Their ideal 
boundaries contain extremely many points. 
However, these compactifications have elegant 
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properties and many applications. An analytic 
mapping ọ of a Riemann surface R into an- 
other R’ is called a Dirichlet (Fatou) mapping 
if ọ can be extended to a continuous map- 
ping of R&(Rẹ) to Rot (R). For example, a 
Lindelofian mapping (AD-function) is a Fatou 
(Dirichlet) mapping. The Dirichlet and Fatou 
mappings were investigated by Constanti- 
nescu, Cornea, and others. 

By using the Martin boundary, Z. Kuramo- 
chi and M. Nakai proved the extension of the 
Evans-Selberg theorem to parabolic Riemann 
surfaces [3,8]. The normal derivatives of HD- 
functions on the ideal boundaries and their 
applications were studied by Constantinescu, 
Cornea, F. Maeda, Y. Kusunoki, and others. 
The theory of compactification can be gen- 
eralized to domains in R”, tGreen spaces, and 
harmonic spaces. 
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A. General Remarks 


Historically, a function y of x was called an 
implicit function of x if there was given a 
functional relation f(x, y)=0 between x and y, 
but no explicit representation of y in terms of 
x(— 165 Functions). Nowadays, however, 
the notion of implicit function is rigorously 
defined as follows: Suppose that a function 
SXi -Xp Y) is of tclass C' in a domain G in 
the real (n+ 1)-dimensional Euclidean space 
R”*! and that f(x}, ..., x8, y°)=0, dX aes 
x?, y°) 40 at a point (x9, ..., x8}, y?) in G. Then 
there is a unique function g(x,,...,x,) of class 
C! in a neighborhood of the point (x9, ..., x? 
that satisfies f(x,, ...,X_,9(X1,---5X,)} =, 

y? =g(x°,...,x°) (implicit function theorem). 
The function g is called the implicit function 
determined by f =0. The partial derivatives of 
g are given by the relation 


dg/Oxj= —(ðflðx Nfld), 


where y=g(x,,...,X,). If the function f is of 
class C” (1 <r < œ or r=q), then the function 
g is also of class C”. In particular, when n= 1, 
letting x, be x, we have dg/dx = —f,/f,. 


B. Jacobian Matrices and Jacobian 
Determinants 


A mapping u from a domain G in R” into R” 


u(x) SAM sce Xn) de sth gl M92 eb 


KK assy kale 


is called a mapping of class C” if each compo- 
nent u,,...,U,, is of class C (O<r<o@ orr=w) 
in G. Given a mapping u of class C! from G 
into R”, we consider the following matrix, 
which gives rise to the differential du, of the 
mapping u (— 105 Differentiable Manifolds I): 


6(u)/O(x) =(0u,/OX,)1 cjcmiaick<n (1) 


This matrix is called the Jacobian matrix of 
the mapping u at x. If there is another map- 
ping v of class C’ from a domain containing 
the trange U of u into R', then we have the 
law of composition: 


(0(v)/E(u)) (O(u)/6(x)) = 6(v)/0(%). 


When n =m, the fdeterminant of the matrix (1) 
is called the Jacobian determinant (or simply 
Jacobian), and is denoted by D(u)/ D(x}, 


D(u,,...,U,)/D(xX,,...,X,) OF 
D(uy,...,U,) 
D3 


803 


Sometimes the notation ĉ is used instead of D; 
but in the present article we distinguish the 
matrix from the determinant, using ô for the 
matrix and D for the determinant. 

If m=n and D(u)/D(x) never vanishes at any 
point of the domain G, then u ts called a regular 
(or nonsingular) mapping of class C'. If the 
Jacobian D(u)/D(x) is 0 at x, we say that u is 
singular at x. A mapping that is singular at 
every point in a set SG is said to be degen- 
erate on S. For a regular mapping u, the sign 
of the Jacobian is constant in a connected 
domain G. If it is positive, the mapping u 
preserves the orientation of the coordinate 
system at each point in G, while if it is nega- 
tive, the mapping changes the orientation. A 
point where u is degenerate is called a critical 
point of the mapping u, and its image under u 
is called a critical value. In general, the image 
of the mapping is “folded” along the set of 
critical points. The set of critical values of a 
mapping u of class C! (sending a domain in R” 
into R”) is of Lebesgue measure 0 in R” (Sard’s 
theorem). If u is a regular mapping, then each 
point in the domain G of u has a neighbor- 
hood V such that the restriction of u on V is a 
‘topological mapping. Its inverse mapping x(u) 
is also a regular mapping of class C! and 
satisfies the relation 


D(u) D(x) _ 
D(x) Dlu) 


(inverse mapping theorem). If u is of class C” 
(!<r<cc orr=o), then so is its inverse 


mapping. 


C. Functional Relations 


A function F(u,,...,u,) defined on a domain B 
in R” is called a function with scattered zeros if 
F has a zero point (i.e., there exists a point u 
for which F(u)=0) and if every open subset of 
B contains a point u such that F(u) 40. Every 
tanalytic function #0 has scattered zeros. Let 
u(x) be a mapping from a domain G in R” into 
BCR". Suppose that there exists a function 
F(u) defined in B, of class C", with scattered 
zeros. If F(u(x))=0 for every x in G, then we 
say that the components u,,...,u, of the 
mapping u have a functional relation of class 
C” or are functionally dependent of class C”. In 
such a case, we sometimes say simply that 
u,,...,U, are functionally dependent or that 
they have a functional relation. If the compo- 
nents u,,...,u, of a mapping u of class C! are 
functionally dependent of class C°, then the 
Jacobian D(u)/D(x) of u must vanish. Con- 
versely, if the Jacobian D(u)/D(x) of a mapping 
u of class C! is identically 0 in the domain G, 
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the components u,,...,u, are functionally de- 
pendent of class C” on every compact set in G 
(Knopp-Schmidt theorem) [1]. 


D. Implicit Functions Determined by Systems 
of Functions 


Suppose that the trank of the Jacobian matrix 
of (1) is r<m everywhere in G. Suppose that 
u(x) is a mapping of class C! from a domain 
G in R” into R”, the Jacobian determinant 
D(u,,...,u,)/D(x,,...,x,) never vanishes in G, 
and further D(u,,...,u,,u,)/D(x,, ...,X,,Xq) Is 
identically 0 in G for each p, o with r <p <m, 
r<o <n. Then the values u, ,, (x), ...,u,,(x) are 
determined by the values u(x), ...,u,(x), and 
each u, is represented as a function of class C! 
OF uj, ..., Up. 

Let u(x) be a mapping of class C! from a 
domain G in R” into R" and V the tinverse 
image of a point u°. To study the properties of 
the set V, we assume, for simplicity, that u? is 
the origin. Suppose that the rank of the matrix 
0(u)/C(x) is r for every point x in G and each 
Of u,.1,---,U IS functionally dependent on 
Uis, u,. Then each u, (r <p <m) is a func- 
tion u,(u,,...,u,) Of u,,...,u,. The set V is 
empty if there is a p such that u,(0,...,0) 40. 
On the other hand, if u,(0,...,0)=0 for all p 
(r<p<m), then V is the set of common zero 
points of the functions u; (x), ...,u,(x). There- 
fore, to study the set V, we can assume that r= 
mn. If m=n, V consists of isolated points 
only. If m<n, then, changing the order of the 
variables x,,...,X, if necessary, we can assume 
that D(u,,...,U,,)/D(x,,.-.,X,) #0 at a point 
(x°) in V. In this case, there is a unique func- 
tion E,(Xm415+++,X,) of class C! (1<p<m)ina 
neighborhood of (x?) satisfying the following 
two conditions: (i) x? =€,(x),41,-.., x2); (ii) if 
the point (x,,41,---,X,) is in a neighborhood of 
(x941 ---, X9), then the point 


(Ei (Xm+1> nag Xa) SARS Gl Mais ares Xn), 


Xm+1> -9 BEV. 
Each function ¢,, is called an implicit function 
Of X45 +++, Xp determined by the relations 
u,=...=u,,=0. The ttotal derivatives of the 

¢, are determined from the system of linear 
equations 


Ou, n Ou; 
X —dé,+ Yo —*dx,=0, j=1,...,m 
k=1 OX, l=mt+1 0X} 


The foregoing implicit function theorem is a 
local one. Among the global implicit function 
theorems, the following one, due to Hadamard, 
is useful: Let x> y(x) be a mapping of class C! 
from R” into R” such that the inverse ®~!(x) 


208 E 
Implicit Functions 


of its Jacobian matrix ®(x) is bounded in R”; 
then the mapping is a tdiffeomorphism of class 
C! from R” onto R”. 

The implicit function theorem holds also in 
complex spaces. Let f(X1, -3 Xans Yis -+> Yph 
1 <i<p, be a system of tholomorphic functions 
in a domain in the complex (n+ p)-dimensional 
space C"*?. If (i) f(x}, ..., x9, V9... Y= 
0, 1 <i<p, and (ii) D( fis -s fp/D(Y1> -5 
Vox, y)=(x9, yo) #9, then there exists a unique 
holomorphic solution y;=y,(x,,...,X,) (1 < 
i<p) ina neighborhood of the point x° = 
(x?,..., xe) that satisfies f;i(x1, <<, Xps Y1 (X1; 

Les Xy) aera YplXis 3 X= OLLS TS p), 


E. Linear Relations 


Suppose that u(x) is a mapping of class C”™? 
from R! into R”. [ts components (uy, ..., Um) 
are functions of class C”~'. Then the 
determinant 


gem yP- yr) 
is called the Wronskian determinant (or simply 
the Wronskian) of the functions u,,...,u,, and 
is denoted by W(u,,u5,...,u,,). If the func- 
tions u,,...,U, are flinearly dependent, i.e., if 
there exist constants c; not all zero satisfying 
È; Cju(x)=0 identically, then the Wronskian 
vanishes identically. Therefore, if W(u,,...,u,) 
#0, then the functions u,,...,u,, are linearly 
independent. Conversely, if W(u,,...,4,,)=90 
identically, and further if there is at least one 
nonvanishing Wronskian for u,,..., 4-1, Uj41, 
...,U, (1 <ix<m), then the functions u,,...,u,, 
are linearly dependent. The necessity of the 
additional condition is shown by the follow- 
ing example: u, = x? and u, =|x|? are of class 
C! in the interval [ —1, 1] and linearly inde- 
pendent, but they satisfy W(x?,|x|*)=0 iden- 
tically. However, the additional condition is 
unnecessary if the functions u,,...,u,, are 
analytic. Similar theorems are valid in a do- 
main in a complex plane. 

Furthermore, if u(x) is a continuous map- 
ping from an interval [a,b] in R! into R”, the 
determinant 


(1,1)... (tym) 
G(uy, -s Um) = (2,1) (2, m) , 


(m, 1) (m, m)| . 


b 
(j,k) -| u,(x)U,(x) dx 


is called the Gramian determinant (or simply 
Gramian). The Gramian is the fdiscriminant of 
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the quadratic form 


| (È em) dx 


of €,,...,€,, and is equal to 


b b 
a | 7 | (det(u,(x,)))? dx, ... dX m: 
m! Ja a 


We always have G(u,,...,u,,) 20, and G(u,, 
...,U,,)=90 if and only if u,,...,u,, are linearly 
dependent. The Gramian is defined if the 
functions u,,...,u,, are tsquare integrable 

in the sense of Lebesgue. In that case, the 
condition G(u,, ...,u,,)=9 holds if and only if 
U,,--.,U, are linearly dependent talmost every- 
where, i.e., there are constants ¢,,...,¢,, not all 
zero such that the relation c,u,(x)+...+ 
CmUm(X)=0 holds except on a set of Lebesgue 
measure 0. 
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India was one of the earliest civilizations, but 
because it has no precise chronological record 
of ancient times, it is said to possess no his- 
tory. Indian mathematics seems to have de- 
veloped under the influence of the cult of 
Brahma, as did the calendar. It may also have 
some relation to the mathematics of the Near 
East and China, but this ts difficult to trace. 
The word ganita (computation) appears in 
early religious writings; after the beginning of 
the Christian Era, it was classified into pati- 
ganita (arithmetic), bija-ganita (algebra), and 
ksetra-ganita (geometry), thus showing some 
systematization. The Buddhists (notably Na- 
garjuna) had a kind of logic, but it had no re- 


805 


lation to mathematics. Unlike the Greeks, the 
Indians had no demonstrational geometry, 
but they had symbolic algebra and a position 
system of numeration. 

Indian geometry was computational; Arya- 
bhata (c. 476—c. 550) computed the value of z 
as 3.1416; Brahmagupta (598—c. 660) had a 
formula to compute the area of quadrangles 
inscribed in a circle; and Bhaskara (1114- 
1185) gave a proof of the Pythagorean theo- 
rem. In trigonometry, Aryabhata made a 
table of sines of angles between 0° and 90° 
for every 3.75° interval. The name “sine” is 
related to the Sanskrit jya, which referred to 
half of the chord of the double arc. 

The Indians had a remarkable system of 
algebra. At the beginning they had no oper- 
ational symbols and described in words the 
rules for solving equations. Brahmagupta 
worked on the *Pell equation ax? +1 = y’. 
Bhaskara knew that a quadratic equation can 
have two roots that can be positive and nega- 
tive, but did not assign any meaning to the 
negative root in such cases. Bhaskara also 
introduced algebraic symbols. 

The symbol 0 was used in India from about 
200 B.c. to denote the void place in the posi- 
tion system of numeration; 0 as a number is 
found in a book by Bakhshali published in the 
3rd century A.D. The number 0 is defined as a 
—a=0O in our notation, and the rules a +0= 
a,0xa=0, /0=0, 0+a=0 are mentioned. 
Brahmagupta prohibited division by 0 in 
arithmetic, but in algebra he called the “quan- 
tity” a+0 taccheda. Bhaskara called it khahara 
and made it play a role similar to that of our 
infinity. 

Some historians assert that the Indians had 
the ideas of infinity and infinitesimal. Some 
hold that the Indian position system of numer- 
ation arose from the circumstance that the 
names of numbers differed according to their 
positions. The Indian numeration system was 
exported to Europe through Arabia, and 
had great influence on the development of 
mathematics. 
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Inductive Limits and 
Projective Limits 


A. General Remarks 


Inductive and projective limits can be de- 
fined over any preordered set J and in any 
‘category. We first explain the definition of 
these limits in the special case where J is a 
tdirected set and the category is that of sets, of 
groups, or of topological spaces. The simplest 
case is when / is the ordered set N of the 
natural numbers. 


B. The Limit of Sets 


Let J be a directed set. Suppose that we are 
given a set X; for each ie I and a mapping 
(ji: X; > X; for each pair (i,j) of elements of I 
with i<j, such that g; = 1x, (the identity map- 
ping on X;) and Pi = Pu O P; (i<j <k). Then 
we denote the system by (X;, @;;) and call it 
an inductive system (or direct system) of sets 
over I. Let S be the tdirect sum H ;X; of the 
sets X; (ie J), and define an equivalence re- 
lation in S as follows: xe X; and ye X; are 
equivalent if and only if there exists a ke] 
such that i<k,j<k, g,:(x)=¢,,(y). Let D be 
the tquotient set of S by this equivalence rela- 
tion, and let f,;:X;-D (ie I) be the canonical 
mappings. Then we have I(1) fo g= fi (i<j); 
I(2) for any set X, and for any system of map- 
pings g;:X;>-X (ie I) satisfying gjo P; =g; 
(i<j), there exists a unique mapping f:D>X 
such that fo f;=g; (ie J). We call (D, f;) the 
inductive limit (or direct limit) of the inductive 
system (X;, @;;) over I, and denote it by lim X; 
or ind lim X; (more precisely, by liM;er X; or 
ind lim,.; X;). 

Suppose, dually, that we are given a set X; 
for each ie J and a mapping y;:X;>X; for 
each i<j, such that y; = 1y, and Yir = Yio Wy 
(i<j<k). Then we denote the system by (X;, 
ýy) and call it a projective system (or inverse 
system) of sets over 7. Let P be the subset 
of the Cartesian product [LX; defined by P= 
{œl pyx )=x; (i <j)}, and let p;: PX; be 
the canonical mappings. Then we have P(1) 
Yop; = P; (i<j); P(2) for any set X, and for 
any system of mappings q;: X >X; satisfying 
W;04;4; (i<j), there exists a unique mapping 
p:X—P such that p;o p=q; (ie). We call 
(P, p;) the projective limit (or inverse limit) of 
the projective system (X;, ,;) over I and de- 
note it by lim X; or proj lim X;. 
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Note that we may replace I by any coftnal 
subset of J without changing the limits. 


C. The Limit of Groups and of Topological 
Spaces 


If, in the notation of Section B, X; is a group 
and ¢;; (Yy) is a homomorphism, then we say 
that (X;, p) ((X;, W;)) is an inductive (projective) 
system of groups. The inductive limit (as a set) 
D =lim X; has the structure of a group for 
which the canonical mappings f; are homo- 
morphisms. With this group structure, D is 
called the inductive limit (group) of the induc- 
tive system of groups. It satisfies properties 
I(1) and [(2) with group X and homomor- 
phisms g; and f. Similarly, the projective limit 
(as a set) P= lim X; has a unique group struc- 
ture such that each p;: PX; is a homomor- 
phism, namely, that of a subgroup of the direct 
product group []X;. The group P is called the 
projective limit (group) of the projective system 
of groups. When each X; is a module over a 
fixed ring A, we get entirely similar results by 
considering A-homomorphisms instead of 
group homomorphisms. 

Next, let X; be a topological space and @;; 
and w;; be continuous mappings. Then (X,, ~;;) 
((X;, Y;)) is called an inductive (projective) 
system of topological spaces. If we introduce in 
D =lim X, the topology of a quotient space of 
the ttopological direct sum of the spaces X; 
(ie 1), then the f; are continuous, and I(1) and 
1(2) hold with sets and mappings replaced by 
topological spaces and continuous mappings. 
Similarly, if we view P=lim X; as a subspace 
of the tproduct space HX, then the p; are 
continuous and P(1) and P(2) hold with the 
same modification as before. The spaces D and 
P are called the inductive limit (space) and the 
projective limit (space) of the system of topo- 
logical spaces, respectively. The projective 
limit of Hausdorff (compact) spaces is also 
Hausdorff (compact). 

Furthermore, if the X; (ie I) form a topolog- 
ical group and @;;, Yj are continuous hom- 
omorphisms, then lim X; and lim X; are topo- 
logical groups, and Properties 10), I(2)}, P(1), 
and P(2) are satisfied for topological groups 
and continuous homomorphisms (— 423 
Topological Groups). In particular, projec- 
tive limits of finite groups are ttotally discon- 
nected compact groups and are called profinite 
groups; they occur, e.g., as the ring of *p-adic 
integers and as the tGalois group of an infinite 
tGalois extension. Conversely, the tgerms of 
continuous functions at a point x in a topolog- 
ical space X, and other kinds of germs (— 383 
Sheaves), are important examples of inductive 
limits of groups. 


806 


D. Limits in a Category 


Let I be a preordered set and © a category. If 
we are given an object X; of a category @ for 
each iel and a tmorphism g4: X; >X; of € 

for each pair (i,j) of elements of J with i<j, 
and if the conditions u= lx, Qi = Pry Pi 
(i<j <k) are satisfied, then we call the system 
(X;, @;;) an inductive system over / in the cate- 
gory G. A projective system in @ is defined 
dually: It is an inductive system in the ‘dual 
category 6°. If we view I as a category (— 52 
Categories and Functors B), then an tnductive 
(projective) system in @ over the index set 7 is 
a tcovariant (tcontravariant) functor from 7 to 
€. Now if an object Deg and morphisms 
J: X; 0D (ie I) satisfy conditions I(1) and [(2) 
with the modification that X is an object and 
gi, f are morphisms in @, then the system 
(D, fi) is called the inductive limit of (X;, g;,) and 
is denoted by lim X;,. Similarly, if an object 
Pe@ and morphisms p;: P—>X; (ie I) satisfy 
P(1) and P(2) with a similar modification, then 
(P, p;) is called the projective limit of (X;, y;;) 
and is written lim X;. By I(2) and P(2), these 
limits are unique if they exist. 

In the categories of sets, of groups, of 
modules, and of topological spaces, inductive 
and projective limits always exist. Note that if 
the ordering of I is such that i<j implies i=j, 
i.e., if there is no ordering between two distinct 
elements of J, then the inductive (projective) 
limit is the direct sum (fdirect product) (— 52 
Categories and Functors E). 

Let (X;, p;i) (X;, pq) be two inductive sys- 
tems over the same index set J, and let 9;: 
X;— X; (ie 1) be morphisms satisfying p}; 0 @; 
= 9,0 P; (i<j). Then the system (;) is called 
a morphism between the inductive systems. 
Such a morphism is a fnatural transformation 
between the inductive systems viewed as func- 
tors 1+. If lim X; and lim X; exist, then (@,) 
induces a morphism lim pi: lim X,>lim X; in 
a natural way, and similarly for projective 
limits. 

For the more abstract notion of limit of a 
functor — [5]. For the theory of procategories 


— [4]. 
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211 (X.3) 
Inequalities 


A. General Remarks 


In this article we consider various properties 
of inequalities between real numbers. An in- 
equality that holds for every real number (e.g., 
x? >0) is called an absolute inequality; other- 
wise it is called a conditional inequality. When 
we are given a conditional inequality (e.g., 
x(x — 1)<0), the set of all real numbers that 
satisfy it is called the solution of the inequality. 
The process of obtaining a solution is called 
solving the conditional inequality. 


B. Solution of a Conditional Inequality 


Suppose that a conditional inequality is given 
by f(x)>0 (or f(x) >0), where f is a continu- 
ous function defined for every real number. 

If the equation f(x)=0 has no solution, then 
we have either f(x)>0 or f(x)<0 for all x. 
On the other hand, if « and £$ (x< f) are adja- 
cent roots of the equation f(x)=0, the sign 

of f(x) is unchanged in the open interval (a, f). 
Therefore the solution of the given inequality 
depends essentially on the solution of the 
equation f(x)=0. If inequalities involve two 
variables x, y and are given by f(x, y)>0, 

g(x, y)>0 for continuous functions f and g, 
the solution is, in general, a domain in the xy- 
plane bounded by the curves f(x, y)=0 and 
g(x, y)=0. Similar results hold for the case of 


inequalities involving more than two variables. 


C. Famous Absolute Inequalities 


(1) Inequalities concerning means (or averages): 


Suppose that we are given an n-tuple a= 


(a,,...,@,), 4,20. We set 
1 n 1/r 

M,=Ma)=("¥ a) | 
Ny=1 


If at least one a, is 0 and r<0, we put M,=0. 
In particular, we put 


A=M,, 
n ljn 
G=lim m-(Å a) , 
P20 v=1 
H=M_; 
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these are called the arithmetic mean, geometric 
mean, and harmonic mean of a, (v= 1, ...,n), 
respectively. Except when either all a, are 
identical or some a, ts 0 and r <0, the func- 
tion M, increases ‘strictly monotonically as r 
increases, and M,>mina, (r> —0o), M,> 
max a, (r> +00). Therefore we always have 
mina,<M,<maxa,. In particular, we have 
H<G<A ifthe a, are all positive and not 
all equal. 

Let p(x) (> 0), f(x) (20) be fintegrable func- 
tions on a measurable set E. We put 


l/r 
min-(| wax] | pdx) , rÆ0. 


Furthermore, if M,(f) is strictly positive for 
some r>0, we put 


Mo(f)= lim M,(f) 


=exo( | plog sax) | pdx), 
E E 


We call M,(/) the mean of degree r of the 
function f(x) with respect to the weight func- 
tion p(x). It has properties similar to those of 
M,(a). In particular, when the weight function 
p=1, the means M,(f), Moa( J), M_,(/) are 
called the arithmetic mean, geometric mean, 
and harmonic mean of f, respectively. 

(2) The Hélder inequality: Suppose that 
p#0, 1 and (p—1)(q—1)=1; that is, 1/p + 
1/q=1, and a,>0, b, >0. Then, in general, 
we have the Hélder inequality: 


1/p 1/4 
Janeza) (ze) — 


where the inequality signs in the first inequal- 
ity are taken in accordance with p<1 or p>1. 
The summation may be infinite if the sums 
are convergent. The inequality sign is replaced 
by the equality sign if and only if there exist 
constant factors À and u such that Aa? = ub? 
for all v. The Hölder inequality for p=q=2 
is called the Cauchy inequality (or Cauchy- 
Schwarz inequality). 

For two measurable positive functions f(x), 
g(x), we have the Hélder integral inequality: 


1/p 1/4 
| naz, f prax) (| grax) >, PSl, 
E E E 


except when there exist two constant factors 4 
and u ((A, u) 4(0,0)) such that Af? = yg? holds 
‘almost everywhere. The above inequality is 
replaced by equality if and only if we are in 
this exceptional case. The case where p=q=2 
is called the Schwarz inequality (or Bunyakov- 
skii inequality). 

(3) The Minkowski inequality: Suppose that 
p#0, 1 and a,>0, b,>0. Then we have the 
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Minkowski inequality: 
1/p 1/p i/p 
(Zarar) 2(za) z)". 


except when {a,} and {b,} are proportional. 
The inequality is replaced by equality if and 
only if we are in the exceptional case. 

The corresponding ee inequality for 
positive functions f(x), g 


(veer) "al ra 


grax) > psi, 
E 


psi, 


except when f(x)/g(x) is constant almost every- 
where. The inequality is replaced by equality if 
and only if we are in the exceptional case. 


D. Related Topics 


Absolute inequalities are important in anal- 
ysis, especially in connection with techniques 
to prove convergence or for error estimates. 
However, there seldom are general principles 
for deriving such inequalities, except for a few 
elementary theorems. 

For other famous inequalities — Appendix 
A, Table 8. For related topics — 88 Convex 
Analysis; for convex functions and their ap- 
plications — 255 Linear Programming; for 
linear inequalities — 212 Inequalities in 
Physics. 
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212 (XX.36) 
Inequalities in Physics 


A. Correlation Inequalities 


Let be a probability measure on a space X 
(with o-field 2) and f; be measurable functions 
on X, and write 


Sd = E 


A number of inequalities among such expec- 
tations under a variety of conditions on the 
measure y and functions f; are known as cor- 
relation inequalities, after their original occur- 
rence for correlation functions in the statistical 
mechanics of lattice gases. 

The earliest results were obtained by Grif- 
fiths (J. Math. Phys., 8 (1967)), for the case 
where X is the product []}., I; of two point 
sets J,={1, —1}, the f are kth coordinate func- 
tions o, =9,(x) (= +1) of xe X, du(x) is the 
counting measure multiplied by a Gibbs factor 
Z'e7F# with a ferromagnetic Hamiltonian 


fax) du(x). 


H(x)= Dial a(x)o(x), J 


i<j 


=J; 20, 


ij 
P20, and the normalization factor Z = 
£e’, His conclusions are 
<6,6,>20 (Griffiths’s first inequality), 
{DF 10m On) > LOKT) Fm Fn > 


(Griffiths’s second inequality). 


These were extended by Kelly and Sherman 
(J. Math. Phys., 9 (1968)) as 


<o4> =0 
(O49g) > (04> Og» 


(GKS first inequality), 


(GKS second inequality), 


where A and B are subsets of {1,..., N}, o4= 


Hies.g;,and H= —Y,-yJ,o, with J,>0.A 
further generalization (for example, J;=R) can 
be found in [1,2]. 
Under the same situation with 
S595) -Ý har J20, h;,>0, 
i<j =1 


the following GHS inequality by Griffiths, 
Hurst, and Sherman (J. Math. Phys., 11 (1970)) 
holds: 


LOOO? — (5) LOOK) — (Gj LOO) 
— (Ox) (G5; + 2G; (Gj) (Oj <O. 
If h,=0 for all i, then 
(6,6040,> — (6,0, L0; — (0,0, (o,0,) 
— 0401) <0,5,> +20:01) (60,) (o,0,> <0. 


809 


Further generalizations can be found in [2, 3], 
and the references quoted therein. 

Let X be a finite distributive lattice, u be 
a positive measure satisfying the condition 
u(x ay)u(x vy) > u(x) u(y), and f and g be both 
increasing or both decreasing functions on X. 
Then the following FKG inequality by For- 
tuin, Kasteleyn, and Ginibre (Comm. Math. 
Phys., 22 (1971)) holds: 


<fa ><? <9- 


B. Inequalities Involving Traces of Matrices 


` Let tr A denote the trace of a matrix A. Some 
of the earlier results applied to statistical 
mechanics are as follows, where p 20, c >0, 
A* = A, B* =B: 


tr(plogp—ploga—p+oa)20 
(Klein mequality), 
tr(e4**)/tre4 >exp(tre4B/tre%) 
(Peierls-Bogolyubov inequality), 
tr(e4e) > tr(e4t) 
(Golden-Thompson inequality). 


The following inequality is related to the last 
inequality and was proved for a general m>0 
by Lieb and Thirring (Studies in Mathematical 
Physics, Lieb, Simon, and Wightman (eds.), 
Princeton Univ. Press, 1976): 


tr(pa)"<tr(p"o™), p>0, o>0. 


For the Hilbert-Schmidt norm ||A|jy5 = 
(tr A*A)"? and the trace-class norm |A |lu = 
tr|A|, where |A|=(A* A)", the Powers-Stormer 
inequality (Comm. Math. Phys., 16 (1970)) 
holds: 


lo? 97 lr >l- olas) p20, o20. 


Araki and Yamagami (Comm. Math. Phys., 81 
(1981)) give 


ICI- IDI as. </2I'C —Dllns.- 


If C* =C and D* =D, then 
removed. 

The entropy S(p)= —trplogp is a concave 
function of p > 0 satisfying the triangular 
inequality (Araki and Lieb, Comm. Math. 
Phys., 18 (1970)): 


S(p1) + S(p2) > S(p1 2) 2 18(p1) — S(p2)| 


and strong subadditivity (Lieb and Ruskai, J. 
Math. Phys., 14 (1973)): 


S(P123)—S(P12)—S(P13) + S(p1) <9, 


where p;23 is a matrix on the tensor prod- 
uct space H, © H, Q Hz, py =ttiP123 Pi= 

tr tr, Py23 ({i,,k} = (1, 2, 3}) and tr; is a 
(partial) trace taken only on the space H;. The 


2 can be 
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last inequality is based on the concavity of 
the function f(p)=trexp(A + log p) in p>0 
(A* = A) proved by Lieb [4] (also see Epstein, 
Comm. Math. Phys., 31 (1973)), who also 
proved the joint concavity of tr(C*p*Co’) 
(r>0,s>0,r+s<1)in p>0 and c >20. This 
concavity for the case r=s=1/2 was previ- 
ously proved by Wigner and Yanase (Proc. 
Nat. Acad. Sci. US, 49 (1963)), and the general 
case has been conjectured by Wigner, Yanase, 
and Dyson. It leads to the joint concavity of 
the relative entropy S(p,a)=tr p(log p —loga) 
(defined to be +œ if ov =0 and py £0 for 
some vector w) in p20 and o >0 as well as its 
monotonicity S(a(p), afo) < S(p, 0) for any 
trace-preserving expectation mapping «. (For 
entropy, see also [5].) 

The above results have generalizations in 
the context of von Neumann algebras (Araki, 
Publ. Res. Inst. Math. Sci., 11 (1975); 13 (1977); 
Uhlmann, Comm. Math. Phys., 54 (1977); [6]). 


C. Operator Monotone and Operator Convex 
Functions 


A real-valued function f(x) defined on an 
interval J (finite or infinite; open, half-open, or 
closed) is called matrix monotone increasing 
(decreasing) of order m if f(A)> f(B) whenever 
m xm Hermitian matrices A and B with their 
eigenvalues contained in J satisfy A> B (A < B) 
and is called operator monotone if it is matrix 
monotone of order n for all positive integers 
n. f(x) is called matrix convex of order m if 
f[G—HA+tB] <(1—t)f(A)+¢(B) for 0<t< 
1 and all m x m Hermitian matrices A and B 
with their eigenvalues in J and is called opera- 
tor convex if it is matrix convex of order n 

for any positive integer n. The functions x* 
(0<a<1), logx, and —(x+a)™' (a>0) are 

all operator monotone increasing in the half- 
line interval x >0. The functions (x +a) +! 

and xlogx are operator convex in the same 
interval. 

A function f(x) is operator monotone in- 
creasing in an open interval (a, b) if and only if 
it is analytic in (a, b) and has an analytic con- 
tinuation to the whole upper half-plane where 
the function has a nonnegative imaginary part 
[7]. Another necessary and sufficient condition 
is 


3 SEC) ECA +k+ 1)! 20 
i,k=0 


for all xe(a,b), for all real ¢, and for all posi- 
tive integers N. An operator monotone func- 
tion f(x) on an open interval (— R, R) has the 
integral representation 


fe)= f+ F'0)| 0 —tx)du(t), 
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where u is a probability measure with its sup- 
port contained in [— R~ !, R™}] and, if f is 
not a constant, is uniquely determined by f. 
The set of all extremal points of the set of all 
operator monotone increasing functions on 
(—1, 1) satisfying f(0)=0 and f’(0)= 1 is exact- 
ly the set of functions x/(1 — tx), |t|<1, which 
appear as integrands in the above formula. An 
operator monotone function on R must be 
linear. 

An operator convex function tn an open in- 
terval (— R, R) has the integral representation 


F= f(0) + FO) x +(F'"(0)/2) [ea —tx)du(t), 


where u is a probability measure with its sup- 
port contained in [—R~',R~'] and, if f is 
not linear, is uniquely determined by f. An 
operator convex function in R must be at most 
a quadratic polynomial satisfying f”(0)>0. 
For a continuous real function on an inter- 
val [0, x) (0 <x < œ), the following four condi- 
tions are mutually equivalent: (i) f is operator 
convex and f(0) <0, (it) f(a*Aa)<a*f(A)a 
for any self-adjoint operator A with its spec- 
trum in [0,«) and any operator a with its 
norm not exceeding 1, (iti) the preceding condi- 
tion with a limited to projections, (iv) x‘ f(x) 
is operator monotone increasing in an open 
interval (0, æ). (See [8—11] and Hansen and 
Pedersen, Jensen’s inequality for operators and 
Lowner’s theorem, Math. Ann., 258 (1982), 
229-241. 
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A. General Remarks 


The mathematical theory of information trans- 
mission in communication systems, first devel- 
oped by C. E. Shannon [1] and now called 
information theory, is one of the most impor- 
tant fields of mathematical science. It consists 
of two main themes, channel coding theory and 
source coding theory. The purpose of channel 
coding theory is to ascertain the existence of 
schemes for transmitting information or data 
reliably over a noisy channel at a fixed trans- 
mission rate. The purpose of source coding 
theory is to show the existence of schemes for 
compressing data emitted from an information 
source and reproducing them within tolerable 
limits of distortion. Both theories are based 
profoundly on the theory of probability, statis- 
tics, and the theory of stochastic processes. 


B. Entropy 


In information theory it is customary to call 
a finite set A= {a,,...,%,} an alphabet and to 
call its elements the letters of the alphabet. The 
simplest model of information sources consist- 
ing of an alphabet A and a tprobability dis- 
tribution p over A is denoted by X =[4A, p], 
which can be regarded as a ‘finite probability 
space, where the probability of outcome of a 
letter «;¢ A from the source is denoted by p;= 
p(a;). A real-valued function defined by I(«,;)= 
— log p(«;) is called the self-information of 

the event «,¢ A. The tmean value of the self- 
information, i.e., 


H(X)= py — p(a)log pla), 
= 
is called the entropy of the source. This can 

be interpreted as a measure of the average a 
priori uncertainty as to which letter will ema- 
nate from the source, or a measure of the 
average amount of information one obtains 
upon receiving a single letter from the source. 
The unit of information or entropy for base e 
logarithms is called a nat, while that for base 2 
logarithms is called a bit. 
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Let A={a,,...,0,} and B={f,,...,B,,} be 
two alphabets. Let r(«;, 8;) be a tjoint proba- 
bility distribution defined on the product AB, 
and denote the *probability space by X Y= 
[AB,r]. Then the joint distribution gives rise 
to tmarginal distributions p(«,;)= X74, r(a;, B)) 
and q(8)=X}-, r(%;, B)) and to tconditional 
distributions p(a;| B;)=r(a;, B)/q(B) and q(B;| «;) 
=r(a;, B;)/p(a,). Probability spaces X =[A, p] 
and Y=[B,q] are subspaces of the probability 
space X Y=[AB,r]. The entropy of XY = 
[AB,r] is defined by 


HUXY)=> Ý —r(x;, Blog r(a;, B) 


i=1 j= 
as well. A real-valued function I(a;| £) = 
— log p(%;|ß;) is called the conditional self- 
information. The average of I(a;| B;) over a; 
defined by 


n 


H(X |p)= 2, PI CAR) 


is called the conditional entropy of X for given 
P;E B. The average of H(X | B;) over p; defined 
by 
HX IY)= È aH 

is called the conditional entropy of X for given 
Y. The conditional entropy of Y for given X, 


denoted by H(Y|X), is defined similarly. Then 
it can be shown that 


H(XY)=H(Y)+H(X|Y)=H(X)+H(¥|X), 


H(XY)<H(X)+H(Y), 
H(X| Y)<H(X), 
H(Y|X)<H(Y), 


where equalities in the last three expressions 
hold if and only if r(«;, 8) = p(a,)q()) for all 
a,€A and fe B. 


C. Information Sources 


Given a finite alphabet A, we consider the 
finite product set AN=A x... x A and the 
doubly infinite product A* = JI - ,, A,, where 
A,=A,k=0, +1, +2,.... Let Z be the o- 
algebra generated by all fcylinder sets in 47. 
Given a tprobability measure P over ¥,, an 
information source is defined as a probability 
space [A%, P] or as a trandom process X = 

... X_,X)X,X,.... When P is invariant under 
the tshift transformation T on A%, i.e., P(E)= 
P(TE) for any Ee ¥,, then [ AZ, P] is said to 
be a stationary source. In particular, if P(E)= 
0 or 1 whenever TE = Ee F,, then the infor- 
mation source is said to be ergodic. If X is an 
tindependently and identically distributed 
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random process, the source X =[A%, P] is said 
to be memoryless. 

For a given X, denote the subsequence 
X,...Xy of X by X”. Then a probability mea- 
sure P on A“ and a finite probability space 
X^ =[AN, P] are naturally induced. The en- 
tropy of the stationary information source 
X =[A%, P] is defined as H(X)=limy ,,, H(X*)/ 
N or as H(X)=limy,_,,, H(Xy|X™ t), because 
both limits exist and are identical. If X is 
memoryless, the entropy of X is sae 
to that of X, = rI i.e., H(X)= Eż — 
pilogp;, where p;=p' (X, =a;) and H(X*)= 
NH(X!)=NH{(X). 


D. Source Coding Theorem 


Let x =x, ... xy be a sequence of N consecu- 
tive letters from the source [ A”, P]. Suppose 
that we wish to encode such sequences into 
fixed-length code words u” =u, ... uz, consist- 
ing of L letters from a code alphabet U of 
size v. The number R=(logyv")/N is called the 
coding rate per source letter. A mapping 9: 
ANU is called an encoder and 9: U> A a 
decoder. The set U with specified encoder- 
decorder pair [ọ, y] is called a code with rate 
R=(Llogv)/N. The error probability of the 
code is defined by 


P.[o.w]=PLW{o(X)#X"}]. 


The fixed-length source coding theorem 
[1,2] states: Let X=[A%, P] be a stationary 
ergodic source. Then for any 6>0, if R > H(X) 
+ò, there exists a code [g, w] with rate R such 
that the error probability P,[ọ, Y] can be 
made arbitrarily small by making N suffi- 
ciently large. Conversely, if R < H(X)—6 then 
for any code with rate R, P,[@, Y] must be- 
come arbitrarily close to 1 as N> oœ. 

This theorem is trivial if R>logn. For mem- 
oryless sources, the theorem follows immedi- 
ately from the tweak law of large numbers, 
which implies that for arbitrary e>0 and 6>0 
there exists an integer N,(e, ô) such that for all 


N >N, (e, ò) 
ke 


N N 
al —log P(X”) 
The validity of this property for stationary 
ergodic sources was proved by B. McMillan 
[2]. In case of memoryless sources, the exact 
asymptotic form of the error probability for 
an optimal code with rate R was given by 
F. Jelinek [3], I. Csiszár and G. Longo [4], 
Blahut [5], and Longo and A. Sgarro [6] as 


H(X) > 


1 
lim ——logP,[o,W]= min D f 
lim ——-log eLp, y] in, (qlip) 


where p is the source distribution, q denotes a 
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probability distribution on A, and 
qi 


Diq\lp)= > q:log 
i=] i 


which is called Kullback’s discrimination in- 
formation [7] or the divergence. 

Source sequences x™ can be encoded into 
variable-length code words consisting of letters 
from alphabet U of size v. A set of n code 
words is called a prefix condition code if there 
is no code word which is equivalent to the 
prefix of any other code word. Denote the 
` length of the code word corresponding to a 
source sequence x” by L(x) and the average 
length of the code words per source letter by 


1 7 l Ni N N 
N SAN” (x*) L(x"). 
The variable length source coding theorem 
states: Given a memoryless source X with 
entropy H(X), there is a prefix condition code 
such that the average length of the code words 
per source letter satisfies 


> 1 
H(X)< Llogv<H(X)+ 5 logy. 


This theorem is valid for stationary ergodic 
sources if (log v)/N in the last term is replaced 
by e(N), where e(N)—0 as N> œ. 

These two theorems are referred to as noise- 
less source coding theorems. 


E. Source Coding Theorem with Fidelity 
Criterion 


The noiseless source coding theorem implies 
that the average number of code letters per 
source letter can be reduced to the source 
entropy H(X) under the requirement that the 
source sequence be exactly reproduced from 
the encoded sequence. If an approximate re- 
production of the source sequence to within a 
given fidelity criterion is required, the coding 
rate per source letter must be reduced further 
to a certain value below the source entropy. 

Suppose that a distortion measure d(«;, æ) is 
defined for 2;,«;€A, where it is assumed that 
d(a;,«;)>0 and d(a,,%,)=0. For blocks x* = 
X4 -Xy and p*=,... Yy, define 


1 N 
d(x", y= D A(X, Vx). 
k=1 


Any set @={y,..., yar}, YA e A“, of reproduc- 
ing words is called a source code of block 
length N. Each source sequence x” of the 
source X =[A%, P] is mapped into whichever 
code word yñ eg minimizes d(x”, y%), Le. 


dy(x", Ym) = min dy (x, y®), 
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and the suffix m is transmitted. Hence the 
coding rate per source letter is R =(log MYN, 
and the average distortion is 
dy(@)= J, P*(x")mindy(x”, y"). 

xNeAN vse 
The problem is how far we can reduce the rate 
under the condition that the average distortion 
keeps satisfying a given fidelity criterion, which 
is specified as a maximum tolerable value d for 
the average distortion. 

The source coding theorem with a fidelity 
criterion states: Let X =[A%, P] be a memory- 
less source. For any specified d>0, any e>0, 
and 6>0, there exists a source code ¢ with 
rate R > R(d)+ 6 and with sufficiently large 
block length N for which the average distor- 
tion satisfies 


dy(@)<dte, 


where R(d) is the rate distortion function de- 
fined by 


R(d)= „min I(p; W), 
n n W il? 
I(p;W)= 2 2 Pi wgllog ŽE, 
Pee py pW li) 


pqs 


wid) =WW: È nwulida,a)<a}, 
i=1 j=1 


where W(j|i) denotes a conditional proba- 
bility distribution referred to as the test chan- 
nel, and where F(p; W) is called the mutual in- 
formation. It should be noted that R(0)= H(X). 
The rate distortion function R(d), which was 
first defined by Shannon [1,8], is closely re- 
lated to the -entropy introduced by A. N. 
Kolmogorov [9]. R(d) is a monotonically de- 
creasing and convex function. 

The theorem was first proved by Shannon 
[8], and was extended by R. G. Gallager [10] 
to the case of stationary ergodic sources with 
discrete alphabets and by T. Berger [11] to 
stationary ergodic sources with abstract al- 
phabets. More recently, R. G. Gray and L. D. 
Davisson [12] have proved source coding 
theorems without the ergodic assumption 
for stationary sources subject to a fidelity 
criterion, The proof was based on Rokhlin’s 
ergodic decomposition theorem [13]. Other 
important source coding theorems have been 
obtained by F. Jelinek [14] for tree codes, A. J. 
Viterbi and J. K. Omura [15] for trellis codes, 
and Gray, D. L. Neuhoff, and D. S. Ornstein 
[16] for sliding block codes. 

The rate distortion function for memoryless 
Gaussian sources subject to squared error 
distortion was given by Shannon [8], and that 
for autoregressive Gaussian sources was deter- 
mined by Kolmogorov [9], M. S. Pinsker 
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[17], Berger [11], Gray [18], and T. Hashi- 
moto and S. Arimoto [19]. 


F. Channel Coding Theory 


A mathematical model for a channel over 
which information is transmitted is specified in 
terms of the set of possible inputs, the set of 
outputs, and a probability measure on the 
output events conditional on each input. The 
simplest channels are the noiseless ones, for 
which there is a one-to-one correspondence 
between input and output and no loss of in- 
formation in transmission through the chan- 
nel. The second simplest channels are discrete 
memoryless channels (DMCs) which are de- 
fined as follows: The input and the output are 
sequences of letters from finite alphabets (say, 
a,€ A and f,e B), and the output letter at a 
given time depends statistically only on the 
corresponding input letter. That is, a DMC is 
characterized by a fixed conditional proba- 
bility distribution W(j|i)= W(f;|a,) because 
the probability measure on the input and 
output sequences satisfies 


N 
WO ea I] W(y,|X,). XE, EB. 
=1 


By U ={1,..., M} we denote the set of integers 
each of which is assigned to each correspond- 
ing possible message from the source. A map- 
ping p: U >A“ induces a collection @ = {x¥, 
...,Xjyt, called the block code with rate R = 
(log M)/N; each element is called a code word. 
Only code words are transmitted over the 
channel. A mapping Ņ:B~—U is called the 
decoding. Thus, given an encoding and decod- 
ing pair [ọ, y], the error probability is defined 
by 


1 M 
P [oy] a 2* W~(y™lo(m)), 


where the summation È * is taken with respect 
to all y” such that y(y™)#m. The capacity for 
a DMC is defined by 


C=max I(p; W) 
p 


2 re W(jli) 
=max iW log. 
at A PMU Doss WTO 

The fundamental theorem of channel coding 
theory states: Given a DMC with capacity C > 
0, there exist a block code with rate R below 
capacity C, a pair [g, Y], and a function E(R) 
>0 for O< R <C such that the error proba- 
bility satisfies 


P.[e,w]<exp{—NE(R)}. 


This was first discovered by Shannon [1], 
and its precise proof was first given by A. 
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Feinstein [20]. The precise expression of E(R) 
was subsequently given by P. Elias [21], R. M. 
Fano [22], and Gallager [23]. The best ex- 
pression of E(R), due to Gallager [23], is 


E(R)=max{E,(R), E.,(R + O(N ~*))}, 


where 


E,(R)= max max| -oR 


O<p<l Pp 
ltp 
tog {Zawya} | 
j i 


E.,(R)=sup max l —pR 


1/p 
= logy PiPx iy WU|D)WG| B} | 


The converse of the fundamental theorem 
states: Given a DMC with capacity C, for any 
block code with rate R above C and any pair 
Ly, Y], the error probability satisfies 


P.Le,#]> 1—exp{—NE(R)}, 


where E(R) is a function positive for R >C., 
This was first proved by J. Wolfowitz [24], 
and the precise expression, found by S. Ari- 
moto [25], is described by 


-1<p<0 p 


E(R)= max min | —pR 


l+p 
-e3 {gawo} "| 


The fundamental theorem of coding theory 
and its converse imply that the capacity is a 
critical rate; above the capacity, information 
cannot be transmitted reliably through the 
channel. Unfortunately, there is, in general, no 
direct method for computing the capacity, and 
therefore an iterative method was proposed by 
Arimoto [26]. Another iterative method for 
computing the rate distortion function was 
given by R. E. Blahut [27]. 

A discrete channel with memory is, in gen- 
eral, defined by a list of probability measure 
{W,,xeA“} on a tmeasurable space {B7, Fp} 
such that for each F€. Fg, W,(F) is a measur- 
able function of x. A channel W is called sta- 
tionary if W,,(TF)= W, (F) for all xe A” and 
all Fe Fp. Given an input source [ A7, P] and a 
channel W, connecting the input to the chan- 
nel induces a joint process of input and out- 
put, denoted by [A7 x BŽ, PW] where PW is a 
measure on {A* x B”, ¥,, 2}. Ifa joint process 
[A7 x B7, PW] is stationary the mutual in- 
formation between input and output is defined 
by 


1(X; Y) = H(X)+ H(¥)—H (XY). 


A channel W is called ergodic if for every 
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ergodic input [A%, P] the induced joint process 
[A7 x B*, PW] is ergodic. The channel capac- 
ity for a discrete stationary channel is defined 
in various ways. The important ones are the 
ergodic capacity defined by C,=sup. of I(X; Y} 
with respect to all stationary ergodic sources 
X=[A%, P], and the stationary capacity de- 
fined by C,=sup. of I(X; Y) w.r.t. all station- 
ary sources. K. R. Parthasarathy [28] proved 
C, =C, for all discrete stationary channels. 

In another way J. Nedoma [29] introduced 
the operational source/channel block coding 
capacity C,.,, which is defined as the supre- 
mum of the entropies of all admissible sta- 
tionary ergodic sources in the sense that there 
exist source/channel block codes such that 
the error probability P,>0 as block length 
N- oo. Nedoma [29] also pointed out an 
example of a stationary channel where C, > 
Ce»: Hence block coding theorems have been 
proved for various channels: for finite memory 
channels by A. I. Khinchin [30], K. Takano 
[31], Nedoma [29], and Feinstein [32], for 
d-continuous channels by Gray and Ornstein 
[33], and for almost finite memory channels 
by D. L. Neuhoff and P. C. Shields [34]. 
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A. General Remarks 


Insurance is a system in which a large number 
of people contribute a small precalculated 
amount of money (called a premium) to fill the 
economic need that arises when a person 
meets adversity. The amount of economic need 
filled by this system is called the amount of 
insurance (or amount insured). The insurer is 
the one who implements the system. Actuarial 
mathematics is the branch of applied mathe- 
matics that studies the mathematical basis of 
insurance, one of the first cases in which math- 
ematics was successfully applied to a social 
question. Actuarial mathematics can be di- 
vided into two branches according to its ap- 
plication. The first includes the calculation of 
various values of each individual policy, such 
as premiums or reserves. The second is mainly 
connected with management of an insurance 
business and includes the study of reinsurance 
systems, of the maximum amount of insurance, 
of the contingency fund, or the analysis of 
profits. There is only one basic principle in 
actuarial mathematics, called the principle of 
equivalence. It determines the premium and 
reserve in each year so that the present value 
of future premium income of the insurer is 
equal to the present value of future benefits for 
each policy. 

The basic factors of actuarial calculations 
are (1) probabilities of contingencies, (2) an 
expected rate of interest in the future (often 
referred to as the assumed rate of interest), 
and (3) cost of administration of the system. 


214 A 
Insurance Mathematics 


Premiums are calculated using these factors 
and the principle of equivalence. The follow- 
ing is an example of the classical method 

of calculation for a life insurance policy 

with the use of “commutation symbols,” 
which is an old device for the convenience 
of calculations. 

We write P for the net premium (in which 
the cost of administration is disregarded), P’ 
for the gross premium, 7; for the amount of 
death benefits payable in the tth year after the 
policy is issued, E, for the amount of survival 
benefits payable at the beginning of the tth 
year, n for the period for which the insurance 
is effective, and m for the period for which pre- 
miums are to be paid. Let a, $, and y stand for 
three positive constants determining the initial 
expenses=a(7, or E,), the premium collection 
expenses = $P’, and the general expenses for 
maintenance = y(T, or E,). The factor that 
comes into consideration next is a model of 
human death and survival (measurement). 
Assume that /, is the number of lives attainmg 
age x, and write q, for the probability that a 
life of x years will end within one year. Then 
d,, the number of lives ending within one year 
out of l, is l qx, and 1,.,, the number of lives 
remaining after one year at age x + I, is 1, — 
d,=1,(1 —q,). The commutation symbols 
commonly employed are defined as follows: 
Write v=1/(1+i), where i is the assumed rate 
of interest; then 


For a policy issued at an insured person’s age 
x, the present value of the insurer’s future 
income can be expressed as P’(N, — Ny+4m)/Dy; 
and the present value of his future payments 
can be expressed as 


1 n n+1 
Ae? TCs -1 + > EDs 
x \t=1 t=1 


+a(T, or E,)D.+y $} (T, or E) Dyas 
t=0 


+ BP(N,— Nera) 


By assuming that the present value of the 
future income is equal to the present value of 
the future payments, the value of the gross 
premium P’ is obtained. (The P’ obtained from 
the assumption «= ß =y =0 is denoted by P 
and is called the net premium. The difference 
P’— P is called the loading.) For a policy in 
which benefits are payable on disability or 
contingencies other than death, we have only 
to obtain a model of contingencies and apply a 
similar calculation. 
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B. Liability Reserve 


During the term of an insurance contract, it 
often happens that the present value of the 
future income is less than the present value of 
the future payments. If this is the case, the 
difference is to be held by the insurer as the 
liability reserve. The source of this fund is the 
past premium income plus interest. The net 
premium reserve, which disregards expenses, is 
calculated as 


1 n nti 
Y= ( > i Ores > E,Dy+y-1 


Dy r=t+1 r=t+1 


=T P(N, 41 = Neon) . 


Between the net premium P and the net 
premium reserve V, we have the relation 


P=(vV— Vi tT Wart Er 


The first term of the right-hand side of this 
formula is called the savings premium, since it 
is the amount left out of the premium income 
of the tth year and added to the reserve. The 
second term is called the cost of insurance or 
risk premium and is applied to cover the dif- 
ference between the amount of insurance and 
that of the existing reserve in case the contin- 
gency of death arises. The third term is applied 
to the payment of the survival benefits (or 
annuities in case of an annuity contract). If 7;— 
V,, the amount of risk insured by the insurer, 
is positive for all values of t during the period 
of insurance, the policy is called death insur- 
ance. On the other hand, if the value of T— V, 
is negative for all values of t, the policy is 
called survival insurance. If the value of T;—V, 
varies between positive and negative according 
to the different values of t, the policy is called 
mixed insurance. If 7; is always equal to V,, the 
policy constitutes mere savings. Most of the 
insurance policies issued today are one or 
another type of death insurance, while life 
annuity policies are a type of survival insur- 
ance. For a long time, studies have been made 
of the effect on premiums and reserves of 
changes in the three basic factors (1), (2), and 
(3) in Section A. 


C. Risk Theory 


Risk theory occupies a special position in the 
field of actuarial mathematics. Actuarial math- 
ematics was first born from the theory of prob- 
ability. Since the modern theory of proba- 
bility based on measure theory was developed 
by A. N. Kolmogorov and other mathema- 
ticians (— 342 Probability Theory), new ap- 
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proaches have inevitably been made to ac- 
tuarial mathematics. An outstanding example 
is risk theory. 

Risk theory can be divided into two 
branches. One is called classical risk theory 
(or individual risk theory), in which the profit 
or loss that may result during a certain term 
of an insurance contract is regarded as a tran- 
dom variable. Since the insurer’s profit equals 
the sum of these random variables over all the 
individual contracts, various probability func- 
tions can be obtained by applying the theory 
of probability. The second, called collective 
risk theory, pays no attention to each indiv- 
idual contract but studies changes in the 
insurer’s balance as a whole with the lapse of 
time. The basis of collective risk theory was 
given by F. Lundberg, H. Cramér, and other 
mathematicians. 

We explain the collective risk theory follow- 
ing Cramér [5]. For simplicity we consider an 
insurer who issues no policies other than death 
insurance and makes no expenditures except 
the policy claims. Suppose that during the 
time interval (t,t + At) contingency occurs with 
probability 2At + o(At) independently of the 
past. We denote by F(x) the fdistribution 
function of the amount to be paid by the in- 
surer when a contingency occurs. Then the 
number of contingencies in (0, t), N(t), is a 
*Poisson process with parameter å, and the 
total expenditure of the insurer during the 
time interval (0, t), X(t), is a fcompound Pois- 
son process (— 5 Additive Processes) such that 


E(e?*) =exp (af (e ~ naro); 

o 
If p is the premium income per unit time, then 
we have p=14f¢ xdF(x), because E(X(t))= pt 
holds by the principle of equivalence. If u 
denotes the initial fund of the insurer, then the 
fund reserved at time t, Y(t), equals u + pt — 
X(t). Y(t) is a tLévy process (— 5 Additive 
Processes) such that 


E(e??7) 
=exp (ia +At | (e7*—1— zdr) . 
0 


The probability that Y(t)<0 for some time 
t< œ is called the ruin probability, which de- 
pends on the initial fund u, denoted by y(u). 
This satisfies the following tintegral equation 
of Volterra type: 


P yw) ie Q(x) dx-+ | Wo)Q(u—x) dx 


(Q(x)=1—F(x)), 


from which one can derive an asymptotic rela- 
tion p(u)~ Ce ®“ as uo, where R and C 
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are positive constants depending only on À 
and F. See [6] for recent developments in risk 
theory. 
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A. General Remarks 


Integer programming, in its broadest sense, 
addresses itself to either minimization or maxi- 
mization of a functional f over some discrete 
set S in R”, but it is usually understood as 
dealing with questions related to linear pro- 
gramming problems (— 255 Linear Program- 
ming) with additional integrality conditions 

on the variables, namely, the problem Po: 
Minimize {c’x| Ax =b, x >0, x, an integer, j = 
1,...,n,(<n)}, where AER™*”, be R”, ceR” 
are given data and x=(x,,...,x,)'eR" is a 
vector. P, is called a pure integer programming 
problem or an all-integer programming prob- 
lem if n=n,, and a mixed integer program- 
ming problem if n; <n. In particular, P) is called 
a 0-1 integer programming problem if al! the 
integer variables are restricted to be equal to 
either 0 or 1. We write 


X°= {xeR"| Ax=b, x,;20,j=1,...,n}, 
X' = X°N {xeR" |x, is an integer, j=1,...,n,}, 


and assume for simplicity that (i) X7 # Ø, (ii) 
X° is bounded, and (iii) all the components of 
A and b are integers. Py arises not only as a 
mathematical model for an optimization prob- 
lem where some of the decision variables 

have indivisible minimum units but also as one 
for many optimization problems with some 
logical and/or combinatorial constraints [1]. 
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Algorithms for solving P, can be classified into 
algebraic methods and enumerative methods. 
Both are based upon linear programming and 
relaxation techniques [2] in which some of 

the constraints of P) are temporarily relaxed 
(— 264 Mathematical Programming). 


B. Cutting Plane Methods 


The origin of this class of algorithms is the 
fractional cutting plane algorithm proposed 

in 1958 by R. E. Gomory [3], the outline of 
which is described as: 

(1) Let X =x”. 

(2) Solve a linear programming problem: Mini- 
mize {e’x|xe X}, and let x be its toptimal 
solution. If the x; are integer for all j<n,, then 
stop (X is optimal to P,). Otherwise, go to (3). 
(3) Generate a half-space H = {xe R” |n'x > no} 
(neR”, zoeR!), where H (i) contains X! and (ii) 
does not contain X. Return to (2) by replacing 
X by XANH. 

Here, a linear programming problem ob- 
tained by relaxing integrality conditions is 
solved, and as long as X¢ X’, an inequality 
satisfying the two conditions (i), (ii) of (3) is 
introduced. Such an inequality n’x 2 no or 
equality m’x = nọ is called a cut or a cutting 
plane. Gomory devised a Gomory cut using 
(relaxed) integrality conditions on the vari- 
ables, and showed that the algorithm above 
produces a point of X” in finitely many steps. 
Some of the other algorithms using cutting 
planes are the all-integer algorithm (Gomory, 
1963) and the primal all-integer algorithm 
(R. D. Young, Operations Res., 16 (1968)). 
These algorithms, however, are generally slow 
and behave erratically, so that it is believed 
that they cannot in practice serve as general- 
purpose algorithms. 


C. Other Algebraic Methods 


Gomory [4], again in 1965, proposed a group- 
theoretic approach to P). This method is based 
upon the following observation: Let x, be the 
vector of tbasic variables (— 255 Linear Pro- 
gramming) associated with a tdual feasible 
basis of a linear programming problem P,: 
Minimize {c'x|xe X°}, and let X! be the set 
generated from X” by relaxing the nonnegativ- 
ity constraints on xz. Then X! can be shown 
to have a ‘cyclic group structure, so that a 
group minimization problem P,,: minimize 

{ex |xeX/} can be solved as a tshortest path 
problem on a directed graph with a special 
structure (— 186 Graph Theory). If the opti- 
mal solution X of Pg satisfies X, >0, then it is 
optimal for P). If, on the other hand, x, #0, 
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then a branch and bound algorithm described 
later can be applied for a systematic search 
of integer points near xX. This algorithm is 
reported to produce good results when the 
size of the associated graph ts not excessively 
large (G. A. Gorry et al., Management Sci., 17 
(1971)). Some of the other important results 
in this area are: (1) the theory of subadditive 
cuts (R. Gomory et al., Math. Prog., 3 (1972)) 
and disjunctive cuts (E. Balas, in Nonlinear 
Programming 2, O. Mangasarian et al. (eds.), 
Academic Press, 1975), (ii) research on facial 
structures of the integer polyhedron coX! 
(convex hull of X’) for some of the more im- 
portant integer programming problems, such 
as that of the corner polyhedron co X! [5], 
knapsack polytopes (E. Balas, Math. Prog., 8 
(1975)), and traveling-salesman polytopes 
(M. GrGtschel et al., Math. Prog., 16 (1979)). It 
should be pointed out, however, that more 
difficulties of Py) have been revealed rather 
than resolved through the intensive research 
in this area. Incidentally, the 0-1 integer 
programming problem is known to be tNP- 


complete (— 71 Complexity of Computations). 


D. Enumerative Methods 


Another large class of algorithms for solving 
P, consists of the branch and bound methods, 
first proposed by Land and Doig [6] in 1960. 
An outline of the improved version (R. J. 
Dakin, Computer J., 8 (1965)) is as follows. 
(1) Let P={P)}, z* = œ, x* = undefined. 
(2) If A= Ø, then stop (x* ts optimal to Po). 
Otherwise, choose from P the problem P;: 
Minimize {e’x|xeX/}. 
(3) Solve P,: Minimize {c’x|x¢X,}, in which 
the integrality condition is relaxed from the 
constraints of P,. If P, has an optimal solution 
X, then go to (4). Otherwise, return to (2). 
(4) If Xe X} and e’x <z*, then let Kx*, "x> 
z*, and return to (2). If ğe X{ and e’x >2z*, 
then return to (2). Otherwise, go to (5). 
(5) Choose j <n,, for which X; is not integral 
and generate the two subproblems P,* : mini- 
mize {c'x|xe X}, x,>[X,]+1}, and P7: mini- 
mize {e’x|xeX/,x,<[X,]}, in both of which 
[X,] represents the largest integer not exceeding 
X,. Let PUP, , P; } > and return to (2). 
The best point of X” identified during the 
preceding steps is denoted by x* and called 
an incumbent. In summary, the branch and 
bound method chooses one subproblem P, 
fromthe problem list Y and estimates the 
lower bound of its optimal objective func- 
tional value. If the lower bound is worse than 
the current incumbent, then P, is discarded, 
whereas P, is separated into two subproblems 
if no conclusion can be reached. This process 
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is continued until the problem list P becomes 
empty, thereby implicitly checking all points of 
X'. The above method is called an LP-based 
branch and bound method because linear 
programming techniques are employed to 
obtain a lower bound. The branch and bound 
method tends to require a large amount of 
storage, but many engineering improvements 
on the method of choosing (i) a subproblem P, 
and (it) a branching variable x,, and several 
improvements of the bounding techniques, in 
addition to the substantial progress made in 
linear programming codes, enable us to solve 
a problem of size nx 100. In particular, an 
improved version of the implicit enumera- 
tion method, proposed by E. Balas [7] for 0-1 
integer programming problems which uses 
logical conditions for obtaining lower bounds, 
is known to be able to solve rather large 0-1 
integer programming problems (A. Geoffrion, 
Operations Res., 17 (1969)). 


E. Other Topics 


The partitioning algorithm [8], in which in- 
teger variables are varied parametrically, is 
reported to work well for 0-1 mixed integer 
problems with relatively few integer variables. 
As people begin to realize the intrinsic dif- 
ficulty of P), they pay more attention to heu- 
ristic algorithms or approximate algorithms 

to obtain a good but not necessarily optimal 
solution. Among heuristic methods, the inte- 
rior path method [9], which elaborates simple 
ideas such as rounding of the optimal solution 
of P), has been reported to work well for prob- 
lems in which X° has a nonempty interior. 
Also, more emphasis is being placed on special 
purpose algorithms for solving practical prob- 
lems, such as tset partitioning and the ttravel- 
ing salesman problem, etc. [1]. Po is a typical 
nonconvex programming problem, and no 
practically useful tduality theorem is avail- 
able. Hence it is difficult to perform sensitivity 
and/or post-optimality analysis. Some research 
in this area has emerged recently (e.g., C. J. 
Piper et al., Management Sci., 22 (1976)), but 

it looks as if it will be several years before a 
reasonably good procedure becomes available. 
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A. The Riemann Integral 


Let f(x) be a bounded real-valued function 
defined on an interval [a,b]. We shall divide 
this interval J =[a, b] into subintervals 1; = 
[x;-,,X,] (@=1,...,n) by a finite number of 
points x; (a=Xxg<x,<...<x,=5). This divi- 
sion into subintervals is uniquely determined 
by the set D= {x,}, called the partition of I. We 
set M;=sup,<), f(x), m;=inf,., f(x), and put 
o(D)= Li=1 Mi(x;—x,-1), @(D) = X- m(x;— 
X;~-1). Considering all possible partitions D 
of I, we set |? f(x)dx =inf, a(D), f? f(x)dx = 
supp a(D), which are called the Riemann upper 
integral and Riemann lower integral of f, re- 
spectively. If they coincide, then the common 
value is called the Riemann integral of f on 
[a,b] and is denoted by fg f(x)dx. In this case, 
we say that f is Riemann integrable (or simply 
integrable) on [a,b] and call f the integrand; a 
and b are called the lower limit and the upper 
limit, respectively. In this case, by integrating f 
from a to b we mean the process of obtaining 
the value f? f(x) dx. 

Darboux’s theorem: For each £ >0 there 
exists a positive ò such that the inequalities 


<E 
<E 


an- f f(x)dx 
b 
a(D)— | f(x)dx 
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hold for any partition D with ô(D) = max;(x; 
—x;_,)<6. In other words, we have 


lim @(D)= | f(x)dx, 


6(D)>0 å 


b 
„üm g(D)= f f(x)dx. 
From Darboux’s theorem it follows that a 
necessary and sufficient condition for f(x) to 
be integrable on [a,b] is that for each positive 
e there exist a positive 6 such that ô(D) <ô im- 
plies o(D)—a(D) = X7-,(M;—m,)(x; — x;-1) <e. 
We call M;— m; the oscillation of f on I; and 
a(D) and a(D) the Darboux sums. Obviously, if 
f is integrable on [a, b], then for each positive 
é there exists a positive 6 such that the follow- 
ing inequality holds for every partition D= 
{x;} with 6(D) <6 and for every set of points 
čel (j=1,...,n): 
<E. 


n b 
$ Ea- | fle) dx 


The sum Yj_, f(€))(x;—x;-1) is often called a 
Riemann sum (or sum of products). A function 
that is continuous on [a,b], or bounded and 
continuous except for a finite number of points 
in the interval, is integrable. Furthermore, a 
bounded function that is continuous on [a, b] 
except for an infinite number of points x, is 
integrable if for an arbitrary positive number € 
there exist a finite number of intervals 1; of 
which the total length is less than ¢ and if the 
set {x,} of exceptional points is contained in 
y I,, Generally, a necessary and sufficient 
condition for a bounded function defined on 
[a,b] to be integrable is that the set of points 
where the function is not continuous be of 
tmeasure 0 (in the sense of Lebesgue). A func- 
tion that is either tmonotonic on [a, b] (and 
consequently bounded) or of tbounded vari- 
ation is integrable. A function that is inte- 
grable on [a,b] is integrable on any sub- 
interval of [a,b], the integrand being the 
restriction of the given function to this 
subinterval. 


B. Basic Properties of Integrals 


Let I be the set of all functions integrable 

on [a,b]. If f, gel, then for any numbers 

a, P, we have af + Bgel, fgel, min{ f,g}et, 
max{ f,g}el, and f/gel provided that there 
exists a positive constant A such that the 
inequality |g|> A holds. Furthermore, if fel, 
then | f|eI; and if f el (n=1,2,...) and f, 
converges ‘uniformly to f, then fel. Cor- 
responding to these properties, the following 
formulas hold. 
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(1) Linearity: 


b 
| (af(x) + Bg(x))dx 


-a| fods | f(x)dx, 


where a, 8 are constants. 
(2) Monotonicity: If f(x) >0, then 


b 
| S(x)dx 20. 


If, further, f is continuous at a point x,¢[a, b] 
and f(x )>0, then f? f(x)dx >0. 

(3) Additivity with respect to intervals: If a, 
b, and c are points belonging to an interval on 
which f is integrable and a <c <b, then 


f fisyax= | fasa f f(x)dx. 


Adopting the conventions that fë f(x)dx =0 
and ff f(x)dx = —{? f(x)dx, the additivity 
formula holds independently of the order of a, 
b, and c. 

It follows from (2) that |f} f(x)dx] < 
{21 f()| dx if a <b. Further, if f(x) converges 
to f(x) uniformly on [a, b], then 


b b 
lim | f(x)dx -Í f(x)dx. 
Replacing f,,(x) by partial sums of a series, 

we obtain the following theorem: Let È a,(x) 
be a series in which each term a,(x) is inte- 
grable on an interval [a, b]. If the series con- 
verges uniformly on [a,b], then the sum s(x) is 
integrable on [a, b], and the series is termwise 
integrable, that is, 


| S(x)dx = 5 


Also, the series Dy), | a,(t)dt converges uni- 
formly on [a,b] to the integral {x s(t)dt. As- 
sume that È a„(x) is convergent but not uni- 
formly convergent. If all a,(x), together with 
s(x) =) a(x), are integrable and there is a 
constant M independent of n such that |s,(x)| 
<M (xé[a, b]) for all n, where s,(x) are par- 
tial sums, then the series is termwise integrable 
(C. Arzela). 

The first mean value theorem: If f(x) is con- 
tinuous on [a,b] and (x) is integrable and of 
constant sign on [a, b], then there exists 8 (0 < 
0<1) such that 


b 
a„(x)dx. 


a 


b b 
| fisyotsidx=fla+0b—a) | e(x)dx. 
When ¢(x)=1, we have 


| flodx= flor 0(b—a))(b—a). 


a 
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The second mean value theorem: If ; (x) is a 
positive, monotone decreasing function de- 
fined on [a,b] and (x) is an integrable func- 
tion, then there exists y (a<n <b) such that 


b n 
| f(x)o(x)dx = f(a + of ọ(x)dx. 


In the hypothesis of the second mean value 
theorem, if f(x) is assumed to be monotonic 
but not necessarily positive, then there exists 
n (a<n <b) such that 


b 
| F(x)p(x)dx 


a 


P b 
= f(a +o ọ(x)dx +000) f p(x)dx. 

a n 
H. Okamura (1947) proved that the condition 
a<n <b can be replaced by a <q <b. 

In the case f(x)2>0 on [a,b], we consider the 
figure F bounded by the graph of f(x), the x- 
axis, and the lines x=a and x =b. Then a(D) 
and o(D) are areas of polygons of which one 
encloses F and the other is enclosed by F, as 
shown in Fig. 1. Hence it can be shown that 
the integrability of f(x) in the sense of Rie- 
mann is equivalent to the measurability of F 
in the sense of Jordan. The Riemann integral 
f? (x) dx is the area of F with respect to its 
tJordan measure. 


Fig. 1 


C. Relation between Differentiation and 
Integration 


Suppose that f(x) is integrable on an interval 
I. We fix a point a of J and consider the in- 
tegration F(x)=z f(t)dt, where x varies in I. 
The function F(x) is called the indefinite in- 
tegral of f(x). In contrast with this, the integral 
on a fixed interval, as considered in the previ- 
ous sections, 1s often called the definite in- 
tegral. The indefinite integral F(x) is continu- 
ous on the interval J and of bounded varia- 
tion. If f(x) is continuous at a point xq in J, 
then F(x) is differentiable at xy and F'(xọ)= 

J (Xo). In general, if a function G(x) satisfies 
G'(x)= f(x) everywhere in J, then G(x) is called 
a primitive function of f(x). If f(x) ts continu- 
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ous, the indefinite integral of f(x) is one of 

the primitive functions of f(x). Furthermore, if 
a function G(x) is a primitive function of f(x), 
then any other primitive function can be writ- 
ten in the form G(x)+ C, where C is a con- 
stant, called an integral constant. For a con- 
tinuous function f(x) on [a,b] and any one 

of its primitive functions G(x), we have 


| I (x)dx = G(b)— Ga) =[G(x) Je 


(fundamental theorem of calculus) (—> Ap- 
pendix A, Table 9). From the differentiation 
formulas we obtain the following integration 
formulas: 

Integration by parts: If f(x) and g(x) have 
continuous derivatives on [a, b], then 


b b 
| F(x)g'(x)dx =f won- | I'(x)g(x)dx. 


More generally, if f(x) and g(x) are integrable 
on [a,b], then 


[aof roa)a 
b b b x 
=| sa| sod- | ri | att) 


Change of variables: If f(x) is integrable on 
[a,b] and x= p(t) and ọ'(t) are continuous on 


[o, f]. where a= p(x), b=(f) (a< p(t) <b), 
then 


b B 
| fisyax=| Slot) o'(t)dt. 


D. Improper Integrals 


The concept of the integral can be generalized 
to the case where the integrand or the interval 
on which integration is accomplished is not 
bounded. Assume that f(x) is not bounded on 
[a, b) but is bounded and integrable on any 
interval [a, b—e] (€ [a, b)). If [? *f(x)dx has a 
finite limit for 0, the limit is denoted by 
fé f(x)dx and is called the improper Riemann 
integral (or simply improper integral) of f(x) on 
[a, b). For example, if f(x) is continuous on 
[a, b) and f(x)=O((b—x)*) for some « (O>a> 
—1), where O is the tLandau symbol, then the 
improper integral f? f(x)dx exists. On the 
other hand, if f is integrable on [a+ e, b] for 
each ¢>0 but not bounded in any neighbor- 
hood of a, we can define the integral on (a, b] 
in the same way. If f is not bounded in any 
neighborhood of a or b and if there exists a 
point c (a<c<b) for which the improper in- 
tegrals f; f(x)dx and f? f(x)dx exist, then we 
define fa f(x)dx = f; f(x)dx + f? f(x)dx, which is 
independent of the choice of the point c. Fur- 
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thermore, assume that f(x) is not bounded in 
any neighborhood of each point c; (j=1,...,n) 
(a<c,<...<c,<b). Then we define 


| fisyax= |" foyax+ |” foods a 


[f fisyax+| f(x)dx, 


provided that all improper integrals 


i Fods, a | fe 


exist. Suppose that f(x) is defined on [a, b] 
and bounded outside any neighborhood of 
cé(a, b) but not bounded in either [c — e, c] or 
[c,c +e] for any ¢>0. It may well happen 
that, although neither lim, „o f4 ® f(x)dx nor 
lim, .o JP.» f(x) dx exists (accordingly, the 
improper integral f? f(x)dx does not exist), if 
we put ¢=¢’, the limit 


=E b 
im( | fisyax- | fesydx) 


does exist. This limit is called Cauchy’s prin- 
cipal value and is denoted by p.v. f? f(x)dx 
(v.p. in French). For example, p.v. {1 ,(dx/x)= 
lim, .o(f =} (1/x)dx + fi (1/x)dx)=0. 


E. Integrals on Infinite Intervals 


Suppose that we are given a function f(x) 
defined on an infinite interval [a, œ) and 
integrable on any finite interval [a,b]. If 
limy.,.. J? f(x)dx exists and is finite, then this 
limit is called the improper integral of f on 

[a, œ) and is denoted by |? f(x)dx. We define 
similarly f? ,, f(x)dx =lim,.,_,, f? f(x)dx, where 
f is defined on (—oo, b] and integrable on any 
interval [a, b]. Furthermore, f2 f(x)dx is, by 
definition, f% ,, f(x)dx + f£ f(x)dx, which is 
independent of the choice of c. Suppose that 
f(x) is integrable on [a, b] for a fixed a and an 
arbitrary b larger than a. If f(x) = O(x’) for 
some x < —1, then |? f(x)dx exists. Generally, 
for x, $ such that —œ <a<oo and —œ < 

P< œ, if the improper integral ff f(x)dx exists, 
we say that the integral is convergent; other- 
wise, it is divergent. Improper integrals also 
satisfy the three basic properties of integrals 
(1), (2), and (3) (— Section B). However, the 
existence of an improper integral of a func- 
tion f on an interval J does not imply the 
existence of the improper integral of |f] on 
the same interval J. For example, let f be 

a function determined by f(0)=0, f(x)= 
(1/x)sin(1/x) for 0<x <x. Then [§ f(x)dx 
exists, but fg] fC) dx does not. On the other 
hand, if the improper integral of | f(x)| exists, 
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then the improper integral of f(x) exists, and 
we have 


B 
| f(x)dx 


where —œ <a<f< +o. In this case, we say 
that f is absolutely integrable on the interval 
[x, 8]. Assume now that f(x) is defined on 
(—œ, œ) and integrable on any finite interval. 
If lim, |. f(x)dx exists, then it is called 
Cauchy’s principal value of the integral of f in 
(—00, 00). 

If f(x) is a monotone decreasing, positive, 
and continuous function defined on [k, œ) 
(where k is an integer), then according as 
Zo f(v) converges or diverges, so does 
SP f(x) dx. 

Suppose that a series B f,(x), where all 
the functions f,(x) (n=1,2,...) are defined and 
nonnegative on an infinite interval [a, œ), 
satisfies (MDP hdx = D2, fo f(x)dx 
for arbitrary b> a. Then according as 
E fe f(x) dx converges or diverges, so does 
I? (Cam, fr(x))dx. When they converge, the 
following equality holds: {2 E f,(x)dx 
= Era |? f(x) dx. In this theorem, if 
fe EZ lfOodldx or E2, [E fol dx con- 
verges, then the same conclusion as above will 
follow even when the f,(x) are not necessarily 
positive (— Appendix A, Table 9). 


B 
<| | f(x)| dx, 


F. Multiple Integrals 


Suppose that f(x, y) is a function defined and 
bounded on an interval I = {(x, y)|a<x<b, 
c<y<d} in the xy-plane. Partitions {x,} and 
{y,} of [a,b] and [c,d] with a=xọ <x; <... 
<Xmn=b and c=yo<y,<...<y,=d determine 
a “partition,” denoted by D, of J into subinter- 
vals of the form I, = {(x, y)|Xj-1 SX S Xj, Yr-1 S 
y<y,} G=1,...,m; k=1,...,n). Writing 

M= sup f(x,y), 


(x, pe Tjk 


Mj = inf f(x, y), 


(x. yel jy 


we set 


o(D)= y M (Xj — Xj—1) Vie — Yr-1) 


fe 
ll 
= 
~ 
ll 
pa 


a(D)= X, 2 M(X; — Xj-1) Ve — Yr-1). 
j= k=l 

Then we obtain infp E(D) >suppa(D). If 
inf, o(D)=suppa(D), then f(x, y) is called inte- 
grable on J, and the common value is called 
the double integral of f on J and is denoted by 
JJ, f(x, y)dx dy. Analogously, we can define n- 
tuple integrals (or multiple integrals) and the 
integrability of functions of n variables. 

Let K be a bounded set in the xy-plane and 
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I an interval containing K. Let g(x, y) be the 
characteristic function of K defined on 7, that 
is, ọ is determined by 


g(x,yJ=1 for (x,y)eK, 
g(x,y)=0 for (x,yjel—K. 


Replacing f(x, y) by this @(x, y), we consider 
inf, (D) (supp, a(D)). These values can be 
shown to be independent of the choice of such 
an interval J and are called the outer area 
(inner area) of K, respectively. When these two 
values coincide, K 1s said to be of definite area, 
and the common value is called the area of K. 
A necessary and sufficient condition for K to 
be of definite area is that the outer area of 

the tboundary of K be zero. Now consider a 
bounded function defined on a set K of de- 
finite area. Then, taking an interval I contain- 
ing K, define an extension (x, y) of f(x, y) as 
follows: 


(x, y)=f(x,y) for (x, yJeK, 
g(x,y)=0 for (x,yjeI—K. 


If ọ(x, y) is integrable on I, then f(x, y) is called 
integrable on K, and the integral of f on K is 
defined by f fk f(x, y)dxdy =f f; p(x, y)dxdy, 
which is independent of the special choice of I. 
The set K is called the domain of integration. 
Since K is of definite area, the set of boundary 
points of K at which (x, y) is not continuous 
can be contained in a union of intervals whose 
total area can be made smaller than any preas- 
signed positive number. Consequently, a func- 
tion bounded on K and continuous at each 
interior point of K is integrable on K. Like 
integrals of functions of a single variable, 
multiple integrals satisfy the three basic prop- 
erties of integrals (— Section B). 


G. Multiple Integrals and Iterated Integrals 


Suppose that we are given a function f(x, y) 
that is continuous on an interval I = {(x, y)| 
axx<b,c<y<d}. Then, for a fixed y in 
[c,d], the function f(x, y), regarded as a func- 
tion of x, can be integrated with respect to 

x on the interval [a,b], and the integral thus 
obtained is a continuous function of y. The 
integral of the function defined on [c,d], 
namely, J2({? f(x, y)dx) dy, is called the iterated 
integral (or repeated integral) of f(x, y) and is 
often written as [¢dy |? f(x, y)dx. The following 
formula gives a representation of a double 
integral by iterated ones: 


d b i 
{| fis yiaxdy= | a| f(x, y)dx 
b d 
-| ax| I(x, y)dy. 
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More generally, o; (x) and p(x) being con- 
tinuous on [a, b] and o, (x) < @2(x), consider 
the following subset K = {(x, y)|a<x <b, 

0 (x)<y <¢@,(x)! of the xy-plane. Suppose 
further that f(x, y) is continuous on K. Then 
the following equality holds: 


b P(x) 
{| I(x, y)dx iy=| dx f(x, y)dy. 


a p(x) 


In the case of unbounded integrands or 
unbounded domains of integration, we can 
still define integrals under suitable restrictions. 
For instance, assume the following two prop- 
erties: (1) There exists a sequence {K,,} of sets, 
each of which is of definite area, satisfying 
K,c Kc... and K=|)”, Kn (2) f(x, y) is 
bounded and integrable on each K, (n= 
1,2,...). Ifa finite limit limp | fx, f(x, y)dxdy 
exists and is independent of the choice of {K,}, 
then f(x, y) is called integrable on K. This limit 
is called the integral of f(x, y) on K and is 
denoted by f fx f(x, y)dxdy: 


lim i fos. nacay=| | f(x, y)dx dy. 
eae Kn K 


When the integral thus defined exists, we say 
that the integral is convergent. If a finite limit 
liM, | Sx,| f(x, y)ldx dy exists for some se- 
quence {K,,} with property (1), then f is inte- 
grable on K. Let f(x, y) be continuous and 
nonnegative on K ={(x, y)|Ja<x<f,y<y<od}, 
where -œ <a < f <œ, —o <y <ô <œ. Fur- 
thermore, let f(x, y) be integrable on K, and 
assume that the improper integral F(x)= 

P f(x, y)dy=lim,,, ats fe f(x, y)dy exists and 
converges uniformly with respect to x as cy, 
dt. Then [f F(x)dx is well defined, and we 
have 


B é 
[i fændxdy=f ax| f(x, y)dy. 
K a Y 


In particular, if «=a, y =b, f = ô= œ, then we 
have 


| | fos yrbvdy=| af f(x, y)dy. 
a b a b 


H. Interchanging the Order of Differentiation 
and Integration 


If both f(x, y) and ôf (x, y)/ðy are continuous 
on an interval {(x, y|a<x<b, yo—n<y< 
Yo +n}, then we can interchange the order of 
differentiation and integration as follows: 


d b b 6 : 
— | fix,y)dx= fy) dx for y=yo. 
dy Ja a Oy 


Assume further that this equality holds for 
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every b( >a), the improper integral 


æ b 
| fs, ypa= im | I(x, y)dx 
converges, and the improper integral 


oe b A 
Of (x, y) actin Of (x, y) dx 
a oy b-+00 oy 


a 


converges as b> co uniformly for y with |y— 
Vol <n. Then 


d {® 2g 
— f(x, y)dx= TOY i for y=yo. 
dy Ja a oy 


Several other similar theorems are known. 
Though the previous theorems are written in 
terms of two variables, analogous theorems 
hold for n variables. 


I. Change of Variables in Multiple Integrals 


Let G be a bounded domain of definite area in 
an n-dimensional Euclidean space R”(x). As- 
sume that a mapping x y(x)=(y,(X1,.--5 Xn} 
wey Va(X15 «+5 Xp)) is Of class C! from an open 
set containing the closure G of G into an n- 
dimensional Euclidean space R"(y). We de- 
note the image of G under this mapping by B. 
If f(y,,---,¥,) is continuous on B, then the 
following formula on change of variables 
holds: 


ie SOY is -+s Vad dyi- dYa = 


D(Yi, -+3 Yn) 
seh Gee Xs) 
G 


DX i 5 -3 Xp) 

where g(Xi, -s Xa) = f1 (X15 +++ Xah + Val 
...,X,)) and D(y,,...,,)/D(x,,.-.,X,) is the 
tJacobian determinant of the mapping y(x). 
This formula is usually utilized in the case 
where y,,..., Y, are functionally independent, 
though otherwise both sides vanish and the 
formula still holds. For improper integrals, 
a similar formula will hold under suitable 
restrictions, for example, if the integrals con- 
verge absolutely. 

For related topics — 94 Curvilinear In- 
tegrals and Surface Integrals, 221 Integration 
Theory, and 270 Measure Theory. 


dx, ...dx,, 
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A. General Remarks 


Equations including the integrals of unknown 
functions are called integral equations. The 
most studied ones are the linear integral equa- 
tions, i.e., linear in unknown functions. 

Let D be a domain of n-dimensional Eu- 
clidean space and f(x) and K(x, y) be func- 
tions defined for x=(x;, X3, ..., Xp}E D, y= 
(Yis Y2 ---, Yn)E D. Integral equations of Fred- 
holm type (or Fredholm integral equations) [1] 
are those of the forms 


| K(x, y)o(y) dy = f(x), (1) 
ọ(x)— | K(x, yyp(y) dy=f(x), (2) 
D 
A(x) @(x)— | K(x, y)o(y)dy = f(x), (3) 
D 


where (x) is an unknown function and Jody 
means the n-fold integral |... [pdy,...dy,. 
Equations of the forms (1), (2), and (3) are 
called equations of the first, second, and third 
kind, respectively. Equations of the second 
kind have been investigated in great detail. 
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Equations of the third kind, in many cases, can 
be reduced formally to those of the second 
kind. The function K(x, y) is called a kernel (or 
integral kernel) of the integral equation. 
Integral equations of Volterra type (or Vol- 
terra integral equations) are those of the forms 


[Kes nowdr= ft, (1’) 
ots) | K(x, ye(y)dy= f(x), (2’) 
A(x) p(x) — | K(x, yyp(y) dy = f(x), (3) 


where g(x) is an unknown function. Equations 
of the forms (1’), (2’), and (3’) are also called 
equations of the first, second, and third kind, 
respectively. Integral equations of Volterra 
type can be regarded as integral equations of 
Fredholm type having kernels equal to 0 for 
x<y, but these two types of equations are 
usually treated separately, since they have 
considerably different characters. 

The kernels in equations (1)—(3) and (1’)— 
(3’) are frequently written in the form AK (x, y) 
with a parameter A, in particular when the 
equations are related to eigenvalue problems, 
which is explained in Section F. 

The theory of integral equations was orig- 
inated in 1823 by N. H. Abel, who investigated 
the relationship between time and the path of 
a falling body in the field of gravitation. Let 
p(t) be a quantity varying with time, which is 
connected by some law with its values in some 
time interval of the past or the future. Then 
the law of variation of y(t) can be described 
mathematically by an integral equation. The 
situation ts the same even when the variable t 
is not time but a coordinate of the space. In 
this way, various problems in physics can be 
reduced to solutions of integral equations. 


B. Relation to Differential Equations 


Many problems in differential equations can 
be reduced to problems related to integral 
equations. Such reduction often makes the 
problems easier to handle and clarifies the 
nature of the solutions. For example, consider 
the problem of finding a solution of the ordi- 
nary second-order linear differential equation 
d? y/dx? + Ay =0 with the boundary condition 
y(0) = y(1)=0 [4]. Let d? y/dx? = u(x). If we 
integrate the equation twice, change the order 
of integration, and make use of the boundary 
condition, then we have 


=| (x—)u(d)dé -f x(1— ¢)u(d) dé, 
0 0 


825 


from which we see that the given differential 
equation can be written in the form 


1 x 
u -af x(1 — ends 2 f (x — Jul) dě. 
0 0 

Decomposing the first integral on the right- 
hand side into the sum of an integral over 

(0, x) and one over (x, 1), and combining the 
integral over (0, x) with the second integral on 
the right-hand side, we obtain a Fredholm 
integral equation of the first kind as follows: 


ozi | G(x, QulOdé, 


Clearly, the solution of this integral equation 
is equivalent to that of the original differential 
equation. The function G is called *Green’s 
function for the boundary value condition 
y(0) = y(1)=0 in the theory of boundary value 
problems. Differential equations of higher 
orders can be treated analogously (— 315 
Ordinary Differential Equations (Boundary 
Value Problems)). tInitial value problems of 
linear ordinary differential equations can be 
reduced to the solution of Volterra integral 
equations in a similar way. 

As another example [5, 6], consider the 
tDirichlet problem on a plane, 1.e., the problem 
of finding a function u satisfying the conditions 
(i) u is tharmonic in the interior of the region D 
bounded by a closed curve C(ë = ¢(s),47 =(s), 
O<s<lJ); (ii) u(x, y)> F(s) uniformly with re- 
spect to (Xo, Yo) as (x, y) approaches (xo, Yo) 
from the inside of D, where (x9, Yo) is an arbi- 
trary point on C, F(s) is a continuous func- 
tion given on C, and s is the arc length along 
C. Put f(s)= F(s)/x and 


M-Y Op- lols- EMH) 
m(p(s)— POF +- — 


Then it is known that a solution u of the prob- 
lem can be given in the form 


K(s;t)= 


l ô 1l 
u(x, y)= | a(s)}—log-ds, 
o On r 
where r? =(¢(s)— x)? +(W(s)— y)’, n is the inner 
normal of C, and p(s) is a continuous solution 
of the following Fredholm integral equation of 
the second kind: 


I 
wo=s0-| K (s; t)u(t) dt. 

0 
We can treat the Neumann problem similarly, 
i.e., the problem in which condition (ii) is re- 
placed by (it) (@u/0n)(x, y)—> F (s) uniformly 
with respect to (Xo, Yo) as (x, y) approaches 
(Xo, Yo) from the inside of D. In the Neumann 
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problem, put f(s)=F(s)/x and 


((s)— vi) e'(s)—(e(s)— eM) HS) 
n(p(s)— PO) +(W(s)-— WO) — 


Then we have a solution u in the form 


L{s; t)= 


; 1 
u(x, y)= — | u(s)log—ds+c, 
r r 
where ji(s) is a solution of the following Fred- 
holm integral equation of the second kind: | 


I 
u(s)=f(s)— | L(s; t) y(t) dt. 

0 
A solution of this integral equation, however, 
exists when and only when fh F(s)ds=0. In the 
expression of a solution u(x, y) of the Neu- 
mann problem, ¢ is an arbitrary additive con- 
stant, up to which a solution of the problem is 
determined uniquely. We can also treat par- 
tial differential equations of telliptic type in 
an analogous way. 


C. Integral Equations with Continuous Kernel 


We describe some results for integral equa- 
tions with m-dimensional independent vari- 
ables, i.e., equations in which D is an m- 
dimensional closed domain. We assume that 
K(x, y) and f(x) are continuous in Sections 
D-H. 


D. The Method of Successive Iteration 


Among methods of solving Fredholm integral 
equations of the second kind, the simplest is 
the method of successive iteration, sometimes 
called the method of successive approximation 
[7]. In the method of successive iteration, we 
rewrite (2) in the form 


p(x)=f(x)+ | K(x, y)o(y) dy 


D 


and replace the function ¢(y) on the right- 
hand side by the function 


f(y)+ | K(y, z)@(z)dz. 
D 


If we repeat the process successively, then we 
have 


p(x) = f(x) + J | K(x, y) f(y) dy 
t= D 


+| Kisii, yo y)dy, 
D 


where 


K(x, y)= K(x, y), 


K,(x, n= | K;-;(x, s)K (s, y)ds. 
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The functions K(x, y) are called the iterated 
kernels. Assume that >, K,(x, y) converges 
uniformly. Then, putting 


R(x, y)= 3 K,(x,9), (4) 


we obtain a solution of (2) in the form 


p= | R(x, y) f(y) dy. (5) 


The series (4) is called a Neumann series. 
For a given kernel K(x, y), a function R(x, y) 
satisfying 


K(x, -Ræ | K(x, s)R(s, y)ds =0 


and 


K(x, y)— R(x, n+] R(x, s)K(s, y)ds =0 
D 


is called a resolvent of K(x, y) (in some cases 
— R(x, y) is taken as the resolvent). If a re- 
solvent of K(x, y) exists, the solution of (2) 
can be given uniquely by (5). If a Neumann 
series converges uniformly, (4) gives a resolv- 
ent of K(x, y). 

If we apply a similar process to Volterra 
integral equations of the second kind, then we 
have the iterated kernels defined by 


Kaiten f K{x,s)K(s,y)ds (i=1,2,...). 
y 

For these iterated kernels, a Neumann series 
defined by (4) always converges uniformly. 


E. Fredholm’s Method 


Let D be a bounded closed domain and K(x, y) 
a continuous kernel. A Neumann series (4) 
converges uniformly and gives a resolvent if 
|K(x, y)| or the region D is sufficiently small, 
but otherwise it does not necessarily converge. 
E. I. Fredholm [1,7] gave a method of con- 
structing a resolvent for the more general case. 
Write a kernel in the form 2K(x, y), and put 


ae K(x;,¥1) K(X, Yn) 
K = ; 
Pieria. Rect ai) K (Xp, Yn) 
Define D(A) and D(x, y; 2) by 
D(A)= 
wo (— A)" vive Be 
145! {| K(® dnd 
na. M! Jp D Siz scssi 
and 


D(x, y;A)= K(x, y) 


æ% (—A)" Si jess Se 
Er ) | I K(* a Jasi ds, 
n=1 n! D D Ys Sy5-++55y 
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The series in these two equations both con- 
verge uniformly and hence define tentire func- 
tions of 7. The functions D(A) and D(x, y; A) are 
called Fredholm’s determinant and Fredholm’s 
first minor of the kernel K(x, y), respectively. 
For small |2|, we have 


— = “n-i 
DO Yå [Ku s)ds, 


where the K,,(x, y) are iterated kernels corre- 
sponding to K(x, y). Now if D(A) 40, a resolv- 
ent AR(x, y; A) of the kernel AK(x, y) can be 
given by 
D(x, y;) 


If D(A) =0, some extension of the method in 
this section is needed. Fredholm introduced 
for this purpose Fredholm’s rth minor 


aes 
Yis Ve 
defined by 
Xis... Xp 
oon 
Yis- Vp 


=K("™ y y (—Ay" 


Pisces Me! wa A! 
Nyy cory Xps Sy, 00-58 
D D Virerrs Veo Siste Sy 


F. Eigenvalue Problems and Fredholm’s 
Alternative Theorem 


Consider a homogeneous integral equation of 
the second kind 


TE K(x, y)o(y)dy=0, (6) 


where D is a bounded closed domain and 

K(x, y) is continuous in D x D. When (6) has 

a nontrivial solution g(x) for some 4, then å 

is called an eigenvalue corresponding to the 
kernel K(x, y), and the corresponding non- 
trivial solution g(x) is called an eigenfunction 
corresponding to the kernel K(x, y). If D(A) #0, 
then (6) has no nontrivial solution, from which 
it follows that eigenvalues must be zero points 
of the entire function D(A). For an arbitrary 
eigenvalue À, there is a set of linearly inde- 
pendent eigenfunctions corresponding to À 
such that any etgenfunction corresponding to 
å can be written as a linear combination of 
the eigenfunctions belonging to the set under 
consideration. Such a set of linearly indepen- 
dent eigenfunctions corresponding to an eigen- 
value / is called a fundamental system corre- 
sponding to the eigenvalue 4. The number of 
elements of the fundamental system is called 
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the index of the eigenvalue 4. The index of an 
eigenvalue is always finite. The homogeneous 
integral equation of the form 


wf K(x, y)(x)dx =0 (6') 
D 


is called an associated (or transposed) integral 
equation of (6). The associated equation has 
the same eigenvalues as the original equation; 
moreover, the index of a common eigenvalue is 
the same for both equations. For any eigen- 
value A, the order of the zero point 4 of the 
entire function D(A) is called the multiplicity of 
the eigenvalue 4. If an eigenvalue å is a pole of 
R(x, y; 4), then we have p+ 1 >r-+q, where r is 
the order of the pole, p is the multiplicity of A, 
and q is the index of 4. In particular, if A is a 
simple pole of R(x, y, 4), we have p=q, namely, 
the multiplicity is equal to the index. An exam- 
ple with this particular property is the integral 
equation with a symmetric kernel to be dis- 
cussed in Section G. For the set of eigenvalues 
there is no finite accumulation point even if 
there are infinitely many eigenvalues. 

If å is not an eigenvalue, the inhomogeneous 
equation 


(x) -a| K(x, yjø(y)dy = f(x) (7) 
D 


can be solved uniquely for any continuous 
function f(x). In this case we have D(A) # 

0, and the resolvent R(x, y; 2) of the kernel 

AK (x, y) exists. If A is an eigenvalue, we have 
D(A) =0, and equation (7) has a solution if and 
only if 


| (x) f(x)dx =0 


for all solutions W(y) of (6) (linearly indepen- 
dent solutions w(y) are finite in number). The 
last statement is called Fredholm’s alterna- 
tive theorem (— 68 Compact and Nuclear 
Operators). 

A kernel of the type 


K(x,y)= 2 Xx) YO) 


is called a separated kernel, degenerate kernel, 
or Pincherle-Goursat kernel. For such a kernel, 
we have D(A)=det(d,,— 4 |? X,(t) ¥,(t)dt), and 
hence we can easily obtain eigenvalues and 
eigenfunctions. A nondegenerate kernel can be 
studied using the results obtained for sepa- 
rated kernels, since we can regard a kernel of 
the general form as the limit of a sequence of 
separated kernels. 


G. Symmetric Kernels 


A kernel K(x, y) is called a symmetric kernel 
if it is real and K(x, y}= K(y, x). Let D bea 
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bounded closed domain and K(x, y) be a con- 
tinuous symmetric kernel. In this case the as- 
sociated equation (6’) clearly coincides with the 
original equation (6) [5, 6, 8]. 

Corresponding to any nontrivial symmetric 
kernel K(x, y), there exist at least one eigen- 
value and one eigenfunction. The eigenvalues 
are all real, and the eigenfunctions correspond- 
ing to distinct eigenvalues are mutually ortho- 
gonal. If we orthonormalize the eigenfunctions 
belonging to all fundamental systems and 
number them according to the order of in- 
creasing absolute values of the corresponding 
eigenvalues, then we have an orthonormal 
system {@,(x)}, called a complete orthonormal 
system of fundamental functions or simply a 
complete orthogonal (or orthonormal) system. 
If we number the eigenvalues taking their 
multiplicities into account and according to 
the order of increasing absolute values, then 
we have the equality 


s 1 
X zÍ | K(x, y)dx dy. 
i DJD 


i=1 4 


Corresponding to an iterated kernel 
K m(x, y), we have the eigenvalues {4;"}, and we 
can choose the corresponding orthonormal 
system so that it coincides with the one corre- 
sponding to K(x, y). Eigenvalues and eigen- 
functions corresponding to an iterated kernel 
can be obtained in the following way: Put 
[p K,,(s, s)ds=u,; then the following limit 
exists: 


lim u,,/Uy_42=A4°< +0, 
n> æ 


which gives an eigenvalue of the iterated ker- 
nel K,(x, y). A function g(x, y) defined by 


lim 4" K(x, = (9) 


(uniformly convergent) gives the correspond- 
ing eigenfunction (x, c) for any constant c 
satisfying ọ(x,c)Æ0. 

Let 4" be an eigenvalue corresponding to an 
iterated kernel K,(x, y) and ọ(x) be a corre- 
sponding eigenfunction. Consider the func- 
tions w,(x) (j=0, 1,...,n— 1) defined by 


n-1 
HO=o()+ $ oat K,(x, ye(y)dy 


D 
(j=90,1,2,...,n—1), 


where e is one of the nth primitive roots of 1. 
The for at least one value of j, e74 is an eigen- 
value corresponding to the kernel K(x, y), and 
yw, is a corresponding eigenfunction. This rela- 
tionship between eigenvalues and eigenfunc- 
tions corresponding to an iterated kernel and 
those corresponding to the original kernel is 
valid even for kernels that are not necessarily 
symmetric [2]. 
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Let a kernel K(x, y) be defined by 


K(x, Y= K(x, y) SOO), 
i=1 i 

where the /; (i= 1,2,...,n) are the eigenvalues 
corresponding to the kernel K(x, y) and the 
(x) (i=1,...,n) are the corresponding ortho- 
normalized eigenfunctions. Then eigenvalues 
and eigenfunctions corresponding to K(x, y) 
are those corresponding to K(x, y), with the 
exception of A,,...,4, and @,(x), ..., @,(X). 

Let (x) be any function that satisfies 
Jo(y(x))? dx = 1. Then the integral 


=| | K(x, yp(x) p(y) dx dy 
DJD 


assumes the maximum value when ọ(x) is an 
eigenfunction corresponding to K, (x, y) with 
the smallest eigenvalue 4?. Let the eigenvalues 
2, Of K(x, y) be numbered tn order of increas- 
ing absolute values, so that |/,,|<|Z,,.,|. Let 
p(x) be any function satisfying 


| Wi(x)p(x)dx=0 (i=1,2,...,n), 


| (p(x)? dx=1 


for given functions w(x) (i=1,...,n). Then the 
maximum value of the integral J above is the 
least when the set of all linear combinations of 
{p (x), ...,W,(x)} coincides with the set of all 
linear combinations of {@, (x), ...,@,(x)}, and 
in this case the maximum of J is attained by 
some eigenfunction (x) corresponding to 
K,(x, y) with the eigenvalue 4, ,. The results 
in this paragraph show that we can obtain 
eigenvalues by solving a variational problem 
concerning the integral J. 


H. Expansion Theorems 
Let K(x, y) be a continuous symmetric kernel 
and h(x) be a function square integrable on a 


bounded closed domain D. Then a function 
J (x) such that 


I(x)= i K(x, y)h(y)dy 

can be expanded in the form 

$0) = Yeu 

where {@,(x)} is a complete orthonormal sys- 


tem of fundamental functions corresponding 
to K(x, y) and 


c=| f(x@,(x)dx  (n=1,2,...). 
D 
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The series in the expansion of f(x) converges 
uniformly. These facts are the content of the 
Hilbert-Schmidt expansion theorem [5, 6, 8]. By 
using this theorem, for a å that is not an eigen- 
value, we can obtain a solution (x) of the 
Fredholm integral equation (7) with a sym- 
metric kernel in the form 


wd=fej+25, 20 


| I(x) @j(x) dx. 
D 


For m2 2, iterated kernels can be expanded in 
the form 


(uniformly convergent). If AR(x, y; A) is a re- 

solvent of a symmetric kernel åK (x, y), then 

R(x, y; 4) can be expanded as 

. & Gx) Ply) 

R(x, y;A)= K(x, y)+ —~. 

wy D=K +AA Fo 
If a symmetric kernel K(x, y) satisfies the 

inequality 


| | K(x, y)p(x)e(y) dx dy 20 
DJD 


for all (x), it is called a positive (semidefinite) 
kernel. If in this inequality the equality holds 
only for g(x)=0, K(x, y} is called a positive 
definite kernel. For a positive definite kernel, 
eigenvalues are all positive, and the kernel can 
be expanded in the form 
K(x, y)= 5 Pe 
i=1 i 

(uniformly convergent). This result is called 
Mercer’s theorem. 

When a real continuous kernel K(x, y) is not 
symmetric, we consider two positive kernels 
K'(x, y) and K"(x, y) defined by 


| K(x, s)K (y, s)ds= K(x, y) 
D 


and 


| K(s,x)K(s, y)ds=K"(x, y). 
D 


The eigenvalues corresponding to these ker- 
nels are the same, and they are all positive. Let 
A; (i=1,2,...) be these eigenvalues and {@,(x)} 
and {w,(x)} be the corresponding complete 
orthonormal systems corresponding to K’ and 
R”, respectively. Then we have 


1 K(y, x)p;(y)dy = Yx), 
D 


Ài | K(x, y)Wi(y) dy = p(x). 
D 
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Let f(x) be an arbitrary function such that 


s=] K(x, y)h(y)dy, 


where h(x) is a function square integrable on 
D. The function f(x) can then be expanded in 
the form 


S= È este (8) 


where 


=| S(x@(x)dx  (i=1,2,...). 
D 


The series in the expansion (8) converges 
uniformly. 

The Fredholm integral equation (1) of the 
first kind with a general kernel (i.e., a kernel 
that is not necessarily symmetric) has a square 
integrable solution (x) if and only if f(x) has 
a uniformly convergent expansion (8) and 

24(c;A,))? < +00. When this condition is 
satisfied, equation (1) has a solution given by 
Enoi CnAn Q(X) that converges in the sense of 
tmean convergence. 

It should be noted that the theory concern- 
ing symmetric kernels can be extended to 
complex-valued Hermitian kernels, i.e., kernels 
such that K(x, y)= K(y, x). Also, we can obtain 
the theory in Section G and this section, con- 
cerning Fredholm integral equations with 
continuous kernels, by using the methods of 
functional analysis that treat fp K(x, y)y(y)dy 
as a ‘compact operator in the space of con- 
tinuous functions (— 68 Compact and Nuclear 
Operators). 


I. Kernels of Hilbert-Schmidt Type 


Kernels of Hilbert-Schmidt type are kernels 
which are square integrable in the sense of 
Lebesgue over D x D, where D is an arbitrary 
domain. Most of the results mentioned in the 
previous section concerning integral equations 
with kernels continuous on bounded domains 
are valid also for equations with kernels of 
Hilbert-Schmidt type, because every operator 
mentioned in the previous section is also a 
compact operator in the space concerned, i.e., 
the space L,(D) [6] (— 68 Compact and 
Nuclear Operators). 


J. Singular Kernels 


For general kernels that are not necessarily 
continuous, the theory described in the previ- 
ous sections does not apply properly, but 
when an iterated kernel K,,,(x, y) has a resolv- 
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ent R,,,(x, y), we can find a resolvent R(x, y) of 
K(x, y) in the form 


R(x, y) = Ry A(X, y) + Al, y) 


ef R m(x, 8) H,,(s, y) ds, 
D 


where H,,(x, y)=07",' K‘(x, y). When a kernel 
is of the form AK (x, i the relationship be- 
tween eigenvalues and eigenfunctions corre- 
sponding to an iterated kernel and those cor- 
responding to the original kernel, which was 
stated in Sections G and H, is still valid for 
general kernels. If a kernel K(x, y) is continu- 
ous for x #y and has a singularity of the form 
|x—y|-*(0<a<1)onx=y, the iterated ker- 
nels K,,(x, y) are continuous provided that 

(1 —a)m>1. tGreen’s functions of partial dif- 
ferential equations of elliptic type have this 
property. 

A kernel that is not square integrable is 
called a singular kernel. An integral equation 
whose domain of definition is unbounded or 
whose kernel is singular is called a singular 
integral equation [9]. Singular integral equa- 
tions have some particular properties that are 
not seen in ordinary integral equations, i.e., 
integral equations with kernels continuous in a 
bounded closed domain. For example [10], 
consider the identity 


= y 
sin Y d 
E eo ( fiera) 
= Leeg a, 
fk a? + x? 


where « is an arbitrary real number. This 
equality shows that for the continuous kernel 
/2/rsin xy, 1 is an eigenvalue and /n/2 e 
+x/(a* + x?) is a corresponding eigenfunction. 
Since «& is arbitrary, the index of the eigenvalue 
1 is evidently infinity. As another example, 
observe the equality 


ts @) 2 
—lz~yl emio dy = ~iax 
e e = ev 
fe ETE 


where « is an arbitrary real number. From this 
equality, we see that for the continuous kernel 
e '*~™ defined on (—oo, 00), and number A= 
(1+.«7)/2 greater than or equal to 1/2 is an 
eigenvalue. In this example, the spectrum, i.c., 
the set of eigenvalues, is a continuum. Such a 
spectrum is called a continuous spectrum. 

In applications, an important role is played 
by integral equations with kernels of Carleman 
type: 


K(x, y)= G(x, y)/(y — x), 


where G(x, y) is a bounded function. In inte- 
gral equations with such kernels, the integral 
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is taken in the sense of the tCauchy principal 
value [5,9, 11,12]. For example, the Riemann- 
Hilbert problem is the following: We are given 
a simple closed and smooth curve L in the 
complex plane and real-valued smooth func- 
tions a, b, and c defined on L, a+ib never 
vanishing. The problem is to find a function 
(z) which is holomorphic in the exterior of L, 
at most of polynomial growth at infinity, and 
continuous up to L with boundary value + 
such that Re[(a+ib)pt]=c on L. This prob- 
lem is reduced to that of finding a function p 
defined on L satisfying an integral equation 


G(z, 
uo | NGS) eae Fy er) 
L z—€ 


where G ts a smooth kernel determined by 
a+ib, f is a known function depending on c, 
and the integral is taken in the sense of the 
Cauchy principal value. The integer x defined 
by K=(1/n)f, d(arg(a—ib)) is called the in- 
dex of this problem. The full solution of the 
Riemann-Hilbert problem was given by I. N. 
Vekua [11]. 

A multidimensional analog of the Cauchy 
principal value is the singular integral of A. P. 
Calderon and A. Zygmund (— 251 Linear 
Operators). A smooth function k(x) defined 
in R” except at x =0 is called a kernel of 
Calderon-Zygmund type if k(x) is positively 
homogeneous of order —n and if its integral 
mean on the unit sphere is zero. Then the 
operator K defined by 


Kozim | | k(y) f(x — y)dy 
yl>e 

is called a Calderon-Zygmund singular integral 
operator [13]. K is a bounded linear operator 
in L?(R") if 1<p< +o. Ifn=1 and k(x)= 
1/{xix), K is nothing but the ‘Hilbert trans- 
formation. The pseudodifferential operator (— 
345 Pseudodifferential Operators) is an exten- 
sion in some sense of the singular integral 
operator. 


K. Systems of Integral Equations 
A system of Fredholm integral equations of 
the second kind can always be reduced to a 


single equation. In fact, as is seen easily, a 
system of integral equations 


1 
air] K;(x, yoy) dy = f(x) 
i Jo 


can be reduced to a single equation 


D(x)—A i K(x, y)®(y)dy= F(x) (O<x<n), 
o 
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where 


®(x)=p(x—i+1)  F(x)=f({x—i+1), 


K(x, yy=K,(x—i+ 1,y—j+1) 
(i—1l<x,j-l<y<jij=1,2,...,n). 


A system of Volterra integral equations of the 
second kind can be reduced to a single equa- 
tion by eliminating the unknown functions 
successively. 


L. Integral Equations of Volterra Type 


Consider a Volterra integral equation of the 
first kind 


| K(x, y)e(y)dy= f(x) 
such that K(x, x)40 and K,(x, y) and f'(x) are 
continuous. If we differentiate both sides of the 
equation, then we have a Volterra integral 
equation of the second kind: 


*K,(x, y) — £09) 
ots | K(x) (y)dy = K(x) 


a 


Abel’s integral equation of general form is 


*G 
| CoN) a= fe) (0<a<1). (9) 
o (x—y) 


If G, G,, and f’ are continuous and G(x, x) 40, 
equation (9) can be reduced to the equation 


\ iu sonar | f(x)(u—x)** dx, 
0 0 
where 

“G(x, y)dx 


a Ce een 


. Since H(u, u)=(z/sinaxz)G(u, u) 40, it follows 


that 


“H ; 
TH Te o(y)dy= at) (9a) 


where 


g(u)= H(u, uy" = [rwu =x) dx 
du Jo 
=H(u,u) ' (ro 


-f u-i dx); 
0 

Clearly (9a) is a Volterra integral equation of 
the second kind. Abel’s problem (— Section A) 
was to find the path of a falling body for a 
given time of descent. The problem then can 

be reduced to the solution of equation (9) with 
G(x, y)=1 and a= 1/2. When G{x, y)=1, we 
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can solve equation (9) explicitly to get 


sin a7 


p(x) 


(o+ | -AO i) 


T 0 


[5]. 


M. Nonlinear Integral Equations 


When a nonlinear integral equation includes a 
parameter å, it may happen that the parameter 
has a bifurcation point, i.e., a value 4) such that 
the number of real solutions is changed when 
A varies through /, taking real values. For 
example, consider the equation 


1 
TE p*(y)dy=1. 
This equation has real solutions ọ(x)=(1 + 
./1—4A)/24 for 4< 1/4 but no real solution 
for A> 1/4. Hence A, = 1/4 is a bifurcation 
point (— 286 Nonlinear Functional Analysis). 
Among nonlinear integral equations, Ham- 
merstein’s integral equation has been studied 
in detail [5, 14]. It is an equation of the form 


ot | K(x, y) f(y, e(y)) dy =0. (10) 
D 


If K(x, y) and f(y, 0) are square integrable and 
f(y. u) satisfies a tLipschitz condition in u with 
a sufficiently small coefficient, then the inte- 
gral equation (10) can be solved by successive 
approximations. If K(x, y) is a square inte- 
grable positive kernel and {p|K(x, y)|? dy is 
bounded, then we can prove the existence and 
uniqueness of a solution of (10) under a con- 
dition on f(y, u) weaker than a Lipschitz con- 
dition. We can prove similar results for equa- 
tion (10) with a nonsymmetric kernel when 
K(x, y) is continuous in the mean, that is, 


lim | | K(x’, y)— K(x, y)|? dy =0, 
x >x D 


im | |K(x, y’)—K(x, y)? dx =0. 
YY Tp 


A nonlinear Volterra integral equation of the 
form 


o=- | F(x, y; e(y))dy 
can be solved by successive approximations 

if F(x, y;u) and f(x) are square integrable, 

F(x, y;u) satisfies a Lipschitz condition 

| F(x, y; w) — F(x, y, u”)| < k(x, y)|u'—u”| with 
some square integrable function k(x, y), and 

Jz F(x, y; f(y))dy is majorized by some square 
integrable function of x. When F(x, y; u) and 
f(x) are continuous, we can obtain theorems 
on the existence and uniqueness of continuous 
solutions and tcomparison theorems similar to 
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those for initial value problems in ordinary 
differential equations [16]. 


N. Numerical Solution 


For the numerical solution of integral equa- 
tions, we assume throughout that the func- 
tions appearing are all continuous and the 
solution of every equation is unique. Methods 
of numerical solution can be divided roughly 
into two classes. Methods of one class try 

to evaluate numerically the analytical solu- 
tion described in the preceding sections, and 
methods of the other try to obtain a solution 
by transforming the problem to one that is 
numerically solvable. 


(1) A Method Based on Numerical Quadra- 
ture. Consider the integral equation 


b 
| F(x, y, p(x), e(y)) dy =0. 
Let a=x,<x,<...<x,=b be points on the 
interval [a,b] and @,, @,...,@, be the values 
of (x) at x1, X,,...,X,. By the use of numer- 
ical quadrature, we then have the following 
system of equations in 9;: 


2 a, F(x;,x;,9;,9)=90 (i=1,2,...,n). 
fa 


The method corresponds to that of solving 
ordinary differential equations by their dif- 
ference equation analogs. Hence the errors 
involved in the solutions obtained by this 
method can be analyzed similarly to those in 
the case of numerical solution of ordinary 
differential equations (— 303 Numerical Solu- 
tion of Ordinary Differential Equations). If 
the given integral equation is a Fredholm 
equation of the second kind, then we have a 
system of linear equations in 9,. If we apply 
quadrature formulas to the integral appearing 
in the integral equation by using the values of 
Q; obtained, then we have a formula by which 
the solution can be evaluated directly, that is, 
without using an interpolation formula. 


(2) A Method Utilizing Recurrence Formulas. 
Let d, and d,,(x, y) be the respective coefficients 
of 4” in the expansions of Fredholm’s determi- 
nant D(A) and Fredholm’s first minor D(x, y; A). 
They satisfy the recurrence formulas 


b 


d,(X, y)=d, K(x, | K(x, s)d„-1 (s, y)ds, 


a 


1 b 
ap i d,,(s, s) ds, 


n+1 7 


dọ=1, do(x, y)=K(x, y). 


By the use of these formulas, we can compute 
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d,, and d,(x, y) successively and hence evalu- 
ate D(A) and D(x, y; 2) approximately, and by 
means of these recurrence formulas we can 
readily obtain a solution of a Fredholm equa- 
tion of the second kind. 


(3) A Method Utilizing Approximate Kernels. 
If we replace a kernel by an approximate one 
in a Fredholm integral equation of the second 
kind, then we have an integral equation that 
has a solution approximately equal to the 
solution of the original equation. Hence if we 
can find an approximate kernel for which 

an integral equation can be solved numeri- 
cally or analytically, then we can find an ap- 
proximation to the desired solution by solving 
the modified equation. For such solutions, a 
method of error estimation was given by F. G. 
Tricomi [17]. 


(4) An Iterative Method. Consider the integral 
equation 


b 
(p(x) = | F(x, y, p(x), e(y)) dy. 


Let (x) be an adequate function, and define 
P(x) successively by 


b 
Pari (x)= | F(x, Y, Pn(X)s Pn(Y)) dy. 
If the sequence {¢,(x)} converges, then the 
limit lim,- Qn(X) = @(x) is a solution of the 
given equation, and hence we can obtain an 
approximation to a solution by calculating 
P,(x) for some finite n. This method can be 
used effectively for Fredholm integral equa- 
tions of the second kind with a parameter 
A, provided that the absolute value of å is 
smaller than the least absolute value of the 
eigenvalues. 


(5) Variational Method. If some conditions are 
fulfilled, an integral equation of the form 


b 


G(x, p(x))+ | F(x, y, p(x), p(y))dy=0 


a 


can be regarded as an fEuler-Lagrange equa- 
tion for a variational problem 


b fb 
s=] | E(x, y, u(x), u(y))dx dy 


+ \ A(x, u(x))dx =extremal. (11) 


In this case, we can find a solution of the given 
integral equation numerically by solving the 
variational problem (11) numerically [18]. 


(6) Enskog’s Method. Suppose that {¢,,(x)} is 
a complete orthonormal system for the Fred- 
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holm integral equation (7). If we put (x)= 
n(x) — å fa K(y, x)@,(¥)dy, then from (7) we 
have 


b b 
| potid- | I(X) Palx) dx, (12) 
and furthermore we see that {w,(x)} can be 
orthonormalized to yield a complete ortho- 
normal system {y,(x)}. The equality (12) then 
shows that the Fourier coefficients of a solu- 
tion ~(x) with respect to the system {y,(x)} 

can be obtained readily from the Fourier 
coefficients of f(x) with respect to the system 
{p,(x)}. This method of obtaining a solution is 
called Enskog’s method. 


For Volterra integral equations, methods (1) 
and (4) can be used effectively. We usually 
transform equations of the first kind into 
equations of the second kind by differentiation 
and then apply the above numerical methods. 
This is done for the sake of securing stability 
of the numerical methods. 
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A. General Remarks 


Integral geometry, in the broad sense, is the 
branch of geometry concerned with integrals 
on manifolds, but the problems considered in 
integral geometry are usually of a more limited 
nature. If a *Lie group G acts on a tdifferenti- 
able manifold M as a tLie transformation 
group, G also acts on various figures on M, 

by which we mean geometric objects such as 
fsubmanifolds of M, ttangent r-frame bundles 
on M, etc. Let ¥ bea set of such figures on M 
invariant under G (i.e, gF EF for geG, FEF). 
Consider the following problems: (i) to know 
whether any G-invariant tmeasure u on F 
exists, and how to determine w if it exists; (ii) to 
find the integral | p(F)dy(F) of functions p on 
F with respect to the measure p. 

The term integral geometry was introduced 
by W. Blaschke, who considered the special 
case of problem (ti) in which (F) is a function 
representing geometric properties of F and the 
integral is to be evaluated by means of the 
geometric invariants concerning ¥#[1]. Prob- 
lems of so-called geometric probability (such 
as the problem of Buffon’s needle) belong to 
this category. The measure yp is called the 
kinetic measure (or kinetic density), and du(F) 
is also denoted by dF. If F has the structure of 
an n-dimensional differentiable manifold and 
the measure u is given by a tvolume element w 
(i.e., a positive tdifferential form of degree n), 
we denote @ also by dF. Problem (i) is simple: 
If G acts transitively on F, then F =G/H, 
where H is the tisotropy subgroup of G. In 
this case F has the structure of a differentiable 
manifold, and if a G-invariant t(Radon) mea- 
sure exists, it is unique up to a multiplicative 
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constant. A condition for the existence of a G- 
invariant measure u can be given by means of 
tHaar measures of G and H (— 225 Invariant 
Measures). We now consider some examples. 


B. Crofton’s Formula 


Let G(p, 0) be a straight line defined by the 
equation x, cos6+x,sin@=p with respect to 
orthogonal coordinates in a Euclidean plane. 
Let n(p, 0) be the number of intersections of 
G(p, 0) with a curve C of length L. Then we 
have Crofton’s formula, 


fro 0)dpd@=2L, (1) 


where dpd® is the fexterior product of the 
differential forms dp, d0 of degree 1, and the 
integral is extended over pe(—o, 00) and 
ef, 2x]. 


C. Poincaré’s Formula and the Principal 
Formula of Integral Geometry 


The kinetic density dF of a figure F congruent 
(with the same orientation) to a fixed figure in 
a Euclidean plane is defined as follows: Let R 
be an orthogonal frame attached to F, (x,, x) 
be the coordinates of the origin of R with 
respect to a fixed orthogonal frame Rọ, and 

0 be the angle between the first axis of R and 
the first axis of Ro. If we put dF =dx, dx,d0 
(exterior product), dF has the following in- 
variance properties: (i) dF is not changed by 
displacements of F; (ii) dF is not changed if 
instead of R we take another orthogonal 
frame R’ attached to F. 

Let two plane curves C,, C, of length L,, 
L,, respectively, be given, and suppose that C, 
is fixed and C, is mobile. If the number of 
intersections of C, in an arbitrary position 
with C; ts finite and equal to n, then the in- 
tegral of n extended over all possible positions 
of C, is given by 


| ndC,=4L,L, (2) 


(Poincaré’s formula). L. A. Santaló applied this 
result to give a solution of the fisoperimetric 
problem (1936). 

Let C,, C, be two plane ‘Jordan curves of 
length L,, La, respectively, and let S,, S, be 
the areas of the domains bounded by C,, C3, 
respectively. Suppose that C, is fixed and C, 
mobile, and let y be the number of connected 
domains common to the domains bounded by 
C,, C for C, in an arbitrary position. Then 
the integral of y extended over all possible 
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positions of C, intersecting C, is given by 


| xdcy= tytn +205, 45) (3) 


(Blaschke [1]). This is the principal formula of 
integral geometry. Many formulas can be 
derived from it as special cases or limiting 
cases. 


D. Generalization to Dimension n 


The kinetic density of subspaces of dimension 
k in a Euclidean space and in a spherical space 
of dimension n was given by Blaschke, while 
the generalization of the principal formula 
(3) to a Euclidean space of dimension n was 
given by S. S. Chern, applying the methods of 
E. Cartan. 

Let (e,,...,@,) be a positively oriented ortho- 
normal frame with vertex A. The infinitesimal 
relative displacements are then given by 


dA=} we, de,= > ae; 
i=l j=1 
where œ;=(dA, e;), w= (de; e)= — w; are 


differential forms of degree 1 in the orthogonal 
coordinates of A and the n(n — 1)/2 variables 
that determine e,,...,¢,. For various posi- 
tions of a figure, we take an orthogonal frame 
(A, e,,.-.,@,) fixed to this figure and form the 
texterior product 
dK = NoN oy 

i i<j 
of all the œw; and œ; (i<j). This has the invar- 
iance properties (i) and (ii) of Section C and 
is, by definition, the kinetic density of the fig- 
ure in an n-dimensional Euclidean space. 
Moreover, the kinetic density of dE of k- 
dimensional subspaces E can be obtained by 
considering the orthogonal frames such that 
the vertex A and e4, ..., €, lie on E and by 
forming the exterior product of the corre- 
sponding w,, w,, (#=1,...,k;2=k+1,...,n), 


dE =A, /\\@,,;- 


Let 2 be a compact orientable hypersurface 
of class C? in an n-dimensional Euclidean 
space, and let k, (a=1,...,n—1) be the fprin- 
cipal curvatures at a point on X. Denote by S; 
the telementary symmetric form of degree i in 
k, (i=1,...,n—1), and put S,=1. Then con- 
sider the integrals over Z: 


m= | sas/ ("7") i=0,1,...,n—1, (4) 
5 l 


where dS denotes the surface element of X. Let 
D,, D, be the domains bounded by two com- 
pact orientable hypersurfaces 2, 2, of class 
C? with volume V,, V,, respectively, and let 
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M!”, MP be the integrals M; defined by (4) for 
2, 47. If X ts fixed, X, is mobile, and the 
fEuler-Poincaré characteristic y of D, N D, is 
finite, then the generalization of (3) has the 
form 


fraz: =] ae | eae (me, Vz +M?), V 


n—2 
HE MOM 4) 6) 
Nk=0 
(Chern’s formula), where dX, is the kinetic 
density of X, and I, (k=1,...,n—1) is the area 
of the unit sphere in a Euclidean space of 
dimension k+ 1, with the integral extended 
over all positions of X, intersecting 2,. 

Let dE be the kinetic density of the sub- 
spaces E of dimension k intersecting a com- 
pact orientable hypersurface X of class C*, and 
let y be the Euler characteristic of the intersec- 
tion of E with the domain bounded by X. The 
integral f ydE extended over all hyperplanes of 
dimension k intersecting X is proportional to 
M, relative to the hypersurface X defined by 
(4). This fact generalizes (1). Further generali- 
zations were obtained by Chern (1966). 


E. Other Generalizations 


For 2-dimensional spaces of constant curva- 
ture, Santaló derived formulas analogous to 
those in a Euclidean plane (1942-1943) and 
thus solved the isoperimetric problem in these 
spaces. In 1952, he derived a formula corre- 
sponding to (5) in n-dimensional spaces of ` 
constant curvature, following Chern’s method. 
He investigated further integral geometry in 
affine, projective, and Hermitian spaces. 

Chern and others obtained the results of the 
previous sections by Cartan’s method of gen- 
eral moving frames and studied integral geom- 
etry in the setting of the geometry of Lie 
transformation groups in the sense of F. Klein 
(— 137 Erlangen Program). 

Chern, P. Griffiths, and others studied the 
value distribution theory of holomorphic 
mappings in several complex variables from 
the point of view of integral geometry (1961) 
(— 21 Analytic Functions of Several Complex 
Variables, 124 Distribution of Values of Func- 
tions of a Complex Variable). 


F. Radon Transforms 


Another important topic of integral geome- 
try is the theory of Radon transforms. Let ¥ 
be the set of hyperplanes &(, p)={xeR"|(x, œ) 
=p} in the Euclidean space R”, where œ = 
(A,,.-.,4,) iS a unit vector, (x,@)= È x;4;, and 


p is real. For a function f defined in R”, define 
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f(Q)=flo, p) on F by 
f= | Ff (x)dex, (6) 
é 


where d_x is the tvolume element on the 
hyperplane ¢ such that d;x a È A,dx;=dx, with 
dx the volume element of R”. Then f is called 
the Radon transform of f. For example, if f is 
the tcharacteristic function of a bounded 
domain V, the value f (&) of the Radon trans- 
form f of f at Ee F is the volume of the sec- 
tion of V by č. Now the group G of tmotions 
of R” (the tconnected component of the iden- 
tity of the group of isometries) acts on FY tran- 
sitively. For every xe R”, the tisotropy sub- 
group G, of G with respect to x acts transi- 
tively on the set x= {če F |xe č}. Since G, is 
compact, there exists a unique normalized G,- 
invariant measure u on X such that p(x)=1. 
For a function g on F, the conjugate Radon 
transform g can now be defined by 


g(x) -| g(o)du(c) (7) 
xeg 


as a function on R”. 

The determination of g belongs to problem 
(ii) of integral geometry mentioned in Section 
A. In particular, it is important to determine 
g =( fY for g=f and to find the relation be- 
tween ( f )” and f. These problems were solved 
by J. Radon for n=2, 3 and by F. John in the 
general case. The results can be formulated as 
follows: 

In the case of odd n, let Y be the space of 
trapidly decreasing C®-functions (— 168 
Function Spaces). Let /*(F) be the set of 
g(@, pje S(F) such that f? g(~, p)p* dp =0 for 
every natural number k and every w. For every 
feS(R") and every ge ¥*(F), we then have 


f= cA RA, 
where A is the Laplacian in R” and Lg(%, p) 
= d? g(, p)/dp?, c =T (n/)) (27i)! "n", 

In the case of even n, for every fe A(R"), 
geS*(F), 
fea hY), 


where 


g=eL® D), 


g=cJ,((g)"), 


Hifi| SO)x— yl! dy, 
R 


J2(g)(@, p) = |i gio, p)ip—q| "dq. 
These integrals are in general divergent, and 
they must be interpreted as regularizations 
defined by analytic continuation with respect 
to the powers of |x — y| or |p — ql [9]. 

John applied the Radon transform on R” to 
the study of partial differential equations in R” 


218 G 
Integral Geometry 


A formula corresponding to tPlancherel’s 
theorem and an analog of the tPaley-Wiener 
theorem for the Fourier transform are valid 
for the Radon transform (I. M. Gel’fand et al. 


[6]). 


G. Horospheres 


The theory of the Radon transform is also 
important in noncompact tsymmetric Rie- 
mannian spaces M. The connected component 
G of the identity in the group of isometries of 
M is isomorphic to the tadjoint group Ad G 
and can be considered as a linear group. Maxi- 


- mal tunipotent subgroups of G are conjugate 


to each other. If N is such a subgroup, we call 
the torbits on M of gNg™' horospheres on M 
for ge G. These correspond to the hyperplanes 
in R”. If M is the complex upper half-plane 
with the thyperbolic non-Euclidean metric, 
the horospheres are precisely the circles tan- 
gent to the real axis and the straight lines 
parallel to the real axis. 

The group G acts transitively on the set 
F of horospheres on M, and we have ¥ = 
G/M, N. Here G= KAN is an tIwasawa de- 
composition of G, and M, is the tcentralizer 
of A in K. For a horosphere ġe F, let d-x be 
the volume element on € with respect to the 
*Riemannian metric on € induced by the Rie- 
mannian metric on M, and define the Radon 
transform fofa function f on M by (6) asa 
function on F. For every xe M, there exists a 
unique normalized measure on X= {te F | 
xe €} invariant under the (compact) isotropy 
subgroup G, of G at x (u(x)=1). The conju- 
gate Radon transform g of a function g on F 
is defined by formula (7) by means of this 
measure u. Then there exists an integrodif- 
ferential operator A such that if A* is the 
adjoint operator, we have the inversion for- 
mula f=(A/A* f )” and Plancherel’s theorem: 


| fœ] dx= | MfOP d, 


where dx, dë are G-invariant measures on M, 
F , respectively, and f is an arbitrary C®- 
function with compact support. If the *Cartan 
subgroups of G are conjugate to each other, /\ 
is a differential operator; the inversion formula 
can then be written in the form f=L((f)’) 
with some differential operator L on M [9]. 

S. Helgason applied the Radon transform 
on noncompact symmetric Riemannian spaces 
to solve differential equations on these spaces 
(1973). 

Horospheres and Radon transforms can be 
defined not only for symmetric Riemannian 

u i i - 
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X,<b,,kK=1,2,...,n} (—0 <a, <b, < w) is 
defined by m(I) = [Ik= (b; — a,). Let Mo be the 
collection of all sets that can be represented as 
the finite union of disjoint left-open intervals, 
and for such expression A =| }%_, J, in My 
define m(A)= X5_, m(J}). Let M=Mt, U {@} 
and m(@)=0. Then m gives a finitely additive 
measure on Wè that is completely additive. 
Therefore m determines an outer measure p*, 
which in turn determines a measure u. This u* 
is called the Lebesgue outer measure (or simply 
outer measure). Sets that are measurable with 
respect to u* are said to be Lebesgue measur- 
able (or simply measurable), and the measure 
H is called the (n-dimensional) Lebesgue mea- 
sure (or simply measure). Every interval is 
measurable, and its measure coincides with its 
volume. Open sets, closed sets, and Borel sets 
are all measurable. More generally, suppose 
that an outer measure u* defined on a metric 
space X with a metric d satisfies the condi- 
tion: If d(A, B)>0, then u*(A U B)=p*(A)+ 
u*(B). Then every closed subset of X is u*- 
measurable, and therefore so is every Borel 
subset. Here d(A, B)=inf{d(a, b)|ae A, be B} 
denotes the distance between the two sets A 
and B. A measure u defined on the class of all 
Borel subsets of a topological space X is called 
a Borel measure. The cardinality of the set of 
all Lebesgue measurable subsets of R” is 2°, 
while the cardinality of the class of Borel 

sets is c (here c is the cardinal number of the 
‘continuum, i.e., the cardinal number of R). 
Therefore there exists a Lebesgue measur- 
able set that is not a Borel set. It follows 
immediately from the definition that the 
Lebesgue measure is K-regular, so for every 
Lebesgue measurable set A we can find an tF,- 
set B and a tG;-set C such that BC ACC and 
u(C — B)=0. 

Historically, for a bounded subset A of R", 
C. Jordan defined m(A) to be the infimum of 
all possible values m(B), where BEIM and B> 
A, and m(A) to be supremum of all possible 
values m(B), where BEIM and Bc A. He called 
m(A) the outer volume of A and m(A) the inner 
volume of A (in the case of R?, the outer area 
and inner area, respectively). When m(A) = 
m(A), A is called Jordan measurable, and this 
common value is defined to be the Jordan mea- 
sure (Jordan content) of A. Jordan measure is 
only finitely additive, and was found to be un- 
satisfactory in many respects. It was Lebesgue 
who modified this notion and introduced 
completely additive measures. Jordan measur- 
able sets are always Lebesgue measurable. 

Using the faxiom of choice, a set that is not 
Lebesgue measurable can be constructed (G. 
Vitali). For example, a set obtained by choos- 
ing exactly one element from each coset of 
the additive group of all rationals in the addi- 
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tive group of the reals is not measurable [3, 
pp. 67-70]. 


H. Product Measure 


When two a-finite complete measures py and 
fly are defined on completely additive classes 
By and By on X and Y, respectively, an ele- 
ment C belonging to the smallest finitely addi- 
tive class £ in the Cartesian product that 
contains {A x B| Ac®,, Be By} can be repre- 
sented as a finite disjoint union C=(_)*_,(A; 

x B) (A;e By, Bye By). If we define v(C) = 

Liat Hx(4;)uy(B;) (here we agree to put 0: œ = 
0), then this value is independent of the way 
the set C is represented, and v defines a com- 
pletely additive measure on K. By extending v 
by means of Hopf’s extension theorem, we 
obtain a (complete) measure space, called the 
(complete) product measure space obtained 
from the measure spaces (X, By, wy) and (Y, 
By, uy). The measure obtained in this way 

on the space X x Y is called the product mea- 
sure of zy and py and is denoted by uy x py. If 
we denote by M, the class of all Lebesgue 
measurable subsets of the p-dimensional Eu- 
clidean space R” and by m, the p-dimensional 
Lebesgue measure, then the (complete) product 
measure space of (R’, Mt,, m,) and (R9, M, m,) 
is (R?*®, Mp4, Mpt). The product measure 
space of any finite number of measure spaces 
(X; B; u) (i= 1, ...,n) is defined similarly. 

Let X, (A€ A) be spaces with an index set of 
arbitrary cardinality. For the product space X = 
I],-, X,, an n-cylinder set, or simply a cylinder 
set, is a set of the form A x IIg, a3 Xa 
(AcCX, x... X X,). Ifa finitely additive class 
YI, is given for each X,, the class of all subsets 
that can be represented as the union of a finite 
number of cylinder sets of the form A, x A, x 
s X Ag X Dagta, sag} Xa (Ave Uy, J = 1,2, ...,0) 
is a finitely additive class in the space X. 
When each XA, is completely additive, the 
completely additive class 2 generated by this 
finitely additive class is called the product of 
the completely additive classes W, and is de- 
noted by [T,., 2,. When a measure space (X,, 
B,.u,) with 4,(X,)= 1 is given for 2€A, a mea- 
sure u can be defined in the following way on 
the completely additive class B=[],., B, in 
the product space X: To begin with, for a cyl- 
inder set of the form A, x... x A, x X’ (here 
AjeB,, and X'= [ieta 2} Xa), we define 
H(A, X... X A,X X’)= py (Aj)... Ha, (An) If we 
extend y to the finitely additive class € con- 
sisting of all sets that can be represented as 
the finite union of such cylinder sets, then this 
extension gives a completely additive measure 
on &, and therefore, by Hopf’s extension theo- 
rem, there exists a unique extension to a mea- 
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sure u on $B, and yp satisfies u(X)=1. We 
denote this u by u= [zeh Ha. 


I. Radon Measure 


Let X be a tlocally compact Hausdorff space, 
B the topological o-algebra on X, and Co(X) 
the real linear space of all real-valued con- 
tinuous functions f on X having tcompact 
support (i.e., the closure of the set {x| f(x)#0} 
is compact). A (real) tlinear functional @ de- 
fined on C,(X) ts called a positive Radon mea- 
sure if @( f)20 whenever f>0. For such a 
functional o there corresponds a Borel mea- 
sure u on X for which o(f)={y fdu holds for 
every fe€C,(X). Lebesgue measure m, is re- 
garded as a positive Radon measure on R”. If 
X is o-compact (i.e., X can be represented as 
the countable union of compact sets), then the 
property (f)=|x fdy for all feCy(X) defines 
the measure u uniquely on the class of Borel 
sets of X. A linear functional on C,(X) that 
can be written as the difference of two positive 
Radon measures is called a Radon measure. 
For a linear functional @ defined on C)(X) 

to be a Radon measure, it is necessary and 
sufficient that for an arbitrary fe C,(X), the set 
{e(|l9l<If|,9¢Co(X)} be bounded. Equiva- 
lently, it is necessary and sufficient that the 
restriction of o to the subspace of all functions 
in Co(X) having their support in a fixed com- 
pact subset of X must be continuous with 
respect to the ttopology of uniform conver- 
gence. Therefore, if X is compact, an arbitrary 
continuous linear functional on C(X) is a 
Radon measure. L. Schwartz investigated 
Radon measures on spaces that are not locally 
compact [6]. 


J. Measurable Functions 


When a completely additive class B ona 
space X is given, a function f defined on a $- 
measurable set E and taking real (and possibly 
+œ) values is called a 8-measurable function 
on È if for an arbitrary real number a, the set 
{x| f(x)>«} is B-measurable. The condtion 

f >a may be replaced by f >a, f<a, f Sa. 

A function f can also be defined to be 8- 
measurable if the tinverse image under f of 
any Borel set is 8-measurable. When f and 

g are B-measurable, so are af + bg (a,b con- 
stants), f- g, f/g, max(f,g), min( f, g), |f|? (p a 
constant), whenever they are well defined. The 
superior and inferior limits of a sequence of 8- 
measurable functions are also 8-measurable. 
In a complete product measure space of two c- 
finite measure spaces, a function f(x, y) may 
fail to be measurable as a function of two 
variables even if it is measurable with respect 
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to each of the variables x and y separately. For 
example, let Ẹ be the class of all closed subsets 
F of R? satisfying m,(F)>0, F c [0, 1] x [0, 1]. 
Using the fact that the cardinality of § does 
not exceed the cardinal number of the con- 
tinuum, we can prove by ttransfinite induction 
that the elements of % can be indexed as F;, 
E< c, E being the cardinality of €, and that for 
every Fe we can pick two points z; =(X¢, 
yz), Z= (Xz, Yz) in such a way that if F; 4 F,, 
then Xz, Xz, Xy X, are all distinct and yz, yz, Yy» 
y, are all distinct. Furthermore, we can prove 
that the set E consisting of all such z; is not 
measurable. Therefore, denoting the charac- 
teristic function of the set E by f(x, y), f(x, y) is 
not measurable; but if we fix x (resp. y), then as 
a function of y (resp. x), f(x, y) is measurable, 
since f(x,:) (resp. f(-, y)) is always 0 except 
possibly at one point. If 8 is the class of all 
Lebesgue measurable sets or the class of all 
Borel sets, then a B-measurable function is 
called a Lebesgue measurable function (or 
simply measurable function) or a Borel mea- 
surable function, respectively. 

In Euclidean space, the class of all Borel 
measurable functions coincides with the class 
of all Baire functions, and an arbitrary Le- 
besgue measurable function is equal almost 
everywhere to a Baire function of at most the 
second class. For a function f that is finite 
almost everywhere on a Lebesgue measur- 
able set E to be Lebesgue measurable, it is 
necessary and sufficient that for an arbitrary 
é>0 we can find a closed subset F such that 
m(E—F)<e and f is continuous on F (Luzin’s 
theorem). 

If a sequence { fp} of B-measurable func- 
tions on a measure space (X, B, u) converges 
almost everywhere to f on a set E with p(E)< 
oo, then for an arbitrary £ >0 we can find a 
set F (F CE, Fe) such that p(E—F)<e and 
J, converges uniformly on F. If X =R” and B 
is either the class of Borel sets or the class 
of Lebesgue measurable sets, then the set F 
can be chosen to be a closed set (Egorov’s 
theorem). 

For a finite measurable function f(x) defined 
on the real line, there exists a sequence {h,} 
such that lim,- f(x + h) = f(x) almost every- 
where (H. Auerbach). 

The functional equation f(x + y)= f(x)+ 
f(y) has infinitely many nonmeasurable solu- 
tions (G. Hamel; — 388 Special Functional 
Equations). 

Let W, i=1, 2, be c-algebras on X,, i=1, 2, 
respectively. A mapping f: X,— X, is said to 
be measurable YW, /M, if f-'(A,)eM, for every 
A,EU,. When W, generates W,, f: X, >X, is 
measurable YW, /W, if f~'(Aj)eU, for every 
Ae. For example, a mapping f:R!—>R! 
is Borel (Lebesgue) measurable if and only if 
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f is measurable 8'/B1(M, /B!), where B! 

and Jt, denote the Borel subsets and the Le- 
besgue measurable subsets of R'. Measurabil- 
ity of mappings is preserved by compositions, 
namely, if f: X; >X, and g: X, >X, are mea- 
surable WA, /2, and NW, /AM,, respectively, then 
gof:X, >X, is measurable A, /AM,. If the pro- 
jection za: [I;s; X; >X, is measurable, then 
Mier W;/,. Since 2; '(A,) (ie I, A;e U,) generate 
Tie, a mapping f: X >J],., X; is measur- 
able I/TT,.; XG if 2,0 f: X +X, is measurable 
W/W for every ie I, where % is a o-algebra 

on X. 


K. Image Measures 


Let u be a regular measure on a measurable 
space (X, B), i.e., the completion of a measure 
on By. Then every mapping f: X > Y measur- 
able 8, /®By, induces a o-algebra on Y: 


B={BcY|f~'(B) is p-measurable}, 
and a measure on 8: 
v(B)= u( f ~*(B)). 


The measure v is called the image measure of 
p under the mapping f; this is denoted by 

uf ~ or f- u. Obviously, v(Y)= p(X), and v is 
complete. Although 8 includes By by virtue 
of the measurability of f, v is not regular in 
general. However, v is regular if u is a bounded 
measure and if both (X, 8,) and (Y, B,) are 
analytic measurable spaces (= 22 Analytic 
Sets I). Therefore the image measure of a 
regular probability measure under a measur- 
able mapping is also a regular probability 
measure in practically all useful cases. 


L. Related Topics 


(i) Integration. For a nonnegative measurable 
function f(x) on (X, B, m), we can define the | 
integral of f(x) on a set Ee %, denoted by 
Sef u(dx), fef) dul), [ef du, or fef. Fora 
real measurable function f(x), the integral is 
defined to be equal to fg f* du—f,f~ dy, if at 
least one of these integrals is finite, where f * 
and f~ are the positive and negative parts of f 
(— 221 Integration Theory B). 

(ii) Fubini’s theorem for measures. Let (X, 
$B, u) be the complete product measure space 
of (X;, B; u) (i= 1,2), where both u, and u, 
are o-finite and complete. For Ee% we define 
the sections E(x,) and E(x,) by 


E(x1)={X2|(*1,X2)€ E}, 
E(x2)= {x1 (x1, X2)E E}. 


Then we have E(x,)eB, for almost all (u,)x,¢ 


1004 


X,, E(x,)€B, for almost all (,)x,¢X,, and 


H(E)= | Ho (E(X1)) 44 (dx) 
x, 


-| Hi (E(x2))u2(dx2) 
X2 
(— 221 Integration Theory E). 

(iii) The Radon-Nikodym theorem and the 
Lebesgue decomposition theorem for measures. 
Let u and v be o-finite measures on (X, %8). 

If 4(E) =0 implies v(E)=0, then v is said to 

be absolutely continuous with respect to u, 
denoted by v<y. v<y if and only if v is ex- 
pressible in the form v(E)= fz f du, where 

f is a nonnegative 8-measurable function 

(the Radon-Nikodym theorem). Such an f is 
uniquely determined up to z-measure 0 and is 
called the Radon-Nikodym derivative of v with 
respect to u, denoted by dv/du. The concept 
opposite to absolute continuity is singularity. v 
is said to be singular with respect to p if there 
exists an Ee% such that v(E)=y(E)=0. If u 
and v are two arbitrary o-finite measures, then 
v is expressible uniquely as the sum of an 
absolutely continuous measure v; and a sin- 
gular measure v, with respect to u (Lebesgue 
decomposition theorem) (— 221 Integration 
Theory D, 380 Set Functions C). 

(iv) Invariant Measures. Let (X, 8) be a 
measurable space and G a group of Borel 
isomorphic mappings from (X, B) to itself. A 
measure u on (X, B) is said to be invariant 
under G if u(E)=n(g ‘(E)) for every Ee% and 
every geG. For example, the Lebesgue mea- 
sure m on (R", B”) is invariant under the group 
of congruent transformations (— 225 Invariant 
Measures). 

(v) The Lebesgue-Stieltjes measure. Let f be 
a right continuous monotone increasing func- 
tion on R. Then there exists a unique measure 
u on (R, B’) such that u((a, b]) = f(b) — f(a) for 
a<b. This measure is called the Lebesgue- 
Stieltjes measure induced by f (— 166 Func- 
tions of Bounded Variation B). 

(vi) Baire measurable functions and univer- 
sally measurable functions. Let f(x) be a real- 
valued function defined on a topological space 
X.If for every open set O the inverse image 
f(O) has the tBaire property (resp. is mea- 
surable with respect to the completion of any 
o-infinite Borel measure on X), then f is said 
to be Baire measurable (resp. universally mea- 
surable or absolutely measurable). Universally 
measurable functions can be defined similarly 
on a measurable space. 

(vii) Disintegration. Let (S, S) and (T, T) be 
standard measurable spaces, v a o-finite mea- 
sure on (S, S), and p:S—>T a v-measurable 
mapping. If the image measure p=vf ? is o- 
finite, there exists a unique family of measures 
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fv, te T} on (S, S) such that (1) t-v,(E) is 
universally measurable for every Ee GS, (2) v, is 
concentrated on p~'(t) for almost every t with 
respect to u, and (3) the following equality 
holds: 


v(E) -Í uldt)y (E), Ee S. 
T 


This expression is called the disintegration of v 
with respect to u. Every v-measurable function 
f on S is v,-measurable for almost every t with 
respect to u, and the integral of f is expressible 
in the form of iterated integrals as follows: 


| f(s)v(ds)= | (dt) | S(s)v¢(ds). 


This corresponds to Fubini’s theorem on 
product measures. When v is a probability 
measure on (S, S), p(s) is regarded as a (T, )- 
valued random variable on (S, S, v), and v, is 
the conditional probability measure under the 
condition p(s)=t. (— 342 Probability Theory 
E. For disintegration of measures on a topo- 
logical space — [3, ch. 9].) 


References 


[1] H. Lebesgue, Intégral, longueur, aire, Ann. 
Mat. Pura Appl., (3) 7 (1902), 231-359. 

[2] S. Saks, Theory of the integral, Warsaw, 
1937. 

[3] P. R. Halmos, Measure theory, Van Nos- 
trand, 1950. 

[4] N. Bourbaki, Elements de mathématique, 
Intégration Chap. 1-9, Actualités Sci. Ind., 
Hermann, 1965—1969. 

[5] H. L. Royden, Real analysis, Macmillan, 
second edition, 1963. 

[6] W. Rudin, Real and complex analysis, 
McGraw-Hill, second edition, 1974. 

[7] L. Schwartz, Probabilités cylindriques et 
applications radonifiantes, J. Fac. Sci. Univ. 
Tokyo, IA, 18 (1971), 139—286. 


271 (XX.4) 
Mechanics 


A. Newton’s Three Laws of Motion 


The study of laws governing the motion of 
bodies began with the laws of falling bodies, of 
which the first exact formulation was made by 
Galileo. But the general relationship between 
force and acceleration was first described by 
*Newton, who established Newton’s three laws 
of motion; the mechanics based on them is 
Newtonian mechanics. Newton expounded 
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these laws in his famous book Principia mathe- 
matica philosophiae naturalis (1686-1687), 
where the law of gravitation and its applica- 
tion, problems of fluid motion, motions of the 
planets in the solar system, etc., were systemat- 
ically treated. 

Newton’s first law. A body continues its 
state of rest or uniform motion in a straight 
line unless it is compelled to change that state 
by external action (i.e., force). This is also 
called the law of inertia. 

Newton’s second law. The rate of change of 
momentum is proportional to force and is in 
the direction in which the force acts. Here, 
momentum is defined as the product mv of the 
mass m and the velocity v. This law can be 
expressed as d(mv)/dt =F, where F is the force 
expressed in an appropriately chosen system of 
units. Since dv/dt is the acceleration a, the law 
takes the form ma = F, when the mass is con- 
stant. These equations are called equations of 
motion. The second law is often simply called 
the law of motion. 

Newton’s third law. When two bodies 1 and 
2 in the same system interact, the force exerted 
on body 1 by body 2 is equal and opposite in 
direction to that exerted on body 2 by body 1. 
This law is called the law of action and reac- 
tion, or simply the law of reaction. 

Various attempts at a rigorous axiomati- 
zation of Newtonian mechanics have been 
made, beginning with E. Mach’s work. At the 
beginning of the 20th century, it was found 
that Newtonian mechanics requires modifica- 
tion when bodies travel at speeds approach- 
ing the speed of light or when it is applied to 
physical systems of molecular size or smaller. 
These modifications led to the establishment of 
the theory of frelativity and tquantum mechan- 
ics. In contrast to these later theories, New- 
tonian mechanics is called classical mechanics. 


B. Newton’s Law of Gravitation 


Kepler discovered the following three laws for 
the motion of planets around the sun (valid 
within the accuracy in observation available at 
the time): 

Kepler’s first law. The orbit of a planet is an 
ellipse with the sun at one of its foci. 

Kepler’s second law. The area swept per unit 
time by the straight line segment joining the 
planet and the sun is independent of the posi- 
tion of the planet in its orbit. 

Kepler’s third law. The square of the period 
(the time needed for the planet to go around 
the orbit once) is proportional to the cube of 
the major axis of the orbit. 

From these empirical laws, Newton deduced 
his law of universal gravitation: Between any 
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pair of point particles, with masses m, and 

m, and at a distance r, there arises an attrac- 
tive force along the line joining the two points; 
the magnitude of this force is given by F = 
Gm,m,r *, where G is a universal constant 
(approximately 6.670 x 1078 dyne cm? g`?) 
called the gravitational constant. 


C. Kinetic and Potential Energies 


If the force F on a particle is a function of the 
position x of the particle and is the gradient 
— VU of a time-independent (scalar) function 
U(x), called the potential, then 


1 d 
E= mv’ + U(x) (=) 


is a constant of the motion of the particle. 

E is called the total energy, while the first 
and second terms on the right-hand side are 
called the kinetic energy and potential energy, 
respectively. 

For a system of n particles at points x", ..., 
x”) with masses m,,...,m,, suppose that the 
force acting on the particle at x is —V"U 
(the gradient of U relative to x) for a com- 


mon potential function U (x, ..., x®). Then 
n | ax) 
E= —my? 4 U(x®), 0. x™ v= 
2 2 Jd ( ) J dt 


is the constant total energy of the motion. For 
example, Newton’s gravitational force acting 
among a number of particles can be described 
by the Newtonian potential 
U=)) —Gmm,|x® —x9 |), 

i<j 
A potential U which is a sum of functions 
depending only on a pair of coordinates as 
in the above example is called a two-body 
interaction. 


D. Apparent Force 


The coordinate system in which Newton’s 
three laws of motion hold is called an iner- 

tial system. In some cases (e.g., on a rotating 
sphere such as the earth) it is more convenient 
to use a moving coordinate system, in which 
the equation of motion derived from Newton’s 
second law by a coordinate transformation 
(from an inertial system to the moving coordi- 
nate system) takes a form similar to the second 
law except for an additional apparent force to 
be added to the force in the original equation 
of motion. For a coordinate system rotating at 
a constant angular velocity @ (relative to an 
inertial system), the apparent force consists of 
the centrifugal force, of magnitude mo? p (p 
being the distance to the axis of rotation), 
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which pushes the particle away from the axis 
of rotation (along the perpendicular), and the 
Coriolis force 2mv x œ (x denotes the tvector 
product), which bends the motion of the par- 
ticle in a direction perpendicular to both the 
axis of rotation and the velocity of the particle. 


E. Dynamics of Rigid Bodies 


A rigid body is defined as a system of particles 
whose mutual distances are permanently fixed. 
Like a point particle, it is an ideal concept 
introduced into mechanics to simplify the 
theoretical treatment. Actual solid bodies can 
in most cases be regarded as rigid under the 
action of forces of ordinary magnitudes. Since 
a rigid body can be imagined to be made up of 
an infinite number of particles, the equations 
of motion for a system of particles can also be 
applied to it. Thus the motion of a rigid body 
can be completely determined by the theorems 
of momentum and angular momentum. 

The momentum of a rigid body is defined by 

dr 

gai e 
where dm is the mass of the volume element at 
a point r of the rigid body K and dr/dt is its 
velocity. If the external forces acting on K are 
denoted by F, (i=1,2,...), we have 


dQ/dt=> F, 


which expresses the theorem of momentum. 
If the velocity and acceleration of the cen- 
ter of gravity (center of mass or barycenter) 
fr dm/ f dm of the rigid body are denoted by Vg 
and Ag, respectively, and the mass f dm by m, 
we have Q =mVg, and the theorem of momen- 
tum becomes mdVç/dt=mAç=}F,;. 

The angular momentum of a rigid body K 
about an arbitrary point rọ is defined by 


dr 


H= — : 
i (r —ro) x T dm 


If P, is the vector from rọ to the point at which 
F; acts, we have 


dH/dt = (P, x F)=G. 


This is called the theorem of angular 
momentum. 

For the case of a rigid body with one point 
ro fixed, the angular momentum H and the 
angular velocity œ are related by 


H, = Aw, — Fo, — Eo,, 
H,= — Fw, + Bao,—Do,, 
H, = — Ea, — Dow, + Co,, 


where H,, H,, H,; @,, My, @, are the compo- 
nents of H and w in the xyz-coordinate system 
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fixed in space with its origin at the fixed point 
ro, and 


4=[u?+2hdm B= |@e), 


c= fattya, p= [zam 


E= [xam 


with the integrals taken over the whole rigid 
body. We call A, B, and C the moments of 
inertia about the x-, y-, and z-axes, respec- 
tively, and D, E, F the corresponding products 
of inertia. The rotational motion of a rigid 
body with one axis fixed is completely deter- 
mined by the theorem of angular momentum. 
However, for rotation about a fixed point, the 
theorem is not very convenient to use, because 
A, B, C, D, E, and F are generally unknown 
functions of time. 

The tquadric 


F= | xydm 


Ax? + By? +Cz?+2Dyz+2Ezx+2Fxy=!1 


represents an fellipsoid with its center at the 
origin, called the ellipsoid of inertia. If the 
principal axes €, y, and ¢ are taken as coordi- 
nate axes, the equation of the ellipsoid of 
inertia becomes Ač? + By? +r? =1, where A, 
B, T are the moments of inertia about the é-, 
n-, -axes and are called the principal moments 
of inertia, while the €-, 7-, -axes themselves 
are called the principal axes of inertia. If the 
components of the angular momentum H and 
angular velocity in the direction of the prin- 
cipal axes of inertia are denoted by (H,, H}, H3) 
and (w,,@,,@3), respectively, then H, = Aw}, 
H, = Bw, Hy =I w. Furthermore, if the €-, y-, 
¢-components of the resultant moment of the 
external forces G = } (P, x F;) are denoted by 
(Gi, G,, G3), then dH/dt = G becomes 


Adw,/dt=G, +(B-T)o@,0;, 
Bdw,/dt=G,+(T—A)w3@,, 
Tdow,/dt=G,+(A—B)o,@.. 


These are called Euler’s differential equations. 

The study of the motion of a rigid body can 
be reduced mostly to the study of the motion 
of its ellipsoid of inertia, since the latter is 
attached to the rigid body. The method of de- 
scribing the motion of a rigid body by means 
of its ellipsoid of inertia is known as Poinsot’s 
representation. The motions of two bodies 
having equal ellipsoids of inertia are the same 
if the external forces acting on them have 
equal resultant moments, even if their geo- 
metric forms are different. 

When a rigid body moves under no con- 
straint, the motion of its center of gravity G 
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can be determined by the use of the theorem of 
momentum. Also, the rotational motion about 
the center of gravity can be found from dH’/dt 
= G’, which is a modification of the theorem of 
angular momentum. Here H’ and G’ are re- 
spectively the angular momentum and mo- 
ment of external forces about the center of 
gravity. In this case also, simplification can be 
achieved by considering the equation in the 
reference system that coincides with the prin- 
cipal axes of the ellipsoid of inertia with its 
center at the center of gravity. 


F. Analytical Dynamics 


Mechanics as originally formulated by New- 
ton was geometric in nature, but later L. Euler, 
J. L. Lagrange, and others developed the ana- 
lytical method of treating mechanics that is 
now called analytical dynamics. Lagrange 
introduced generalized coordinates q, (j= 
1,2,...,f/ where f is the number of degrees 

of freedom of the system considered), which 
uniquely represent the configuration of the 
dynamical system, and derived Lagrange’s 
equations of motion: 


a (08) aLa ast af 
dt 04; — Ms j= arrose 


where 9;=dq,/dt, and 2= T— U(T=kinetic 
energy, U = potential energy) is a function of q; 
and å; called the Lagrangian function. Later, 
W. R. Hamilton introduced 


p;=0T/ðå;, 
H=} pġ;—£=H(p;, tee 


and transformed the equations to Hamilton’s 
canonical equations: 
dq; CH dp; OH 


LE a Se, j=1,2,...,f. 
dt Op; dt ĉqj 4 f 


»Pr dys -< 9y) 


Here p; is the generalized momentum conjugate 
to q;, and q;, p; are called canonical variables. 
If the functions representing the configuration 
of the dynamical system in terms of q; do not 
explicitly contain the time t, the Hamiltonian 
function (or Hamiltonian) H coincides with the 
total energy of the system T+ U. 

The transformation (p, g)>(P, Q) under 
which canonical equations preserve their form 
is called a canonical transformation. It is given 
by 

oW Š OW kat ow 
riggs = ge eee? 
where W=W(q,,...,475Q1,---,Q,) and K js 
the Hamiltonian of the transformed system. 
The set of canonical transformations forms a 
group, called a group of canonical transforma- 


Pj 
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tions. An infinitesimal transformation is given 
by 


where g is an infinitesimal constant. Here S 
is an arbitrary function and is said to be the 
generating function of the infinitesimal trans- 
formation. Canonical equations can be inter- 
preted to mean that the variations of p and q 
during the time interval ¢=dt are the infini- 
tesimal canonical transformations whose gen- 
erating function is H(p,q, t). 

The variation of an arbitrary function 
F(p, q) under an infinitesimal transformation 
is given by 


dF =«(F,S), 
where 
Ou dv Gu Ov O(u, v) 
(u0)=F( ee D 
j 04; Op; Op; 0q; j aldi P;) 


is Poisson’s bracket. Therefore the time rate of 
change of a dynamical quantity F(p, q) can be 
written as 


dF /dt =(F, H). 


Thus the function F(p,q) that satisfies (F, H)=0 
is an tintegral of the canonical equations. 

If a canonical transformation (p, g)>(P, Q) 
such that P,=«;, Q;= 8; are constant is found, 
the motion of the system can be determined by 


ôW Ww 


P= Og,” A Fa," 


where W is the ‘complete solution of the 
Hamilton-Jacobi differential equation: 


Ow & ow 
H 


—+H{—....,—; s4, t J=O 
ôt 0q, qs ii in) 


(— 82 Contact Transformations). 


G. Theory of Elasticity 


(1) General remarks. Suppose that a solid 
body is deformed elastically by the action of 
external forces. We may inquire about the 
magnitudes of deformation, stress, and strain 
caused by the external forces at each point 
of the body. The theory of elasticity studies 
this problem, assuming the body to be a con- 
tinuum and utilizing classical mechanics as a 
basis, and “endeavours to obtain results which 
shall be practically important in applications 
to architecture, engineering and all other use- 
ful arts in which the material of construction is 
solid” [1]. 

Cartesian coordinates (x, y, z) are employed 
for defining the 3-dimensional space contain- 
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ing the body. We confine ourselves to the 
small-displacement theory of elasticity, in 
which the components of the displacement u= 
(u,v, w) of an arbitrary point P(x, y, z) of the 
body are assumed to be small enough to jus- 
tify the linearization of the governing differen- 
tial equations and boundary conditions. 

(2) Stress. Consider an infinitesimal rectan- 
gular parallelepiped enclosed by the following 
six surfaces: 


x=const, y=const, z=const, 


x+dx=const, y+dy=const, z+dz=const, 


in the body. The stress at the point P(x, y, z) is 
defined as those internal forces per unit area 
acting on the six surfaces of the parallelepiped. 
It has nine components, which are usually 
represented by 


Oy Tyx Tzx 
Txy O. y Ta y |> 
Tye Tyz Oz 


where the o and 7 are called normal and shear- 
ing (or tangential) stresses, respectively. These 
nine quantities form a tensor called the stress 
tensor. Since it can be shown that t,,=1,,, ty 
=T,y, Tex =Txz, the stress tensor is a symmetric 
tensor. By considering equilibrium conditions 
of the parallelepiped, the equations of equilib- 
rium are found to be 


Ga, Oty, OT, 


TET az +X i tdes 
where X,... are body forces per unit volume. 
(3) Strain. The infinitesimal rectangular 
parallelepiped fixed to the body at the point P 
before deformation ts transformed, after de- 
formation, into an infinitesimal parallelepiped 
which is no longer rectangular. The strain at 
the point P is defined as the changes caused by 
the deformation of the parallelepiped: exten- 
sions of the three sides and changes from right 
angles of the three angles formed by the three 
sides of the parallelepiped. Thus the strain has 
six components, usually represented by 


(Ex, Eys Ez» Yyz» Yzx> Vxy)s 


where £ and y are called elongation and shear- 
ing strains, respectively. These six quantities, 
with slight modification, form a symmetric 
tensor called the strain tensor. 

(4) Strain-displacement relations. In small- 
displacement theory, the strain-displacement 
relations are given in linear form by 


ðu dv 
b=, b=, ey Y= H. 
Aig oy Yee ay Ox 


(5) Stress-strain relations. In the theory of 
elasticity, stress and strain are assumed to 
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obey a linear relationship called Hooke’s law: 


{a} =[A] {e}, 
where 


{o}? = [o Oy, Oz, Tyz> Tex Txy], 


{e}7 = [e., Eys Ez» yz» Vzx> Vel 


and [A] is a symmetric positive definite ma- 
trix. For an isotropic body, the relations are 


Ee, = 0x — víc, F oz), t.3 GY xy =T 


y? 


where E, v, and G = E/2(1 +v) are elastic con- 
stants called the modulus of elasticity in tension 
or Young’s modulus, Poisson’s ratio, and the 
modulus of elasticity in shear or the modulus of 
rigidity, respectively. 

(6) Boundary conditions. The surface of the 
body can be divided into two parts with re- 
gard to boundary conditions: the part S$, over 
which the boundary conditions are prescribed 
in terms of external forces and the part S, over 
which the boundary conditions are prescribed 
in terms of displacements. Obviously ôV = 
S,+5S,, where GV is the whole surface of the 
body. 

(7) Small-displacement theory of elastic- 
ity. We have seen that the equations which 
govern the problem are 3 equations of equilib- 
rium, 6 strain-displacement relations, and 6 
stress-strain relations in terms of 15 unknowns, 
namely, 6 stress components, 6 strain compo- 
nents, and 3 displacement components. Thus 
our problem is reduced to a boundary value 
problem in which these 15 field equations are 
to be solved under the specified boundary 
conditions. Since all the field equations and 
boundary conditions are linear with respect 
to the unknowns under the assumption that 
the displacements are small, we obtain linear 
relationships between the external load and 
resulting deformation of the body; this is the 
small-displacement theory of elasticity. It 
should be remembered, however, that the 
assumption of small displacement sets limits 
to the application of the theory to practical 
problems. 

(8) Variational principles. Several variational 
principles have been formulated in the small- 


displacement theory of elasticity. These include - 


the principle of minimum potential energy 
[u], the generalized principle [o, ¢, uj, the 
Hellinger-Reissner principle [o, u], the prin- 
ciple of minimum complementary energy 
[c], and so forth, where the symbols in the 
brackets represent independent functions 
subject to variation. In connection with the 
aforementioned variational principles, vari- 
ational principles with relaxed continuity re- 
quirements have also been formulated by re- 
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laxing the continuity requirements imposed 
on admissible functions. 

One of the practical advantages of these 
variational principles is that they often provide 
the problem with approximate formulations 
and approximate methods of solution, among 
which the Rayleigh-Ritz method is well known. 
Theories of beams, plates, shells, and multi- 
component structures are typical examples 
of such approximate formulations. Recently, 
these variational principles have been found to 
provide effective bases for the formulations of 
the finite element method. 

(9) Notation. Various symbols are used for 
stress and strain. For example, a, t and e, y are 
commonly used in the engineering literature. 
However, in Love’s treatise [6], X,, Y,, Z, and 
exx» xy» €xz are used in place of o, T,,, Tz and 
Exs Vxy> Yxzs respectively. Also, various nota- 
tions are used for elastic constants. E is widely 
used, while G and v are less common. Love 
uses 4 and o for G and v, respectively. In the 
engineering literature the reciprocal of Pois- 
son’s ratio m (= 1/v) ts used and is called the 
Poisson number. 

(10) Finite-displacement theory of elastic- 
ity. When the displacement of the body is 
no longer small (infinitesimal) but is finite, 
we should abandon small-displacement the- 
ory and instead employ finite-displacement 
theory, in which stress and strain are defined 
in a manner different from that in small- 
displacement theory, keeping in mind the 
difference between spatial and material vari- 
ables. Thus equations of equilibrium, strain- 
displacement relations, and the boundary 
conditions on S, become nonlinear equations 
with stress and displacement components as 
unknowns, although the stress-strain relations 
remain linear. Thus the problem is reduced to 
solving a nonlinear boundary value problem, 
sometimes called a nonlinear elasticity prob- 
lem. Variational principles have been formu- 
lated for finite-displacement theory and are 
frequently used in the formulation of approxi- 
mate methods of solution. 

When the stress becomes large enough to 
exceed the so-called elastic limit, where the 
linear stress-strain relationships cease to hold, 
the theory of elasticity is no longer valid and 
should be replaced by the theory of plasticity 
[9, 10]. 
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A. General Remarks 


A single-valued tanalytic function in a domain 
D in the complex plane C is called meromor- 
phic in D if it has no singularities other than 
tpoles. A function that is meromorphic in 

the whole complex plane including the point 
at infinity is a rational function (Liouville’s 
theorem). Specifically, if a function is mero- 
morphic in the domain C, then the function is 
called simply a meromorphic function, and a 
meromorphic function that is not a rational 
function is called a transcendental meromor- 
phic function. A meromorphic function f(z) 
can be represented as a quotient of two tentire 
functions. Let {z,} (k= 1,2, ...) be poles of f(z), 
and let f,(z) =a /(z—z,)"*+... +a (z—z,) 
denote the ‘singular parts of f(z) at z; (k= 
1,2,...). Then f(z) can also be written in the 
form 


fl2)=9(2) 7 (lz) — pul) 


where g(z) is an entire function and the p,(z) 
(k=1,2,...) are rational entire functions 
(Weierstrass’s theorem). Assume that a se- 
quence {z,} (k=1,2,...) converges only to the 
point at infinity and that f,(1/(z—z,)) (k= 

1,2, ...) are rational entire functions of 1/(z— 
z,) which have no constant terms. Then 
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there exists a meromorphic function of z with 
f,A/(z —2z,)) as its tsingular part at z, (Mittag- 
Leffler’s theorem). 


B. Nevanlinna Theory 


The theory of meromorphic functions can be 
considered an extension of the theory of entire 
functions. In particular, value distribution 
theory, originating in Picard’s theorem, was 
studied by many people, and in 1925 R. Nevan- 
linna published a systematic theory unifying 
the results obtained until then. This is called 
Nevanlinna theory. 

We let f(z) denote a meromorphic function 
in |z|< R< +œ, and when we say that f(z) 
takes on a value, the value may be oo. Fora 
value a, n(r, x) denotes the number of -points 
of f(z), i.e., points z with f(z)=a, in |z|<r<R, 
where each «-point is counted with its multi- 
plicity. We set 


N(r, apm [MO +n a)logr, 


0 


1 f7 1 
meg= | log* |__| 40 


if a4 oo, and 


Nr, o= | Me aa =) dt +n(0, 00) logr, 


0 


2n 
mr, xz | log* | f(re'®)| de 
2r Jo 
if x= œ, where log* a=max(loga, 0) for a>0. 
The functions N and m are called the counting 
function and proximity function of f(z), respec- 
tively, and T(r)= T(r, f)=m(r, 00)+ N (r, œ) is 
the order function (or characteristic function) of 
f(z). T(r) is an increasing function of r and a 
tconvex function of logr and is useful for ex- 
pressing f(z) as an infinite product, etc. 

The following relation holds among T(r), 
m(r, a), and N (r, «) for any a: 


T(r)=m(7, a) + N(r, o) + O(1), (1) 


where O(1) is tLandau’s symbol (Nevanlinna’s 
first fundamental theorem). By this theorem, if 
a bounded remainder is disregarded, then 
m(r, a) + N(r,a) is equal to T(r) for all æ. This 
equality thus demonstrates a beautifully bal- 
anced distribution of «-points. 

We see that N{r,a) is in a sense the mean 
value of the number of «-points in |z| <r, and 
m(r, a) is the mean proximity to « of the value 
f(z) on |z|=r. If the term log* in the definition 
of the proximity function is replaced by the 
logarithm of the reciprocal of the chordal 
distance between f(re’’) and x on the com- 
plex sphere, then the remainder term in (1) is 
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eliminated. Hence the definition of the proxim- 
ity function is sometimes given in this form. 


C. The Order of Meromorphic Functions 


For an entire function f(z), the equality 


. logT(r) loglog M (r) 
lim sup = lim sup-——— 

om ogr has logr 
holds, where M(r)= M (r, f)=max,,)_,|f(z)]. 
Since the right-hand side is the order of f(z) 
(— 429 Transcendental Entire Functions), 
we define the order (lower order) p of a mero- 
morphic function f(z) by 


log T(r) . , log T(r) 
lim inf ; 
r r> 


0 ogr 


p=limsup I 
The order of a meromorphic function in {z|<R 
is also defined by 

log T(r) 


= lim sup ————_-—_. 
P= R log(l/(R—N) 


D. Meromorphic Functions on a Disk 


The order function T(r) is bounded if and 
only if f(z) can be represented as the quo- 
tient of two bounded holomorphic functions 
h (z), h2(z) in |z| << R (Nevanlinna). If T(r) is 
bounded, lim,_., f (re) exists and is finite 

for every 0, 0<0< 2r, except possibly for a 
set with tlinear measure zero (P. Fatou and 
Nevanlinna). Among functions f(z) such that 
lim,_.z T(r) = œ, those satisfying 


T(r) 


lim sup ———_——— = co 

ror log(l{R—r)) 
have properties similar to those of transcen- 
dental meromorphic functions. 


E. Meromorphic Functions in the Whole Finite 
Plane 


Any meromorphic function such that 


T(r) 
lim sup-——< K 
ror logr 

is a rational function. If f(z) is a meromorphic 
function of order p and {r(«)}, ra) <r; (2) 
(j=1,2,...) is the set of absolute values of «- 
points, then 17°, (1/r,(a))?** converges for any 
a. Furthermore, 


fle)=zter 
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where p is the smallest integer satisfying p+ 1 
>p, the a, and b, are the zeros and poles of 
f(z), respectively, k is an integer, and P(z) is a 
polynomial of degree at most p (Hadamard’s 
theorem). 

Let «,,...,%, (q> 3) be distinct values. Then 
for any meromorphic function in |z|< R <0, 


(@-2)T<Y N(r,2)—N,()+ DO), 


O<r<R. 


Nyiop= | OM aan Olor 


n,(r) is the number of tmultiple points in |z| <r 
(a multiple point of order k is counted k— 1 
times), and D(r) is the remainder such that if R 
= 0, then D(r)< K(log T(r) +logr) for some K 
except possibly for values of r belonging to the 
union of a countable number of intervals with 
finite total length, and if R < œ, then D(r)< 
K (log T(r) + log(1/(R —r))) except possibly for 
the union of a countable number of intervals 
{L} with £J,,d(1(R —r)) < œ (Nevanlinna’s 
second fundamental theorem). 
Several theorems on value distribution of 
meromorphic functions can be obtained di- 
rectly from this theorem. For instance, if f(z) is 
a transcendental meromorphic function, the 
equation f(z)=« has an infinite number of 
roots for every value « except for at most 
two values called Picard’s exceptional values 
(Picard’s theorem). For a meromorphic func- 
tion of order p, lim, Z,,<r(%,(a)) * (<p) 
diverges for every value « except for at most 
two values (Borel’s theorem). A value « for 
which the series converges is called a Borel 
exceptional value. We call 6(«)=6(a, f)=1 
—limsup,.,,, N(r, «)/T(r) the defect of f. It 
always satisfies 0<6d(a«)<1, and the values 
with d(«)>0 are called Nevanlinna’s excep- 
tional values. The number of values « (may be 
œ) with 6(«)>0 is at most countable for any 
meromorphic function f(z), and 2%, d(«;) <2. 
There are many studies concerning the values 
a with d(a)=0 


F. Julia Directions 


Among functions that have an essential sin- 
gularity at the point at infinity and are mero- 
morphic in the whole plane, there are some 
that possess no tJulia directions. These func- 
tions, called Julia exceptional functions, are 
of order 0. A necessary and sufficient condi- 
tion for f(z) to be a Julia exceptional func- 
tion is that f(z) can be written in the form 

z” TI, —z/a,)/T1,(1 —z/b,) (A. Ostrowski, 
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1925). Concerning zeros a, and poles b, of f(z), 
the following three properties are obtained 
using the theory of tnormal families due to P. 
Montel: (1) There are constants K,, K,, and 
K, independent of r such that |n(r, 00) —n(r, 0)| 
<K,, n(2r, 0©)—n(r, 0)< K,, and n(2r,0)— 
n(r, 0)< K3. (2) There are constants K, and 

K, such that for any p and q, 


a a 
P P 
|a,|" et | I] b <K4, 
la„|<lapl ay lb,i<lapl v 
b b 
ba TT e T <k 
Ibyi<ibgl | By |] layl<lbgl | Ey 


(3) There exists an ¢>0 satisfying |a,/b,—1|> 
é>0 for any p and q. 

G. Valiron gave a precise form of Julia 
directions that corresponds to Borel’s theorem 
(Acta Math., 52 (1928), [3]). Namely, if the 
order p of a meromorphic function f(z) in |z| < 
œ is positive and finite, then there exists a 
direction J defined by argz=« such that the 
zeros z,(a, A) of f(z)—a in any angular domain 
A:|argz—a|<6 containing J have the prop- 
erty 2, |z,(a, A)| °° = œ for any e>0 except 
for at most two values of a. The direction J is 
called a Borel direction. 


G. Relations between Two or More 
Meromorphic Functions 


Borel’s unicity theorem can be stated as fol- 
lows: Let f; (j= 1, ...,n) be nonvanishing entire 
functions satisfying L7_, f;= 1; then for some 
(Cis ---,¢,) A(O,...,0), LF, ¢,f;=0. This is 
contained in the following theorem: Let f; 
(j=1,...,n) be transcendental entire functions 
such that 27-1 j= 1; then Xi- 6(0, f))<n—1. 
If two meromorphic functions f, (z), f>(z) have 
the same «points for five distinct values a; (j 
=1,...,5) (where multiplicity is not taken into 
account), then they coincide everywhere. If the 
*Riemann surface of the talgebraic function 
w(z) defined by a polynomial P(z, w)=0 of z, w 
is of tgenus > 1, it is impossible to find mero- 
morphic functions z = f (¢), w=g(¢) that satisfy 
P(f(¢), g(€)) =0 (tuniformization by meromor- 
phic functions). 


H. Asymptotic Values 


If a meromorphic function f(z) >a as z~-»00 
along a curve C, the value « and the curve 

C are called an asymptotic value and asymp- 
totic path, respectively. Each (Picard’s) excep- 
tional value of f(z) is an asymptotic value. For 
meromorphic functions, no simple relation is 
known between their order and the number of 
their asymptotic values. There exists a merom- 
orphic function of order 0 with an infinite 
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number of asymptotic finite values. Some 
results analogous to those for entire functions 
are obtained for meromorphic functions by 
applying the theory of normal families. F. 
Marty established a systematic theory of 
normal families of meromorphic functions by 
using spherical distance. 


I. Inverse Functions 


Generally, the inverse function of a meromor- 
phic function w= f(z) is infinitely multiple- 
valued. Let P(w, wo) be a function element of 
the inverse function with center at wo, and let 
C be an arbitrary curve starting at wọ and with 
œ its terminal point. For any domain S con- 
taining C, P(w, wọ) can be continued analyti- 
cally in S up to a point arbitrarily near œ 
(Iversen’s theorem). Continue the function 
element analytically along each half-line start- 
ing at its center. Then the set of arguments of 
half-lines along which the analytic continu- 
ation meets a singularity at a finite point is of 
zero linear measure (Gross’s theorem). 

By considering the inverse image of the 
suitably cut Riemann surface of the inverse 
function, the z-plane can be divided into fun- 
damental domains such that each domain is 
the inverse image of the whole w-plane (with 
suitable slits removed) and has a boundary 
each point of which is taccessible from the 
inside of the domain, and the boundary curves 
of fundamental domains cluster nowhere in 
the plane. 

For a meromorphic function f(z), the set 
of functions z’ = (z) defined by f(z’) = f(z) 
(i.e., transformations between points that give 
f(z) the same value) has the property of a 
thypergroup. If @(z) is single-valued, then it 
is a linear entire function, and if it is finitely 
multiple-valued, then it is an algebraic func- 
tion. The fcluster set of the inverse function at 
a transcendental singularity consists of only 
one point, oo, that is, it is an fordinary sin- 
gularity. To an analytic continuation along a 
curve that determines a transcendental sin- 
gularity of the inverse function there corre- 
sponds a curve in the z-plane terminating at 
oo. This curve is an asymptotic path of f(z). 
Namely, the value f(z) tends to the coordinate 
of the transcendental singularity as z— œ 
along this path. Each asymptotic value of a 
transcendental meromorphic function w= f(z) 
corresponds to a transcendental singularity of 
its inverse function z = (w), and if we consider 
two asymptotic paths to be the same if they 
correspond to the same singularity, then there 
exists a one-to-one correspondence between 
the set of asymptotic paths and the set of 
transcendental singularities of the inverse 
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function. The inverse function of any mero- 
morphic function of order p has at most 2p 
‘direct transcendental singularities if p > 1/2 
and at most 1 such singularity if p < 1/2 (L. V. 
Ahlfors). 


J. Theory of Covering Surfaces 


Ahlfors established the theory of covering 
surfaces by a metricotopological method and 
in applying it, obtained Nevanlinna theory 
and many other results on meromorphic 
functions. 

Let F, denote the covering surface of the 
Riemann sphere F with radius 1/2; F, is the 
image of |z|<r under a meromorphic function 
w= f(z). The area of F, divided by z, where z is 
the area of Fo, is given by 


ole 
o-i] \ a aie 


z=pe®, 


and is called the mean number of sheets of F.. 
The length of the boundary of F, is given by 


If'(2)| 
|z|=r 1+ IAEN? 


The relation 


L(r)= |dz]. 


To= (4Oar+ o0 


holds (T. Shimizu, Ahlfors). 

Consider the Riemann surface of the inverse 
function of a meromorphic function w= f(z) in 
|z| < R< +. It has a countable number of 
components Q, over a domain on the w-plane. 
Let A, denote the inverse image of Q, on the z- 
plane. If A, together with its boundary is con- 
tained in |z| < R, the component Q, is called an 
island, and otherwise, a peninsula. 

Let D be a simply connected domain of the 
w-plane, n(r, D) be the sum of the sheet num- 
bers of the islands of F, over D, and m(r, D) be 
the sum of the areas of the peninsulas of F, 
over D divided by the area of D. Then 
m(r, D)+n(r, D)= A(r)+ O(L(r)). 

Let D, (j= 1, ...,q) (q È 3) be disjoint simply 
connected domains on the w-plane. Then 


3 n(r, D)— 3 


nı (r, D)>(q—2)A(r)—O(L(r)), 
where n, (D) is the sum of the orders of branch 
points in all islands of F, over D. 

For a meromorphic function f(z), L(r)< 
A(r)"?**, where r satisfies 0<r< œ except 
forre GF for some intervals J,. Hence in the 
case where D; is a point a,, this inequality 
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yields 


>(q—2)A(r)—O(A()'?*) 


with some exceptional intervals of values of r. 
This latter important inequality corresponds 
to Nevanlinna’s second fundamental theorem. 
Let D, (j=1, ...,q) (q = 3) be disjoint simply 
connected domains on the w-plane. If every 
simply connected island over D; has at least 4; 
sheets, then X4- (1 —(1/y;)) <2 (disk theorem). 
It follows from this theorem that given three 
disjoint disks D; on the Riemann sphere, there 
is at least one D, that has an infinite number 
of islands over it, and also that given five D 
there exists at least one D, that has a 1-sheeted 
island over it (Ahlfors’s five-disk theorem). This 
theorem corresponds to tBloch’s theorem for 
entire functions. These theorems can also be 
obtained for meromorphic functions on a disk. 
Ahlfors established a more important theory 
by introducing a differential metric. 


K. Recent Development 


The Nevanlinna brothers raised several im- 
portant problems, which gave strong motiva- 
tion for later investigations. The first major 
breakthrough after World War II was given by 
A. Goldberg in 1956. He gave an example 
which has infinitely many deficient values 
(Nevanlinna’s exceptional values). W. Hayman 
proved that > d(a, f converges for «> 1/3, 
and there are examples of meromorphic func- 
tions for which the series diverges for a < 1/3. 
Finally, A. Weitsman showed that the series 
converges for «= 1/3. The second major break- 
through was given by A. Edrei and W. Fuchs 
in 1959, whose works concern the following 
Nevanlinna theorem: Let K(f) be 


N(r,0)+ N(r, œ) 
lim sup Tf) 


and x(p)=inf K(f), where inf is taken over all 
meromorphic function f of order p. Then x(p) 
> 0 if p is neither a positive integer nor œ. 
Furthermore, Nevanlinna gave a conjecture 
for an exact value of x(p). This conjecture is 
still open, although several estimates have 
appeared. In the case of entire functions hav- 
ing only negative zeros the conjecture was 
positively solved by S. Hellerstein and J. Wil- 
liamson. Edrei and Fuchs proved that x(p)=1 
forO<p<1/2 and k(p)=sinzp for 1/2<p<l. 
They also proved the ellipse theorem: Let f(z) 
be a transcendental meromorphic function of 
order p (O<p< 1). Put u=1—d(a, f), v=1— 
O(b, f). Then u, ve [0, 1] and u? — 2uv cos np + 
v? 2 sin? xp. If further u < cos zp, then v= 1. 
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For the sum of deficiencies Edrei proved 
that 


Zaans] 


unless the number of deficient values is one. It 
was conjectured that È ô(a, f )=2 implies that 
the order p of f is a half integer, the number v 
of deficient values is at most 2p, and the value 
of d(a, f) is a multiple of 1/p. For the case of 
entire functions this conjecture is true (A. 
Pfluger). Weitsman proved that v< 2p. Other 
cases remain open. All the above results still 
hold even if the order is replaced by the lower 
order. The inverse problem was completely 
solved by D. Drasin. Many of the above re- 
sults depend on the concept of Polya peaks. 

N. V. Govorov and V. Petrenko proved 
independently that 


. . log M({r, f) 
lim inf ———~—— 


Be Te, f) 
for every entire function of order p. This was a 
conjecture made by R. Paley. For p < 1/2, an 
exact upper bound zp/sin zp was given by 
Valiron. Furthermore, the following result was 
proved by Edrei and Fuchs: 


N(r, 0) sin 7p 
ogM(r,f)” np 


t—coszp (O<p<1/2) 
(1/2<p<l1) 


2—sinzp 


<np for p>1/2 


lim sup 


roo | 


(0<p<1). 


L. History 


The value distribution theory of meromorphic 
functions had its inception with the classical 
Picard theorem. It first appeared as the value 
distribution theory of entire functions and was 
developed into a well-organized field by way 
of the Nevanlinna theory and the Ahlfors 
theory of covering surfaces. In recent years, 
emphasis has also been placed on the study of 
meromorphic functions on open Riemann 
surfaces (— 367 Riemann Surfaces). The value 
distribution of a set of several meromorphic 
functions was studied first by A. Bloch and 
developed into the study of meromorphic 
curves by Ahlfors, and H. and J. Weyl [2]. The 
behavior of meromorphic functions in neigh- 
borhoods of general singularities has also been 
studied. An example of results in that field is 
the theory of fcluster sets. 
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A. General Remarks 


The distance between two points x =(x,,..., 
x,) and y=()j,,..., y,) in the n-dimensional 
tEuclidean space R” is defined by p(x, y)= 


JO1—x1) +... +, —X,)?. The function 
p(x, y) is nonnegative for every pair (x, y) and 
has the following properties: (i) p(x, y)=0 if 
and only if x = y; (il) p(x, y)= p(y, x); and 

(iii) p(x, z)< p(x, y)+ p(y, z) for any three 
points x, y, z. Property (iii) is called the 
triangle inequality. 


B. Definition of Metric Spaces 


Abstracting the notion of distance from Eu- 
clidean spaces, M. Fréchet defined metric 
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spaces [1] (1906). A metric on a set X isa 
nonnegative function p on X x X that satisfies 
(i), (ii), and (iii) of Section A, and a metric space 
(X, p), or simply X, is a set X provided with a 
metric p. The members of X are called points, 
p is called the distance function, and p(x, y) is 
called the distance from x to y. The distance 
function is sometimes denoted by d(x, y) or 
dis(x, y). If (i) is replaced by its weaker form (1’) 
p(x, x)=0, the function p is called a pseudo- 
metric (or pseudodistance function), and X is 
called a pseudometric space. 

Examples of metric spaces: 

(1) The n-dimensional Euclidean space R”, in 
particular the real number system R with 
Po{x, y)=|x —y|. (2) The ffunction space 
L,(Q). (3) The tfunction space C(Q). (4) The 
tsequence space s, i.e., the space Rẹ of all se- 
quences of real numbers with metric p(x, y)= 
Enei 2 "Xn Yall + [Xn — Yal), Where x = 
(x,,X2,..-) and y=(yj, Y2, --- ). (5) The tse- 
quence space m, i.€., the space of all bounded 
sequences of real numbers with metric 
p(x, y)=sup,|x, —y,| for x =(x,,X>,...) and 
y=()1,)2.---). (6) A Baire zero-dimensional 
space (QN, p), where Q is a set and the dis- 
tance p(x, y} between x =(x,,X,,...) and y= 
(¥1,¥2,---} is equal to the reciprocal of the 
minimum n such that x,#y,. When the car- 
dinal number t of Q is specified, the Baire 
space is denoted by B(t). (7) For any set X, 
define p by setting p(x, x)=0 and p(x, y)=1 
when x #y. Then (X, p) is a metric space, 
called a discrete metric space. (8) For any set 
X, define p by setting p(x, y)=0 for any mem- 
bers x and y. Then p is a pseudometric, and 
the resulting space X is called an indiscrete 
pseudometric space. 

For a subset M of a metric space X, 
sup { p(x, y)|x, ye M} is called the diameter 
of M (denoted by d(M)), and M is said to 
be bounded if its diameter is finite (includ- 
ing M = Ø). For two subsets A, B of X, 
inf{ p(x, y)| xe A, ye B} is called the distance 
between A and B, denoted by p(A, B). We 
have p{A, B)= p(B, A). When a family W= 
{M,|AeA} of subsets of X is a covering of 
X, i.e., X=|),M,, the supremum of the dia- 
meters d(M,) of M, in WM, sup{d(M,)|4e A}, 
called the mesh of the covering WM. For a posi- 
tive number e, a covering whose mesh is less 
than e is called an e-covering. A metric space 
X is called totally bounded (or precompact) 
(F. Hausdorff, 1927) if for each positive num- 
ber e there exists a finite -covering of X. 

A subset X, of a metric space X becomes a 
metric space if we define its metric p, by set- 
ting p,(x, y)= p(x, y) for x, ye X,, where p is 
the metric of X. The space (X,, p,) is called a 
metric subspace of (X, p). A subset of X is 
called totally bounded (or precompact) if it is 
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totally bounded as a metric subspace. Any 
totally bounded subset is bounded. Converse- 
ly, in the Euclidean space R” any bounded 
subset is totally bounded. 

A bijection f from a metric space (X,, p4) 
onto a metric space (X,, p2) is called an iso- 
metric mapping if f preserves the metric, i.e., 
Pa f(x), JO) = py (x y) for any points x, ye X,; 
and X, and X, are called isometric if there is 
an isometric mapping from X, onto X3. 

Let X be a metric space with metric p, and 
left f be an injection from a set Y into X. 
Then the function p'(y1, Y2)= P(S (y1) f(V2)) 
(V1, ¥2€ Y) is a distance function on Y, and 
with this metric the set Y becomes a metric 
space called the metric space induced by /; 

f is an isometric mapping from (Y, p’) onto 
(SCY), p). 
For a finite number of metric spaces (X,, p4) 
<-> (Xn Pn), we can define a metric p on their 
Cartesian product X = X, x... x X,, by 
setting 


p(x, Y= Px(X4,y4) + Lt PulXns Vn) j 


for two points x =(X,,..-,Xq), Y=(Y1; -> Yn) 

of X. Thus we obtain a metric space (X, p), 
called the product metric space of (X,,/,), ..., 
(Xn Pn). The n-dimensional Euclidean space R” 
is the product metric space of n copies of the 
real line (R, po). 


C. Topology for Metric Spaces 


For a point x of a metric space (X, p) and 
any positive number g, the set U,(x) of all 
points y such that p(x, y)<e is called the g- 
neighborhood (or ¢-sphere) of x. We can intro- 
duce a topology for X by taking the family of 
all e-neighborhoods as a tbase for the neigh- 
borhood system (— 425 Topological Spaces). 
Then the following five propositions hold, any 
one of which can be used to define the same 
topology: (i) A subset O is topen if and only if 
for any point x in O there is a positive number 
e such that the e-neighborhood of x is con- 
tained in O. (ii) A subset F is *closed if and 
only if any point whose every -neighborhood 
contains at least one point of F is contained 
in F. (ut) A subset U is a neighborhood of a 
point x if and only if U contains some e- 
neighborhood of x. (iv) A point x is an finte- 
rior point of a subset A if and only if some £- 
neighborhood of x is contained in A; the inte- 
rior A’ of A is the set of all such points. (v) A 
point x is adherent to a subset A if every £- 
neighborhood of x contains at least one point 
of A; the closure A of A is the set of all such 
points, and xe A if and only if p(x, A)=0. 
Every metric space X satisfies the tfirst 
countability axiom. A metric space X is a 
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tHausdorff space and, more specifically, a tper- 
fectly normal space; it is also tparacompact. 

In the same way, we can define a topology 
for each pseudometric space that satisfies the 
first countability axiom, but a pseudometric 
space is not necessarily Hausdorff. 


D. Convergence of Sequences 


A sequence {x,} of points in a metric space is 
said to converge to a point x (written lim,..,, Xn 
=x) if p(x,, x) tends to zero as n> œ. The 
point x is called the limit of {x,}. This conver- 
gence is equivalent to convergence with re- 
spect to the topology defined in Section C (— 
87 Convergence). As the first countability 
axiom is satisfied, we may define the topology 
by means of convergent sequences of points: 
the closure A of a subset A is the set of all 
limits of sequences of points in A. 


E. Separable Metric Spaces 


For a metric space X the following three con- 
ditions are equivalent: (1) There exists a coun- 
table family D, of open sets of X such that 
each open set of X is the union of members of 
Do (tsecond countability axiom). (ii) X is tsep- 
arable, that is, X has a countable subset that is 
tdense in X. (iii) Every open covering of X has 
a countable subcovering (tLindel6f space). A 
metric space with any of these properties is 
called a separable metric space. The sequence 
space s is separable. Any separable metric 
space can be isometrically embedded in the 
sequence space m, i.e., is isometric to a sub- 
space of m(— 168 Function Spaces B). 


F. Compact Metric Spaces 


For a metric space X, the following five con- 
ditions are equivalent: (i) X is tcompact, that 
is, every open covering of X has a finite sub- 
covering. (ii) X is countably compact, that is, 
every countable open covering of X has a 
finite subcovering. (iii) X is tsequentially com- 
pact, that is, any sequence of points in X has a 
convergent subsequence. (iv) Every nested 
family F, > F, >... of nonempty closed sets 

of X has a nonempty intersection. (v) Every 
infinite subset M of X has an accumulation 
point x, i.e., xe M — {x}. A metric space satis- 
fying any of these conditions is called a com- 
pact metric space (M. Fréchet [1]). Every real- 
valued continuous function defined on a com- 
pact metric space has a maximum and a min- 
imum. A metric space is compact if and only 
if it is totally bounded and complete (— Sec- 
tion J). Every totally bounded metric space is 
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separable. In particular, every compact metric 
space ts separable. 

Let U={U,} be an open covering of a com- 
pact metric space X. There exists a positive 
number ô such that every set with d(A)<6 is 
contained in some U,. The number ô is called 
the Lebesgue number of the open covering UI. 

A subset A of a metric space is said to be 
compact if it is compact as a metric subspace, 
and A is said to be relatively compact if its 
closure is compact. Bounded closed sets in R”, 
in particular closed intervals of real numbers, 
are compact. For these sets, conditions (i), (iv), 
and (y) are called the Heine-Borel theorem (or 
Borel-Lebesgue theorem), Cantor’s intersection 
theorem, and the Bolzano-Weierstrass theorem, 
respectively. 


G. Product Spaces of Metric Spaces 


Let (X,, Pi), ---, (Xn, Pa) be metric spaces. Then 
the Cartesian product X = X, x... x X, has 
distance functions 


PAX, Y= {p1 (x1, yP +- + Pa(Xns Ya” } P, 


p21 
and 
Pa (x, y)= max { py (Xp, Y1), os Pal(Xn Va) 
where x =(x,,...,X,) and y=(y,,..., y,). The 


topology of X induced by each one of these 
metrics coincides with the product topology. 
In particular, for the n-dimensional Euclidean 
space R”, the metrics p, (p> 1) and pa define 
the same topology. 

Let (X,, p1),---, (Xn Pn), --- be a countable 
number of metric spaces. If we define a metric 
p on the Cartesian product X =[T],_, X,, by 
SL Pan Yn) 


X, = > 
n ») 2, 2 1 + palXs Yn) 


where x =(X,,X2,...) and y=(y,, Y2,- ), then 
the topology defined by p is identical with the 
product topology. For the Cartesian product 
of an uncountable number of metric spaces, we 
cannot construct a metric p such that the 
topology induced by p agrees with the product 
topology in general. 


H. Uniformity of Metric Spaces 


Every metric space X is a tuniform space, for 
which we may take a countable number of 
subsets {(x, y)| p(x, y)<2-"}, n=1, 2,..., of 
X x X asa base of tuniformity (— 436 Uni- 
form Spaces). 


1017 


I. Uniform Continuity 


A mapping f from a metric space (X, p) into a 
metric space (Y, ø) is tcontinuous if for any 
point x in X and any positive number e there 
is a positive number 6 such that f(U;(x))< 
V,(f(x)), where U(x) is a 6-neighborhood for 
p and V,(y) ts an e-neighborhood for o; that 
is, p(x, x’) <ô implies a( f(x), f(x’))<e. In this 
case, we must generally choose 6 depending 
on x and g. In the special case where we can 
choose 6 depending only on e, independently 
of x, we call f uniformly continuous in X. (The 
notion of uniform continuity may be general- 
ized to uniform spaces.) Not every continuous 
mapping is necessarily uniformly continuous, 
but every continuous mapping from a com- 
pact metric space into a metric space is uni- 
formly continuous. 


J. Complete Metric Spaces 


A sequence {x,„} of points in a metric space 
(X, p) is called a fundamental sequence (or 
Cauchy sequence) if p(x, x,,) 0 as n, m> œ. 
Every convergent sequence is a fundamental 
sequence, but the converse is not always true. 
A metric space is called complete if every fun- 
damental sequence in the space converges to 
some point of the space (M. Fréchet [1]). A 
topological space that is homeomorphic with a 
complete separable metric space is sometimes 
called a Polish space (— 22 Analytic Sets I). 
The metric spaces introduced in examples (1) 
through (5) of Section B are complete. (In 
example (3) we must assume that the space Q 
is a compact Hausdorff space.) A metric space 
is compact if and only if it is complete and 
totally bounded. A flocally compact metric 
space is homeomorphic to a complete metric 
space. 

For a metric space X, we can construct a 
complete metric space Y such that there is an 
isometric mapping ọ from X onto a dense 
subspace X, of Y (F. Hausdorff, 1914). Such a 
pair (Y, ọ) is called a completion of X. If X has 
two completions (Y,, g,) and (Y2, @,), then 
there is an isometric mapping f from Y, onto 
Y, with œ, = f o @,. In this sense the comple- 
tion of X is unique. By identifying X with 
(X) when (Y, p) is the completion of X, any 
metric space can be regarded as a dense sub- 
space of a complete metric space. For example 
the completion of the rational number system 
Q is the real number system R. A metric space 
is totally bounded (= precompact) if and only 
if its completion is compact. 

Baire-Hausdorff theorem: In a complete 
metric space every set of the ffirst category is a 
tboundary set. That is, every set that can be 
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expressed as the union of a countable number 
of sets whose closures have no interior point 
has no interior point. In other words, if the 
union |_)?_, F, of closed sets F,, F,,... of X has 
an interior point, then at least one of the F, 
must have an interior point. 


K. The Metrization Problem 


A topological space X is called metrizable if 
we can introduce a suitable metric for X which 
induces a topology identical to the original 
one. A tT, -space satisfying the second counta- 
bility axiom is metrizable if and only if it is 
tregular (Uryson-Tikhonoy theorem; P. S. 
Uryson, Math. Ann., 94 (1925), A. Tikhonov, 
Math. Ann., 95 (1925)). However, a metric 
space does not necessarily satisfy the second 
countability axiom. Therefore, the Uryson- 
Tikhonov theorem does not provide a neces- 
sary and sufficient condition for metrizability. 
The following are some necessary and sufficient 
conditions for a topological space X to be 
metrizable: 

(1) There exists a nonnegative real-valued 
function d on X x X satisfying the first two 
axioms given in Section A and the following 
condition: There exists a real-valued function 
(w) that converges to zero as w—0 such that, 
for any three points x, y, z and any positive 
number e, d(x, y)< (e) and d(y, z)< (£) imply 
d(x,z)<e (E. W. Chittenden, Trans. Amer. 
Math. Soc., 18 (1917)). 

(2) X is a T,-space that has a countable 
number of open coverings M,, M,,... satisfy- 
ing the following two conditions: (i) If U,, 

U eM, have a common point, there is a set 
Uewm, with U >U, U U}; (ii) for any point x 
in X, if U, is any member of M, containing x, 
the family {U,},,-1,2,... is a base for the neigh- 
borhood system of x (P. S. Aleksandrov and 
Uryson, C. R. Acad. Sci., Paris, 177 (1923), and 
N. Aronszajn). When X is a uniform space, 
this amounts to saying that X has a metric 
compatible with the uniform structure if and 
only if X is a T,-space and has a countable 
base of uniformity. 

(3) X is a T,-space that admits a countable 
number of open coverings M, Wi,, ... such 
that {S(S(x, M;), Mi) |i j= 1,2, ...} is a base for 
the neighborhood system of x at each point of 
X, where S(A, M) is the ‘star of A relative to 
M (R. L. Moore, Fund. Math., 25 (1935); K. 
Morita, Proc. Japan Acad., 27 (1951); A. H. 
Stone, Pacific J. Math., 10 (1960); A. V. Arkh- 
angel’skii, Dokl. Akad. Nauk SSSR, 2 (1961)). 

(4) X is regular and has a to-locally finite 
open base (J. Nagata, J. Inst. Polytech. Osaka 
City Univ., 1 (1950); Yu. M. Smirnov, Uspekhi 
Mat. Nauk, 6 (1951)). 
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(5) X is regular and has a to-discrete open 
base (R. H. Bing, Canad. J. Math., 3 (1951)). 

(6) X is a tcollectionwise normal Moore 
space (— below; Bing, ibid.). 

(7) X is a tperfect image of a subspace of a 
Baire’s zero-dimensional space (Morita, Sci. 
Rep. Tokyo Kyoiku Daigaku, sec. A, 5 (1955)). 

(8) X is a Hausdorff tM-space such that the 
diagonal is a 'G,-set in the direct product 
X x X (A. Okuyama, Proc. Japan Acad. 40 
(1964); C. J. R. Borges, Pacific J. Math., 17 
(1966); J. Chaber, Fund. Math., 94 (1977)). 

A regular space is said to be a Moore space 
if it has a countable number of open coverings 
M; such that {S(x, Mit,)} is a base for the neigh- 
borhood system of x for any point x. A Moore 
space is not necessarily metrizable. F. B. Jones 
(Bull. Amer. Math. Soc., 43 (1937)) proved 
under the assumption 25° < 2": that every 
separable normal Moore space is metrizable 
and asked whether or not every normal Moore 
space is metrizable. (3) and (6) are partial 
answers to the question. A normal Moore 
space is metrizable if it is locally compact and 
tlocally connected (G. Reed and P. Zenor). The 
existence of a nonmetrizable separable normal 
Moore space is consistent with and indepen- 
dent of the axioms of the usual ZFC set theory, 
the tZermelo-Fraenkel set theory with the 
taxiom of choice (F. D. Tall). W. G. Fleissner 
(Trans. Amer. Math. Soc., 273 (1982)) con- 
structed a normal nonmetrizable Moore space 
assuming an axiom weaker than the contin- 
uum hypothesis, while P. J. Nyikos (1980) has 
proved that every normal space with the first 
countability axiom is collectionwise normal 
from the strong axiom of set theory. 

In connection with (7) the following result is 
known. Let f be a tclosed continuous mapping 
from a metric space X onto a topological 
space Y. Then the following conditions are 
equivalent: (1) Y is metrizable; (2) For each 
ye Y the tboundary ôf! (y) of the inverse 
image is compact; (3) Y satisfies the first coun- 
tability axiom (Morita and S. Hanai, Proc. 
Japan Acad., 32 (1956); Stone, Proc. Amer. 
Math. Soc., 7 (1956); I. A. Vainstein, Dokl. 
Akad. Nauk SSSR, 57 (1947). In particular, 
perfect images of metric spaces are metrizable. 

For quotient topological spaces of metric 
spaces — 425 Topological Spaces CC. 
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A. General Remarks 


Let X be an open set of R”. Then X x (R"\0) 
can be identified with T*X ~0, the tcotangent 
bundle of X minus the zero-section. To every 
locally integrable function (or distribution 

or hyperfunction (— 125 Distributions and 
Hyperfunctions)) f(x) defined on X, one can 
assign closed subsets of X x (R" <0) called the 
wave front set of f and the singularity spec- 
trum (or analytic wave front set or essential 
support) of f. The wave front set (resp. the 
singularity spectrum) of f describes in detail 
the singularity of f modulo the infinitely dif- 
ferentiable functions (resp. the real analytic 
functions). One can further associate with f 
more refined objects defined on X x (R” 0), 
such as microfunctions. In many cases one can 
recover knowledge of the structure of f by 
analyzing these objects defined on X x (R"™.0). 
Such an analysis on the cotangent bundle of X 
is called microlocal analysis. Microlocal analy- 
sis is particularly successful if f is a solution of 
a system of linear (pseudo-)differential equa- 
tions, because in that case one can use vari- 
ous linear transformations, such as differential 
operators, pseudodifferential operators (— 345 
Pseudodifferential Operators), or microdiffer- 
ential operators and Fourier integral opera- 
tors, or quantized contact transformations. 


B. Microlocal Analysis for Distributions 


Let u be a distribution defined in an open 
subset X of R”. The wave front set WF (u) of u 
is defined as the complement in X x (R"\0) of 
the collection of all (x9, čo) in X x (R"\0) such 
that for some neighborhood U of xa, V of čo 
we have for each pe C (U) and each N >0, 


<u, pexp(—itx: č) =0(17™) 


as T> 00, uniformly in če V (L. H6rmander; 
{1]). WF(u) is considered to be a subset of 
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T*X N {0}. If WF(u)= Ø, then u is a C®- 
function. Let 7: X x(R"\0)—X be the natural 
projection. If 7(WF(u)) contains xo, then u is 
not a C®-function in any small neighborhood 
of x9. Thus WF(u) is the obstruction for u to 
be infinitely differentiable. Let p(x, D) be a 
tlinear partial differential operator of order m. 
Assume that p(x, D)u= f. Then 


WF(f)< WF(u)— WF(f)U pm (0), 


where p,,:(x, €)> p,,(x, č) is the tprincipal sym- 
bol of p(x, D). Such a technique of localizing 
the problem on the cotangent bundle has 
been used in the form of the estimation of the 
Fourier transform of f since the advent of the 
tsingular integral operators of A. P. Calderon 
and A. Zygmund [3,4] (see also S. Mizohata 
[5,6]). The formula above contains as a special 
case the classical result that u is a C®-function 
if p(x, D) is telliptic and if f is a C%-function. 
In obtaining useful results of microlocal 
analysis for distributions one often uses Fou- 
rier integral operators and pseudodifferential 
operators (or singular integral operators) 
(— 345 Pseudodifferential Operators). 


C. Fourier Integral Operators [1, 2, 7-9] 


A Fourier integral operator B: CS (R")—> 9'(R") 
is a locally finite sum of linear operators of the 
type 


Afisy=any rm f a(x, 0, y) 


x explie(x, 0, y)) f(y) dy dð. 


Here a(x, 0, y) is a C®-function satisfying the 
inequality 


| Dz D§ D?.a(x, 0, y| < C(1 + [O|y™ FFG -P alld 


for some fixed m and p, 1 >p > 1/2, and any 
triple of tmulti-indices «, $, y, and @(x, 0, y) is 
a real-valued C”-function which is homoge- 
neous of degree 1 in 0 for |6| > 1. The function 
ọ is called the phase function and a the ampli- 
tude function. 

Let C, = {(x, 0, y)| dg g(x, 0, y)=0, 0#0} and 
W = {(x, yje R" x R"|50 #0 such that (x, 0, yje 
Co}. If dxo, yØ(x,0, y) #0 for 0 #0, then the 
kernel distribution k(x, y) of A is of class C” 
outside W. A phase function ¢ is called non- 
degenerate if the d, 4 ,(6(x, 0, y)/08,), j= 1,2, 

., N, are linearly independent at every point 
of C,. In this case, C, is a smooth manifold 
in R”*N*” and the mapping ®:C,,a(x, 8, y)> 
(x, y, on), € =d,.9(x, 8, y), n=d, p(x, 9, y), is 
an immersion of C, to T*(R” x R”)~0, the 
cotangent bundle of R” x R” minus its zero- 
section. The image ®C,= A, is a conic La- 
grangian manifold, i.e., the canonical 2-form o 
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= Lda dx;— 1 dn; Ady; vanishes on A, and 
the multiplicative group of positive numbers 
acts on A,. Let 4,,A3,...,/, be a system of 
local coordinates in A,. These, together with 
6/00,, 09/085, ...,0@/00y, constitute a sys- 
tem of local coordinate functions of R"*‘*" 

in a neighborhood of C,. Let J denote the 
Jacobian determinant 


6p 0p 
OENE geod 
1 N 
D(x, 0, y) 


The function an, = JI alc, ® exp(zMi/4) is 
called the symbol of A. Here alc, is the restric- 
tion of a to C, and M is an integer called the 
Keller-Maslov index [10, 11]. The conic La- 
grangian manifold A,=A,(A) and the symbol 
an, =a, (A) essentially determine the singular- 
ity of the tkernel distribution k(x, y) of the 
Fourier integral operator A. Conversely, given 
a conic Lagrangian manifold A in T*(R" x 
R”)~0 and a function a, on it, one can con- 
struct locally a Fourier integral operator A 
such that A,(A)=A and ay (A)=ay. For 
global construction of such a Fourier integral 
operator one requires detailed consideration of 
the Keller-Maslov index. A globally defined 
Fourier integral operator A with A,(A)=A 
and a, (A)=a, exists if and only if a, is not a 
function on A but a section of the complex 
line bundle Q, @ L, where Q,,, is the bundle 
of square roots of the volume elements of A 
and Lis a Z, bundle over A called the Maslov 
bundle. The factor dd exp(2Mi/4) in the de- 
finition of a, , above appears as the effect of 
trivialization of the bundle Q, ® L. Those 
Fourier integral operators whose associated 
conic Lagrangian manifolds are the graphs 
of thomogeneous canonical transformations 
of T*(R") are most frequently used in the 
theory of linear partial differential equations. 
Let A be a Fourier integral operator such that 
A,(A) is the graph of a homogeneous canon- 
ical transformation y. Then the adjoint of A 
is a Fourier integral operator such that the 
associated conic Lagrangian manifold is the 
graph of the inverse transformation y~!. Let 
A, be another such operator; if A,(A,) is the 
graph of z,, then the composed operator 
A, A is also a Fourier integral operator and 
A,(A, A) is the graph of the composed homo- 
geneous canonical transformation y; 7. 
Consider the kernel distribution k(x, y) of A. 
If the phase function ọ of A is nondegenerate, 
then WF(k) is contained in A,(A). Moreover, 
if the symbol a, (A) does not vanish, then 
WFE(k)=A,(A). Let u be a distribution and A 
be a Fourier integral operator such that A,(A) 
is the graph of a homogeneous canonical 
transformation y. Then WF(Au)c y(WF(u)). 
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A pseudodifferential operator of class 
Sp. 1—)(R") is a particular type of Fourier inte- 
gral operator (— 345 Pseudodifferential Oper- 
ators). In fact, a Fourier integral operator 
A is a pseudodifferential operator of class 
Sp.1—)(R") if and only if A,(A) is the graph of 
the identity mapping of T*(R"). Hence for any 
Fourier integral operator A, A*A and AA* are 
pseudodifferential operators. 

The following theorem is due to Yu. V. 
Egorov [12]: Let P(x, D) be a pseudodifferen- 
tial operator of class S” ,_,(R") with the sym- 
bol p(x, č), and let A be a Fourier integral oper- 
ator such that the associated conic Lagrangian 
manifold A,,(A) is the graph of a homogeneous 
canonical transformation y of T*(R"). Then 
there exists a pseudodifferential operator 
Q(x, D) with the symbol q(x, č)e S% , _,(R”) 
such that P(x, D)A = AQ(x, D) and q(x, €)— 
P(x(x, č))e S3125" (R"). Note that m— 2p + 
l<m. 

Assume that m=1, p=1, and that p,(x, ë) 
is a real-valued C”-function, homogeneous 
of degree 1 in £ for |E|>1, such that p(x, €) 

— Pix, E)ES? o(R”) and d.p, (x°, €°) 40 at 

(x°, €°), where p,(x°, €°)=0. Then one can find 
a Fourier integral operator A such that the 
function q(x, č) of Egorov’s theorem satisfies 
the relation q(x, €)—&, € S? a(R”). 

The boundedness of Fourier integral oper- 
ators in the space L,(R") (or the spaces H*(R")) 
has also been studied in several cases. Some 
sufficient conditions for boundedness can be 
found in [7, 8, 14-16]. 

The theory of Fourier integral operators has 
its Origin in the asymptotic representation 
of solutions of the wave equation, (see, e.g., 
[17, 18}). Fourier integral operators were first 
used by G. I. Eskin [7]. 


D. Essential Support or Analytic Wave Front 
Set of a Distribution 


Inspired by the physical idea introduced by 
C. Chandler and H. P. Stapp, J. Bros and D. 
lagolnitzer introduced the notion of the essen- 
tial support of a distribution, which is a closed 
subset of X x (R"~0) [19]. Let u be a distri- 
bution defined on an open set X of R” and 

x be a C*-function with compact support 
around x€ X which is locally analytic and 
different from 0 at xo. Let X, (u) be the subset 
of R”~0 of which the complement is defined 
as follows. A point y is in the complement of 
2 (u) if there exist a conic neighborhood U 

of y, constants 4, yọ >0, and Cy such that 


Ku zexp{—ix'č—ylčllx— xol} > 
<Cy(l + El) "eT 
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for all če U, 0<7 <7, and all positive integers 
N. 

The essential support X, (u) of u at xo, is 
the limit of Z, (u) when the width |supp y| of the 
support of y around x, tends to 0. The es- 
sential support 5(u) of u is the closed subset 
\Jeex {x} x E(u) of X x R”~0. D(u) is the ob- 
struction for u to be real analytic. L. H6rman- 
der [20] also defined the analytic wave front 
set of a distribution, which is also the obstruc- 
tion for a distribution to be real analytic. The 
definition of an analytic wave front set is quite 
different from that of essential support. How- 
ever they coincide with each other [21]. More- 
over, both of them coincide with the singular- 
ity spectrum of u if the distribution u is re- 
garded as a hyperfunction. 


E. Microlocal Analysis for Hyperfunctions 


[22] 


(1) Microfunctions. Let N be a real analytic 
manifold of dimension n+d and M its sub- 
manifold of codimension d. In what follows, 
Ty N and T% N denote the normal bundle of 
M and the conormal bundle supported by M, 
respectively. Here the normal bundle T, N is 
defined to be the quotient bundle TN | ,,/TM 
of the tangent bundle and the conormal bundle 
Ty N to be the subbundle of the cotangent 
bundle T* N|,, that annihilates TM. Identify- 
ing N with {(x,v)e TN|v=0} or {(x, Ze 
1*N|¢=0}, we define the tangential sphere 
bundle SN and cotangential sphere bundle S* N 
by (TN \N)/R%(= rew(TeN S{0})/R*) and 
(T#N\NY/RE(=aen( TAN ~ {0})/R%), re- 
spectively. The normal sphere bundle S, N = 
(Tu N^5MYRX and conormal sphere bundle 
SiN =(Tă Nx M)/RX are defined in the same 
manner. In parallel with the algebraic geom- 
etry (— 16 Algebraic Varieties) we define the 
real monoidal transform of N with center M to 
be the manifold (N ~ M)U Sy N with boundary, 
in which the center M is blown up to Sy N by 
the polar coordinates. We denote it by MN, 
We mainly use this notion when N is a tcom- 
plexification X of M, regarding X as a 2n- 
dimensional real manifold. In this case we can 
canonically identify Ty X with ft TM, and 
hence S,,X with TÆ SM. We denote by x+ 
Jf ot vO the point in S,,X that corresponds to 
(x, ,/—1v) in \/—1 SM by this identification. 
An open subset W of X ~M is called a conoidal 
neighborhood of a subset U of \/—1 SM if 


WU./—1 SM is a neighborhood of U in “X. 
Let e, è, and t denote respectively the canon- 
ical embedding mappings from X ~M to X, 


from “MX ~./—1SM to™X and from ./—1 SM 
~ ~ ~~ 
to “X. We then define the ‘sheaves @ and ./ 
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by &,¢7'Oy and O| /_1 sx¢(=7'O,). Here Oy 
denotes the sheaf of germs of holomorphic 
functions on X. We also define the sheaf 2 on 
/-1 SM by #1/_ism(t ‘Qy), where t is the 
canonical projection from MY to X and HE 
denotes the pth tderived functor of the functor 
I; of taking the sections with support in S. 
Then 2 is isomorphic to 7 /t~'.of for the 
sheaf £ of real analytic functions on M. The 
sheaf ./ is, so to speak, the sheaf of “boundary 
values” of holomorphic functions. Actually 
there exists a canonical mapping b from to 
t | By, where Zy denotes the sheaf of thyper- 
functions on M (— 125 Distributions and 
Hyperfunctions). Thus we see that 2 describes 
the singularities of the boundary value of a 
holomorphic function. These sheaves ./ and 
2 are easy to understand intuitively. However, 
they are defined on fst SM, while NEE 
S*M is more important in analysis. Our final 
goal, namely, the sheaf of microfunctions, is 
constructed on dal S*M through cohomo- 
logical machinery starting from 2. In order to 
do this, we introduce the disk bundle DM by 


{(x+,/—100, (x, / —1 €00)) 


e /—1 SM xy./—1 S*M| <v, €> <0}. 


Here ./—1 SM xy,./—1 S*M denotes the 


‘fiber product of ./--1 SM and ./—1 S*M 


over M and the symbol čo is used to em- 
phasize that ë designates the codirection, 
which is dual to the infinitesimally small 
quantity v0. The canonical projections from 
DM to Ael S*M and from TE SM to 

M are both denoted by t. Similarly, the pro- 
jections from DM to A SM and from 
gat S*M to M are denoted by z. We denote 
by a the antipodal mapping on fol S*M, 


namely, a(x, J- E00) = (x, =,/ =1 00). For 
a sheaf F on gat S* M, we also denote 

a, F(= F) by F". In the following, Rit, 
etc., denotes the jth tright-derived functor of 
the functor t, of taking the direct image of 
sheaves, etc. (— 383 Sheaves). Now the sheaf 
6 of microfunctions is defined on at S*M 
by (Rtr n IF On lay, where wy de- 
notes the torientation sheaf of M. Note that 
Rit n’ 2=0 holds for j#n—1. 

Remark: Here we have defined the sheaf 6y 
of microfunctions on TE S*M. However, it 
is sometimes more convenient to define the 
sheaf on gt T*M (= T% X) by the follow- 
ing convention: 64.2.) a1 = EM bef 10) if 
€#0, and Zy, x if €=0. Several authors (e.g., 
[23]) call this sheaf Gy, the sheaf of micro- 
functions and denote it by Cy. 


(2) Basic Properties of Microfunctions. The 
sheaf €y defined above has the following 
properties: (i) The sheaf €y is a flabby sheaf 
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on ./—1 S*M. (ii) For each (x, ./ — isc) in 


Jl S*M, there exists a surjective mapping 
from By. tO Cm, Zizo This mapping is 
denoted by sp. The mapping from By to 
1, 6y 18 also denoted by sp. (iii) We have the 
following exact sequence: 0->./,, > By > 
160. (iv) Rn 6u =0 holds for k #0. 
The exact sequence (iii) shows that the singu- 
larities of hyperfunctions are dispersed over 
./—-1 S*M and that the dispersed object 
is described by the sheaf @,, of microfunc- 
tions. For a hyperfunction fe 4,,(M), we call 
f)(€@(,/—1 S*M)) the spectrum of f. We 
denote supp sp(f) by S.S. f and call it the 
singularity spectrum of f or the singular spec- 
trum. It is known [21] that this coincides 
with the analytic wave front set of f and with 
the essential support of f if f is a distribution. 
(v) The following sequence is an exact se- 
quence on fA SM: 0> Zy >t By ont 
—0. In the following, a subset A of the (n — 1)- 
dimensional sphere S”"' is said to be convex 
if @ '(A)U {0} is convex, where w is the ca- 
nonical projection from R”~ {0} to S”~1, and 
if w~'(A)U {0} is convex and includes no 
straight line, A is said to be properly convex. 
A subset Z of /-1 SM (resp., /-1 S*M) is 
also said to be (properly) convex if t7'(x)NZ 
(resp., x ‘'(x)MZ) is so for each x in M. For 
a subset Z of ft SM its polar set Z° is, by 


definition, { (x,./—1€, oO) jeu 1. S*M | 
(Dg E> > i oot for each x+,/—1 v, O in 
Z}. The polar set Z° of a subset Z of J-i 
S*M is defined in the same way. (vi) Let U 

be an open subset of gel SM such that 

t ‘(x)NU is a nonvoid connected set for 

each x in M. Let V denote U°°. Then we 

have (a) The restriction mapping p:I'(V;.7 )> 
['(U;.f ) is a bijection. Here T(V; ), etc. de- 
notes the space of global sections of .«/ over V, 
etc. (b) 0->.A(U) > B(M)36(,/—1 S*M — UÀ’) 
is an exact sequence. (vii) Let f(x) be a hyper- 
function on M. Then the following two state- 
ments are equivalent: (a) SP(P Van /—1eq00) =Q. 
(b) There exist a finite family of open subsets 
U; of at SM whose polar set U; does not 
contain (xo,/—1 & 900) and g, in T(U, A ) 
such that f=%,b(¢,). Then we say that fis 


micro-analytic at (x9,./—! & 00). 


(3) Operations on Microfunctions. Let M and 
N be real analytic manifolds, and let f be a 
real analytic mapping from N to M. We de- 
note by T% M the kernel of the natural map- 
ping from N x, T*M to T*N. It is also called 
a conormal bundle supported by N. The as- 
sociated sphere bundle is denoted by SM. 
Denote by p and w the natural mappings from 


N Xy —1 S¥M~,./—1 SHM to,/—1 S*N 
and from N xy,./—1 S*M~./—1 SEM to 


./ —1 S*M, respectively. (i) Let Zy denote the 


me 
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sheaf {ue By|S.S. uN. /—1 S$ M = Ø}. Then 
we have the following two canonical homo- 
morphisms: f*: By > By and f*: pa! Sy 
@y. Here and in what follows, for a continuous 
mapping ọ from N to M and a sheaf F on N, 
~,F denotes the sheaf on M defined by assign- 
ing {ser (P7 (U); F)|Pleupps?Supps> U is 
*proper} to each open subset U of M. These 
two homomorphisms are consistent. We call 
each of them a substitution (homomorphism) 
and denote it by (f*u)(y)=u(f(y)). (ii) Let 

Vy denote the sheaf of tdensities. Then there 
exist the following two canonical homo- 
morphisms: fy: f(y Qu, nx)? Bu Quy, Um and 
Sy: tp "(Gy Qu, Un) > Cu Qy,, Vm- These two 
homomorphisms are consistent. We call each 
an integration along a fiber and denote it by 
(Jau) (x)= fj- u. (iii) Using the result in (i), 
we can define the product u, u, of two hyper- 
functions u, and uw), if S.S. u, N (S.S. u) = 

Ø. Furthermore, the singularity spectrum 


of u, u, is contained in {(x, ./—1(6€, + (1 — 


O)E2) 0) (x, y —1 č; œ)eS.S. uy, (x,y ~1 6,0) 
ES.S.u2, 0S0 <1} US.S. u; US.S. uz. 


F. Microdifferential Operators [22] 


(1) Microlocal Operators. Let M be a real 
analytic manifold, and define the sheaf 

Ly on ,/—1 S4(M x M)=,./—1 S*M by 

HI stm x mu xm Q Vy). A section K(x, y)dy 
of #y naturally determines an integral oper- 
ator X :u(y)—>f| K(x, y)u(y)dy. An operator 
thus obtained is called a microlocal operator, 
because it acts on the sheaf €, of microfunc- 
tions as a sheaf homomorphism. Usually we 
identify an operator ¥ and a kernel function 
K(x, y)dy. (i) Ly is a sheaf of rings by the 
natural composition. The unit element of 

Ly 18 0(x— y)dy. It acts on Gy as an identity 
operator. (i1) Let K(x, y)dy be the kernel func- 
tion of a microlocal operator X defined near 
(cas J-i €) 0). Then its adjoint operator %* 
is, by definition, the microlocal operator de- 
fined near (Xo, eye čo 00) with the kernel 
function K(y, x)dy. The operation * is a sheaf 
isomorphism between Zy and Z, where a 
denotes the antipodal mapping (— Section E). 


(2) Microdifferential Operators. A micro- 
differential operator is an analog of a micro- 
local operator in the complex domain. By a 
procedure similar to that used to define the 
sheaf of microfunctions we first define the 
sheaf yj, of holomorphic microfunctions for 
a submanifold Y of X (— [22, definition 1.1.7 
on p. 319], where it is denoted by yx). It 
follows from the definition that 6,7, is sup- 
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ported by P*¥(Y x X), which is identified with 

Y x, P*X. Here and in what follows P*X, 

etc., denotes the cotangential projective bundle 
of X, etc. Then the sheaf & of microdifferen- 
tial operators (of infinite order) is, by defini- 
tion, Cixox Qo, Qx, where Qy denotes the 
sheaf of holomorphic dim X-forms. 

Remark. Several notations are used to de- 
note éy in the literature. For example, [22] 
uses the symbol 2 and calls it the sheaf of 
pseudodifferential operators. As in the case of 
microfunctions, some authors use the symbol 
&éy¥ to denote a sheaf on T*X. In this case 
Ex |x is, by definition, 2%, the sheaf of linear 
differential operators (of infinite order). One 
should be careful in these notational confu- 
sions in referring to papers using microdif- 
ferential operators. We note also that the sym- 
bols 2 and £ have nothing to do with the 
symbols in distribution theory. 

We now list the basic properties of micro- 
differential operators. 

(i) When X is a complexification of a real 
analytic manifold M, £% | /—js«y is a subring 
of Zu. 

(ii) Let Q be an open subset of P*X. Using 
a local coordinate system (x) on X, we define 
Ô by {(x, eC" x (C"— {0})| (x, €o0) EQ}. Let 
{ p(x, )}iez be a sequence of holomorphic 
functions on Q satisfying the following condi- 
tions: (1) p,(x, č) is homogeneous of degree j 
in ¢. (2) For each £>0 and each compact sub- 
set K of Q, there exists a constant C, x such 
that sup, |px, č)| < C; xe?/j! (J20) holds. 

(3) For each compact subset K of Ô, there 
exists a constant Rx such that supx|p,(x, &| < 
RgÝ(—j)!(j <0) holds. Then there is a one-to- 
one correspondence between the space of such 
sequences and the space of sections of & over 
Q. 

(iii) A sequence satisfying the conditions 
in (ii) is called a symbol sequence, and the 
corresponding section of & is denoted by 
Ljez P(x, D,). If we define a subsheaf &,(m) of 
by by {P= Lip, DEF p(x, )=0 (j> 
m+ 1)}, it is independent of the choice of the 
local coordinate systems. A microdifferential 
operator belonging to &,(m) is said to be of 
order (at most) m. We denote |), 6 (m) by 
éx and call a section of £y a microdifferential 
operator of finite order. 

(iv) Let ®,(z) denote I'(A)/(—z)*, where 


its branch is chosen so that ®,(—1)=T (4). 


When A=0, —1, —2,..., we consider its *finite 
part. Let Q be a complex neighborhood of 
(X9,4/—1 Egco)e,/=1 S*R"= Rx, /—1 S, 
Using a symbol sequence {p,(z, €)} on Q, we 
define a multivalued holomorphic function 
K(z,w,¢) by 2; p,(z,0)®,4(¢2—w, 0) and 
consider its boundary value from the domain 
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Re<z—w,f><0. We denote the resulting 
microfunction by 


YP 9/1 OO, y/-1 (Kx. 
+,/—10)). 


Then 


K(x, y) =n) | 45 ilo VZT HDX, 
Ja o-oo 


is a well-defined microfunction in a neighbor- 


hood of (Xo. Xy3</—1(€o, — čo) 0) whose sup- 
port is contained in the antidiagonal set A’ = 
f(x, y3,/—1(E,mooye./—1 S*(M x M)|x=y, 
E+n=0}. Here w(€) is the volume element 

of the (n—1)-dimensional sphere S"-*. Hence 
K(x, y)dy defines a microlocal operator. 

The mapping which associates K(x, y)dy with 
{ p,(x, €)} is compatible with the inclusion 
mapping stated in (1). 

(v) By using the tplane wave decomposition 
of the 6-function due to F. John (— 125 Distri- 
butions and Hyperfunctions CC), we find that 
microdifferential operators are a natural gen- 
eralization of tlinear differential operators: A 
linear differential operator corresponds to a 
symbol sequence {p,(x, č)};>o, where p; is a 
polynomial with respect to ¢. 

(vi) (a) Let P=X,p,(x, D,) and Q = 
È; d(x, D,) be microdifferential operators. 
Then their composition R =P oQ is a micro- 
differential operator with the symbol sequence 
{iiez given by 


n= Z Dip, Digs. 
1=jfk- |a| &: 

Here D? = d/Ges: ... 0&2" and a! =a,!...a,! for 
the tmulti-index «=(a,,...,%,). (b) Let P 
=) ,p;(x, D,) be a microdifferential operator. 
Let 6(x — y) denote the residue class [1] of the 
left Ex x x Module Ey x x/(2k=1 Ex x x(%% Yi) 
+E i Exx x(0/0x, + 6/dy,)). “Module” means 
sheaf of modules.) Then there exists a unique 
microdifferential operator R = }Ł,n(y, D,) such 
that P(x, D,)d(x— y)= R(y, D,)ô(x— y). Fur- 
thermore, r(x, č) is given by 

(—1) 


a! 


Dz Dž p;(x, €). 


l=j- lal 


R is called the adjoint operator of P and is 
denoted by P*. When X is a complexification 
of the real manifold M, it coincides with the 
adjoint operator P* € Ly. 

(vii) For a microdifferential operator P in 
&,(m), we define its principal symbol o,,(P) by 
Pm(X, é). The principal symbol o,,(P) is in- 
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dependent of the choice of local coordinate 
system. It gives an isomorphism between 
&y(m)/Ey(m—1) and ©. y(m), the sheaf of 
holomorphic functions on T*X which are 
homogeneous of degree m with respect to č. 
(viii) (a) Let P be in &y(m),,,, <,). Assume that 
d,,(P)(Xq, č) #0. Then its inverse P~? (i.e., 
PP '=P™'P=1)exists in 6y(—m),,,_ ¿oy (b) 
Let P and G be in 6y(m), é) Exo. čo TE- 
spectively. Suppose that H} ¢¢)(Gm(P)) (Xo; So) = 
0(j=0,...,p—1) and that HÈ (g)(Gm(P)) (Xo; čo) 
#0. (Here H,(g) is, by definition, the *Poisson 
bracket { f,g} of f and g (— 82 Contact Trans- 
formations)). Then for each S in &y (.,. 2) We 
can find Q and R in éx xé) so that S=QP + 
p 


R with (ad G} RY2G, [G, ..., [G, R], ...]=0 
holds. This result is usually referred to as the 
Späth-type division theorem (for microdifferen- 
tial operators). In particular, when G =x, 

and (Xo, čo) = (0; 1,0, ...,0), R has the form 
EPZ R(x, D') DE. Here R®(x, D')= R(x, Dy, 
...,D,_,). AS a corollary to this expression 

we find the following (Weierstrass-type) pre- 
paration theorem (for microdifferential oper- 
ators): Let P be as above, and let G=x,. Then 
we can find Q and W in 6, (o:1,0,...,0) Such 
that P=QW with invertible Q and W= D} + 
Ei W(x, D’))D*, where W™ belongs to 
&(p—k) and o, ,(W®)(0; 1,0, ...,0)=0. 

(ix) Quantized contact transformation. (a) Let 
X be an n-dimensional complex manifold and 
Q an open subset of P*X. Let P;(j=1,2,...,n) 
be in &(1) (Q) and Q; (j=1,...,n) in &(0)(Q). 
Assume that [P,, P,]=[0;,0,]=0 and [P;, Q,] 
= 6, hold (1 <j,k <n). Let @ be the contact 
transformation from Q to P*C" defined by 
Pr (Go(Qi)(P), ---, Go(Qn)(P), 71 (Pi )(P), ---> 
o,(P,)(p)). Then there exists a unique C-algebra 
homomorphism ®:@ | &—>&x|g such that 
O(x;)=Q; and O(D)= P; (j= 1, ...,n). Further- 
more, ® is an isomorphism, Dé¢.(m) = &(m) 
holds, and a,,(®(R))=6,,(R) o @ holds for R in 
&cu(m). We call the pair (9, ®) a quantized con- 
tact transformation. In the above situation, 
the x x c»-Module 


A =b 05, Ex x cr(X; — Q) 
j= 


j=1 


+3 b.0(-D,-P)) 


is a simple holonomic system (— Section H) 
whose support is the graph of o°. Let u be the 
canonical generator of Æ, i.e., the residue class 
of 1 in M. Then R*u=@(R)u holds for RE c. 
(b) Conversely, let ọ be a contact transfor- 
mation from a neighborhood of p in P*X to 
P*C", and let u be a generator of a simple 
holonomic system whose support is the graph 
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of o°. Then a C-algebra isomorphism ®: 

o` écr >é is defined in a neighborhood of p 
through R*u=@(R)u (RE éc), and (p, ®) 
becomes a quantized contact transformation. 
(c) In particular, let p be a point in Jai S*M 
and Q its complex neighborhood. Let (p, ©) be 
a real quantized contact transformation de- 
fined on Q (i.e., @ maps gat S*M to Ji 
S*R”). Then, in a neighborhood of (p, o(p)*)e 
yA S*(M x R”), there exists a microfunction 
solution K(y, x) 40 of the equations x,K(y, x)= 
O;K(y,x), —(8/0x) K (y, x)= PK (y, x) (j= 
1,... 7) (xe R”, ye M). Such a microfunction is 
unique up to constant eae The integral 
operator X :v(x)+> | K(y, x)v(x)dx gives rise 
to a sheaf isomorphism between p! €p and 
@y in a neighborhood of p. Furthermore, we 
have #(Rv)=(R) (4%) for ve Cr and Rec. 
This .# is the counterpart of the Fourier inte- 
gral operator (— Section C). 

(x) Algebraic properties of é% and £x. (a) éy 
is ‘coherent as a left &,-Module, and its stalk is 
a tleft Noetherian ring. (b) &Y is *faithfully flat 
over &y. (c) &, is tflat over & (0). (d) &, is flat 
over n` Dy. 


G. Microdifferential Equations [22,24] 

(1) Background. A system æ of microdifferen- 
tial equations (of finite order) is by definition a 
‘coherent left (or right) 6,-Module, i.e., there 
exists locally an exact sequence of the form 

k > Ey M0. For a coherent 6-Module 
M, the support Supp æ of M is an impor- 
tant geometric object associated with æ. It is 
called the characteristic variety of Æ. For 

a coherent Y-Module M, its characteristic 
variety is by definition Supp(é @,,.@). It is 
often denoted by S.S.. Æ. Since a microdifferen- 
tial operator is a microlocal operator, the 
result in (viii) (a) of Section F (3) asserts that 
&at}(M, Cy) is supported by the characteristic 
variety of M intersected with gat S*M. 
Now, it is known that V=Supp./@ is tinvolu- 
tory (=involutive, in involution) in P*X, 
namely, f|y=g|,=O entails { f/g}|,=0 [22, 
theorem 5.3.2 on p. 453]. One of the most 
important problems in microlocal analysis is 
to study how much information V can give 
concerning the structure of M itself, and hence 
that of &-¢i(.M, Cu). The epoch-making dis- 
covery of [22] is that V determines the struc- 
ture of £” Qy M at generic points of V as 
follows. 


(2) Structure Theorems. The fundamental 
result of [22, theorem 5.3.7 on p. 455] is the 
following theorem for coherent &-Modules: 
Structure theorem 1. Let .@ be a coherent 
&y-Module satisfying the following conditions: 
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(1) ath, (M, &)=0 for j#d. (2) V€ Supp“ 
is regular at pe V in the sense that V is non- 
singular near p and that w|,(p)40 for the 
‘canonical 1-form w. Then, through a quan- 
tized contact transformation (p, ©), £” Q M 
is isomorphic to a direct summand of a direct 
sum of finite copies of partial de Rham sys- 
tem Ng = 66 /(}$-, E&0/0z;) with o(p)= 

(0; 0, ...,0, be P*C". 

By studying the canonical form of V under 
real contact transformations, [22] further 
gives the following structure theorem in a real 
domain, i.e., in y- S*M for a real analytic 
manifold M. 

Structure theorem 2. Let X be a complexifi- 
cation of a real analytic manifold M, and let M 
be as in structure theorem 1. Let p be a point 
in VN aal S*M. Suppose that the following 
three conditions are satisfied: (3) VN V is regu- 
lar at p. (4) (VNA T,(V) = (VN V) holds for 
each q in VN V. Here V denotes the complex 
conjugate of V (with respect to gt S*M) 
and T,(V), etc., denotes the tangent space of V, 
etc., at q. (5) The generalized Levi form of V is 
of constant fsignature (a, b) near p, where the 
generalized Levi form of V is, by definition, the 


Hermitian form 
X,4/ —1 ¢) 0/0, 


for the p; such that V=();p; '(0). Then £” @ 
M is isomorphic to a direct summand of 

a direct sum of finite copies of the system 

é” Qg M considered in a neighborhood of 


(x,./-19=(0,./—-1(0,...,0, 1))e./—1 S*R", 


where WV is given by 


L(v)= d {Pi Px} ( 


A A 
O o 
(. T 1 Xrt2s+1 )f=0 
OX, +2541 Xn 


CG 0 
Saf =) Xr+2s+i Fx 
OXr+2s+1 


Here, r=2codim V —codim(VN V) and s= 
codim(VN V)—codim V— (a+ b). 

The first (resp. second) type of equation in 
the above are called (partial) de Rham equa- 
tions (resp. (partial) Cauchy-Riemann equa- 
tions). The third and the fourth are called 
Lewy-Mizohata equations (of type (a, b)) after 
these authors’ pioneering works [25, 26]. Thus 
any system is seen to be microlocally isomor- 
phic to a mixture of these three types of equa- 
tions, generically speaking. As a corollary to 


a+1,... a+b). 
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this, structure theorem 2 clarifies the structure 
of microfunction solutions of .@ as follows: 
Structure theorem 3. Let M, X, M, V, 
and p be as in structure theorem 2. Then 
Ext, (EY O M, Gy) =0(j #a) holds, and the 
remaining cohomology group F = 624%: (6x © 
M, Gr) has the following structure in a neigh- 
borhood U of p: There exists an s-dimen- 
sional complex manifold Y, a real analytic 
manifold N, and a tsmooth mapping @ from 
VOUN/=1 S*M to Yx,/—1 S*N such 
that ¥ = 'G for a sheaf Y on Yx./-1: 
S*N that is a direct summand of #%, with # 
being the solution sheaf of the partial Cauchy- 
Riemann equations associated with Y. 


H. Holonomic Systems 


A coherent (left) 6-Module .@ is called holo- 
nomic if Supp Æ is Lagrangian. A coherent 
(left) Z-Module Æ is called holonomic if 

E Q pM is so. Even though the term “holo- 
nomic” is currently used, another term, “maxi- 
mally overdetermined,” is used to describe the 
same object in some of the literature, including 
[22]. The importance of such a system lies in 
the fact that the space of its microfunction 
solutions is finite-dimensional [27, 29]. In this 
sense it resembles an ordinary differential 
equation. A holonomic system, however, does 
not satisfy condition (2) of structure theorem 1 
of Section G, and its structure ts rather com- 
plicated. A result which corresponds to struc- 
ture theorem 1 in Section G is the following: 
Let V be an involutory submanifold of T*X, 
and let & be the subring of éy generated by 
{Pe&,(1)|o,(P)], =0}. Then a coherent £y- 
Module æ defined on an open subset Q of 
T*X is said to have regular singularities along 
V if for any point p of Q, there exist a neigh- 
borhood U of p and an 6,-sub-Module Mo 

of M defined on U which is coherent over 
6(0), and which generates “æ as an éy-Module. 
A holonomic system is said to have R.S., 
which is an abbreviation for regular singular- 
ities, if it has R.S. along its support. Then for 
an arbitrary holonomic &-Module M we can 
find a holonomic &-Module .@,,, with R.S. 
such that &* Qg Mreg %8” Qg M holds. See 
[28] for the proof of this striking result and 
related topics on holonomic systems with reg- 
ular singularities. 

An elementary class of holonomic systems is 
that of simple holonomic systems. A holonomic 
&-Module M is called simple if there exists a 
left Ideal .4 such that .“=6/¥ and that the 
symbol Ideal {o(P)|Pe4} coincides with the 
defining Ideal of Supp.@. Let u denote the 
generator 1 mod.¥ of a simple holonomic 
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system &/.%. Suppose that A = Supp M is 
nonsingular. Then the principal symbol o,(u) is 
defined as follows: For P= %jp,(x, D,)eé(m), 
let L, denote the first-order linear differential 
operator 


ke i 
L= Hna Pamit T] 


2% x, 


Here, H, denotes the Hamiltonian vector field 
defined by {Pm y- Let Q, and Qy denote the 
sheaf of n-forms of A and X, respectively, and 
let Q2! (resp. Q812 @ QE") denote a line 
bundle L such that L®? is isomorphic to Qy 
(resp. Q, ®@ Q71). Since the line bundles QE"? 
and Q$! @ OF '? do not exist globally in 
general, all the equations among their sections 
should be understood up to a constant multi- 
plicative factor. When A is a purely imaginary 
Lagrangian submanifold of ft S*M for 

a real analytic manifold M, these line bun- 
dles can be constructed globally by using the 
Keller-Maslov index (— Section C). Let v 

and wy denote respectively the first term and 
the second term in the right-hand side of the 
above definition of Lp. Then £p:Q21? 3 

Q12 is given by Lp(s)=(1/2s)L,(s2) + Ws for 
seQ®? where L,(s?) denotes the tLie deriva- 
tive of s? along v. One can then prove that the 
system of equations Lps =0 (Pe.#) admits 
locally one and exactly one nonzero solution 
s in Q©!” up to a constant factor. Then the 
principal symbol o,(u) of u equals s ® dxe 
Q812 @Q2-"? by definition. The principal 
symbol c, (u) is homogeneous with respect to 
č, and its homogeneous degree is called the 
order of u and is denoted by ord,(u). The 
microlocal structure of a simple holonomic 
&-Module with a nonsingular characteristic 
variety is determined by the order of its gen- 
erator as follows: (a) Let éu and &v denote 
simple holonomic &-Modules with the same 
characteristic variety A. Then &u is isomorphic 
to &v if and only if ord,(u)—ord,(v) is an 
integer. (b) Let &u be a simple holonomic 
system, and let x denote the order of u. Then 
through a suitable quantized contact trans- 
formation, &u is isomorphic to gw, where w 
satisfies 


Calar. 0 
i ry i 


ew 


—=0 (j=2,...,n). 
02; 

Thus the microlocal structure of a simple 
holonomic system M = &u is fairly simple at 
nonsingular points of its characteristic variety. 
Moreover a Hartogs-type theorem for micro- 
differential equations [28, ch. I, §2] entails 
that if Supp. has the form A, UA, with 
Lagrangian manifolds A, and A, such that 
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codim, (A, MA,)>2; then æ has the form 

M, ® M, with supp.4@=A, (j= 1,2). Hence 
the case where codim, (A, A,)=1 is impor- 
tant. Suppose A, and A, are nonsingular in this 
case and T,A, #T,A, at a point xeA, NA. 
Then, through a quantized contact transfor- 
mation (w, ®), the system M is isomorphic to 
cW, with w satisfying the following equations 
defined near (0; dz, œ): 


ôw 

—= =3,. 

5 (j=3,...,7), 

where a= ord, (u) (j= 1,2) with f(A,)= 
{z,=0,,=...=&,=0} and w(A,)={z,= 
z,=0,€,=...=€,=0}. For more general cases 


and an application — [30]. 

The theory of holonomic systems and its 
applications are being studied most inten- 
sively, raising the hope of establishing a uni- 
fied theory of special functions of several 
variables; — [23] and references cited therein. 


I. History 


Microlocal analysis means local analysis on 
the cotangent bundle. It emphasizes the im- 
portance of localization in cotangent bun- 
dles in analysis, which was pointed out by S. 
Mizohata [5,6] immediately after the advent 
of singular integral operators in the works of 
A. P. Calderon and A. Zygmund [3, 4]. Since 
then, localization in the cotangent bundle has 
been used frequently in the theory of linear 
partial differential equations. R. T. Seeley [31] 
proved that the symbol of a singular integral 
operator is well defined on the cotangent 
bundle. Works by J. J. Kohn and L. Nirenberg 
[32] and L. H6rmander (Comm. Pure Appl. 
Math., 18 (1965)) strengthened the trend of 
localizing the problem on the cotangent bun- 
dle. Although it seems that the term “micro- 
local analysis” first appeared in the literature 
in 1973 (T. Kawai, Astérisque, 2 and 3 (1973)), 
the basic part of the theory had been con- 
structed during the period from 1969 to 1972 
by M. Sato (Proc. Intern. Conf. Functional 
Anal. and Related Topics, 1969), Yu. V. Egorov 
[12], Hormander [8, 20], J. J. Duistermaat 
and Hormander (Acta Math., 128 (1972)), 

M. Kashiwara and Kawai (Proc. Japan Acad., 
46 (1970)), and Sato, Kawai, and Kashiwara 
[22]. Apparently the work of V. P. Maslov 
[11] had an important influence on the work 
of Egorov. The most important part of Sato’s 
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contribution was the fact that, through the 
construction of the sheaf of microfunctions he 
found that the singularities of hyperfunctions 
can be canonically dispersed over the cotan- 
gent bundle (— Section E) and that a hyper- 
function solution u of a linear differential 
equation Pu =0 is concentrated on the charac- 
teristic variety when its singularities are thus 
dispersed (— Section G). The last-stated fact 
was also formulated by Hormander [8, 20] in 
the framework of distribution theory. The 
most important part of the contribution of 
Egorov [12] was the discovery that one can 
use an integral transformation introduced by 
V. I. Eskin [7] to find a transformation of 
pseudodifferential operators compatible with a 
homogeneous canonical transformation, i.e., a 
contact transformation, so that the commut- 
ation relations and the orders of the operators 
can be preserved. H6rmander (Acta Math., 

121 (1969)) independently introduced integral 
transformations of the same type, calling them 
Fourier integral operators. Egorov [13] and L. 
Nirenberg and F. Treves [33] successfully used 
the transformation of operators to study the 
regularity and existence of solutions. Sub- 
sequently, H6rmander [8] elaborated the 
theory of Fourier integral operators. Kashi- 
wara and Kawai (Proc. Japan Acad., 46 (1970)) 
observed that a pseudodifferential operator in 
their sense (now called a microdifferential 
operator; — Section F) gives rise to a sheaf 
homomorphism on the sheaf of microfunctions 
and that the structure of the microfunction 
solutions of pseudodifferential equations is 
determined by the principal symbol of the 
operator in question if it has simple character- 
istics. Then Sato, Kawai, and Kashiwara [22] 
succeeded in amalgamating these two theories, 
namely, the theory of microfunctions and the 
theory of the transformation of operators. 
(They called the transformation a quantized 
contact transformation in [22]). Such an amal- 
gamation was also done independently by 
H6érmander [8, 20], who introduced the notion 
of the wave front set for distributions as a 
substitute for the support of microfunctions. 
Incidentally, it is noteworthy that C. Chandler 
and H. P. Stapp (J. Math. Phys., 10 (1969)) and 
D. Iagolnitzer and Stapp (Comm. Math. Phys., 
14 (1969)) obtained a notion similar to the 
singularity spectrum in a physical context. 
Their results were later elaborated (around 
1971-1973) by J. Bros and Iagolnitzer [19]. 
With the aid of the above-mentioned amal- 
gamation of the theories, the works of Sato, 
Kawai, and Kashiwara [22], Hérmander [20], 
Duistermaat and Hormander (Acta Math., 121 
(1969)), Kawai (Publ. Res. Inst. Math. Sci., 7 
(1971-1972)) and K. G. Andersson (Trans. 
Amer. Math. Soc., 177 (1973)) have clarified the 
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importance of the bicharacteristic strip, which 
is a submanifold of a cotangent bundle, not a 
base manifold, as a carrier of singularities of 
solutions of pseudodifferential equations (— 
Section G); and the works of Sato, Kawai, and 
Kashiwara [22], Sato (Actes Congr. Internat. 
Math. Nice, 1971) and Kawai (Publ. Res. Inst. 
Math. Sci., 7 (1971—1972)) revealed the hidden 
mechanism of the celebrated counterexample 
of H. Lewy [25]. Among these, the contri- 
bution of Sato, Kawai, and Kashiwara [22] 
was most decisive and fundamental in that it 
first clarified the structure of a general system 
of pseudodifferential equations at generic 
points of the characteristic variety and then 
derived from it the above-quoted results on 
the structure of the solutions (— Section G). 

Microlocal analysis has now become one of 
the most important and basic concepts in the 
theory of linear partial differential equations 
and theoretical physics. For recent develop- 
ments — Hormander [34] and V. Guillemin, 
Kashiwara, and Kawai [23] and references 
cited therein. 
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A. General Remarks 


An immersion f of an m-dimensional manifold 
M with boundary ôM (possibly empty) into a 
Riemannian manifold N is called minimal if 
the tmean curvature vector field H of M with 
respect to the induced Riemannian metric 
vanishes identically. Then M is called a mini- 
mal submanifold of N. This definition comes 
from the following variational problem: By a 
smooth tvariation of f is meant a smooth map- 
ping F:I x M>N, where I =(—1, 1), such that 
each f,=F(t,-): M —N is an immersion, fy = f, 
and f;|OM=f/|0M for all tel. Let dV, be the 
volume element of the metric induced by f,, 
and set V(t)= fu dV, the volume of M at time 
t. Then the first variation of the volume is 


expressed as 
ee 
t=0 


0 
=—mM H, ne 
1=0 M ot 


where d/ét denotes the canonical vector field 
along the J factor in J x M. Thus the mean 
curvature vector field H of f vanishes identi- 
cally if and only if dV/dt|,..,=0 for all vari- 
ations of f. Therefore a minimal submanifold 
gives an extremal of the volume integral, 
though neither necessarily minimal nor of the 
least volume. 

In the case N =R”, an immersion x: M >R” 
is viewed as a vector-valued function, and the 
mean curvature vector field H is expressed as 
H =Ax/m, where A denotes the tLaplace- 
Beltrami operator —gyy. Thus x is minimal 
if and only if each component function of x is 
harmonic. In particular, there is no compact 
minimal submanifold without boundary in R”. 


dV 
dt 
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The history of the theory of minimal sub- 
manifolds goes back to J. L. Lagrange, who 
studied minimal surfaces in the 3-dimensional 
Euclidean space R?. In 1762 he developed 
his algorithm for the tcalculus of variations, 
which can be applied to higher-dimensional 
problems and ts now known as the tEuler- 
Lagrange equation. For instance, let D be a 
domain in the plane R? and z= f(x, y), (x, y)€ 
D, the equation of a surface in R?. As a neces- 
sary condition for the surface to have the least 
area among the surfaces with fixed boundary, 
Lagrange obtained a ‘quasilinear elliptic par- 
tial differential equation of the second order, 
called the minimal surface equation: 


(1 t22)Z ye — 224 ZyZyy + (1 +z2)Zyy =0. 


Before this, in 1744, L. Euler had found that a 
tcatenotd is a minimal surface. In 1766, J. B. 
M. C. Meusnier showed that a right thelicoid 
is a minimal surface. Besides catenoids and 
helicoids, in 1834, H. F. Scherk found that the 
surface defined by z=log(cos y)—log(cos x) is 
a minimal surface, which ts called Scherk’s 
surface. 

In the latter half of the 19th century, *Pla- 
teau’s problem (— Section C; 334 Plateau’s 
Problem) was studied extensively by O. Bon- 
net, B. Riemann, K. Weierstrass, A. Enneper, 
G. Darboux, and others. The problem is stated 
as follows: Given a Jordan curve F in R? (or 
in R”), find a surface of least area having I 
as its boundary. On the other hand, in 1866, 
Weierstrass gave a general formula, called 
the Weierstrass-Enneper formula (— Section B 
(5)), to express a simply connected minimal 
surface in terms of a complex analytic function 
and a meromorphic function with certain 
properties. The formula allows one to con- 
struct a great variety of minimal surfaces by 
choosing those functions. 

The existence of a minimal surface of disk 
type having a prescribed boundary curve was 
first obtained in 1930 by J. Douglas and T. 
Rado independently as a solution to Plateau’s 
problem, admitting singularities. The result 
was improved by R. Courant for the case of 
finitely many boundary curves by the method 
of fDirichlet integrals. The method was carried 
out further by C. B. Morrey for the gener- 
alized Plateau’s problem in a Riemannian 
manifold (— Section C (5)). Indeed, the Euclid- 
ean space was replaced by any complete Rie- 
mannian manifold which is metrically well 
behaved at infinity. For example, any compact 
or any thomogeneous Riemannian manifold ts 
in this class. 

The existence proof of minimal surfaces 
cannot in general be applied directly to the 
case of higher-dimensional minimal submani- 
folds. The notion of varifolds (— Section G (2)) 
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and the corresponding generalization of a 
minimal submanifold, called a minimal variety 
(— Section G (2)), were then introduced by F. 
Almgren, who proved the existence of a mini- 
mal variety in a compact Riemannian mani- 
fold. It was also proved that a minimal variety 
is approximated by regular minimal submani- 
folds. This work has been developed as geo- 
metric measure theory by E. R. Reifenberg, 
W. H. Fleming, H. Federer, F. Almgren, and 
others (— Section G). 

The study of minimal surfaces given as the 
graph of a real-valued function of two vari- 
ables leads to that of solutions of the minimal 
surface equation. In 1915, S. Bernshtein proved 
that a minimal surface z = f(x, y) defined on 
the entire plane R? must be a plane. Sub- 
sequently, this was generalized to the following 
problem: Is a minimal hypersurface x,,,,; = 
f(X1,..-,X,) in R”*! defined on the entire space 
R” a hyperplane? The answer turns out to be 
affirmative for n<7 and negative for n>8 
(— Section F (1)). 

In general, when a bounded domain D in R” 
and a continuous function ¢ on its boundary 
OD are given, the problem of finding a minimal 
hypersurface M defined by the graph of a real- 
valued function f on D with f|@D=@ gives 
rise to a typical fDirichlet problem. The basic 
questions are those of existence, uniqueness, 
and regularity of solutions. These were first 
studied by Rado for n=2 and later by L. Bers, 
R. Finn, H. Jenkins, J. Serrin, R. Osserman, 
and others (— Section D (1)). 

Minimal submanifolds of a sphere have 
interesting properties; some are analogous to 
those in Euclidean space but some are not. 
Among them, J. Simons, in 1968, gave a dif- 
ferential equation that the norm of the second 
fundamental form of a minimal submanifold 
in a sphere should satisfy. He then showed 
that a totally geodesic submanifold is tsolated 
among the compact minimal submanifolds in 
the sphere. S. S. Chern, M. P. do Carmo, S. 
Kobayashi, N. Ejiri, T. Ito, T. Otsuki, N. Wal- 
lach, and others have made further contri- 
butions to this subject (— Section F (2)). 


B. Minimal Surfaces 


(1) Branched minimal surfaces. A minimal 
surface M in R” is an immersed surface with 
vanishing mean curvature vector field. M 
equipped with an atlas of tisothermal coor- 
dinates is viewed as a *Riemann surface. A 
branch point of a tharmonic mapping f:M > 
R” is a point at which the differential df, is 
zero. A harmonic mapping f:M >R" ofa 
Riemann surface M is called a branched (or 
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generalized) minimal immersion if it is con- 
formal except at the branch points, and the 
image {(M) is then called a branched minimal 
surface. The solution to Plateau’s problem 
given by Douglas and Rado is a branched 
minimal surface. 

(2) Maximum principle. When n= 3, the 
folowing maximum principle for minimal 
surfaces holds: If M, and M, are two con- 
nected branched minimal surfaces in R? such 
that for a point pe M, N M,, the surface M, 
locally lies on one side of M, near p, then M, 
and M, coincide near p. 

(3) Convex hull property. In general, every 
branched minimal surface with boundary in R” 
lies in the convex hull of its boundary curve, 
the smallest closed convex set containing the 
boundary. 

(4) Reflection principle. If the boundary 
curve of a branched minimal surface contains 
a straight line y, then the surface can be ana- 
lytically continued as a branched minimal 
surface by reflection across y. Based on this 
principle, the following holds: Let T be an 
analytic Jordan curve in R” and f:M >R" a 
branched minimal immersion with boundary 
F. Then f is analytic up to the boundary, i.e., 
f(M) is contained in the interior of a larger 
branched minimal surface (H. Lewy). The 
smooth version of this theorem was given by 
S. Hildebrandt: If f: M-+R" is a branched 
minimal immersion with smooth boundary 
curve, then f is smooth up to the boundary. 

(5) Weierstrass-Enneper formula. Every 
simply connected minimal surface in R° is 
represented in the form 


x)= Re | hodh Cy k=1,2,3, 
0 


where $, = f(1 —9°)/2, @2=./—1 fll +g°)/2, 
$3 = fg, and c, is a constant. Here, g(f) is a 
meromorphic function on a domain D in the 
complex ¢-plane, and f(¢) is an analytic func- 
tion on D with the property that at each point 
¢, where g(¢) has a pole of order m, f(¢) has a 
zero of order 2m, D being either the unit disk 
or the entire plane. This formula is quite use- 
ful for constructing various minimal surfaces. 
For instance, on setting f=1 and g(f)={, 
Enneper’s surface is obtained: 


3 3 


u v 
(X1 X2:X3)= (u= +u’, aa vu*,u? —v), 


(u,v)ER?. 


Another feature of the formula is that general 
theorems about minimal surfaces can be ob- 
tained by translating statements about analy- 
tic functions into the corresponding minimal- 
surface ones. For example, a surface f: M >R? 
is minimal if and only if the Gauss mapping 
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G:M-S? is antiholomorphic, i.e., the complex 
conjugate of the mapping is holomorphic, 
when S? is viewed as the Riemann sphere with 
complex structure by the stereographic projec- 
tion (from the south pole) onto the complex 
plane. As a consequence, a complete minimal 
surface in R° is a plane or else the normals to 
the surface are everywhere dense in S? (R. 
Osserman). 

(6) Stability. A minimal submanifold M is 
called stable if for every compact region on M 
all the second variations of the volume are 
positive. For instance, if M—R" is a minimal 
hypersurface and fy is a variation vector field, 
v being the unit normal vector field of M and f 
a smooth function with compact support on 
M, then the second variation formula for the 
volume V(t) is given by 


V 
dt? 


-| (Vfl? -l|A[?f7)dvo, 
t=0 M 


where | A| denotes the norm of the second 
fundamental form of M. In particular, when 
n= 3, then —|A|?/2=K, the Gauss curvature 
of M. Therefore a minimal surface M in R°? is 
stable if and only if 


| (IVf +2Kf*)dV>0 


for any smooth function f with compact sup- 
port on M. It follows that the stability of mini- 
mal surface M in R? is related to the bound- 
ary value problem of a linear elliptic operator 
L=A—2K, A being the Laplacian on M. 
Namely, a minimal surface M in R? is stable if 
and only if the first eigenvalue 4,(D) of L on 
any bounded domain D in M is nonnegative. 

In connection with the tGauss mapping, a 
sufficient condition for a domain D in a mini- 
mal surface to be stable was obtained by H. 
Schwarz in 1885: If a minimal surface M in R? 
has one-to-one Gauss mapping G: M >S?, 
then a domain DCM such that G(D) is con- 
tained in a hemisphere of S? is stable. This was 
generalized by J. L. Barbosa and M. do Carmo 
as follows: If the area A(G(D)) of the spherical 
image G(D) is less than 2z, then D is stable. 
This result is sharp in the sense that for any 
&>0 there exists an unstable domain D with 
A(G(D))=2z2+6. As an analog to Bernshtein’s 
theorem, the following holds: A complete and 
stable minimal surface in R? is a plane. H. 
Mori has also obtained results in this regard. 

As for the existence of unstable minimal 
surfaces, it is known that if there are two dis- 
tinct stable minimal surfaces with the same 
smooth boundary curve T in R?, then there 
exists an unstable branched minimal surface 
with T as its boundary (M. Morse and C. 
Tompkins, M. Schiffman). 
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C. Plateau’s Problem (— 334 Plateau’s 
Problem) 


Let T be a trectifiable ‘Jordan curve in R” and 
D={(x, y)eR?|x?+ y? <1}. Then there exists 
a continuous mapping f: D—>R” such that (a) 
f{|éeD maps homeomorphically onto T; (b) 
f{|D is harmonic and almost conformal, i.e., 
(fe f>=0 and | f,|=|f,] in D with |df|>0 
except at isolated branch points; (c) the in- 
duced area of f is the least among the class 

of piecewise smooth surfaces bounding I" with 
(a). This mapping f is called the classical solu- 
tion or the Douglas-Rad6 solution to Plateau’s 
problem for T, which may have singularities 
called tbranch points. The resulting surface S 
is a branched minimal disk. This theorem 
establishes the existence of a surface of least 
area among all surfaces homeomorphic to a 
disk. It has been generalized by R. Courant 
for T consisting of finitely many rectifiable 
Jordan curves. 

For a branched minimal disk S bounded by 
smooth F in R°, there is a relation, called the 
Gauss-Bonnet-Sasaki-Nitsche formula, among 
the ttotal curvature x(I) of T, the total curva- 
ture of S, and the orders of branch points: 


1 1 
1 —| M, +— dAx—«k(T), 
+¥(m,-)+¥ rts | IKI zD 


where K denotes the Gauss curvature of S, 
m,— 1 the orders of the interior branch points, 
and 2M, the orders of the boundary branch 
points, which must be even. 

(1) Regularity. A minimal disk of least area 
in R? has no boundary branch points when I" 
is real analytic (R. Gulliver and F. Leslie) or 
when T is-smooth and the total curvature «(T) 
is less than 4x (J. C. C. Nitsche). In general, a 
classical solution for smooth T in R? cannot 
have infinitely many branch points. A re- 
markable fact is that every classical solution 
to Plateau’s problem in R? is free of branch 
points in its interior, i.e., is a regular immer- 
sion (R. Osserman). 

(2) Embeddedness. A classical solution is 
not necessarily an embedding; it may have 
self-intersections. For instance, if I is knot- 
ted in R>, then every solution must have self- 
intersections. It is known, however, that im- 
mersed minimal disks of least area in R? which 
can self-intersect only in their interiors are 
embeddings. In particular, if I is an extremal 
Jordan curve, i.e., if IT lies on the boundary of 
its tconvex hull, then the classical solution for 
T is an embedding (F. Tomi and A. J. Tromba; 
W. H. Meeks and S. T. Yau). If the topological 
type is not specified, then there always exists 
an embedded minimal surface. That is, if T is 
the union of any finite collection of disjoint 
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smooth Jordan curves in R3, then there exists 
a compact embedded minimal surface with 
boundary F which is smooth up to the bound- 
ary (R. Hardt and L. Simon). However, there 
exists an unknotted Jordan curve that never 
bounds an embedded minimal disk. 

(3) Uniqueness. In general, the classical 
solutions to Plateau’s problem do not have the 
uniqueness property. Rado was the first who 
gave a condition on a boundary curve to 
guarantee the uniqueness of the minimal disk. 
Namely, if a Jordan curve T in R” admits a 
one-to-one orthogonal projection onto a con- 
vex curve in a plane R? in R”, then the classical 
solution to Plateau’s problem for T is free of 
branch points and can be expressed as the 
graph over this plane. When n=3, the solution 
is unique. Another geometric condition on I 
has been given by Nitsche: An analytic Jordan 
curve I in R? with total curvature k(T)< 
4r (or a smooth T with k(T)<4r) bounds a 
unique immersed minimal disk. Moreover, 
the generic uniqueness holds: In the space of 
of all smooth Jordan curves in R” with suitable 
topology, there exists an open and dense sub- 
set Z such that for any I in & there exists 
a unique area-minimizing minimal disk (F. 
Morgan and A. J. Tromba). 

(4) Finiteness. As for the finiteness of the 
classical solutions to Plateau’s problem, 
several conditions on boundary curves are 
known. An analytic Jordan curve in R? 
bounds only finitely many minimal disks of 
least area (F. Tomi). An analytic fextremal 
Jordan curve in R? bounds only finitely many 
minimal disks with relative minima of area (H. 
Beeson). A smooth Jordan curve I’ with total 
curvature k(I)< 6r bounds only finitely many 
minimal disks (Nitsche). Moreover, generically, 
i.e., for an open and dense subset of boundary 
curves, there are at most finitely many minimal 
surfaces with given boundary, relative minima 
or not (R. Böhme and Tromba). 

(5) Generalized Plateau’s problem. C. B. 
Morrey’s setting of the generalized Plateau’s 
probiem is as follows: A homotopically trivial 
rectifiable Jordan curve T is given in an n- 
dimensional Riemannian manifold N. Let D 
denote a disk in R*. Find a mapping f:D>N 
such that (a) f (0D maps homeomorphically 
onto T, (b) the induced area of f is the least 
among the class of piecewise smooth surfaces 
in N bounded by T with (a). Obviously when 
N =R", this is the classical Plateau’s problem. 
A solution was given by Morrey under the 
assumption that N is homogeneously regular; 
i.e., that there exist 0<k<K such that for 
any point ye N there is a local coordinate 
system (V, ®) around y for which ®(V)= 
{xeER"|||xl| <1} and the Riemannian metric 
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È gap dx“ dx” satisfies 
KI OY ÈE gyro SKE 0Y 


for any x and (v,,...,v,)€R”. Any compact or 
any homogeneous Riemannian manifold is 
homogeneously regular. Morrey’s solution ts 
as follows: If N is a homogeneously regular 
Riemannian manifold and if I is a homotopt- 
cally trivial rectifiable Jordan curve in N, then 
there exists a branched minimal immersion /: 
D—>N with least area bounded by I with (a). 
The regularity of Morrey’s solution is simi- 
lar to the classical one. If I is smooth, then 
so is the solution f up to the boundary. If, 
furthermore, dim N = 3, then the solution f 
is an immersion in its interior. If N and T are 
real analytic and dim N =3, then the solution f 
is an immersion up to the boundary. 


D. Existence Problems of Minimal 
Submanifolds 


(1) The Dirichlet problem for the minimal 
surface equation. When the graph of a (vector- 
valued) function is a minima! submanifold 

in some Euclidean space, the function must 
satisfy the so-called minimal surface equation. 
To be precise, let D be a (bounded) domain in 
R”, f: D>R* a (vector-valued) function, and 
put F:D>R"**: F(x)=(x, f(x)) for xe D. Let M 
be the graph of f: M = F (D). Then M is mini- 
mal if and only if f (or F) satisfies one of the 
following equations: 


ifn=2 and k=1, 
Ui ie AE aE a G (1) 
if nis arbitrary and k=1, 
n 0/1 of 

—|—— J= h WH /1I+|Vf/?; 
saz) 0 where +|Vf\°; 


(2) 
if n=2 and k is arbitrary, 
(L+H) S ex 2h hey tA HAI Gy = 3) 


and if n and k are arbitrary, 


nf OF aF 
J r -=0 h = ST 
2 "ado mare ee Ox! Ax?” 


i,j=1 


(4) 


and (g) is the inverse matrix of (g;;). 


The basic problem for this class of equations 
is the tDirichlet problem. For n=2 and k=1, 
the following theorem of Rado and Finn is 
fundamental: There exists a solution f of the 
Dirichlet problem corresponding to an arbi- 
trary continuous function o on the boundary 
OD of D if and only if D is convex. Since the 
difference of any two solutions of eq. (1) satis- 
fies the maximum principle, there can be at 
most one solution f of the Dirichlet problem 
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corresponding to a given continuous function 
g on OD. As for the removability of isolated 
singularities, the following is known: Every 
solution of eq. (1) in0<x*+y? <1 extends 
continuously to the origin. The extended func- 
tion ts smooth at the origin and satisfies eq. (1) 
in the full disk x? + y? <1 (Finn). 

For arbitrary n and k=1, namely, for the 
case of minimal hypersurfaces, the following is 
known: Let D be a bounded domain in R” with 
smooth boundary. The Dirichlet problem for 
eq. (2) has a solution for any continuous func- 
tion g on ôD if and only if the mean curvature 
of ôD with respect to the inner normal is non- 
negative at every point. As for the uniqueness 
and regularity, the same results as above hold 
(Jenkins and Serrin). 

For n=2 and arbitrary k, namely, for the 
case of minimal surfaces in R***, exactly the 
same result of existence as in the classical case 
k=1 holds. A solution exists for an arbitrary 
continuous vector-valued function on ôD if 
and only if D is convex (Osserman). Though 
the removability of singularities holds under 
suitable restrictions, the uniqueness fails in this 
case. 

For the last case, n >2 and k>1, there are 
essentially no results on either existence or 
uniqueness for the Dirichlet problem. 

(2) Existence of minimal surfaces and top- 
ology. For a compact Riemannian manifold 
N, there are results on the existence of minimal 
surfaces related to the homotopy groups of 
N. Let f be a continuous mapping from a 
*Riemann surface 2, of tgenus g into N. If the 
tinduced mapping f,:2,(2,)->7,(N) of f on 
the tfundamental groups is injective, then there 
exists a branched minimal immersion h:2,—>N 
such thath, =f, on 2,(2,) and the induced 
area of h is the least among all mappings with 
the same action on z,(2,). If furthermore 
m>(N)=0, then A can be deformed from f 
continuously (R. Schoen and S. T. Yau). If 
m(N) #0, then there exists a generating set for 
T(N) consisting of branched minimal immer- 
sions of spheres that minimize energy and area 
in their homotopy classes (J. Sacks and K. 
Uhlenbeck). If, furthermore, dim N = 3, then 
the above minimal immersion h in the homo- 
topy class is an embedding or a two-to-one 
covering mapping. In the second case, its 
image is an embedded real projective plane. If 
h' is another such least-area mapping, then 
either h’ is equal to h up to a conformal re- 
parametrization or the images of h and h’ are 
disjoint (Meeks and Yau). 


E. Gauss Mapping of Minimal Surfaces 


On a connected, orientable surface M there 
exist local tsothermal coordinates (x, y) at each 
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point. On putting z=x+ at y, the metric 
has the form ds? = 2F|dz|?, and M is viewed as 
a Riemann surface. A conformal immersion 
f:M-—R'" is minimal if and only if Af=0, or 
equivalently 6,¢,f =0. In the case n=3, the 
*Gauss mapping G:M—S? of a surface f: 

M >R? is defined by assigning to each point 
péM the unit normal vector translated paral- 
lel to the origin of R*. A surface f: M >R? is 
minimal if and only if the Gauss mapping is 
antiholomorphic. In the general case, the 
Gauss mapping G is defined to be a mapping 
assigning to each point pe M the oriented 
tangent space f, 7,(M)<R”. Thus G is a map- 
ping from M into the tGrassmann manifold 
M,, »(R) =SO(n)/SO(2) x SO(n —2) of oriented 
planes in R”, which is naturally diffeomorphic 
to the tcomplex quadric Q,,_, in the complex 
projective space CP”~'. An immersion f: 

M >R" is minimal if and only if its Gauss 
mapping ts antiholomorphic. 

Let f: M >R” be a complete orientable 
minimal surface. Let y be the Euler character- 
istic and — nC = fẹ KdA, the total curvature of 
J(M). Assume that the total curvature is finite. 
Then the following results of Chern and Osser- 
man are fundamental: (a) M is conformally a 
compact Riemann surface M with finite num- 
ber, say r, of points deleted; (b) C is an even 
integer and satisfies C >2(r—y)=4g + 4r—4, 
where g is the genus of M (= genus of M); (c) if 
J(M) does not lie in any proper affine subspace 
of R”, then C>4g+rt+n—32>4g+n-—2>n—-2 
(F. Gackstatter); (d) if {(M) is simply connected 
and nondegenerate, i.e., its image under the 
Gauss mapping does not lie in a hyperplane 
of CP" !, then C >2n—2 and this inequality 
is sharp; (e) when n= 3, C is a multiple of 4, 
with the minimum value 4 attained only for 
Enneper’s surface and the catenoid; (f) the 
Gauss mapping G of M extends to a mapping 
of M whose image G(M) is an algebraic curve 
in CP"! lying in Q,,_,; the total curvature of 
{(M) is equal in absolute value to the area of 
G(M), counting multiplicity; (g) G(M) inter- 
sects a fixed number of times, say m (counting 
multiplicity), every hyperplane in CP”! except 
for those hyperplanes containing any of the 
finite number of points of G(M — M); the total 
curvature of f(M) equals — 2am. 

In particular, if a complete minimal surface 
in R” has k tends, then the total curvature 
never exceeds 22(y—k). Enneper’s surface and 
the catenoid are the only two complete mini- 
mal surfaces in R* whose Gauss mapping is 
one-to-one. 

If the Gauss mapping of a complete minimal 
surface of finite total curvature in R? omits 
more than 3 points of $°, then it is a plane 
(Osserman). F. Xavier proved that the Gauss 
mapping of any complete nonflat minimal 
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surface in R? can omit at most 6 points of $°. 
It is an open question whether this can be 

improved to 4 points. The Gauss mapping of 
Scherk’s surface omits exactly 4 points of S?. 


F. Minimal Submanifolds 


(1) Bernshtein’s problem. As was mentioned in 
Section A, a minimal surface in R? represented 
as the graph of a function on the entire plane 
R? is a plane. This is known as Bernshtein’s 
theorem (1915). This is, however, not true in 
R4, namely, there is a minimal surface defined 
as the graph of a vector-valued function on the 
entire plane, which is not a plane. The gen- 
eralized Bernshtein problem is stated as follows: 
Let f:R">R be a function satisfying the mini- 
mal hypersurface equation 


"a/la 
—{ —~— ]=0, 
py A(z) 
where W=,/1+|Vf|?. Is the graph of f an 


affine hyperplane? The answer ts affirma- 

tive for n<7 (E. De Giorgi, F. J. Almgren, J. 
Simons). For n> 8, it is negative (E. Bombieri, 
De Giorgi, E. Giusti). Though the problem 
turns out to be negative for n> 8, no concrete 
counterexample is known, even for n= 8. 

(2) Minimal submanifolds in the sphere. 
Minimal submanifolds in the unit sphere S” 
have some special properties. For example, 
there is no compact minimal submanifold of 
S" contained in an open hemisphere. On the 
other hand, there is no stable minimal closed 
submanifold in S” (B. Lawson, Simons). 

More generally, any closed minimal hypersur- 
face in a Riemannian manifold with positive 
Ricci curvature is unstable (Simons). As for the 
existence of minimal surfaces in a sphere, the 
following theorem of Lawson is general: Any 
closed surface of any genus, except the real 
projective plane, can be minimally immersed 
into the unit sphere S*. As an analog to Bern- 
shtein’s theorem, a sphere $°, which is mini- 
mally immersed in S°, is necessarily totally 
geodesic. On the other hand, by making use 
of higher-order second fundamental forms and 
the relations among them, a complete descrip- 
tion of the minimal immersions of R? into S” 
has been given by K. Kenmotsu. 

As a general theorem which gives a neces- 
sary and sufficient condition for a Riemann- 
ian manifold to be minimally immersed in the 
unit sphere S”, the following is fundamental: 
An isometric immersion x of an m-dimensional 
Riemannian manifold M into S”, viewed as a 
vector-valued function into R""', is minimal if 
and only if Ax =mx (T. Takahashi). As its 
immediate application, any m-dimensional 
compact homogeneous space whose linear 
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isotropy group is irreducible can be minimally 
immersed into the n-sphere of curvature 4/m, 
corresponding to any eigenvalue ( #0) of the 
Laplacian, where n+ 1 is the dimension of the 
eigenspace corresponding to 4 (W. Y. Hsiang, 
Takahashi). 

Since for the symmetric spaces of frank 1 
the eigenvalues and the corresponding eigen- 
spaces of the Laplacian are known, the mini- 
mal immersions of such spaces into the unit 
sphere have been determined as well as the 
trigidity of such minimal immersions (N. Wal- 
lach). For example, if the m-sphere of constant 
curvature c is minimally immersed into the 
unit n-sphere, but not contained in any great 
hypersphere, then for each nonnegative integer 
k,c=m/k({k+m—1)andn+1<(2k+m-—1): 
[(k +m—2)!/k!(m—1)!]. The immersion is rigid 
if and only if m=2 or k<3. Similar results 
have been known for the projective spaces 
over real numbers, complexes, quaternions, or 
Cayley numbers. 

Simons showed that the scalar curvature 
p of acompact minimal submanifold M of 
dimension m in the unit (m + p)-sphere is not 
greater than m(m— 1). Furthermore, if p> 
m(m— 1)—mp/(2p— 1), then either p = m(m— 1) 
or p=m(m—1)—mp/(2p— 1). In the former 
case, M is totally geodesic and therefore is the 
unit sphere S™. In the latter case, M is either 
a hypersurface of the unit (m + 1)-sphere 
which is isometric to the product S*(./k/m) x 


S™-*(, /(m—k)/m) of spheres of radius ,/ k/m 


and ./(m—k)/m, respectively, called the gen- 
eralized Clifford torus or the Veronese surface 
in S*. Here the Clifford torus is the torus 
S'(,/1/2) x S'(/1/2)<S3 c R4, and the Ver- 
onese surface is defined as follows: Let (x, y, z) 
be the natural coordinate system in R? and 
(u!,u?, u”, ut, u°) that in R°. The mapping 
defined by 


u! =yz/,/3, u? =zx/,/3, u? =xy/ 3, 
ut =(x? —y)/2,/3, už =(x? + y? —227)/6 


gives an isometric immersion of s?(,/3) into 
St. Two points (x, y, z) and (— x, — y, — Z) of 

S 2(,/3) are mapped into the same point of $$, 
and thus the mapping defines an embedding of 
the real projective plane RP? into S*. This 
embedded real projective plane in S* is called 
the Veronese surface. 

T. Otsuki proved the following: Let M be a 
complete minimal hypersurface immersed in 
the unit (n + 1)-sphere with two principal cur- 
vatures. If their multiplicities are m and n— 
m> 2, then M is congruent to the generalized 
Clifford torus S”(./m/n) x S"-"(./(n—m)/n)c 
S"+! c R"*?., If one of their multiplicities is 1, 
then M is a hypersurface of S"*! in R"+? = 
R” x R? whose orthogonal projection into R? 
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is a curve of which the tsupport function x(t) 
is a solution of the following nonlinear differ- 
ential equation of the second order: 


nx(1 — x°) (d? x/dt?) + (dx/dt)? + (1 — x?) (nx? — 1) 
=Q. 


Furthermore, there are countably many com- 
pact minimal hypersurfaces immersed but not 
embedded in S"*!. Only S"! (,/(n— 1)/n) x 
S+(/1/n) is minimally embedded in $"*". 
This just corresponds to the trivial solution 
x(t)=1 /./n of the above equation. 

(3) Minimal submanifolds in Riemannian 
manifolds. In general, it is difficult to deter- 
mine all the complete stable minimal sub- 
manifolds in a given Riemannian manifold. If, 
however, some curvature conditions are given 
to the Riemannian manifold, then a classifica- 
tion has been given: Let N be a complete 3- 
dimensional Riemannian manifold with non- 
negative scalar curvature p, and let M bea 
complete, stable, and orientable minimal sur- 
face in N. 

(a) If M is compact, then either M is confor- 
mally equivalent to the Riemann sphere S? 
or else it is a totally geodesic flat torus. Fur- 
thermore, if p >Q, then the latter case never 
occurs. 

(b) If M is not compact, then M is confor- 
mally equivalent to the complex plane C or a 
cylinder (D. Fischer Colbrie; R. Schoen). 

As a generalization of the procedure for 
generating periodic minimal surfaces in R3 
with octahedral or tetrahedral symmetry 
(Schwarz, 1867), T. Nagano and B. Smyth gave 
a construction procedure to generate periodic 
minimal surfaces in R” or n-tori with sym-. 
metry corresponding to any tWeyl group of 
rank n. 


G. Minimal Varieties 


Recent progress in the study of Plateau’s prob- 
lem in higher dimensions is closely related to 
the point of view of geometric measure theory 
and tthe calculus of variations. 

(1) Integral currents. Let T be a tcurrent of 
degree k, or simply a k-current, defined on an 
open set U of R”, and let ôT be the boundary 
of T, the exterior derivative of T. For simplic- 
ity, in the following only currents with com- 
pact support are considered. 

The mass M(T) of a k-current T is the dual 
norm M(T)=sup{T(9)|M()< 1} of the 
comass M(¢ọ) on k-forms, which is defined by 
M(o)=sup{l o)l |xeU} with lol = 
sup { (P(X), 0, A... A0,>|U4,-.., 0, are ortho- 
normal vectors at xe U}. It follows that M(T) 
coincides with k-dimensional volume when T 
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is a k-current defined by integration on a k- 
dimensional simplex. We say that T is normal 
if M(T) and M(@T) are finite. Normal cur- 
rents form a Banach space with norm N(T)= 
M(T)+ M(CT). 

A current T is called rectifiable if it can be 
approximated in the mass norm by currents of 
type fy, where y is a finite polyhedral chain 
with integer coefficients, f:sup py U is a 
*Lipshitz mapping of the support of y into U, 
and f, is the current defined by means of 
F,9(9)=y(F *¢@). If both T and CT are recti- 
fiable, T is called an integral current. Inte- 
gral currents give the appropriate notion of 
generalized manifolds with boundary in the 
study of higher-dimensional Plateau problems. 

One of the fundamental properties of in- 
tegral currents is the compactness, stated as 
follows: Given a compact subset K CU anda 
number c>0, the set of integral currents T 
such that supp Tc K and N(T)<c is sequen- 
tially compact in the tweak topology. Since 
the mass M(T) is flower semicontinuous in 
the weak topology, it follows that Plateau’s 
problem can always be solved in the space of 
integral currents (— Section C; 334 Plateau’s 
Problem). The question arises: To what extent 
is this a satisfactory solution to the problem? 
For codimension 1, it is known from the work 
of Federer and others that an integral (n— 1)- 
current of least mass in R” is nonsingular in 
codimension <7. In particular, in R”, n<7, 
every integral (n— 1)-current of least mass is an 
analytic manifold. In general codimensions, it 
is known that the set of regular points is dense 
(Reifenberg, Almgren, and others). 

The definition of rectiftable and integral 
currents carries over to those on a Riemann- 
ian manifold M in a natural way. Then the 
space of integral currents J,(M) on M with the 
boundary operator ô, for which 6? =0, form 
a chain complex. It is then a fundamental 
theorem, due to Federer and Fleming, in 
homological integration theory that there is a 
natural isomorphism H,(J,(M))= H,(M; Z) of 
the homology of the complex of integral cur- 
rents with the integral singular homology 
groups of M. From this it follows that if M is a 
compact Riemannian manifold M, then each 
class xe H,(,(M))=H,(M; Z) contains an 
integral current of least mass among all in- 
tegral currents in «. 

For a homology class of codimension 1, it 
has also been proved by Almgren that if M is a 
compact Riemannian manifold of dimension 
<7, then for each «e H,_,(M; Z) there exists a 
finite collection of mutually disjoint, compact 
oriented minimal hypersurfaces S,,..., S, em- 
bedded in M and integers n,,...,n, such that 
the integral current i= n;s; represents «. On 
the other hand, it has been recently proved 
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that every compact Riemannian manifold of 
dimension <6 contains a nonempty closed 
embedded minimal hypersurface (J. T. Pitts). 

(2) Varifolds. The theory of integral currents 
developed by Federer, Fleming, and others 
yields reasonable spaces for purposes of the 
calculus of variations. However, it is not en- 
tirely feasible to use these in the study of the 
actual soap films that occur in physical ex- 
periments. It turns out that we should work 
in a more set-theoretic fashion and give up 
the notion of orientability and the boundary 
operator. A convenient theory for describing 
these physical experiments is the theory of 
varifolds developed by Almgren. 

A k-dimensional varifold on a Riemann- 
ian manifold M is a *Radon measure on the 
bundle G,(M) of unoriented tangent k-planes 
of M. For simplicity, only varifolds with com- 
pact support are considered. These are re- 
garded as continuous linear functionals on the 
space C(G,(M)) of continuous functions on 
G,(M). Thus a k-dimensional submanifold S of 
finite volume embedded in M determines a 
varifold Vs by integration: 


n=] I(T, S\dH™(x), fE C(G,(M)), 
S 


where #* denotes the k-dimensional tHaus- 
dorff measure on M. More generally, to 

any rectifiable current there corresponds an 
underlying varifold obtained by neglecting 
orientations. 

Let V,(M) denote the space of k-dimensional 
varifolds on M. Given a varifold Ve V,(M), the 
mass M(V) is defined to be the V-measure of 
the total space, i.e., M(V)=V(1). Let f: M>M 
be a C'-mapping. Then f induces naturally a 
mapping f, of V,(M) into itself. In particular, if 
X is a smooth vector field on M with asso- 
ciated flow f,, then there is a smooth function 
M(t)=M(f,,V). A varifold Ve V,(M) is then 
called a k-dimensional minimal variety in 
M if the first variation (d/dt)M(t)|,-. =0 for 
all smooth vector fields on M. For example, 
minimal submanifolds, complex analytic sub- 
varieties of Kahler manifolds and integral 
currents of least mass are minimal varieties. 

An appropriate notion of rectifiable and 
integral varifolds is defined, and an analog of 
the compactness theorem can be obtained. As 
a consequence, it was proved by Almgren 
using tMorse theory methods that if M is an n- 
dimensional compact Riemannian manifold, 
then for each p, 1 <p<n—1, there exists at 
least one minimal variety of dimension p. As 
for the regularity of minimal varieties, it is 
known that if V is a k-dimensional variety, 
then in the support of V there is a relatively 
open dense subset that is a regular minimal 
submanifold of dimension k (W. K. Allard). 
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H. Maximal Hypersurfaces in Minkowski 
Space 


Let L"*! be a Minkowski space, i.e., L”! = 
{(X1, +++, Xn5t)|(X1,-..,X, ER”, te R} with the 
tLorentzian metric £j- (dx;)* —(dt)*. Let M 
be a *spacelike hypersurface of L”*! so that 
the induced metric on M is Riemannian. If the 
mean curvature vector field H of M, defined 
in the same way as in the Riemannian case, 
vanishes identically, then M is called maximal. 
In contrast to the Riemannian case, by the 
variation in the normal direction of H, the 
volume increases, provided H #0. However, 
the equation describing a maximal hyper- 
surface is similar to that for the Riemannian 
case (— Sections D (1), F (1)). Indeed, let D be 
a domain in R” and f: D-R a smooth func- 
tion. Then the graph of f in L”*? is a maximal 
hypersurface if and only if f satisfies the fol- 
lowing quasilinear partial differential equation 
of the second order: 

n A2 n 
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This is elliptic for |Vf |< 1, since M is space- 
like. E. Calabi proposed a problem similar to 
Bernshtein’s in 1968. Contrary to Bernshtein’s 
case, the answer is affirmative for any n. 
Namely, a maximal hypersurface in L"*' de- 
fined as the graph of a function on the entire 
space R" is a hyperplane in R’*'. More gener- 
ally, a maximal hypersurface, which is a closed 
set in L"*', is a hyperplane (S. Y. Cheng and 
S. T. Yau). It is also known that any maximal 
hypersurface in L"*! is stable. 
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A. Language 


Every mathematical theory has an appro- 
priate language. To determine a language for a 
theory means to determine a language for the 
related mathematical system. Such a language 
consists of the following symbols (the symbols 
given here are examples of only one notational 
system). 

(1) Symbols that express logical concepts 
(flogical symbols): V, 3, 1, A, V, >; 

(2) free variables: dg, @,, az,...5 

(3) tbound variables: xo, X1, X2,.-.3 

(4) symbols that denote individual objects 
(individual constants): Co, C1, C25 ---3Cas -3 

(5) function symbols: fo, fis fa, -es Sass 

(6) tpredicate symbols: Po, Pi, P2,...,P,,...- 

The tcardinalities of the sets of symbols in 
(4), (5), and (6) are arbitrary, except that there 
must be at least one predicate symbol. It is 
assumed that each set of symbols is twell 
ordered. Also, it is understood that to each f; 
in (5) there corresponds a positive integer i,, 
while to each P, there corresponds a nonnega- 
tive integer (these integers are called the num- 
ber of arguments of f; and P,, respectively). 

In practice, other kinds of languages are 
also dealt with. One example is a system with 
infinitely long expressions, which permits 
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*transfinite ordinals for the numbers of argu- 
ments of f, and P;, and which has the extended 
concepts /\,<p, Va<p 3X1». IXq--- for a < B, 
and Vx, Vx,...Vx,... fora<f, where a and fp 
are transfinite ordinals. Another language 
includes variables of higher ‘type as well as Y 
and 3J over those variables. Free and bound 
variables may not be distinguished typograph- 
ically. In that case a variable not bound by V 
or J in a fformula ts called free. (The notion of 
a formula is defined later.) To simplify this 
discussion, however, we restrict ourselves to 
the tfirst-order predicate language with a 
typographic distinction between free and 
bound variables. We also assume that there 
are only a countable number of variables, and 
hence we use natural numbers as subscripts. 

Set L, = {logical symbols}, Lz = {do,4a,, 
Ghee te Dig Nig 3 Matec T ey pee as 
C2, +++ $s Ls = {Jos fis Jz J Le={Po. Pi, 
Preca ba Ly, bee Labais Lez TO 
determine a language L is to specify such a list 
L. Since V, 3, 1, A, v, 2, are normally used 
for L,, do, 41, @>,... for Lz, and Xo, X4, 

X,,... for L3, we may assume that these are 
fixed, and hence to determine a language is to 
determine <L4, Ls, Le>. We take <L, La, L3) 
just described and assume an arbitrary but 
fixed L= L4, Ls, Lg). First we define the 
notions term of L (or L-term) and formula of 
L (or L-formula). 

Definition of the terms of L (L-terms): (1) 
Each free variable a; is an L-term. (2) Each 
individual constant c, of L is an L-term. (3) If f; 
is a function symbol of L, i; is the number of 
arguments of f;, and each of f),..., ti, is an L- 
term, then f,(¢,,..., ti) is also an L-term. (4) 
The L-terms are only those constructed by (1)— 
(3). 

A term that does not contain a free variable 
is called a closed term. 

Definition of the formulas of L (or L- 
formulas}: (1) Let P, be a predicate symbol of L 
and i; be the corresponding natural number. If 
each of t,, nti is an L-term, then Pi{t,,..., ti) 
is an L-formula. This type of formula is called 
a prime formula (or atomic formula). (2) If A 
and B are L-formulas, then each of (A), 

(A) A(B), (A) v (B), and (4)—>(B) is an L- 
formula. (3) Let F be an L-formula and x; be a 
bound variable that does not occur in F. Then 
an expression obtained by putting { ) around 
F, replacing some occurrences in F of a free 
variable, say a,, by x;, and prefixing Vx; or 3x; 
is an L-formula. (4) The L-formulas are only 
those constructed by (1)—(3). 

A formula that has no occurrence of a free 
variable is called a closed formula. The paren- 
theses used in the formation of a formula may 
be omitted if no ambiguity arises thereby. 
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B. Structures 


Let L be a specific language as described in the 
previous section. Then M=[M :p;a;7] defined 
by (1)—(4) below is called a structure for L (or 
L-structure). 

(1) M is a nonempty set. (M ts called the 
universe of Ji.) 

(2) p is a mapping from L, into M. 

(3) Let L5 ={ f;|the number of arguments 
of f is i}. Then L,=L5UL2U...ULSU... 
provides a partition of L;. Let %, be the set of 
all mappings from M' = Mx... x M (i times) 
into M and c; be a mapping from Li, into §;. 
We define o for an arbitrary f of L; by a(f) 
=o, f}, where i is the number of arguments of 
f. Then ø is obviously a mapping from L, into 

i=t Yi 

(4) Decompose Ls into LLU L4...U LU... 
as in (3). Let P, be the set of all subsets of M' 
and 1; be a mapping from Li, into P,, where P, 
is the set {M, Ø} (Ø is the empty set). Then t 
is defined for every i and for an arbitrary P of 
Lg by t(P)=1,(P). 

If we denote p(c) by T, a(f) by f, and t(P) by 
P, then we may understand that p is repre- 
sented by ĉo, C,,..., 6 is represented by fo, 
fi,- and t is represented by Py, P,,.... 
Therefore Mt is normally expressed as 


M=[M : to; Eis -3 fos Jis; Pos Pio -e ]. 


C. Satisfiability 


We fix not only a language L but also a struc- 
ture M for L. Then the property that an L- 
formula is satisfiable is defined by the follow- 
ing procedure: 

Let m, n, ... stand for tsequences of the 
elements of M, say (mM9,11,...), (Ng, My, +++ )s ee 
called M-sequences. We write m =n to indi- 
cate that each entry of m except the ith one ts 
equal to the corresponding entry of n. Using 
these concepts, the value of an L-term at an 
Wi-sequence m, denoted by t[m], is defined as 
follows: 

(1) If t is a free variable a,, then t[m] = m;. 

(2) If t is an individual constant c,, then 
t{m]=C,. 

(3) If tis of the form f(t,,...,t,), then t[m] = 
f(t, Om], ...,¢,[m]). If £ is an L-term, then 
evidently t[m] is an element of M. 

Based on this definition of t[m], the relation 
A is satisfiable by m in Mi, denoted by M, m = 
A, is defined for an arbitrary L-formula A 
and an arbitrary W-sequence m as follows: 

(1) M, mE Pitis st) > tm], ..., tm] > 
eP, 

(2)M@, mE IBM, m HB is false. 
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(3) M, mE BACSM, mE B and MW, 
mEC. 

(4) Mt, mE BVC<M, mE B or W, 
mC. 

(5) M, mE B>C M, mB implies W, 
mEC. 

(6) Mi, m E Yx F(x) >M, n F(a,) for an 
arbitrary n that satisfies m =n, where a; has 
the least index among the free variables that 
do not occur in F(x;). 

(7) Mt, m 3x, F(x;)<> there exists an n such 
that n=m and M, n j F(a,), where a; satisfies 
the same condition as in (6). 

Following are some consequences of this 
definition. 

(1) For an arbitrary L-formula 4 and an 
arbitrary M-sequence m, exactly one of W, 
mA and M, mE 1A holds. 

(2) Let a;,,...,@), include all free variables 
that occur in A, and let m and n be WM- 
sequences for which m; =n;,, ...,mj,=Nj.. 
Then W, mE A and WM, n F A are equivalent. 
So we may write M} A “iy ee | instead 
of M, m E A, for any formula A whose free 
variables are among ajs Gj,s and any WM- 
sequence m. 

(3) If A is a closed formula, then for an 
arbitrary pair of sequences m and n, M, mE A 
and Mi, n} A are equivalent. Therefore, for a 
closed formula A, we may express the state- 
ment “for some (or, equivalently, for all) W- 
sequence m, Jt, mE A holds” by ME A. 

(4) Let a; be an arbitrary variable that does 
not occur in Vx F(x) or 3xF (x). Then W, 

m = VxF (x) is equivalent to W, n H F(a;) for 
an arbitrary n such that n=m. Likewise, W, 
m = AxF(x) is equivalent to the statement 
that there is an n such that n=m and M, 


nE F(a). 


D. Models 


Here again we fix a language L. Let A be a 
closed L-formula and W an L-structure. If 

M H A, then M is called a model of A. 
Furthermore, if T ={40, A1, ---. } is an arbi- 
trary set of closed formulas and W A; for all 
A; in T, then the structure W is called a model 
of Fr. 

(1) Consistency. Consider a logical system 
whose language is L. If there is a model of the 
set of all provable closed formulas of the sys- 
tem, then the system is tconsistent. In partic- 
ular, the *first-ordered predicate calculus is 
consistent. 

(2) Completeness. A logical system is said to 
be complete if every closed formula that is 
satisfied in every structure ts provable in the 
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system. In particular, the first-order predicate 
calculus is complete. 

K. Gédel proved (2). Later L. Henkin gave 
an alternative proof whose essential idea con- 
tributed to proving the following proposi- 
tion: If a set I of closed L-formulas is con- 
sistent, then there is a model of F. Henkin 
also introduced a (nonstandard) second-order 
semantics, relative to which the tsecond- 
order predicate calculus is complete. This 
can be shown by extending Henkin’s tech- 
nique (for the first order) to the second-order 
language. 

(3) Here we extend the language slightly by 
adding the second-order free predicate vari- 
ables af, 05,...,a7,...(n=1,2,...) and the 
second-order bound predicate variables o}, 
~5,---,Q9!,...(n=1,2,...), where n indicates 
the number of arguments of a variable. Other- 
wise the definition of the language is the same 
as for the case of the first-order predicate 
calculus. For simplicity, however, we assume 
that there are no individual constants, function 
symbols, or predicate symbols. 

The structure is defined as follows: Put Dt= 
[M:S,,S>,...,5,,-.. ], where M is a nonempty 
set and S, is a set of subsets of M x... M (n 
times). An M-sequence m is defined as before, 
and s,, denotes (s7, s3, ...,s?,...), where each s? 
is a member of S,. The concept of satisfiability 
is defined as follows: 

M, (M, S15. Sm eee) OF, Xi 

ioeo Mi JES. 

M, (M, S15... Sm e) EVEk Alp) for 

an arbitrary s, for which s ae Sp M, (m,$,,..-, 
Sp --- ) = A (a7), where a; has the smallest index 
among the free predicate variables that do not 
occur in A(@j). 

M, (M, S15- Sm e) = For Alok) > there 
exists an s’, such that s! =s, and M, 
(m,51,...,5,,...) = Al), where g} satisfies 
the same condition as in the previous clause. 

Satisfiability for other cases is defined as for 
first-order predicate language. A structure W 
is called normal if all axioms of the second- 
order predicate calculus are true in W. 

Completeness of the second-order predicate 
calculus: Every closed formula that is satisfi- 
able in all normal structures is provable in the 
second-order predicate calculus. 

(4) Let the cardinality of L, be t, and F be 
an arbitrary set of closed L-formulas. If T 
has a model, then I has a model of cardinal- 
ity max (t, Xo). This follows from Henkin’s 
method. Historically, however, it was first 
proved by Th. Skolem and L. Löwenheim for 
a special case, and was later generalized by A. 
I. Mal’tsev and A. Robinson. 

(5) The following results are all due to A. 
Tarski and R. L. Vaught. 
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Definition 1. Two L-structures W and MN are 
said to be elementarily (arithmetically) equiva- 
lent if for an arbitrary closed L-formula A, 
ME ANE A. 


Definition 2. Let 


M=[M:q0; 41; -5 Gor 19-3 Qo. Qis] 


and 
N=[N iro, Fis... 3 hos his- Ro, Ris... | 


be two structures. Wt is an elementary exten- 
sion of N if the following two conditions are 
satisfied: (i) M >N; q;=r;(j=0, 1,...); the 
restriction of g; to N 1s identical to h; (j= 

0, 1,...); the restriction of Q, to N is identi- 

cal to R; (j=0,1,...). (If this condition holds, 
then I is said to be an extension of Jt.) (ii) For 
an arbitrary L-formula A and an arbitrary R- 
sequence n, if R, nm A then W, nH A. 


Theorem 1. Let M be an extension of N. A 
necessary and sufficient condition for Mt to be 
an elementary extension of St is that for an 
arbitrary L-formula of the form 4xF(x) and 
an arbitrary Jt-sequence n, if Mi, n HE Ax F(x), 
then there is some element n of N such that 
for the Nt-sequence m for which m +n and 
m,=n, W, m | F(a;), where a; is an arbitrary 
free variable that does not occur in F(x). 


Theorem 2. Here we place a condition on L 
that each set of symbols be at most countable 
and arranged tn the w-type (— 312 Ordinal 
Numbers). Let the cardinality of the universe 
M of M be an infinite cardinal a, M’ be a 
subset of M of cardinality c, and b be an in- 
finite cardinal that satisfies c <b <a. Then 
there exists an L-structure N whose universe 
N has cardinality b and such that M’CN and 
Wt is an elementary extension of R. 


Theorem 3. Suppose that L satisfies the same 
condition as in Theorem 2. Let the cardinality 
of the universe M of M be a (a is an infinite 
cardinal) and b be a cardinal for which a<b. 
Then there exists an L-structure ® that is a 
proper elementary extension of I and whose 
universe has cardinality b. 


E. Ultraproducts 


Assume that for a set of L-structures X and a 
set of indices J, there is a mapping 0 from I 
onto 2. If « is a member of J, W is a member 
of Z, and 6(a)=M, then Nt may be denoted 
by WM”. It should be noted that there may be 
more than one x corresponding to the same 
structure. If D is a tmaximal filter of I and WM“ 
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is expressed as 
M =[M*:2%, 0.2% ...: PY... J, 
then [I e; M* is defined by 


[] @*={¢l¢ is a mapping from 


ael 


Tinto |) M*, where o(a)eM*}. 


ael 


For any two elements ọ and w of [I e; M°, 
p 2 W is defined by 


p= W< {a pla) =W(a)}eD. 


Then o2 y is an equivalence relation between 
the elements of [Í e; M". Furthermore, the set 
Tae; M? partitioned by 2 is expressed by 
Iac, M*/D, and each element m of TI e, M*/D 
is expressed by m =[ọ], where ¢ is a repre- 
senting element of m. 

Next we define an operator that produces a 
new structure from 2. Put M = Flaer M*/D. 
For an individual constant c of L, let €c =[ọ], 
where ~(x)=c" for every a. For an n-ary 
function f of L and arbitrary elements m, = 
[¢,],....m,=[9,] of M, define f(m,,..., 
m,) = [LW], where (x) = f7(—, (a), ..., Pa(2)) for 
every x. Foran n-ary predicate P of L, define 
<m,,...,m,>€P by 


<m, ... m, EP 


< {a| <p (a), 15 Q,(%)>€P*} ED. 


According to these definitions, put 
IM=(M:c,...,f,...,P,...] 

and denote it by ĮI,- St*/D, called the ultra- 
product of {WM} <; (with respect to D). M is an 
L-structure. 


Fundamental Theorem of Ultraproducts. Let 
M = TT, Wi7/D be the ultraproduct of 
{Mher M=(mM,,m,...) be an M-sequence, 
o; be a representing element of m;, and A 

be an arbitrary formula. Then It, m = A> 
{al MR, (p; (0), pala), ...) A} ED. 

By using this fundamental theorem we have 
the following result: Suppose that I is a set of 
closed formulas in L such that every finite 
subset of I has a model. Let I be the set of all 
the finite subsets of T. For each «e7 and each 
AET, let 9% be a model of « and A the set of 
all the finite sets in J which contain A as a 
member. Let F={A|AeT}. Since F has the 
finite intersection property, there is a maximal 
filter D such that D > F. Let M be the ultra- 
product [],., IN7/D. Since A is a subset of the 
set {a| M" H A} and A belongs to D, the set 
{a| M" H A} belongs to D, for each Aer. 
Hence we have that M is a model of T, by 
the foregoing fundamental theorem of ultra- 
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products. Therefore, we have the following 
theorem. 


Compactness Theorem. A set F of closed for- 
mulas has a model if and only if every finite 
subset of it has a model. 

In the case where all the structures IN 
coincide with the single structure Jt, the ultra- 
product of {M} er (with respect to D) may be 
written Jt’/D and called the ultrapower of N 
(with respect to D). 

Let 


M= LM; qo: qis- Gos Jiss Qo, Cisx] 


and 
N=[N; Yo, Pisces fos Bice Ro, R,, tat 


be two structures. W and R are said to be 
isomorphic if there is a bijection f from M to 
N such that the following three conditions 
hold: (i) f(qo)=ro, J(q1)=r1, ---- Gi) The se- 
quences go, g;,.-. and ho, h,, ... are of the 
same type and f(g,(a,,...,4,))=h(f(a,),---, 
f(a,)) holds for every n-tuple a,,...,a, in M. 
(iii) The sequences Qo, Q,,... and Ro, Ris.. 
are of the same type and R; ={< f(q,),..., 
Flay)> [Ky 5-54.) EQ}. 

Let j be the function from N to N!/D de- 
fined by j(a)=[,] for each ae N, where Q, is 
the constant function from J to N such that 
~, (x) =a for each «eI. Let M be the substruc- 
ture of N'/D whose universe is the range of j. 
Then j is an isomorphism of N to Mt. In the 
following we identify a and j(a) for each ae N. 
Then K is an elementary substructure of M//D 
by the fundamental theorem of ultraproducts. 

If Mt and N are isomorphic, then M and N 
are elementarily equivalent. By using this fact 
and the fundamental theorem of ultraproducts, 
we have the following result. Let W and R be 
two structures. If there is a nonempty set J and 
a maximal filter D on I such that W'/D and 
M!/D are isomorphic, then Mi and N are 
elementarily equivalent. H. J. Keisler proved 
the converse of this proposition by using the 
G.C.H. (generalized continuum hypothesis), 
and later S. Shelah proved it without the 
G.C.H. Keisler-Shelah isomorphism theorem: 
Let M and N be two structures. Then WM and 
N are elementarily equivalent if and only if 
there is a nonempty set J and a maximal filter 
D on I such that M’/D and N'/D are 
isomorphic. 

The ultraproduct operation has various 
applications in number theory, algebraic geo- 
metry, and analysis. Here we give an example 
due to J. Ax and S. Kochen. Let P be the set of 
prime numbers. Let Q, and Z,((t)) be the field 
of p-adic numbers and the field of formal 
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power series over Z,={0,1,...,p—1} for each 
p in P, respectively. Ax-Kochen isomorphism 
theorem: Suppose that D is a nonprincipal 
maximal filter on P. Then JJ,,-pQ,/D and 
I1,<pZ,((t))/D are isomorphic. 

As an immediate consequence of this theo- 
rem, we have the following partial solution of 
Artin’s conjecture on Diophantine equations. 
Theorem: For each positive integer d, there 
exists a finite set Y of primes such that every 
homogeneous polynomial f(t,,...,t,) of degree 
d over Q,, with n>d?, has a nontrivial zero 
in Q, for every p¢ Y (— 118 Diophantine 
Equations). 

We give another example in nonstandard 
analysis. A. Robinson developed the general 
theory of nonstandard analysis in [10]. Here 
we explain a theorem due to A. R. Bernstein. 
Let X be a nonempty set and U(X) be the 
smallest transitive set (i.e., aeb and be U(X) 
imply ae U(X)) which has X as a member and 
is closed under the following operations: pair- 
ing, union, power set, and subset operation 
(i.e, ace U(X) and bca imply be U(X)). Let L 
be the first-order predicate logic with equality 
whose set of nonlogical constants consists of a 
binary predicate symbol e and individual 
constant symbols c, for ae U(X) (— 411 Sym- 
bolic Logic F). Then the first-order structure 
MN =[U(X); a(ae U(X)); E] is an L-structure, 
where E is the e relation on the set U(X). Let 
M=[U(X)!/D:a(ae U(X)); E'/D] be the 
ultrapower 93t//D of N with respect to a non- 
principal ultrafilter D on a set J. For each 
ae U(X), let a* be the set of all elements [ọ] in 
U(X)'/D such that fie1|g(i)ea}eD. Then a 
is a proper subset of a* if a is infinite. Since N 
and M are elementarily equivalent, these two 
sets a and a* have common first-order prop- 
erties in the following sense: for each formula 
(ap) in L, 


(vbea) (NE P[]) (vb Ea*) (ME P]. 


From this it follows that ifr is a relation on a 
set a in Mt, then r* is a relation on the set a*; 
and if f is a mapping from a to b in Ñ, then f* 
is a mapping from a* to b*. Hence a* is a 
mathematical object which greatly resembles 
a. By using this type of resemblance between a 
and a* we have the following result. 

Let H be a Hilbert space over the complex 
number field C such that dim(H)=q and let T 
be a bounded linear operator on H. Let X = 
HUC and consider the first-order structure 
M as above. Since R (the set of all real num- 
bers) and N (the set of all natural numbers) are 
infinite sets which belong to U(X), R* and N* 
have elements which do not belong to R and 
N, respectively. Such elements are called non- 
standard real numbers and nonstandard natural 
numbers, respectively. By the fundamental 
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theorem of ultraproducts we can conclude that 
there are many nonstandard real numbers « 
such that 0 <*a < *a in R* for any aeR. Such 
a nonstandard real number « is called an 
infinitesimal real number. Since the norm 
operator || || is a mapping from H to R, 

|| ||* is a mapping from H* to R*. If || x||* 
(xe H*) is infinitesimal, then x is said to be 
infinitesimal in H*. Let S be the set of all linear 
subspaces of H. For a linear subspace K of H* 
that is contained in S* let K” be the set of 
elements xe H such that x — x, is infinitesimal 
for some x, in K. Then K° is a closed linear 
subspace of H. Let e= {e;};en be an ortho- 
normal basis of the Hilbert space H; e can be 
considered as a mapping from N to H, and 
hence e* = {e;} jen: is a mapping from N* to 
H*. For each je N*, let H, be the linear sub- 
space of H* spanned by {e,|k <j}. For a given 
bounded linear operator T on H, T* isa 
linear operator on H*. We define T;=P,T* P, 
where P, is the projection from H* to H,. Since 
dim(H,)=j is a (nonstandard) natural number, 
there exists a tower Joge Jie... cJ,=H, of 
closed, T-invariant linear subspaces of H; such 
that dim(J,;)=k(k <j). Then J,;° is a closed, 
T-invariant linear subspace of H. If there is a 
polynomial p(x) such that p(T) is a compact 
operator, then we get a nonstandard natural 
number j such that ie is a proper subspace 
of H for some k <j. This gives the follow- 

ing result, which is an affirmative solution of 
a problem of K. Smith and P. R. Halmos. 
Theorem (Bernstein [4]): Let T be a bounded 
linear operator on an infinite-dimensional 
Hilbert space H over the complex numbers 
and let p(x) # 0 be a polynomial with complex 
coefficients such that p(T) is compact. Then T 
leaves invariant at least one closed linear 
subspace of H other than H or {0}. 


F. Categoricity in Powers 


Let T be a set of closed formulas in a first- 
order language L which has a designated 
binary predicate symbol P,. In the following, 
we assume that the interpretation P, of P) by 
Mi is the equality relation on the universe of 
Mi for every L-structure Mt. T is said to 

be categorical if all the models of F are iso- 
morphic. By Theorem 3 in Section D, any I 
having a model of infinite cardinality is not 
categorical. Hence, there exists no interesting 
I which is categorical. Therefore, we consider 
the weaker notion of categoricity in powers. 
Let x be an infinite cardinal and n(I’, x) be the 
number of nonisomorphic models of T of 
cardinality x. Then I is said to be categorical 
in x if n(I, x)= 1, i.e., if all the models of T of 
cardinality x are isomorphic. There exist many 
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interesting I’’s which are categorical in «x for 
some x. For example, the set of axioms of 
algebraically closed fields of characteristic 0 is 
categorical in %,, and the set of axioms of 
dense linear orderings without endpoints is 
categorical in No. With respect to this notion, 
J. Los conjectured that if I ts categorical in x 
for some «> L (the cardinality of L), then T is 
categorical in x for all «> L. This conjecture 
was solved affirmatively by M. Morley in the 
case L=Np, and later by S. Shelah in the 
general case. Theorem (1): Let I’ be a set of 
closed formulas in L. Then T is categorical in 
k for some x> L if and only if T is categorical 
in x for all x > L. We also have the following 
interesting theorem, due to J. T. Baldwin and 
A. H. Lachlan. Theorem (2): Let I" be a set of 
closed formulas in L such that L=,. If T ts 
categorical in X4, then n(I, No)=1 or No. 

As for n(T, x) there are two famous conjec- 
tures due to R. Vaught and others. A set F of 
closed formulas in L is called complete in L if 
it has a model and, for any closed formula A 
in L, either AeET or Aer. 


Conjecture 1. There is no complete set F of 
closed formulas in L such that No <n(T, Xo) < 
250, 


Conjecture 2. There is no finite set I" of closed 
formulas in L such that n(T, NX) =No and 
nT, &,)=1. 


G. Omitting Type Theorem 


By do-formulas (or a-formulas), we mean for- 
mulas which have no free variables except do. 
Suppose that W is an L-structure and 2 is a 
set of a-formulas in L. Then, W realizes X if 
there is an element m of the universe of WM such 
that M H A [f] for all A in X, and M omits £ 
if Mt does not realize X. For example, if L is a 
first-order language such that L4={c9,¢,}, 
Ls={ fo, fi}, Lo= Po. Pi}, where fo, fi, Po, 

P, are all binary, and % is the L-structure 
[9t;0, 1; +,x; =, <], where 9 is the set of 
natural numbers, and 2 = {P,(n,a)|n=0, |, 
2,...}, where 0=cy, 1=f,(0,c,),...,n+1 = 
Jfa, c,),..., then clearly N omits X. Also, if WM 
is an L-structure, m is an element of the uni- 
verse of WM, and Y,,={A] A is an a-formula 
such that M H A[%,]}, then clearly W realizes 
Xm, Where £m is called the type of m in WM. 
Suppose that T is a set of closed formulas in L. 
Then, by the completeness theorem of L, we 
can easily see that F has a model realizing » if 
and only if T“ X is consistent. On the other 
hand, it is rather difficult to obtain a necessary 
and sufficient condition for I to have a model 
omitting 2. The following is a sufficient con- 
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dition: T is said to locally omit 2 if there is no 
a-formula A such that I”’{ A} is consistent 
and, for any formula B in X, [’{A, 7B} is 
not consistent. 


Theorem. Suppose that T is a consistent set of 
closed formulas in a countable language L, 
and X is a set of ag-formulas in L. If T locally 
omits X, then I has a countable model which 
omits 2. Also, if I has a model of power 
greater than 1; omitting 2 for each €<@,, 
then T has a model omitting Z in each infinite 
power, where 2, is defined by Ip = No, Je41 = 
27, 1, =sup:<g if o is a limit ordinal. 
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A. General Remarks 


In this article, we consider mainly modules 
with operator domain (Section C), in partic- 
ular modules over a fring. Modules over a 
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field are linear spaces (= 256 Linear Spaces). 
Modules over a commutative ring are impor- 
tant in algebraic geometry (— 16 Algebraic 
Varieties, 67 Commutative Rings, 284 Noe- 
therian Rings). The theory of modules over a 
‘group ring can be identified with the theory 
of linear representations of a group (— 362 
Representations). Modules without operator 
domain may be regarded as modules over the 
ring Z of rational integers, and the theory of 
finitely generated Abelian groups can be gen- 
eralized to the theory of modules over a tprin- 
cipal ideal domain (— 52 Categories and 
Functors, 200 Homological Algebra). 


B. Modules 


A module (without operator domain) is a tcom- 
mutative group M whose law of composition 
is written additively: a +b =b +a (a, be M); the 
tidentity element is denoted by 0, and the 
inverse element of a by —a. Every subgroup H 
of M is a normal subgroup. For any ae M, the 
left and right cosets of H containing the ele- 
ment a are identical: H+a=a+H (— 190 
Groups A). 

In the set N™ of all mappings of a set M to 
a module N, we define an addition by the sums 
of values: (f+g)(x)= f(x) +g(x). Then N™ 
forms a module. The set Hom(M, N) of all 
homomorphisms of a module M to a module 
N forms a subgroup of the module N™, called 
the module of homomorphisms of M to N. The 
composite of homomorphisms is a homomor- 
phism. Hence the set Hom(M, M)=&(M) of all 
endomorphisms of M forms a fring with re- 
spect to the addition and the multiplication 
defined by composition; this is called the endo- 
morphism ring of M. The tunity element of 
&(M) is the identity mapping of M, and the 
tinvertible elements of &(M) are the automor- 
phisms of M. 

Let {x,},<, be a family of elements in a 
module M. The sum $e x, is well defined if 
x, =0 (AEA) except for a finite number of 2. 
For any family {N,},<, of subsets of M, Dae, 
N, denotes the set of all elements of the form 
Daca X, (x,€N,), where x, =0 except fora 
finite number of å. If all the N, are subgroups 
of M, then N =$ ea N, is also a subgroup, 
called the sum of {N,},.,. If every element of 
N can be written uniquely in the form ¥,.,.X, 
(x,€N,), N is called the direct sum of {N,}\j2,.- 
When the N, are subgroups, this is equivalent 
to the condition that N,N Za guea Na = {0} 

(ic A). 


C. Modules with Operator Domain 


Suppose that we are given a set A anda 
module M. If with each pair of elements ae A 
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and xe M there is associated a unique element 
axéM satisfying the condition (1) a(x + y)= 
ax +ay (ae A; x, ye M), we say that A is an 
operator domain of M and M is a module with 
operator domain A (module over A or A- 
module) (— 190 Groups E). The mapping 

Ax M-M given by (a, x) ax is called the 
‘operation of A on M. Any ae A induces an 
endomorphism ay:x—ax of M as a module 
(not as an A-module). To give the structure of 
an A-module to a module M amounts to 
giving a mapping A-6(M) (aay). 

If N is a subgroup of an A-module M such 
that axe N for any ae A and xe N, then N 
forms an A-module, called an A-submodule (or 
allowed submodule) of M. If {N,},., is a family 
of A-submodules of an A-module M, then the 
intersection (\,.,N, and the sum X,., N, are 
both A-submodules of M. 

Let R be an fequivalence relation in an A- 
module M such that if ae A and xRy, then 
axRay. Then R is said to be compatible with 
the operation of A. In this case, an operation 
of A is induced on the quotient set M/R. 
Moreover, if R is compatible with the addition, 
namely, xRx’ and yRy’ imply (x + y)R(x'+y’), 
then M/R forms an A-module, called a factor 
A-module of M. The equivalence class N con- 
taining 0 is an A-submodule of M, and M/R 
coincides with M/N. 


D. Modules over a Group or a Ring 


If a (multiplicative) group structure is given to 
the operator domain A of a module M, we 
always assume (in addition to condition (1) 

in Section C) that the following two condi- 
tions are satisfied: (2) (ab) x = a(bx); (3) 1x=x 
(a,be A,xe M). 

If a ring structure is given to A, we always 
assume (besides conditions (1) and (2)) that the 
following condition holds: (4) (a + b)x=ax+bx 
(a,be A,xe M). This means that the mapping 
A>é(M)(a>4áy) is a tring homomorphism. If 
the ring A has unity element 1, and 1,,=iden- 
tity mapping (namely, condition (3) holds), 
then the A-module M is called unitary. We 
consider only unitary A-modules. Any module 
M can be regarded as a Z-module or as an 
&(M)-module. 

When M is a module over a ring A, an 
element of A is called a scalar, A itself is called 
the ring of scalars (basic ring or ground ring), 
and the operation A x M >M is called the 
scalar multiplication. The elements ax (ae A) 
are called scalar multiples of x, and the totality 
of these elements is denoted by Ax. Let {xi} cA 
be a family of elements in M. An element of 
the form &,.,4,X,, where the a, are elements 
of A and equal to 0 except for a finite number 
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of A, is called a linear combination of {xj} j¢,. 
The set N of linear combinations of {x,},., is 
the smallest A-submodule of M containing all 
the x, (AeA) and is equal to the sum ©... AX. 
The A-module N is said to be generated by 

fX heas and {x,},., is called a system of gen- 
erators of N. A module having a finite number 
of generators is said to be finitely generated (of 
finite type or simply finite). The module Ax 
generated by a single element x is called mono- 
mial. If A is a tfield (which may be noncom- 
mutative), an A-module is a linear space over 
A (— 256 Linear Spaces). 

Let a be an element of an A-module M. If 
there exists a nonzero divisor J of A such that 
4a=0, then a is called a torsion element of M. 
We say M is a torsion A-module if every ele- 
ment of M is a torsion element, and M is 
torsion free if M has no torsion element other 
than 0. An element a of M is called divisible if 
for any nonzero divisor Aé A there exists an 
element be M such that a= Ab. M is called a 
divisible A-module if every element of M is 
divisible. 

Strictly speaking, the A-modules we have 
considered so far are called left A-modules. If 
we define the operation of A on M by xa 
(ae A,x€ M) instead of ax and modify con- 
ditions (1)—(4) appropriately (in particular, 
condition (2) becomes x(ab)=(xa)b), then M is 
called a right A-module. If A° is a group or 
ring antt-isomorphic to A, then a left A- 
module can be naturally identified with a right 
A°-module. If A is a commutative group or 
ring, we can disregard the distinction between 
left and right A-modules. 

Let A and B be groups or rings. Sometimes 
we consider an A-module structure and a B- 
module structure simultaneously on the same 
module M. If the operations of A and B com- 
mute with each other, namely, a(bx) = b(ax) 
(ae A, be B,xeM), it is convenient to put one 
of the operations to the right. If M has a left 
A-module structure and a right B-module 
structure, satisfying condition (5) (ax)b=a(xb), 
then M is called an A-B-bimodule. If G is a 
group and K is a commutative ring, the G-K- 
bimodule structure is equivalent to the left 
K[G]}-module structure, where K[G] is the 
tgroup ring. 


E. Operator Homomorphisms 


A homomorphism f:M-—-WN of A-modules M 
and N such that f(ax)=af(x) (ae A,xeM) is 
called an A-homomorphism (operator homo- 
morphism or allowed homomorphism). If A is a 
ring, f is also called an A-linear mapping. 
Regarding A as an A-module, an A-linear 
mapping M >A is called a linear form on M. 
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The composite of A-homomorphisms is an A- 
homomorphism. 

Let f:M >L be an A-homomorphism of A- 
modules. The A-submodule Im f = f(M) of L is 
called the image of f, and the A-submodule 
Ker f ={x|xeM, f(x)=0} of M is called the 
kernel of f. Coim f = M/Ker f is called the co- 
image of f, and Coker f = L/Im f the cokernel 
of f. The binary relation xRy on M defined by 
SO)=f(y) (x, ye M) coincides with the equiva- 
lence relation defined by x— ye N= Kerf 
(x, ye M), and the mapping f induces an A- 
isomorphism f:M/N—>f(M). 

A sequence of A-homomorphisms of A- 
modules M, (neZ) 

2M, aye + M,+,7-- 
is called an exact sequence if Im f,- = Ker f, 
for all n. The A-module {0} is denoted by 0. 
Exactness of 0+N3M or M SL>0 means 
that the mapping f: NM is injective or the 
mapping g: M >L is surjective, respectively. In 
an tinductive (tprojective) system {M,, Jan} of 
A-modules, where every f,,:M,7M,A<upu 
(A>) 1s an A-homomorphism, the limit M = 
lim M, (lim M,) is also an A-module. If O> 
L,>M,—-N,-0 is exact for every 4 and 


L,7M,N, 


i We. g 
Li? M,N, 


is a commutative diagram, then 0-lim L,— 
lim M,—lim N,-0 is also exact. For the 
projective limit, however, we can only state the 
exactness of 0—lim L >lim M,—>lim N,. 

The set of all A- {-homomorphisms of an A- 
module M to an A-module N, denoted by 
Hom,(M, N), is a subgroup of the module 
Hom(M, N), and is called the module of A- 
homomorphisms. The set Hom,(M,M)=6&,(M) 
of all A-endomorphisms of an A-module M 
forms a ‘subring of the ring é6(M) and coin- 
cides with the set of all elements commuting 
with any ay, (ae A). We denote by GL(M) the 
group of all tinvertible elements in &,(M). If 
Ais a commutative ring, Hom,(M, N) can be 
regarded as an A-module by defining (af)(x)= 
af (x), namely, af =a, of. In particular, &,(M) 
is an ‘associative algebra over A. If M is an 
A-B-bimodule, Hom,(M, N) forms a left B- 
module by (bf )(x) = f(xb). If N is an A-B- 
bimodule, Hom,(M, N) forms a right B- 
module by ( fb)(x)= f(x)b. 


F. Direct Products and Direct Sums 


In the Cartesian product P=J],.,M, ofa 
family {M,},-, of A-modules, we define ad- 
dition and an A-operation as follows: {x,} + 
{y,}={x,+y,}, a{x,} ={ax,}. Then P forms 
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an A-module. We call I[,-, M, the direct 
product of modules {M,},.,. The canonical 
projection assigning x, to {x,} is denoted by 
p,:P—>M,. Suppose that an A-module M and 
A-homomorphisms f,:M—M,(Ae€A) are 
given. Then there exists a unique A-homo- 
morphism f: M >P such that p,of =f, (AeA); 
f is given by fix)={ f(x}. 

In the direct product T],-, M3, the set S of 
all elements whose components x, are equal to 
0 except for a finite number of 4 is called the 
direct sum of modules {M,},-, and is denoted 
by Daca M, (or H zea My Or ® iea Ma). The 
canonical injection assigning {...,0,x,,0,...} 
ES to x€ M, is denoted by j,:M,-S. If an 
A-module M and A-homomorphisms f,:M,—> 
M are given, then there exists a unique A- 
homomorphism f:S—M such that foj,=f, 
(AeA), defined by f({x,})= Ljea Ja(x1). When 
M is an A-module and {N,},., is a family of 
A-submodules of M, the A-homomorphism 
S:Ziea Na >M defined by f({xi}) = Diea X1 is 
an A-isomorphism if and only if M is the 
direct sum of {N,}. 

If M,=M for all 2EA, Ilea M, and 
Daca M, are denoted by M^ and M“, respec- 
tively. M^ can be regarded as the set of all 
mappings of A to M. The direct product M, 
x... X M, and direct sum M,@...®M, ofa 
finite number of A-modules M,,...,M,,, can 
be identified with each other and, if M;= M 
(1 <i<n), we simply denote it by M". 


G. Free Modules 


Let A be a ring. A family {x,},., of elements 
in an A-module M is called linearly indepen- 
dent if 1 4.,4,x,=0 (a,€ A) implies a, =0 for 
all AEA. This is equivalent to saying that the 
mapping A“ M that assigns X eaa; x; E M 
to {a,} is injective. A linearly independent 
family {x,},-, generating M is called a basis of 
M. A family {x,},¢, is a basis if and only if 
every element of M can be written uniquely in 
the form 3) -,.4,X, (a,€ A). 

An A-module that has a basis is called a free 
module over A. If A is a field (which may be 
noncommutative), every A-module is a free 
module (— 256 Linear Spaces). The fcardi- 
nality of a basis of a free module M over A 
depends only on M if A is a field (which may 
be noncommutative) or a commutative ring; 
this number is called the rank (or dimension) of 
M. Any submodule of a free module over a 
principal ideal domain is a free module. 


H. Simple Modules and Semisimple Modules 


An A-module M is called simple if M #0 and 
M has no A-submodules except M and 0. 
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If M and N are simple A-modules, any 4- 
homomorphism of M to N is an isomor- 
phism or the zero homomorphism (i.e., one 
which sends every element of M to 0) (Schur’s 
lemma). If an A-module M is the sum of a 
family {M,},-, of simple submodules, M is the 
direct sum of a suitable subfamily {M,.+,..,- 
(A’ <A). In this case, M is called semisimple (or 
completely reducible). 

If an A-module M can be decomposed into 
the direct sum of A-submodules N and N’, 
then N’ is called a complementary submodule 
of N. An A-module M is semisimple if and 
only if every A-submodule of M has a comple- 
mentary submodule. Let A be a ring. Then the 
A-module A is semisimple if and only if every 
A-module is semisimple. In this case A is 
called a tsemisimple ring (— 368 Rings G). 
Every simple module over a semisimple ring A 
is A-isomorphic to a tminimal left ideal of A. 


I. Chain Conditions 


The set of all A-submodules of an A-module 
M forms an fordered set under the inclusion 
relation. An A-module is called a Noetherian 
module if the ordered set satisfies the tmaximal 
condition and an Artinian module if it satisfies 
the tminimal condition (— 311 Ordering C). 

Let N be an A-submodule of an A-module 
M. Then M is Noetherian (Artinian) if and 
only if N and M/N are both Noetherian (Ar- 
tinian). A ring A ts called a tleft Noetherian 
ring (‘left Artinian ring) if A is Noetherian 
(Artinian) as a left A-module, and similarly for 
right Noetherian and Artinian rings. Every 
finitely generated module over a Noetherian 
(Artinian) ring is Noetherian (Artinian). Over 
an arbitrary ring A, a module M is Noetherian 
if and only if every A-submodule of M is fi- 
nitely generated. 

A finite sequence {M;}o<i<, of A-submodules 
of an A-module M is called a tJordan-Hélder 
sequence if M = Mo, M;> M,,,, M,={0}, and 
the M,/M;,, (0<i<r) are simple. If such a 
sequence exists, M is said to be of finite length. 
The number r, called the length of M, depends 
only on M. The quotient modules M;/M,,, 
(0<i<r) are uniquely determined by M up 
to A-isomorphism and permutation of the 
indices (C. Jordan and O. Hélder). An 4- 
module M is of finite length if and only if M is 
Noetherian and Artinian. A semisimple A- 
module is of finite length if and only if it is 
finitely generated. 

An A-module M is called indecomposable if 
M cannot be decomposed into the direct sum 
of two A-submodules different from M and 
{0}. Any A-module of finite length can be 
decomposed into the direct sum of a finite 
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sequence N,,..., N, of indecomposable A- 
submodules different from {0}. The direct 
summands N; (1 <i<n) are unique up to A- 
isomorphism and permutation of the indices 
(W. Krull, R. Remak, and O. Schmidt). 


J. Tensor Products 


Let A be a ring. Given a right A-module M 
and a left A-module N, we construct a module 
M ®,N (called the tensor product of M and N) 
and a canonical mapping ọ:M xN>M Q, N 
as follows. Let F be a free Z-module (free 
Abelian additive group) generated by M x 

N, and R be the subgroup generated by the 
elements of the forms (x + x’, y)—(x, y)— 

(x, y) (x, y +y) —(x, y)— (x, y’), (xa, y) — (x, ay) 
(x,x' eM, y, y' €N, ae A). We define M@,N 

= F/R, and call the natural projection o. If we 
denote (x, y) by x @ y, then we have (x, + 
X2) @V=X, @V+X, @y, xQ (y1 +y) = 

x® yı +x® yz, and (xa) ® y=x @ (ay). Any 
element of M @, N is written in the form 
Xx; @ y; (x€ M, y;EN). 

The tensor product M @,N of M and N 
and the canonical mapping ọ:M x N > 
M ®,N can be characterized as follows: 

For a module L, a mapping f:M x NL is 
called biadditive if the conditions f(x +’, y) 

= f(x, y) +f, y), FOO y +y) = Sfx y) + Sx, y’) 
hold. A biadditive mapping f satisfying the 
condition f(xa, y)= f(x, ay) is called an A- 
balanced mapping. Then we have (i) the canon- 
ical mapping ọ:M x N>M®,N is A- 
balanced; and (it) for any module L and any A- 
balanced mapping f: M x N-L, there exists a 
unique homomorphism f,:M @,N—L such 
that f(x, y)= f(x © y) (xe M, yeN). 

A right (left) A-module can be regarded as a 
left (right) A°-module, where A° is the ring 
anti-isomorphic to A. In this sense, we have 
M@,N=N®@,4M. 

Let A be a commutative ring. For A- 
modules M, N, and L, a mapping f:M x N> 
L is called a bilinear mapping if f is biaddi- 
tive and satisfies f(ax, y)= f(x, ay)=af(x, y) 
(ac A,xeEM, yeEN). The set 2(M, N; L) of 
all bilinear mappings M x N >L forms A- 
submodule of the A-module L“*“. A bilinear 
mapping M x N >A is called a bilinear form 
on M x N. The tensor product M @, N be- 
comes an A-module if we define a(x @ y)= 
(ax) @ y (=x @ (ay)), and the canonical map- 
ping Mx N>M ©, N is bilinear. For any A- 
module L and bilinear mapping f: M x N> 
L, there exists a unique A-linear mapping 
f,:M @,N-L satisfying f(x, y)=f,(x ® y). 
By this correspondence fof,, we get an A- 
isomorphism £(M, N; L) = Hom,(M @,N, L). 
If A is a field, M ©, N coincides with the ten- 
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sor product M @ N as a linear space (— 256 
Linear Spaces H, I). 

In general, let M be a B-A-bimodule and N 
be a left A-module. Then M @, N becomes a 
left B-module if we define b(x © y)=(bx) @ y. 
Let N be an A-B-bimodule and M be a right 
A-module. Then M @, N becomes a right B- 
module if we define (x Q y)b =x ® (yb). In 
particular, we have A@,N2=N,M@,A=M. 

Let M, M’ be right A-modules and N, N’ be 
left A-modules. For A-homomorphisms f: 

M —M' and g: NN’, there exists a unique 
homomorphism h:M @®,N>M’ @,N’ satisfy- 
ing h(x O y)= f(x) © g(y); h is called the tensor 
product of f, g and is denoted by f Q g. We 
give here some simple examples (also — Sec- 
tion L). 

Examples. (1) Let M, N be free modules 
(linear spaces, for example) over a commuta- 
tive ring A. If {x,};., and {y;}j.) are bases 
of M and N, respectively, M ©, N is also a 
free module with a basis {x;@ Ylisen, jes. 

If the dimensions dim M, dim N are finite, 
dim M &, N =dim M dim N. 

(2) For an tideal a of a commutative ring A, 
the tfactor ring M = A/a can be regarded as an 
A-module, and we have M@®,N2N/aN. For 
instance, (Z/mZ) ® 7(Z/nZ) = Z/(m,n)Z, where 
(m,n) denotes the greatest common divisor of 
mand n. 


K. Hom and © 


We continue to consider modules over a ring 
A, Concerning the direct sum and product, we 
have 


Hom, (5 MIT.) = HI Hom,(M,, N,,) 
m sH 


and 


(z m) Oa (5 N,) =F (M,Q4N,). 


Ayu 


Concerning projective and inductive limits we 
have 


Hom, (um M,, lim n,) = lim Hom,(M,, N,) 
and 


(in m) Qa (i N,) =lim(M; ®4N,): 


An A-homomorphism {:M—M’ induces a 
homomorphism Hom,(M’, N)>Hom,(M, N) 
by the assignment g—g o f. An exact sequence 
M'’+M-—M"-W0 gives rise to the exact 
sequence 


0—-Hom,(M”, N)>Hom,(M, N) 
—+Hom,(M’, N). (1) 
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An A-homomorphism f:N—N’ induces a 
homomorphism Hom,(M, N)>Hom,(M, N’) 
by the assignments gf og, and an exact 
sequence 0 N’>N-—-N" gives rise to the exact 
sequence 


0—Hom,(M, N’)->Hom,(M, N) 
—Hom,(M, N”). (2) 


Let M be a right A-module and N’, N, N” 
be left A-modules. An A-homomorphism f: 
N>N' induces the homomorphism ly & f: 
M®,N-M @,N’, and an exact sequence 
N'+N-—-N”-0 gives rise to the exact sequence 


MQN’: >MQ, N>M Q, N">0. (3) 


Exchanging left and right, we obtain similar 
results (— 52 Categories and Functors B; 200 
Homological Algebra). 

Let Q be an A-module. If for any exact 
sequence of A-modules 


0>M'>M->M">0, (4) 
the induced sequence 
0—>Hom,(M”,Q)>Hom,(M, Q) 

>Hom,(M’, Q)>0 (5) 


is exact, then Q is called an injective A-module. 
This is equivalent to the condition that if M’ is 
an A-submodule of an A-module M, then any 
A-homomorphism M’—@Q can be extended to 
an A-homomorphism M >Q. Any A-module is 
an A-submodule of some injective A-module, 
and any injective A-module is a divisible A- 
module. If A is a Dedekind domain, any 
divisible A-module is an injective A-module. 

Let P be an A-module. If for any exact 
sequence (4), the induced sequence 


0—-Hom,(P, M’)> Hom, (P, M) 
—>Hom,(P, M”)>0 (6) 


is exact, then P is called a projective A-module. 
This is equivalent to the condition that for any 
surjective A-homomorphism g: M >M” and 
any A-homomorphism f:P— M”, there exists 
an A-homomorphism h: P>M satisfying goh 
= f. Any A-module is a factor A-module of 
some projective A-module. A projective A- 
module has no torsion element. A free A- 
module is a projective A-module. In general, 
an A-module is a projective A-module if and 
only if it is a direct summand of a free A- 
module. 

Let R be a right A-module, If for any exact 
sequence (4), the induced sequence 


0-R®O,M’>~R®,M-R@®,M’"-0 (7) 


is exact, then R is called a flat A-module. Any 
projective A-module is a flat A-module. A flat 
A-module R is called faithfully flat if R ©, M 
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= {0} implies M = {0}. A flat right A-module R 
is faithfully flat if and only if R# RY for any 
left ideal U (+ A) of A. Let A be a ‘principal 
ideal domain. Then an A-module R is flat if 
and only if R has no torsion element, and R is 
faithfully flat if and only if R has no torsion 
element and R # Rp for any ‘prime element 

p of A. We have the following important 
examples: 

(1) For a commutative ring A and its multi- 
plicatively closed subset S, the tring of quo- 
tients A, is flat as an A-module. However, As 
is not faithfully flat. For instance, the field of 
rational numbers Q is not faithfully flat as a 
Z-module. 

(2) Let A be a fsemilocal ring and A be its 
completion. Then A is faithfully flat as an A- 
module (— 284 Noetherian Rings; also [1,7]). 

In the exact sequence (4), if Im ọ = Ker y isa 
direct summand of the A-module M, we say 
that (4) splits. Then (5), (6), and (7) are exact for 
any A-modules Q, P, R. The exact sequence (4) 
splits if M’ is injective or M” is projective. 

By ,M, M,, and ,Mg,, we mean that M isa 
left A-module, a right A-module, and an A-B- 
bimodule, respectively. As already stated, ¿Mpg 
and ,N imply ,(Hom,(M, N)), and ,M and 
aNg imply (Hom,(M, N))x. Similarly pM, and 
N, imply (Hom,(M, N)),, and M, and pN; 
imply ,(Hom,(M, N)). Furthermore, „M, and 
aN imply (M @,N), and M, and ¿Ng imply 
(M@4N)z. 

With gL, aM, and gN, we have 


Hom,(M,Hom,(L, N))=Hom,(L @, M,N). 
(8) 


Similarly, for ,M,, La, and Nz, we have 


Hom, (L, Hom,(M, N))xHom,(L @, M, N). 
i (8) 


If B is a commutative ring, (8) and (8’) are B- 
isomorphisms. Furthermore, with L4, Mp, 
and N, we have 


(LOM) @gN=L@y(M gN). (9) 


We denote by M* the set Hom,(M, A) of all 
linear forms on an A-module M. Then ,M 
implies M%, and M, implies ,M*; the A- 
module M* is called the dual module of M. A 
as a left A-module is dual to A as a right A- 
module, and vice versa. For a family of A- 
modules {M,},-,, we have a canonical corre- 
spondence ($ ea M,)* =I1,-, M*. From this, 
we have a canonical isomorphism (M*)* = M 
for any finitely generated projective A-module 
M. Many facts concerning this tduality are 
similar to those valid for linear spaces (— 256 
Linear Spaces G). 

Let A be a commutative ring. Letting A= 
B=N in (8) and (8’), we have the canonical A- 
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isomorphisms 
Hom, (M, L*)=Hom,(L, M*) 
=(L®,M)*=(L, M; A), 


namely, any bilinear form on L x M is repre- 
sented by a linear mapping M >L* or L>M*. 


L. Extension and Restriction of a Basic Ring 


Fix a ring homomorphism p:A— B. We regard 
B as a B-A-bimodule by defining a right oper- 
ation of A on B by b-a=bp(a) (ae A, be B). 
This bimodule is denoted by B,. 

For every left A-module M, we construct the 
left B-module p*(M)= B, @, M, which is 
called the scalar extension of M by p. Every 
A-homomorphism of A-modules f: M— 

M' induces the B-homomorphism p*(f)= 
1, ® f:p*(M)—>p*(M’), 

For every left B-module N, we construct the 
left A-module p,(N)=Hom,(B,, N), which is 
called the scalar restriction (or scalar change) 
of N by p. By the assignment h-h(1), we have 
a module isomorphism Hom,(B,, N)=N. We 
identify p,(N) with N under this isomorphism. 
The operation of A on N is then given as a: y 
= p(a)y (ae A, ye N). If A is a subring of B and 
p is the canonical injection, then the opera- 
tion of A on p,(N) is the restriction of the 
operation of B on N. Every B-homomor- 
phism of B-modules f: NN’ induces the A- 
homomorphism p,(/):p,(N)>p,,(N’). For 
any left A-module M and left B-module N, an 
A-linear mapping f:M—>p,(N)=N is called a 
semilinear mapping with respect to p. This 
means that f is an additive homomorphism 
satisfying f(ax)=p(a) f(x) (ae A,xe M). 

The extension and the restriction of a basic 
ring are related by the canonical isomorphism 
Hom,(M, p,(N))&Hom,(p*(M), N) for an A- 
module M and a B-module N (— equation 
(8’)). An element « of the left-hand side and an 
element f of the right-hand side are associated 
by the relation a(x)=f(1 @ x) (xe M)(— 52 
Categories and Functors). 

Let A and B be commutative rings. Then for 
A-modules M and M’, we have the canonical 
B-linear mapping p*:B@,Hom,(M, M’) 
—Hom,(B@,M,B®,M’), which is a B- 
isomorphism if M is a finitely generated 
free (or more generally, projective) A- 
module. Using the notation p*, we have 
p*(Hom,(M, M’))= Hom,(p*(M), p*(M’)). 

We now give some examples where the basic 
rings are noncommutative. Let G be a group 
and H its subgroup. Let p denote the homo- 
morphism of group rings K[H]> K[G] in- 
duced by the canonical injection HG, where 
K is a commutative ring. For any K[H]- 
module M, p*(M) is called the induced module 
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of M. The representation of G associated with 
p*(M) is the tinduced representation of the 
representation of H associated with M. Next, 
we fix a group G and consider a homomor- 
phism p: K[G]—K[G] induced by a homo- 
morphism of commutative rings o: KK. If 
K =K and o is an automorphism, then the 
representation associated with the “scalar ex- 
tension” p*(M) of a K[G]-module M is the 
‘conjugate representation to the representa- 
tion associated with M. If K = K/N (Ql is an 
ideal of K) and a is the canonical projection, 
then the representation over K associated 
with the scalar extension p*(M) of a K[G]- 
module M is the reduction modulo Y of the 
representation over K associated with M, and 
p*(M) is canonically isomorphic to M/2UM. 
Furthermore, the ‘localization and the tcom- 
pletion can also be treated under the formula- 
tion of scalar extension (— 67 Commutative 
Rings G, 284 Noetherian Rings B). 


References 


[1] M. Nagata, Local rings, Interscience, 1962. 
[2] H. P. Cartan and S. Eilenberg, Homolog- 
ical algebra, Princeton Univ. Press, 1956. 

[3] D. G. Northcott, An introduction to homo- 
logical algebra, Cambridge Univ. Press, 1960. 
[4] S. MacLane, Homology, Springer, 1963. 
[5] C. Chevalley, Fundamental concepts of 
algebra, Academic Press, 1956. 

[6] N. Bourbaki, Elements de mathématique, 
Algèbre, ch. 2, Actualités Sci. Ind., 1236b, Her- 
mann, third edition, 1962. 

[7] N. Bourbaki, Eléments de mathématique, 
Algèbre commutative, ch. 1, 2, Actualités Sci. 
Ind., 1290a, Hermann, 1961. 

[8] C. W. Curtis and I. Reiner, Representation 
theory of finite groups and associative alge- 
bras, Interscience, 1962. 

[9] R. Godement, Cours d’algébre, Hermann, 
1963. 

[10] S. Lang, Algebra, Addison-Wesley, 1965. 
[11] S. T. Hu, Elements of modern algebra, 
Holden-Day, 1965. 


278 (XIII.23) 
Monge-Ampère Equations 


A. Monge-Ampère Equations 


A Monge-Ampère differential equation is a 
second-order partial differential equation of 
the form 


Hr+2Ks+Lt+M + N(rt—s?)=0, (1) 
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_where H, K, L, M, and N are functions of x, y, 
z, p, and q, and r, s, t, p, and q represent the 
partial derivatives 


8?z 8?z ĝ?z 

=-5, s= ——, =, 

=x? axdy ay? 
_ oz _ oz 
Pex’? ay 


The characteristic manifolds are integrals of 
a system of differential equations defined as 
follows: 

Case (i) N 40. 


Ndp+Ldx+4,dy=0, 
Ndqt+4,dx+Hdy=0, 
dz—pdx—qdy=0, (2) 
Ndp+Ldx+4,dy=0, 
Ndq+A,dx+Hdy=0, 
dz—pdx—qdy=0, (3) 


where 4, and å, are the two roots of the 
equation 47+2Ki4+HL—MN=0. 
Case (ii) N =0, H <0. 


dy=i,dx, Hdp+H1,dqg+Mdx=0, 
dz—pdx—qdy=0, (4) 
dy=i,dx, Hdp+Hi,dqg+Mdx=0, 
dz~pdx—qdy=0, (5) 


where 4, and å, are the two roots of the 
equation HA? —2Ki+L=0. 
Case (iii) N=0, H=0, L0. 


dx=0, Mdy+2Kdp+4+Ldq=0, 
dz—pdx—qdy=0, (6) 
2K dy—Ldx=0, Mdy+Ldq=0, 

dz— pdx —qdy=0. (7) 


Case (iv) N =0, H =0, L=0. 


dx=0, 2Kdp+Mdy=0, 
dz—pdx—qdy=0, (8) 
dy=0, 2Kdq+Mdx=0, 
dz—pdx—qdy=0. (9) 


A manifold x(A), y(A), z(2), p(A), g(A) that 
satisfies the system (2), (3) of differential equa- 
tions for case (i), (4), (5) for case (ii), (6), (7) for 
case (iii), or (8), (9) for case (iv) is a character- 
istic manifold of equation (1). 

The following result is known concerning 
Monge-Ampere equations: The union of sur- 
face elements of an integral surface of (1) is 
generated in two ways by characteristic mani- 
folds depending on one parameter. Conversely, 
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if a manifold, each element of which is com- 
posed of a point of a surface S and the tangent 
plane at that point, is generated by a family of 
characteristic manifolds depending on one 
parameter, then the surface S is an integral 
surface of (1). 


B. Intermediate Integrals 


If a relation dV (x, y, z, p, q)=0 is a consequence 
of a system of differential equations of charac- 

teristic manifolds, for example, in case (1) when 
V satisfies 


N(éV/dx + poV/dz)— LV/dp—A,6V/éq =O, 
N(6V/dy + qoV/dz)—A,0V/dp—HéeV/dq=0, 


V(x, y, Z, p,q) =c (c an arbitrary constant) is 
called an integral of the system of differential 
equations. (i) If V=c is an integral of a system 
of differential equations of characteristic mani- 
folds, the solution z(x, y) of V =c considered 
anew as a partial differential equation of the 
first order is a solution of (1). Conversely, if 
every solution of V=c (excepting tsingular 
ones) satisfies equation (1), V=c is an integral 
of a system of differential equations of char- 
acteristic manifolds. (1i) If u(x, y, z, p, q) and 
v(x, y, z, p,q) are two integrals of a system of 
differential equations of characteristic mani- 
folds, then for any arbitrary function o, (u, v) 
is also an integral. Thus, as a consequence of 
(i), every solution z(x, y) of @(u, v)=0 is also 

a solution of (1). The converse ts also true, 
namely, for every solution z of (1) we can find 
a function @ such that z(x, y) is also a solution 
of g(u, v)=0. The relation ọ{u, v) =0 is called 
an intermediate integral of (1). Sometimes an 
integral of a system of differential equations of 
characteristic manifolds is also called an inter- 
mediate integral. If each of the two systems of 
differential equations defining the character- 
istic manifolds has an intermediate integral, 
then the two intermediate integrals @(u, v)=0 
and (u,v) =0 form a tcomplete system of 
partial differential equations of the first order. 
Integrating this complete system, the tgeneral 
solution of equation (1) is obtained. 


References 


[1] E. Goursat, Cours d’analyse mathématique 
III, Gauthier-Villars, fourth edition, 1927. 

[2] E. Goursat, Leçons sur l'intégration des 
équations aux dérivées partielles du second 
ordre à deux variables indépendantes I, Her- 
mann, 1896. 


1049 


279 (VIl.16) 
Morse Theory 


A. General Remarks 


We are interested in smooth functions on 

a smooth manifold of dimension m that 

have only the simplest critical points. Here, 
“smooth” means differentiable of class C”. 
Such a function f: MR enables us to investi- 
gate the topology of M. The decomposition of 
M into level sets of f contains a lot of topo- 
logical information on M. For instance, it is 
shown that M is thomotopy equivalent to a 
*CW complex that is determined by critical 
points of f; furthermore the tEuler character- 
istic of M can be computed by means of f, 
because the critical points are related to the 
thomology groups of M. These types of inves- 
tigations, called Morse theory, were originated 
by H. Poincaré [1] and G. D. Birkhoff [2] and 
then developed into the form we see today by 
M. Morse [3]. An excellent exposition of this 
theory has been given by J. Milnor [4]. R. S. 
Palais [5] has extended the theory to Hilbert 
manifolds. Morse theory has been fruitfully 
applied to differential topology and differential 
geometry. 


B. Critical Points of Functions on Manifolds 


For a point pe M, M, is by definition the 
tangent space to M at p. Let f be a smooth 
real-valued function on M. A point pe M is 
called a critical point of f if the induced map- 
ping f,,:!M,—R fip is zero at p. For every flocal 
chart (x,,...,X,,) around a critical point p of f 
with p=(0,...,0), 

of of 


ca )=...=5—-()=0. 


m 


The value f(p) is then called a critical value of 
f If the matrix (3? {/6x,dx,(0)) is invertible, 
then the point is called a nondegenerate critical 
point, and if the matrix is not invertible, then 
the point is said to be degenerate. These no- 
tions are independent of the choice of local 
charts around p. The above matrix is called 
the Hessian of f at p. The nullity and index of 
the Hessian of f at p are called the nullity and 
index of f at p, respectively. The function f has 
a local minimum at a nondegenerate critical 
point of index 0, and a local maximum at a 
nondegenerate critical point of index n. 

A smooth function f on M is called a Morse 
function if it satisfies: (A1) Every critical point 
of f is nondegenerate. The Morse lemma states 
that if p is a nondegenerate critical point of f 
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and if the index of f at p is å, then there exists 
a local chart (y,,...,y,,) around p with p= 
(0, ...,0) such that f is expressed as 


fy) =f(P)-O? —- - (a? + Vans)? +e 


+ (Vm). 


C. The Existence of Morse Functions 


Let M be a compact smooth m-dimensional 
manifold without boundary. The existence of 
Morse functions on M is shown by tembed- 
ding M into R” for a sufficiently large n. For 
convenience, M will be here identified with its 
embedded image. At each point pe M the set 
vp of all unit normals forms an (n—m — 1)- 
dimensional unit sphere, and v(M)=\_) ,em Vp 
is a smooth (n — 1)-dimensional compact 
manifold. Let S be the unit hypersphere in R” 
around the origin. The Gauss normal! mapping 
g:v(M)~S sends each point (p, v(p))ev(M) to 
veS via a canonical parallel translation of R”. 
Since o is smooth, the tSard theorem implies 
that the set E of all critical values of ọ has 
measure zero on S. Thus 9, at every (p, v(p))e 
gy '(v) has maximal rank if v is taken in S— E. 
For every fixed ve S—E let h,: M >R be de- 
fined by h,(x)=<v,x>,xeM, where < , > de- 
notes the canonical inner product of R”. Then 
h, is a Morse function. To see this let dX, dv, 
and dM be volume elements of S, v, and M, re- 
spectively. Then dvAdM is a volume element of 
v(M), and the function G:v(M)>R is obtained 
by the relation p*d = G(x, v(x))dvadM. 

G(x, v(x)) is called the Lipschitz-Killing curva- 
ture at v(x)ev,. If A(v(x)) denotes the second 
fundamental form of M with respect to v(x), 
then G(x, v(x)) =(—1)"det(A(v(x))) (Chern and 
Lashof [6]). Every critical point pe M of h, 
for ve S—E has a normal v(p) to M at p, and 
the Hessian of h, is given by A(v(p)). Thus 
G(p, v(p)) #0 ensures the nondegeneracy at p 
[6]. 

Another approach to the construction of 
Morse functions is to find for a given embed- 
ding of M into R” an open dense set U c R” 
such that the Euclidean distance d, from any 
fixed point x on U to points on M has no 
degenerate critical points on M. In this case 
d.'((—00,a]) is compact for all ae R. It is also 
possible to construct Morse functions on 
compact manifolds with boundary as well as 
on noncompact manifolds. But a striking 
result of A. Phillips states that there exists a 
Morse function on every noncompact mani- 
fold which has no critical points [7]. 

For a pair of smooth manifolds M, N let 
C”(M, N) be the set of all smooth mappings 
from M to N. The weak (or compact-open) C“ 
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topology on C*(M, N) is generated by the sets 
defined as follows. Let feC*(M, N), and let 
(U, ọ) and (V, Y) be charts of M and N, respec- 
tively. Let K c U be a compact set such that 
f(K)cV. For e¢(0, œ), a weak tsubbasic 
neighborhood N*(f;(U, p), (V, Y) K, ejoff 

is defined to be the set of all smooth map- 
pings g¢C*(M, N) such that (1) g(K)€ V, (2) 
\Dipofow')(x)—D(pogow ") l <s, 
xeg(K), k=1,2,..., where D* denotes the kth 
derivative with respect to the coordinates. 
The weak C” topology is thus defined on 
C”(M, N), and the space with this topology is 
denoted by C%(M, N). It follows from the 
construction of the embedding in the Whitney 
embedding theorem that the set of all smooth 
embeddings of compact M into R" is dense in 
C2(M, R") ifn>2dim M. Moreover, the set of 
all Morse functions on a compact M forms 

an open dense set in C?(M, R) (Nagano [8], 
Hirsh [9], Auslander and MacKenzie [10]). 


D. Decomposition of M by a Morse Function 


To decompose M into levels of a Morse func- 
tion we now consider the process of tattaching 
a handle. Let M be a compact manifold with 
*boundary ôM. Let D° be the s-disk, and let 
g:(OD‘) x D™ 8+ 6M be an tembedding. Then a 
tmanifold X(M;g;s) with a handle attached by 
g is defined as the quotient set obtained from 
the disjoint union M U(D* x D” *) by identify- 
ing points in 6D’ x D™~* and their images 
under g and equipped with a natural differenti- 
able structure. Similarly, if g;: (0DF) x DP “*—> 
ðM (i=1,...,k) are embeddings with disjoint 
images, we can define the thandlebody X (M; 
Jise gi S1 S) LALL 

For an feC®(M, R) and for a, be f(M) with 
a<b let M@={xeM|f(x)<a} and Mi= 
{xeM|a<f(x)<b}. Mz is called a level set 
of f. For a Morse function f on a compact 
manifold M without boundary, the following 
fundamental facts are known [4]: (1) The first 
fundamental theorem. If M? contains no crit- 
ical points, then M? is diffeomorphic to M; x 
[a,b]; (2) The second fundamental theorem. 
Let ce(a, b) be a unique critical value in [a, b), 
and let p,,...,P,€M¢ be critical points of f 
with p; having index /;. Then M” is diffeo- 
morphic to X(M5; fi» as fgs Ais -3 Ad) FOT 
suitable embeddings f1, ---, Jp: It follows from 
the existence of Morse functions and from (1) 
and (2) that every compact manifold can be 
obtained by successively attaching handles to 
a disk (— 114 Differential Topology). Fur- 
thermore, if M admits a Morse function with 
only two critical points, then M is homeo- 
morphic to a sphere (Reeb [12]). 

Concerning the homotopy types of M and 
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M" we have the following: (3) If ce f(M) is a 
critical value of f with points p,,...,p, with p; 
having index A; and if M£*? is compact for ¢>0 
and contains no critical points other than 

Pis «+++ Py, then M‘** has the same homotopy 
type as M Uet U... Ue, where e4 (j= 
1,...,k) are A,-cells. (4) Assume that a Morse 
function f on a (not necessarily compact) M 
satisfies: (A2) M* is compact for all ae R. Then 
by choosing a sequence 4; <d2<... of regular 
values of f and applying (3) to each Mg’, we 
see that M has the same homotopy type as 
that of a CW complex ¢, where the number of 
-cells belonging to ¢ is the same as that of 
critical points of index 4 of f. 

Moreover, we have the following Morse 
inequalities: (5) Let f be a Morse function on a 
compact M. If M, is the number of critical 
points of f on M of index 4, and if Riisa å- 
dimensional ‘Betti number of M, then 


M, —Mo2Rı— Ro, 
M, — M3- +... +(—1) Mo 


>R;,—R;ı i+. +(—1 Ro 1<4<n-l1, 


M,—M,-; +... +(—1)"Mo 
SRR AS 1 Ry 


In particular, we have M, > R, for all k. By 
using these facts S. Smale obtained an affirma- 
tive solution of the tPoincaré conjecture in 
high dimensions [11, 13]. 

The concept of critical manifolds in the 
sense of Bott [14] is stated as follows: Let M 
be a compact manifold embedded into an 
open set U c R*. Let f: U —R be a smooth 
function. M is called a nondegenerate critical 
manifold of f on U if (1) all points of M are 
critical points of f and (2) the nullity of all 
xeM is equal to dim M. When M is such a 
nondegenerate critical manifold of f on U, fis 
constant on M, and the index A of f at x is 
well defined and is the same at all points on 
M. The *Poincaré polynomial is expressed as 
P(M;t)= È t*dim H*(M). The Morse poly- 
nomial is defined by M(f;t)= Dy t** P(N; t), 
where N runs over all critical manifolds of f 
and åy is the index of N. Under certain con- 
ditions for orientability along each nondegen- 
erate critical manifold of f, we have again the 
Morse inequality 


M( fst) > PCM; 0). 


E. Morse Theory on Hilbert Manifolds 


Let M be a tHilbert manifold, and let f: MR 
be a smooth function. The Hessian 67f, of f at 
a critical point of f is a symmetric bilinear 
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form on M, given by 
67 f,(u,v)=d?(fo@ *)(de,(u), dg,(v)), 
u, vE Mp, 


where ¢ is a coordinate mapping of a local 
chart around p. The index (coindex) of f at a 
critical point pe M of f is defined to be the 
supremum of dimensions of subspaces of 
M, on which 67 f, is negative (positive) defi- 
nite. The self-adjoint bounded operator A 
which represents 67f, is given by 07 f,(u, v)= 
<u, A(v)>,. If A is invertible, then f is said to 
be nondegenerate at p. These notions do not 
depend on the choice of local charts around p. 
The Morse lemma has been generalized to 
a Riemannian Hilbert manifold M by R. 
Palais and S. Smale [15] as follows. Let f be a 
smooth function defined on M. If xe M isa 
nondegenerate critical point of f then there 
exist a chart ọ around x and a projection P 
such that for y near x 


S= PEY)? - IU- Poy). 


The above result has been extended to critical 
manifolds [16]. A connected submanifold N of 
a Hilbert manifold M is called a nondegenerate 
critical manifold of f: M >R if (1) every point 
peM is a critical point of f and (2) for each 
peM there exists a closed subspace E, of M, 
such that M, = N, ® E, and the Hessian form 
restricted to E, is nondegenerate. Let v= 

(x, E, N) be a smooth Hilbert-space bundle 
over a compact connected manifold N, and let 
<, > be a Riemannian structure for v. Assume 
that a smooth function f: ER has the zero 
section of v as a nondegenerate critical mani- 
fold. Then there exist a tubular neighborhood 
v,={vEE| ||v|| <E} of the zero section in v and 
a fiber-preserving diffeomorphism w:v,>y(v,) 
and an orthogonal bundle projection P such 
that for vev, 


fow(v)=f(N)+ | Poll? — IU — Py]. 


Let us assume that (B) M is a complete 
Riemannian manifold, and (C) (Palais-Smale 
condition) if f is bounded on a set Sc M and if 
the norm ||Vf || of the tgradient vector of f 
has infimum 0 on S, then there is a critical 
point of f on the closure S of S. Since M is not 
locally compact, condition (C) is required. In 
order to prove the first and second funda- 
mental theorems of Morse theory it ts neces- 
sary that the integral curves of Vf exist, and 
(C) ensures their existence. Namely, under 
conditions (A1), (B), and (C), one of the follow- 
ing two facts follows for any be f(M) and for 
any regular point pe M”: (1) The integral curve 
c:[0,rJ—>M? of Vf with c(0)=p exists, and 
foc(r)=b holds for some re[0, æ); (2) the 
integral curve c:[0, 00) M? of Vf exists and 
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lim, „œ c(t) exists such that the limit point c(0o) 
=lim, „œ c(t)e M? is a critical point of f. These 
facts imply that the first fundamental theorem 
of Morse theory holds if M? does not contain 
any critical point of f, and also the second 
fundamental theorem holds if M? has only 
nondegenerate critical points on M£ for some 
ce(a, b). Furthermore, assume that f: M >R 
and M satisfies (A1), (B), and (C), and let a, 

be f(M) be regular values with a <b. If for 
each nonnegative integer 4, R, is the Ath Betti 
number of M?, and if M, is the number of 
critical points of index 4 of f in M?, then the 
Morse inequality holds: 


Mo È Ro, 
M,—My2R,—Ro, 


way 


È (DM, > È (I*R, 


È (IPM, = È (IFR. 


In particular, M, 2 R, holds for all 2. If f is 
bounded below, then the Ath Betti number R¥ 
of M” and the number M* of all critical points 
of f with index 4 in M? satisfy M} > R* for 

all 2. 


F. Morse Theory of Path Spaces 


Morse theory on Hilbert manifolds applies to 
the energy functions on Riemannian Hilbert 
manifolds which consist of all H,-curves on a 
compact Riemannian manifold M, and the 
theory is useful for proving the existence of 
closed geodesics on M. 

Let M be a smooth manifold, and let I = 
[0,1]. Let H, (I, M) be the set of all continuous 
curves g: I—M such that for each local chart 
(V, p) of M, goa is absolutely continuous and 
\\(g oa)'|| is locally square integrable. In par- 
ticular, if M =R”, then H, (I, R”) is a Hilbert 
space with the inner product 


(o, p)1=(0(0), p(0)) + | (a(t), p(t) dt, 


i 
c, p€ H, (1, R”). 


For each oe H (I, M), set Hi (I, M) ={X € 

H, (1, TM)| X(t)€ Mam, teI}, where TM is by 
definition the ttangent bundle over M. For any 
fixed pair of points p, qe M, let Q(M; p, q)= 
{oe H,U, M)|o(0)=p, o(1)=q}, and for each 
o€Q(M; p,q), let Q(M; p, q) ={X € Ay (I, M),| 
X(0)=0eM,, X(1)=0e M,}. Then H, (I, M), 
forms a vector space, and Q(M; p, q), 1s a sub- 
space of H (I, M),. M can be embedded into 
R” for sufficiently large n by the Whitney 
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embedding theorem [17]. Then we have 

the following facts [17]: (1) H U, M)={ce 

A, (1, R")|o(1)< M}, (2) H, (I, M) is a closed 

submanifold of H, (7, R”), (3) Q(M; p,q) is a 

closed submanifold of H, (I, M), (4) the tangent 

space to H, (J, M) at each point oe H, (I, M) is 

H, (1, M),, (5) the tangent space to Q(M; p,q) at 

each point ¢€Q(M; p,q) is Q(M: p, g),. Here 

M is identified with the embedded image in 

R”. Thus H, (I, M) and Q(M; p, q) carry the 

structure of a Hilbert manıfold, and they are 

independent of the choice of embeddings [17]. 
The energy functions on H, (J, M) and 

Q(M; p,q) play important roles in the develop- 

ment of Morse theory. M is now assumed to 

be a complete Riemannian manifold. It follows 

from the Nash isometric embedding theorem 

(— 365 Riemannian Submanifolds B) that 

M is isometrically embedded into R” for a 

sufficiently large n’. H, (I, R") carries a com- 

plete Riemannian metric in a natural way, 

and H, (I, M) becomes a complete Riemann- 

ian manifold with the metric induced from 

H, (I, RY). Similarly Q(M; p, q) admits a com- 

plete Riemannian metric. Then the energy 

function E: H, (1, M)>R is defined to be 


E(a):= 5l <o’,a'> dt, 


where < , > is the inner product induced from 
the Riemannian structure of M. Then the 
Palais-Smale condition (C) is satisfied for E 
on H, (1, M). That is, if a sequence {0,} on 
H, (I, M) satisfies: (1) {E(o,)} is bounded above; 
and (2) VE(o,)-0 as k— œ, then there is a 
subsequence {o;,-} of {op} such that {0x} con- 
verges to a critical point oe H, (I, M) of E. 
Also, condition (C) is fulfilled for the energy 
function on Q(M; p,q). 

For a fixed pair of points p, qe M, let Q = 
Q(M, p,q). Let c€Q be a critical point of E. 
A tangent vector W to Q at o:!1>M with a(0) 
=p, o(1)=q is a tpiecewise differentiable vec- 
tor field along o such that W(0)=OeM, and 
W(1)=OeM,,. A proper variation &:(— e, £) x I 
>M along o which is associated with W satis- 
fies the following: (1) «(0, t)= a(t), te I; (2) there 
exists a finite partition 0=t9<t,<...<t,=1 
of I such that «|(—«,¢) x [t;_,,¢,] is differenti- 
able for all i=1,...,«; (3) a(u,0)=p and a(u, 1) 
=q hold for all ue(—e, £); and (4) éa(0, t)/ðu 
= W(t), tel. It follows from the first variation 
formula (— 178 Geodesics A) that ceQ is a 
critical point of E if and only if ø is a geodesic 
on M. Let W,, W,€Q,, and let U be an open 
set in R* around the origin. Then a proper 
variation x: U x IM along ø that is asso- 
ciated with W, and W, satisfies the follow- 
ing: (1) «(0,0,}=o(t), te J; (2) there exists a 
finite partition 0O=t)<t,<...<t,=1 of I such 
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that «| U x [t;_,,t,] is differentiable for all i= 
1, ...,k; (3) a(u,,u,, 0 =p, a(u,,u,, 1)=q for 
all (u,,u,)eU; and (4) da(0, 0, t)/eu, = W, (t), 
6a(0, 0, t)/ðu = W,(t), te 1. Then the Hessian 
E,, of E at o is given by 

1 07 E(a(u,,u>)) 


aad, W,)=< 


E 
2 ĝu ĝu, 


> 


(0,0) 


where &(u,,u,)E€Q is by definition the curve 
O(Uy,U)(t)=a(u,, U5, t). The second variation 
formula (— 178 Geodesics A) then gives 


Ega(Mi, Ws) = -Y W0, A Wi (D) 
- | w We + ROM, 


where A, W/(t) = Wy (t + 0)— Wy(t —0). We have 
the Morse index theorem, which states: (1) The 
tnull space of E at a critical point ø is the 
linear space spanned by all Jacobi fields (— 
178 Geodesics A) along o that vanish at 0 

and 1; (2) if {a(s,), (s2), ...,0(s))} (0<s,<s, 
<... <Sı< 1) is the set of all points conjugate 
to a(0) along o and if 4; is the multiplicity of 
the conjugate point o(s,), then the index of E at 
g is equal to A, +... +å. It follows from the 
Sard theorem together with the differentia- 
bility of the exponential mapping on M that 
except for a set of measure zero in M x M, p, q 
can be chosen so that p, q is not a conjugate 
pair along any geodesic in Q. Then all critical 
points of E are nondegenerate, and for any 
c>0, OF = {weEQ| E(w)=c} contains at most 
finitely many nondegenerate critical points 
with finite indices. 


G. Existence of Closed Geodesics 


Let M be a compact Riemannian manifold. By 
replacing I with the circle S+, we consider a 
Hilbert manifold A(M)= H,(S', M). A(M) 
carries the structure of a complete Riemannian 
manifold. Every point pe M is naturally em- 
bedded in A(M) as a point curve, and M isa 
totally geodesic submanifold of A(M). A point 
ge A(M) is a critical point of the energy func- 
tion E:A(M)->R if and only if ø is a closed 
geodesic of M. It is known that E satisfies the 
Palais-Smale condition (C). The index and 
nullity of E at a critical point ø is finite. Since 
M is compact, the fundamental length Je of 
M is positive and A‘= {aE A(M)|E(a)<c} isa 
tdeformation retract of M c A(M) by means of 
the deformation along the integral curves of 
—VE. 

If M is not simply connected, then each 
nontrivial element of the tfundamental group 
of M represents a class of homotopic curves in 
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which there is a closed geodesic whose length 
realizes the infimum of all these curves. If 
M is simply connected, then there is a mini- 
mum integer k for which z,(M)#0. Clearly, 
7™,-,({A(M)) x 2,(M) 40. Suppose that E has no 
positive critical value. Then A(M) is a defor- 
mation retract of M, a contradiction. Therefore 
there exists at least one closed geodesic on 
every compact Riemannian manifold (Lyuster- 
nik and Fet [18]). 

Proof of the existence of many closed geo- 
desics on M is difficult due to the follow- 
ing: (1) There is a continuous O(2) action on 
A(M) that assigns oe A(M) and we O(2) to the 
curve t>o(t+a), teS'; (2) for each integer 
k and for each critical point o e A(M), the 
curve t>o(kt) is a critical point with energy 
k? E(a). 

A remarkable result has been obtained 
by Lyusternik and Shnirel’man [19], which 
states that there are at least three closed geo- 
desics on every simply connected compact 
manifold of dimension 2. Fet then proved that 
there exist at least two closed geodesics on a 
compact manifold if all critical points of E on 
A(M) are nondegenerate [20]. By developing 
a precise argument concerning the Morse 
lemma around an isolated degenerate critical 
point ce A(M) of E, Gromoll and Meyer have 
proved that there exist infinitely many closed 
geodesics if the sequence of Betti numbers 
{b{A(M))} with respect to any field is un- 
bounded [21]. If 4: MM is a certain isome- 
try, then there are also infinitely many A- 
invariant closed geodesics if the Betti numbers 
of the space of A-invariant H,-curves are not 
bounded [22, 23]. By investigating the Z,- 
cohomology of A(M) of compact symmetric 
spaces, Ziller has proved that if M has the 
same homotopy type as that of a symmetric 
space of rank >2, then M has infinitely many 
closed geodesics [24]. l 

Many attempts have been made by W. 
Klingenberg and others to prove the existence 
of infinitely many geometrically distinct closed 
geodesics on every compact Riemannian mani- 
fold [25]. 
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A. General Remarks 


Multivariate analysis consists of methods of 
statistical analysis of multivariate observations 
represented by a collection of points in a finite- 
dimensional Euclidean space R”. With the 
development of powerful computers, multi- 
variate techniques are beginning to be utilized 
in many fields of science and technology. 


B. The Multivariate Linear Model 


The multivariate linear model is an immediate 
extension of the univariate linear model. Sup- 
pose that X=(X"...X) denotes the p x n 
matrix of n observations of p-dimensional 
data. Suppose that it can be expressed as 


X=BZ+U, (1) 


where B is a p x m matrix of unknown para- 
meters, Z is a known m x n matrix of inde- 
pendent variables, and U is a p x n matrix of 
errors. We assume that (i) the texpectations of 
the elements of U are zero, that is, the p xn 
matrix E(U)=0, and call the relation (1) a 
multivariate linear model. We usually assume 
further that (ii) the column vectors U®, i= 
1,...,n, of U are independent and identically 
distributed, and that (iii) U is distributed 
according to a multivariate fnormal distri- 
bution with ‘covariance matrix X. Analogous 
to the univariate case, the *least squares es- 
timator B of B is defined to be the p x m ma- 
trix that minimizes 


tr(X — BZ)(X — BZ)’ 
and is given explicitly by 


B=XZ(ZZ’)"! when |ZZ'|#0. 


Here the symbol ' means the transpose of a 
matrix. Also, an unbiased estimator of X is 
given by 


F=Q,/(n—m), 


B is an unbiased estimator of B under the 
assumption (i) above and is the *best linear 
unbiased estimator, under (i) and (ii), while £ is 
unbiased when (i) and (it) are assumed. Under 
the assumptions (i)—(iii), Ê and Ê form a set of 
complete ‘sufficient statistics; hence they are 
tuniformly minimum variance unbiased es- 
timators. Also under (i)—(iii), elements of B are 
normally distributed, and their covariance can 
be expressed by 


Q.=XxX’—Bzx’. 


S&M (Q denotes the ‘Kronecker product), 
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where M =(ZZ')"'. Applying tCochran’s 
theorem for the multivariate case, Q, is shown 
to be distributed according to a fWishart 
distribution with n— m degrees of freedom. 

To test the hypothesis B= By under (i)—(iii), 
we put Q,=(B—B,)ZZ'(B— B,)' and have (X 
— By Z)(X — By ZY =Q,4+Q,, where Q; and Q. 
are independently distributed. The distribution 
of Q; is a Wishart distribution with m degrees 
of freedom when the hypothesis is true, and a 
*‘noncentral Wishart distribution when B# Bg. 
Based on this fact, several procedures have 
been proposed. If we require the invariance of 
procedures with respect to linear transforma- 
tions of the coordinates of p-dimensional 
vectors, the roots 4,,...,4, of the tcharacter- 
istic equation |Q,—2Q,|=0 form a tmaximal 
invariant statistic; hence the testing proce- 
dures should be defined in terms of these roots 
(— 396 Statistic I). Also, the consideration of 
‘power leads to procedures that reject the 
hypothesis when these roots are large. Com- 
monly used test statistics are (1) the flikelihood 
ratio test W=|Q,//1Q,+Q./=TH(1 +4) 

(S. S. Wilks); (2) trQ.'Q,=>/; (D. N. Lawley 
and H. Hotelling); (3) max 4; (S. N. Roy); (4) 
trQ,(Q2+Q,)-'=D4(1+4) 7 (K. C. S. 
Pillai). Pillai’s trace test (4) is locally the most 
powerful invariant; Wilks’s likelihood ratio 
test (1) has the maximum Bahadur efficiency. 
The power functions of the tests (1)—(3) have 
the monotonicity property, namely, they are 
monotonically nondecreasing with respect to 
each eigenvalue of Q= 5 7'(B—B,)ZZ'(B— 
Boy, the matrix of noncentrality parameters 
for Qx. The monotonicity for Pillai’s test (4) is 
known to hold only for restricted cases where 
the critical value c for the acceptance region 

tr Q(B +Q) 7! <c should satisfy O0<c <1. All 
the tests (1)—(4) are unbiased. With respect to 
0-1 loss, the tests (1) and (4) are tadmissible 
Bayes and the tests (2) and (3) are ‘admissible. 
Tests based on min å; are inadmissible. 

Small-sample distributions of these statis- 
tics are complicated but when n> œ, nlog W, 
ntrQ,Q,', and ntrQ,(Q,+Q,) | are as- 
ymptotically distributed according to a chi- 
square distribution with pm degrees of free- 
dom under the null hypothesis. Even under 
the alternative hypothesis that the matrix 
of noncentrality parameters Q = o(1) for large 
n, the asymptotic distributions remain the 
same. When Q= 0(1), they are noncentral chi- 
square distributions of pm degrees of freedom 
and noncentrality parameter 6 =trQ. If Q = 
O(n), they are normal distributions, namely, 
—./n(log W+log|I + 0|), /n(trQ,Q;! —trd) 
and \/n(trQ,(Q,+Q,)~!—trO(+0)~) 
for 0=limQ/n have asymptotically normal 
distributions of zero means and variances 
given by 2tr(I—(1+4) 7), 2tr(20+ 67), and 
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2tr((1+0) 7—(14+6)~%), respectively. As a 
special case, if m= 1 there exists only one non- 
zero A, and the procedures in this paragraph 
all coincide and are equivalent to one based 
on T? = M ~!(Ê-— By £~ (Ê -— Bp). It is known 
that under the hypothesis, (n — p) T? n — 1)p is 
distributed according to an tF-distribution 
with (p, n— p) degrees of freedom. When p= 

2 (resp. m=2), (n=m—1)(1—,/ W/(m /W) 
(resp. (n—p— 1)(1— JW P/W) is distributed 
according to an F-distribution with degrees of 
freedom (2m, 2({n—m — 1)) (resp. (2p, 2(n — p 
—1))). Simultaneous ‘confidence regions of B 
can be derived from the testing procedures in 
this paragraph, that is, 


trQ, '(B—B)ZZ'(B—B) <c. 


Moreover, when the matrix B is decom- 
posed as B=(B, : B,), where B, and B, are a p 
xq matrix and a p x (m— q) matrix, respec- 
tively, and the hypothesis to be tested is of the 
form B; =0, the test procedures can be ob- 
tained as follows: Decompose Z as 


() 
ey BaF 
Z, 


where Z, is a q x n matrix and Z, is an (m—q) 
xn matrix, and put 


B3=XZ}(Z,Z;)"', 


Q* = XX’ — BSZ,X’, Qs, =Q*-Q.. 


Then Q,, and Q, are independent, Qg, is 
distributed according to a Wishart distribution 
with q degrees of freedom when the hypothesis 
is true, and we can apply the procedures in 

the previous paragraph, simply replacing Q, 
by Qs, 

Such a procedure is called multivariate 
analysis of variance (or MANOVA, for short). 
Various standard situations can be treated in 
this way (after some linear transformation of 
variables, if necessary). Some examples are (1) 
X =(X")...X), where the X (i= 1, ...,n) 
are distributed independently according to a 
p-dimensional! normal distribution N (u, X). 
We can express X as X =al’ + U, and the 
estimators are given by #=X=X1/n, $= 
(X —X1’)(X —X1’)(n—1). In this case, we 
obtain a test for the hypothesis z= 4o based 
on Hotelling’s T? statistic, i.e., the test with a 
teritical region of the form 


T? =n(X — poy Z7\(X— py) >. 


(2) Suppose that p x n; matrices X;,i=1,...,k, 
are samples of size n; from p-dimensional 
normal distributions N(#;, 2) with common 
covariance matrix X. The tests for the hypoth- 
esis #, =... =, are obtained from the follow- 
ing observation: Let Q, = ¥(X;—X,1)(X;— 

X; 1y, where X;=X,1/n,, Q, =X n{X;—X) 
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(X,—X)’, X= Dn,X,/Dn,;. Then D(X, — XI’) 
(X;—X1Y=Q, +Q.. We call Q, the matrix of 
the sum of squares within classes, and Q, the 
matrix of the sum of squares between classes. 
The latter is distributed according to a Wish- 
art distribution when the hypothesis is true. 

(3) Suppose that X;; are p-dimensional vectors, 
and that 

X,=H+0;+ 8 +U;;, i=1,...,m, 


where y, a;, $; are p-dimensional constant 
vectors such that $} «;=0 and È f,=0, and the 
U; are independently distributed according to 
a p-dimensional normal distribution N (0, 2). 
We set 


Q,= ny (X; -X) (X; —Xy, 
Q,=m)> (X -XXX ;—X), 


X=) Y X;;/mn. 

Then we have © £ (X; —X)(X;—XY=Q,+ 
Q; +Q., and Q,, Qs, Q. are distributed inde- 
pendently according to (noncentral) Wishart 
distributions with degrees of freedom m— 1, 
n— 1, and (n—1)(m—1), respectively. The tests 
for the hypothesis «;=0 (i= 1,...,m) or B,=0 
(j=1,...,n) are obtained from these matrices. 


C. Tests for Covariance Matrices 


Let X;(p x 1), j=1,...,N;, be a random sample 
from p-variate normal distribution N(u;, 2;) 
fori=1,...,k. For testing the hypothesis Ho: 
X,=...=2;, with unknown mean vectors 4; 
against all alternatives, the likelihood ratio 
Statistic is given by 


NN? I] ISa 
BESA ESP’ 


where S= £X, (X,,—X,)(X,,—X))' for X;= 
£w X,;/N;. If we replace the sample size N, 
by the degrees of freedom n;= N;— 1 and N 
by n= N—k in (2), the modified likelihood 
ratio test is unbiased for general p and k [16]. 
For k=1, the hypothesis specifies Hy: 2, = 

Xo (a given positive definite matrix) and 

the likelihood ratio statistic is given by 

(S| etr(— Xi! 8/2) (etr(x)=exp(tr«)). Again 
replacing N by n=N-—1 yields an unbiased 
test. Moreover the power function of this 
modified likelihood ratio test depends only 
on the eigenvalues of 3,5‘ and is increasing 
with respect to the absolute deviation of each 


N=N,+...+N,, (2) 
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eigenvalue from 1, that is, |ch;,X, £5! — 1]. For 
k = 2, it is conjectured that the modified likeli- 
hood ratio test has a power function monoton- 
ically increasing with respect to |ch,2,27'—1|. 
This conjecture has not yet been proved; we 
know only that the power is increasing if the 
maximum eigenvalue ch, Z, 2; increases 
from 1 or the minimum eigenvalue ch, 2, 2; ' 
decreases from 1. 


D. Estimation of Mean Vector and Covariance 
Matrix 


Let X(p x 1) have p-variate normal distri- 
bution N (æ, J). For estimating the unknown 
mean, take the sum of squared errors S as a 
loss L(x, d)= ||z —d|\*. Since each component 
X, of X is distributed independently according 
to N(u,;, 1) for w=(p,,..., u,), it is natural to 
suppose that X is a good estimator. In fact X 
is a minimax estimator for u. However, Stein 
showed that X is admissible for p=1 and 2 but 
inadmissible for p> 3. James and Stein showed 
that the estimator 


p—2 
1 —_—~ ]X 
( aE a 


dominates X when p > 3. This estimator is 
further dominated by Stein’s positive part 
estimator (1 —(p —2)/||X |7) X, where (a), 
means a if a is nonnegative and zero if a is 
negative. The class of estimators X + V log f(X), 
where V =(0/0x,,...,0/0x,) for X'=(x;,-.-, Xp) 
are all minimax for # dominating X, if f(X) 

>0 and ./f(X) is superharmonic (V?,/ f(X) 
<0), satisfying E[||Vlog f(X)||7] < œ and 

E[ X67 f(X)/0x? |/f(X)] < co. Putting f(X)= 
|X || “°" yields the James-Stein estimator 

(3). For this problem a class of monotone 
estimators is essentially complete, where an 
estimator d(X) is called monotone if d(X)< 
d(Y) whenever X < Y (defined componentwise). 
Stein’s positive-part estimator is not mono- 
tone and is still inadmissible [8]. 

Let X,(p x 1), i=1,...,n, be a random sam- 
ple taken from the p-variate normal distri- 
bution N(0, X). Then the maximum likeli- 
hood estimator for X is given by S/n, where 
S= Xi- X;X;. S has Wishart distribution 
W,(n, X). To study the estimation problem of 
X, take two loss functions L,(2,d)=trdX™! 
—log|dd~'|—p and L,(5,d)=(1/2)tr(d>! 
—1)?. Under L,-loss, the best scalar multiple 
of S is given by S/n, and under L,-loss it is 
given by S/(n+p+1). The James-Stein mini- 
max estimator for L,-loss is given by d,(S) 
= KAK’, where K is the lower triangular ma- 
trix with positive diagonal element for which 
S=KK’ and A=diag(A,,...,A,) with A;= 
1/(n+ p+1-—2i). A minimax estimator for 
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L,-loss is given by d,(S)= KAK’ with the same 
lower triangular matrix K, but A is the solu- 
tion of the linear equation AA =b, where A is 
given by 


(n+ p—1)(n+p+1) n+p—3 


A n+p+3 (n+ p—3)(n+ p—1) 
n—p+1 n—p+1 
n—p+l 
n—p+l1 


.. (n—pt1)(n—p+t+3) 
and b'=(n+p—1,n+p—3,...,n—p+l1). The 
minimax estimators d,(S) and d,(S) dominate 
the best scalar multiples of S. However, they 
are inadmissible. Let P be a permutation ma- 
trix, then the estimator £, P’d,(PSP’)P/p! dom- 
inates d,(S) because of the convexity of the loss 
function L; for i=1 and 2. The estimator h,(S) 
=(S + ub(u)C)/n, where u=1/tr CS“! and C is 
a fixed positive-definite matrix, dominates S/n 
under L,-loss if 0< b(u) <2(p—1)/n and b(-) is 
nonincreasing. Under L,-loss (S + bC/tr CS “')/ 
(n+ p+ 1) dominates S/(n+p+1), if0<b< 
2(p—1)(n—p+3) [7]. The Haff estimator 
h,(S) is dominated by the James-Stein esti- 
mator d,(S) if p>6. The estimators (S + 
b(tr CS“! /tr CS~2)C)/n for 0< b<2(p—1)/n 
under L,-loss also dominate S/n and are not 
dominated by d,(S). Similar results hold under 
L,-loss. 


E. Correlation among Variables 


In order to represent the interrelationships 
among the p variates, the population corre- 
lation matrix P= {p,;} is used. Also, we can 
calculate the population tmultiple correlation 
coefficient of the ith component X; and all the 
rest of the variables by 


Pi-1..)...p 7 1 — P/P; 


and the tpartial correlation coefficient of X; 
and X,, given all others, by 


Pij-1...@...G)...p 7 —Pi/V PP, 


where P, are the cofactors of P (— 397 Statis- 
tical Data Analysis). 

The sample correlation matrix is calculated 
from the data, and the sample multiple correla- 
tion and the sample partial correlation coeffi- 
cients are calculated from the sample correla- 
tion matrix in exactly the same way as the 
population coefficients are calculated from 
the population correlation matrix. When X = 
(X...X) is a sample of size n from a 
multivariate normal population, the sampling 
distributions of R;-1...¢)...p and R; are 


a0) p 1.) Gap 


known (— 374 Sampling Distributions). 
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The determinant of the covariance matrix 
|X| (or |S]), called the (sample) generalized var- 
iance, is a measure of the dispersion of a p- 
dimensional distribution. The distance of two 
distributions with mean vectors #\ and #3, 
respectively, and with common variance is 
often expressed by 


O=(", — py) E "(py — ft), 


which is called the Mahalanobis generalized 
distance. 
When the data consists of (p+ q)- 


X 
dimensional vectors (*) with q <p, the inter- 


relation of X and Y as a whole can be ex- 
pressed in the following way: Let the covar- 
lance matrix be partitioned as 


5- Gs F 

2xy Avy 
and the nonzero roots of the equation |pZyy — 
Jy 2xx Zxy| =0 be p,,...,p,. Then p}”,..., 
pł”, called the canonical correlation coeffi- 
cients, are the maximal invariant parameters 
with respect to linear transformation of X 
and Y. Also, if we denote the eigenvector cor- 
responding to a root p; by 9, i.e., 


(p;2yy) I= (Lyx 2xx Lyy)Nis 


the linear function y; Y and y; yx XxX are 
called the canonical variates. 


F. Principal Components 


An important problem in multivariate analysis 
is to express the variations of many variables 
by a small number of indices. Principal compo- 
nent analysis is a technique of dealing with this 
problem. 

Let X be a p xn matrix with n column vec- 
tors of p-dimensional data. A linear transform 
of X, T= AX (A anr x p matrix, r<p, T an 
r x n matrix) ts called the principal component 
if A is chosen so as to maximize the sum of 
the squares of the sample multiple correlation 
coefficients of each of the row vectors of X to 
those of T, namely, if A is an r x p matrix 
formed by the r eigenvectors of the sample 
correlation matrix of X corresponding to the r 
largest eigenvalues or any nonsingular linear 
transformation of them. This is a characteriza- 
tion of principal components in terms of cor- 
relation optimality. 

The principal component T = AX is also 
characterized by the information-loss op- 
timality in that all eigenvalues of (X — CY — 
b1) (X — CYb1}) are simultaneously minimized 
subject to the condition: C is a p x r matrix, 
bis a px 1 vector and Y is an r xn matrix. 

The solution is given by CY + b1; = A'T +X1;, 
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where X= X1,,/n. The variation optimality of 
the principal component is given by maximiz- 
ing simultaneously all the eigenvalues of the 
matrix C’(X — X1): (X —X11)'C, subject to 

the condition that C is a px r matrix such 
that C’C=I,. The solution is given by C= A’, 
namely, C’X =T [15]. If all the correlations 
between the components of X are positive, the 
largest eigenvalue of the covariance matrix of 
X is simple and positive. All the coefficients of 
the first principal component (components of 
the eigenvector) can be taken to be positive 
(Perron-Frobenius theorem). 

When we assume normality, the eigenvalues 
of the sample correlation matrix R are the 
*maximum likelihood estimators of the eigen- 
values of the population correlation matrix, 
and their sampling distributions can be ob- 
tained. A hypothesis relevant to principal 
component analysis is, for example, that the 
smallest p—r eigenvalues of the correlation 
matrix are equal, which can be tested by the 
statistic 


Rp- 
=[RI/(4 -å ((p—4, — + — A/p — r) ~”), 
where 4,,...,4, are the r largest eigenvalues of 


R. Under the hypothesis, — clog Rp-, (c a con- 
stant) is asymptotically distributed according 
to a chi-square distribution when n> œ. 

Variations of principal component analysis 
can be obtained by taking the eigenvectors of 
the covariance matrix of the raw data or of a 
multiple of it by some weight matrix. 


G. Factor Analysis 


Factor analysis is closely related to principal 
component analysis. We assume a model 


X=BF+U, 


where B and F are unknown p xr andr xn 
matrices of constants (p>r)and Uisapxn 
matrix of independent errors. F is called the 
matrix of factor scores and B the matrix of 
factor loadings. We assume F F’ =nlI. If E(UU’) 
=n® is known, then by applying the least 
squares principle, we can determine B and F 
so as to minimize the trace of (X — BFY® ~! (X 
— BF). Then B is obtained by taking the r 
eigenvectors of ®~' XX’ corresponding to the r 
largest eigenvalues. When ® is diagonal but 
unknown, we can solve the simultaneous equa- 
tion for B and ®, whose solutions are the 
matrix B with columns equal to eigenvectors 
of ®"! XX’ and the diagonal matrix Ê with 
elements equal to the diagonal elements of 
XX’/n— BB’. 

If we assume that U is normal, the proce- 
dure in the preceding paragraph for the case 
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when ® is known is equivalent to the maxi- 
mum likelihood method. When ® is unknown, 
we can assume further that the columns in F 
are also multivariate normal vectors distri- 
buted independently of U, which implies that 
the columns of X are also normal vectors with 
the covariance matrix X = BB’ +. B and ® are 
estimated from the sample covariance matrix, 
and the solutions of the simultaneous equation 
for B and ® gives the maximum likelihood 
estimators. However, there is the so-called 
identification problem of determining whether 
for given Z and r the decomposition X = BB’ + 
® is unique. This problem has not yet been 
completely settled. If the solution Ê is posi- 
tive definite and BB’ is positive semidefinite, it 
is called proper. When 9 is not positive defi- 
nite, it is called a Haywood case, and when 
BB’ is not positive semidefinite it is called a 
complex case. Sometimes iterative procedures 
lead to improper solutions. 


H. Canonical Correlation Analysis 


Canonical correlation analysis can also be 
used for descriptive purposes. Sample canon- 
ical variates have various descriptive implica- 
tions. Suppose that y; Y and č; X are the first 
canonical variates corresponding to the largest 
canonical correlation p}. Then p?! is the 
largest possible correlation between a linear 
function of X and a linear function of Y and is 
actually equal to the correlation of y, Y and 
či X. Similarly, the second canonical correla- 
tion is equal to the largest possible correla- 
tion between linear functions in X and in Y 
which are orthogonal to €, X and y, Y, respec- 
tively, and so forth. 

As a second interpretation of the canonical 
variates, we consider the linear regression 
model 


Y=BX+U, 


where Y, B, U are q xn, q x p, q x n matrices, 
respectively, and rank B =r <q. Then there 
exist an r x p matrix A and a q xr matrix C 
satisfying C'S 'C=I such that B= CA, where 
E(UU’)=n2. Putting T= AX, we get a re- 
gression model Y =CT + U, regarding T as a 
matrix of regressor variables. If X is known, 
least squares considerations lead to minimiz- 
ing tr(Y — BXY £! (Y — BX) with the condition 
rank B =r. The resulting row vectors of A 
consist of the r eigenvectors corresponding 

to the r largest eigenvalues of the matrix 
S.S,,27'S,, and column vectors of C consist 
of the r eigenvectors corresponding to the r 
largest eigenvalues of the matrix S,,S.3S,,27'. 
If X is replaced by S,,, T= AX and Z=C'S, Y 
are equal to the matrices of canonical variates. 
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If we assume that U is normal, this proce- 
dure is equivalent to the maximum likelihood 
method. It should be remarked that although 
the model here is not symmetric in X and Y, 
the results are symmetric in X and Y, and 
therefore they will be the same if the roles of X 
and Y are interchanged in this model. 


I. Linear Discriminants and Problems of 
Classification 


Let p x n; matrices X; (i=1,...,k) be the set of 
observations for k distinct populations with a 
common covariance matrix. We determine a 
vector a such that T;=a’X; reveals the dif- 
ferences of the k populations as much as pos- 
sible, or, more precisely, so that the ratio of 
the sum of squares between classes of T to the 
sum of squares within classes is maximized. If 
the matrices of the sums of squares between 
and within classes are Q, and Q,,, respectively, 
the ratio is equal to 


[=a’Q,a/a’Q,,a, 


which is maximized when a is equal to the 
eigenvector of Q,,'Q, corresponding to the 
largest eigenvalue. The linear function t = 
a’X is called the linear discriminant function. 
When k=2, a is given by a=Q>'(X, —X,), 
where X, and X, are sample mean vectors. 
When k> 2, we let A be the matrix formed 
by the r eigenvectors corresponding to the 
first r largest eigenvalues of Q;,'Q,, and set 
T;= AX;. From this we can construct the r- 
dimensional discriminant function. These 
functions can be used to locate the k popu- 
lations in r-dimensional space, and also to 
decide to which population a new observa- 
tion belongs. For the latter problem we can 
also construct k quadratic functions s; = 
(X—X,)'Q,,1(K —X,), where X; is the sample 
mean vector of the ith population, and X a 
new observation. Then we can decide whether 
X belongs to the population corresponding to 
the minimum s;. Such a method is called a 
classification procedure. 


J. Discrete Multivariate Analysis 


Let X,,, be an observed frequency on three 
characteristics, each belonging to ijkth class 
for | <i<I,1<j<J and 1<k<K. Assume 
that X; is a sample from multinomial distri- 
bution having p; as a probability of occur- 
ence for an (i,j,k) cell, where p..= 2X pj,,=1. 
The multinomial observation X, with proba- 
bility p;x is called an I x J x K contingency 
table. If we further assume that 


log Pij =u+%; + b; Yet ayer Bu FYri t Orn 
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with the restrictions on parameters Ya; = 

2 %j= Lj; %;=9, similarly on f’s and y’s and 
Dj Ôi = oj Ôi = Dee Oj, =O, we Say that a 
saturated log-linear model is given. Here the 
number of observations and the number of 
parameters are equal, and no errors can be 
estimated. The parameter ĝ;; is called the 
three-factor effect, or equivalently, the second- 
order interaction. Similarly, œ; Bjk Yri are 
called the two-factor effects or the first-order 
interactions. Finally %;, f;, Yx are called the 
main effects. A simple model is obtained when 
all the first- and second-order interactions 
vanish. This is equivalent to the independence 
model: p; = pj..p.;-P.., and the maximum like- 
lihood estimators for u, &;, Pj, y, are obtained 
from Pj = X;..X.;.X.4/n°, where n=X.... A 
nontrivial model ts obtained by putting all the 
second-order interactions equal to zero. The 
likelihood equations for this model are given 
by 


npi,=X NDin= ins MP. j= Xj: (4) 


ij? 
Bartlett first described a solution of (4) when I 
=J = K =2. For a 2 x 2 x 2 table, the solution 
can be expressed by Pir =(Xijx + 0)/n because 
of the constraints for X;;., X;.,, and X.,. Putting 
Ôi 11 =9 for Pi; yields a cubic equation for 0: 


(Xiii FOX p29 + O)(X 22, + OX 21. +0) 
=(Xi91 —O)(X142—-P(X2141 —0)(X 222 — 0). 


For a general I x J x K table, the equations (4) 
have a unique solution within the no-three- 
factor effect model if there exists q; >Q satisfy- 
ing (4). The unique solution maximizes the 
likelihood. To solve the likelihood equa- 

tions (4), standard iterative procedures, such 

as the Newton-Raphson method, can be ap- 
plied. However, the following iterative scaling 
method of Deming and Stephan is more useful: 


npomt)) — jam Xj. 
Pijk = Pijx (3m)? 
uy 
npiomt 2) p3m+1) Xix 
Pijk 5 Pijk (3m+1) 
Pik 
X.. 
(3m+3)__ „(3m+2) “Jk 
NP ijx = Pij (5) 


ijk poe?) i 
The first iteration adjusts pi”? by fitting pro- 
portionally with respect to k so that np{}"*") is 
equal to X;,., and similarly for the second and 
third iterations. Starting with any initial values 
satisfying the first-order interaction model, the 
iterative scaling method (5) converges to the 


unique solution of (4).as m> œ. 


K. Other Problems 


The sampling distributions associated with the 
procedures discussed here are usually very 
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complicated, and often only asymptotic prop- 
erties are known (— 374 Sampling Distri- 
butions). Nonparametric rank analogs of 
many multivariate techniques for normal 
distribution are found in [18]. Robustness of 
the distributions of test statistics or of latent 
roots is investigated in [3, 10, 14]. Multivariate 
data analysis is found in [5]. 
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281 (XIX.5) 
Network Flow Problems 


A. Introduction 


The network flow problem is a special kind of 
mathematical-programming problem (— 255 
Linear Programming, 264 Mathematical Pro- 
gramming, 292 Nonlinear Programming), 
where the variables of the objective function 
and the constraints are all defined in terms of a 
graph (— 186 Graph Theory). Owing to its 
special structure, the mathematical properties 
of the network flow problem as well as the 
solution algorithms have been investigated in 
detail. Network flow problems have a variety 
of useful applications in fields such as trans- 
portation, scheduling, and resource alloca- 
tion, and in operations research in general, so 
that they now constitute an important class 
of mathematical programming. By the term 
“network” (Netzwerk in German, réseau in 
French, set’ in Russian) we usually mean a 
graph with some physical attributes attached 
to its edges and vertices. 


B. Basic Form of the Problem 


Let G=(V,E,6*,6 ) be a graph with vertex 
set V, edge set E, and incidence relations 0*, 
0 :E-V, and let R be the field of real num- 
bers. (Most of the statements in the following 
are valid if we take for R an ordered field or 
ordered additive group more general than the 
field of real numbers.) Furthermore, we regard 
the collection £ of all functions €:E>R asa 
vector space of dimension |E], denoting é(e,) 
by é“ if E={e,,...,e,$. Similarly, the collec- 
tion Q of all functions w: V>R is a vector 
space of dimension |V|. If we define the map- 
ping 6*:V>2* by d*v={e|dte=v' (— 186 
Graph Theory), a linear mapping 0:2-Q is 
naturally introduced through the relation 
(0EM) = Yeest » S(€) — Becs- éle). A vector 65 
is called a flow on G if 6& =0. The dual space 
=* of = and the dual Q* of Q are identified 
with the collection of functions y: ER and 
that of ¢: V>R, respectively, under the obvi- 
ous correspondence. Then the linear mapping 
6:Q* +&*, which is contragredient to 0, is 
defined by means of the relation (6£)(e) = 
C(e*e)—€(é e). A vector ye=*, which is the 
image of ¢ under 4, i.e., which satisfies 7 = 6¢, 
is called a tension on G, and ¢ is called the 
potential on G corresponding to the tension 
n. (Sometimes, €(e) is called a flow in edge e, 
n(e) a tension across e, and ¢(v) a potential at 
vertex v.) 

A continuous curve C (cR x R) on the 
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Euclidean plane R x R is said to be mono- 
tone if (x; —X)(y; —y2)20 for any (x1, y,), 
(X2,¥2)€C. A monotone curve C is called a 
characteristic curve if its projection to each 
coordinate axis, i.e., C, = {x|(x, y)e C} and 
C,={yl(x, ye C}, is a closed interval. For a 
characteristic curve C, two convex functions, 
g(x) =[E(y—yo)dx and W(y)= [2 —xo)dy, 
are defined, where (xo, Vo) is a point fixed on 
C and the integrals are taken along (x, y)EC. 
(It is understood that g(x)= œ if xC, and 
w(y)= co if yéC,.) These two functions defined 
for a fixed C are conjugate to each other in 
Fenchel’s sense, and they satisfy @(x)+(y)> 
(x —Xo)(y— Yo) for any (x, yjeR x R, where the 
inequality reduced to an equality if and only if 
(x, y)e C (— 88 Convex Analysis, 292 Non- 
linear Programming). 

A network N in network flow theory is a 
graph G to each edge e, whose edge set E = 
{@1,-+-.€,} is given a characteristic curve C*. 
On a network N, the following three problems, 
PI, PII, and PIII, are defined. PI: Find a flow 
€ that minimizes O(€)= >" -| o, (€*) under 
the constraints €* = &(e, Je C* (k=1,...,n). 

PII: Find a tension y which minimizes ¥(y)= 
Zr -1 9*(y,) under the constraints n, =n(e,)€ 
Cy (k=1,...,). PII: Find a pair (€,7) of a 
flow č and a tension y such that (E*,4,.)e€C* for 
all e=1,...,”. (With respect to PI or PII, a 
flow or a tension satisfying the “constraints” 
is ordinarily called a feasible flow or tension, 
respectively.) Then, as a special case of the 
(Karush-)Kuhn-Tucker theorem and the dual- 
ity theorem in nonlinear programming, we 
have the following theorems (A) and (B). 

(A) For a given network N, one and only 
one of the following four alternatives is the 
case: 

(i) There is a feasible flow in PI, and at the 
same time, there is a feasible tension in PII. In 
this case, both PI and PII have a solution, 
and, for any solution £ of PI and any solution 
4 of PII, the pair (ê, 4) is a solution of PIII; 
and, conversely, for any solution (ê, 4) of PHI, 
the flow £ is a solution of PI and the tension 
is a solution of PII. 

(ii) There is a feasible tension in PII, whereas 
there is no feasible flow in PI. In this case, 
®(¢) of PII is not bounded below for feasible 
tensions č. 

(iii) There is a feasible tension in PII, whereas 
there is no feasible flow in PI. In this case, 
‘Y(y) of PH is not bounded below for feasible 
tensions y. 

(iv) There is neither a feasible flow in PI nora 
feasible tension in PII. 

(B) Let CS =[b*, c*] (b* <c*; b" can be —o, 
and c*, 00) and C; =[d,, Jk] (dẹ can be —o0, 
and f,, 00). Then a necessary and sufficient 
condition for PI to have a feasible flow is that 
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E, c“— $b“ >0 for every cutset (cocycle or 
cocircuit) w of G, where the summation È; 
(resp. È) is taken over all the edges e, (€ E) 
that lie in w in the positive (resp. negative) 
direction. Similarly, a necessary and sufficient 
condition for PII to have a feasible tension ts 
that ©, fk- Ład, 20 for every tieset (cycle or 
circuit) 0 of G, where $, (resp. È) is taken 
over all the edges e, that lie in @ in the posi- 
tive (resp. negative) direction. 


C. Shortest Paths, Maximum Flows, and 
Minimum-Cost Flows 


We choose one of the edges of G, say e,, as the 
reference edge and assign it the parametric 
characteristic curve C! <a) = {(x, y| y=x+a}. 
Then if there is a feasible flow on the network 
obtained from G by contracting (i.e., short- 
circuiting) e,, and if at the same time there is a 
feasible tension on the network obtained from 
G by deleting (i.e., open-circuiting) e,, then 
problem PHI has a solution (êca), <a>) for 
every real a, and <a) and 4, <a> are uniquely 
determined for each «. The problem of deter- 
mining these parametric solutions ts the two- 
terminal problem for the two-terminal network 
N, (which is obtained from G by deleting e,) 
with the vertex 0” e, as the entrance (or source) 
and the vertex 6*e, as the exit (or sink). The 
curve C={(E!a), 4, <x>)|aeR} enjoys the 
properties of a characteristic curve, and is 
called the two-terminal characteristic of N, 
with respect to the entrance 0” e, and exit 
ôte. A two-terminal network for which only 
the projections to the x-axis Ck =[b*,c*] of 
the edge characteristics are specified (x = 2, 
...,n) is called a capacitated network, and the 
maximum-flow problem for a capacitated 
network N, can be mathematically formulated 
as the problem of determining the projection 
to the x-axis C,=[b,c] of the two-terminal 
characteristics of N,. For the maximum-flow 
problem, the relation c=min{>c*—Xb*} 
holds, where the minimum is taken over all 
the cutsets that contain e, in the negative di- 
rection and where X; (resp. £3) denotes the 
summation over all the edges, except e,, lying 
in a cutset in the positive (resp. negative) direc- 
tion. This relation is called the maximum-flow 
minimum-cut theorem. (A similar relation 
holds also for b.) 

Similarly, or dually, the problem of deter- 
mining the y-projection C,=[d, f] of the two- 
terminal characteristic of a two-terminal net- 
work N; for which the y-projections Cf = 
[du fa] of the characteristics are specified to 
the nonreference edges e, (k=2,...,n)isa 
network flow formulation of the shortest-path 
problem. For the shortest-path problem, the 
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relation f=min{’, f,-—25d,}, called the 
maximum-separation minimum-distance theo- 
rem, holds, where the minimum is taken over 
all the tiesets that contain e, in the positive 
direction and where $; (resp. £3) denotes 
the summation over all the edges, except e,, 
lying in a tieset in the positive (resp. nega- 
tive) direction. 

The minimum-cost flow problem is to deter- 
mine the two-terminal characteristic of N, 
when all the C* (x #1) are of staircase form. 

A number of algorithms exist for time com- 
plexity (— 71 Complexity of Computations) 
O(|V|3) for the shortest-path problem [4]; the 
algorithm proposed by E. W. Dijkstra [7] for 
the case d, <0 <f, for all «(4 1) is of complex- 
ity O(|V]?). The algorithm of A. V. Karzanov 
[8,9] for the maximum-flow problem is of 
complexity O({V|°*). The minimum-cost flow 
problem can be solved by alternately solving 
the subproblems of the shortest-path type and 
those of the maximum-flow type, but no al- 
gorithm of time complexity polynomial in | E| 
and |V| has been found. 


D. Transportation and Scheduling 


Let us make the edges of a graph G=(V, E, 
é*,0~) correspond to the transportation 
routes and the flow to the stream of a com- 
modity; impose a capacity constraint of the 
form 0<€*<c* on the flow č“ in each edge e,, 
and assume a cost function of the form ¢,(é") 
= f,.¢" for each edge. (c* is a constant called 
the capacity of edge e,, and f, is a constant 
called the unit cost of edge e,..) Furthermore, 
let us specify a subset V, (c V) of vertices as 
the set of entrances and another subset V, 
(cV) as the set of exits, where V, N V =Ø, 
and prescribe the amount of inflow q(v) to 
each entrance ve V; and the amount of outflow 
q(v) from each exit ve V,, where we must 
have Liev, (0) = Lev, qv). Planning a trans- 
portation plan that satisfies all the above- 
prescribed conditions and that minimizes the 
total cost Eer 9,(€") can be reduced to finding 
a minimum-cost flow on the extended network 
G=(V.E,6*,67), defined as follows, such that 
the flow in the reference edge is to be maxi- 
mum: V=VU {st U{t} (s,t¢ V), E=EUE, U 
E-U feo} (E,NE=E,NE=@, e,¢ EVE, UE); 
eo is the reference edge, de, =t, 0- eg =s; 
E, ={e|éte=s, 6-e=p, ve Vi}, Ce)={0,y)| 
y<0}U{{(x, 0)|0<x<q(de)} U {(4(v), y)| 
0< y}, where 0” e=veV,; E,={e|d*e=p, 
6 e=t,veV}, Cle)={(0, y)|y<O}U {(x,0)]0< 
x <q(v)} U{(q(v), y)]0 <y}, where Ot e=ve V5), 
§* | p=0%, Ce) ={O,WlV< fe} ULC fol 
O<x<c*}U{(c*, y)|y>f.} for ee E. 

For the project-scheduling problem with an 
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acyclic graph G=(V,E,d*,0 ) as the arrow 
diagram for which the start node se V and the 
completion node are specified, we make an 
edge e(e E) correspond to a job (or activity), 
and a vertex correspond to the event that 

all the jobs corresponding to 6” v have been 
finished, whereas those corresponding to 6*v 
are ready to commence. Then we interpret 
the negative of the tension of an edge as the 
time (i.e., duration) spent on the correspond- 
ing job and the potential at a vertex as the 
time (i.e., instant) of the corresponding event 
taking place. Furthermore, we assume that 

to each edge e, (or the corresponding job) are 
given the normal duration —d,(> 0), the crash 
duration — f,(>0, < —d,), and the unit cost 
c*(>0) we have to pay in order to decrease 
the job time by one unit, where the job time 
necessarily lies between the normal and the 
crash duration. Under the above-listed speci- 
fications, we consider the relation between the 
total project time (i.e., the duration from the 
event corresponding to the start node to that 
corresponding to the completion node) and 
the extra cost to be paid for decreasing the 
project time below the normal one. This prob- 
lem can be reduced to the two-terminal char- 
acteristic problem for the following network 
G: G is obtained from G by adding the refer- 
ence edge e, (ft eo =t, dey = s); C* = {(0, y)| 
yde} U {(x, d 0x sc} UL, y lde SyS 
S U fale" <x}. 


E. Applications to Combinatorial Optimization 


Many problems in combinatorial optimization 
can be reduced to network flow problems. The 
problem of finding a maximum matching on a 
bipartite graph G (— 186 Graph Theory) is 
reduced to the maximum-flow problem for the 
graph representing the transportation prob- 
lem on G with one of the two vertex sets as 
the entrance set and the other as the exit set, 
where all the edges have a unit capacity and 
the amount of inflow/outflow to/from each 
entrance/exit vertex is equal to unity (the cost 
being irrelevant). (The existence of an integer 
solution to this kind of maximum-flow prob- 
lem is proved constructively on the basis of 
the solution algorithm.) The maximum-flow 
minimum-cut theorem in this particular case 
can be stated as follows: The maximum car- 
dinality of matchings on a bipartite graph 

is equal to the minimum cardinality of ver- 
tex subsets which cover all the edges (this is 
known as the König-Egerváry theorem). The 
Dilworth theorem for a partially ordered set, 
the criterion for the existence of a graph with 
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prescribed vertex degrees, etc., are known to 
be reducible to network flow problems [2]. 


F. Generalizations 


The network flow problem may be general- 
ized in various directions. Replacing a graph 
by a matroid (— 66 Combinatorics) or con- 
sidering stronger conditions on the feasibility 
of flows and tensions are natural extensions 
[4, 6, 10, 11]. Another extension is to con- 
sider several kinds of flow, instead of a single 
kind, that simultaneously affect the capacities 
of edges. This latter problem is called the 
multicommodity flow problem in contrast to 
the single-commodity flow problem that was 
treated above [5]. It.can be said that any ex- 
tension of the network flow problem aims at 
a mathematical model that has wider appli- 
cation without losing the advantage of having 
simple effective solution algorithms. 
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282 (XX.17) 
Networks 


A. Linear Graphs (— 186 Graph Theory) 


A linear graph (or simply graph) ts an object 
composed of (i) a finite set {B,} (k= 1,...,n) 
of elements called branches (or edges), (ii) a 
finite set {N,} (a=1,...,m) of elements called 
nodes (or vertices), and (iti) an incidence rela- 
tion between branches and nodes represented 
by a function [B,: N,] from {B,.} x {N,} to 
{0, 1, —1} such that for every x, there exist 
exactly one N, with [B,:N,]=1 and exactly 
one N, with [B,:.N,] = —1, with all the other 
[B,:N,] equal to 0. (Intuitively, a branch B, 
starts from the node N, with [B,.:N,]=1 and 
ends at the node N, with [B,:N,]=—1.) In 
terms of topology, a (linear) graph is a 1- 
dimensional finite tsimplicial complex (— 70 
Complexes, 186 Graph Theory). A network 
in the wide sense is a linear graph whose 
branches and nodes are endowed with some 
physical properties. 


B. Networks 


A contact network, one of the simplest kinds of 
networks, is an abstraction of a circuit whose 
branches correspond to contact points of 
relays and switches that are allowed to take a 
finite number of physical states, e.g., the two 
states “on” and “off.” The theory of contact 
networks is developed by means of Boolean 
algebra and is applied to switching networks, 
such as telephone exchange networks and the 
logical networks of digital computers. 

In most cases, to the branches B, of a net- 
work two kinds of real quantities i, and E, are 
assigned (which may be variables or functions 
of time) satisfying the conditions: 


© F [B:N] =0 (a=1,...,m) 
K=İ 


(ii) there exist E, such that 


mM: 


[B,:N,JE,=E, (k=1...,n). 


ll 
m 


a 


In the case of electric networks, where i,, Ex, 
and £E, are the current in branch B,,, the vol- 
tage across branch B,.,, and the potential at 
node N,, respectively, these conditions are 
known as Kirchhoff’s laws. 

The network flow problem, which has a 
number of practically important applications 
in toperations research (e.g., transportation 
problems, project-scheduling problems) and is 
a special case of tmathematical programming, 
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can be formulated as the problem of minimiz- 
ing >”, f.(i,) under condition (i) (or minimiz- 
ing Xt- f(E) under condition (ii)), where for 
each branch B,, fx is a given fconvex function 
defined on a given interval [a,, bx]. 


C. Electric Networks 


Since there has been a great deal of research 
on electric networks, “network” often means 
“electric network.” We call a branch in which 
the current is a given function of time a cur- 
rent source, and one across which the voltage 
is a given function of time a voltage source. 
A branch that is either a current source or a 
voltage source is called a source branch. A 
network with M source branches is called an 
M-port network. If the currents i, in and the 
voltages E, across the non-source branches 
(n’ in number) are related by 


where the z,, or y,, are linear tintegrodifferen- 
tial operators, the network is said to be linear. 
If Z k= Zka OF Yea = Y2 it is said to be recipro- 
cal or bilateral; if the z,., or y,, are invariant 
under the change of the origin of time, it is 
said to be time-invariant; and if a linear time- 
invariant network satisfies the condition 


t 
| X i(TE,(t)dt <0 
-œ ByeS 
for every t and for every choice of functions 

of time for i, or E, associated with the source 
branches B, in S, provided that the current- 
voltage relations for non-source branches are 
satisfied, then the network is said to be passive. 
Under certain nonsingularity conditions, for 
the currents in and the voltages across the 
source branches (denoted by 1, and e,) of a 
linear M-port network, we have the relations 


e= X Zelo h= £ Yea (B.S), 
BES B,eS 


where the matrices Z,, and Y,, of linear inte- 
grodifferential operators are called the port- 
impedance matrix and the port-admittance 
matrix of the network, respectively. Analysis 
determines Z,,,, Y,, from a given linear graph 
and given z,,, Yxa, while synthesis finds a net- 
work (i.e., Z,; OF V,,, aS well as a linear graph) 
when part of Zka, Y¥,.,, or some relations to be 
satisfied by them are given. (In synthesis, the 
Z,, OF y,, are usually confined to some special 
class.) In analysis as well as synthesis we usu- 
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ally deal with the *Laplace transforms Z,,,(s), 
VcalS)s ZealS), Yea(s) instead of Zeis Yeas Zua Yes 
themselves, where the characteristics Z,. ,(s), 
Y,,(s) of a network are determined by the 
topological properties of its linear graph and 
the properties of Z,,(s), ¥,,(s) as functions of 
the complex variable s. (If œ is the angular 
frequency, s = ia.) 

The following fundamental facts are known 
in regard to analysis and synthesis: 

(1) If Z,.,(s), ¥,,(s) are rational functions of s 
holomorphic on the open right half-plane, a 
necessary and sufficient condition for a net- 
work to be passive is that for arbitrary real 
numbers č% 

X. Ee, Z,4(S) or X Caba Y,a(s) 

Bp, BES B,, B ES 

is a positive real function of s, i.e., a function 
whose value is real when s is real and lies on 
the right half-plane when s lies on the right 
half-plane. 

(2) Every passive one-port network can 
be synthesized by using a finite number of 
three kinds of branches, 1.e., positive resistors 
(Ex =R,,i,,R,>0), positive capacitors (i, = 
C,dE,./dt, C, > 0), and positive inductors 
(E. = L,,.di,,/dt, L> 0). 

(3) Every passive M-port network (M 22) 
can be synthesized by using, in addition to the 
three kinds of branches mentioned in (2), ideal 
transformers (an ideal transformer is a pair of 
branches (B,, B,) such that i, =ni}, E,=nE,,, 
and n=real number) and ideal gyrators (an 
ideal gyrator is a pair of branches (B,,, B,) such 
that i, = E}, i, = —E,); ideal gyrators are not 
needed, however, to synthesize a reciprocal 
network. 

However, very little is known about the 
synthesis of passive M-port networks without 
using ideal transformers and gyrators. Topo- 
logical methods are expected to be powerful 
for such synthesis problems. 

The linear graph structure of a network 
loses its significance if ideal transformers are 
admitted. 
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283 (XXI.37) 
Newton, Isaac 


Sir Isaac Newton (December 25, 1642—March 
20, 1727), the English mathematician and 
physicist, was born into a family of farmers in 
Woolsthorpe, Lincolnshire. In 1661, he entered 
Cambridge University, where he was greatly 
influenced by the professor who was teaching 
geometry, I. Barrow, and where he began 
research in Kepler’s optics and Descartes’s 
geometry. 

In 1665, he discovered the tbinomial theo- 
rem, and in the same year, during a stay at 
his birthplace to escape the plague, he began 
work on his three great discovertes—the spec- 
tral decomposition of light, the universal law 
of gravity, and differential and integral cal- 
culus. He returned to Cambridge University in 
1667, and in the following year invented the 
reflecting telescope and proposed his theory 
of light particles. During this period, he suc- 
ceeded Barrow as professor and lectured on 
optics. At the same time, he probed deeper 
into the calculus. Guided by Barrow’s insight 
that differentiation and integration were in- 
verse operations and also by his own research 
on infinite series, Newton obtained the tfunda- 
mental theorem of calculus. Leibniz obtained 
the same theorem a little later, and a struggle 
resulted between the two over priority. The 
two discoveries were independent, but because 
Leibniz’s notation was superior, the later 
development of calculus owes more to him. 
The dynamic elucidation of the heliocentric 
theory was accomplished in Newton’s main 
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work, Principia mathematica philosophiae 
naturalis (1686-1687), in which Kepler’s law 
on the movement of the planets, Galileo’s 
theory of movement, and Huygens’s theory 

of oscillation were unified into the three laws 
of Newtonian dynamics. These natural laws, 
which deal with all dynamic phenomena in the 
universe, are the most superlative realization 
of Descartes’s concept of exploring the mathe- 
matical structure of nature; they had an es- 
sential influence on the later development of 
the natural sciences. The style of writing is 
similar to that in Euclid’s Stoicheia. In the 
Principia, Newton also sets forth his philo- 
sophical position. 

In 1695, Newton moved to London and 
became engrossed in theology. He was ap- 
pointed Master of the Mint and was presi- 
dent of the Royal Society from 1703 until his 
death. While he is sometimes said to have 
divorced himself from science, many of his 
notes on geometry date from this time. 
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284 (I11.12) 
Noetherian Rings 


A. General Remarks 


In this article, we mean by ring a commutative 
ring with unity element. Thus a Noetherian 
ring is a commutative ring with unity element 
that satisfies the tmaximum condition for its 
ideals; if it is also an tintegral domain, then it 
is called a Noetherian integral domain or 
Noetherian domain (for right and left Noe- 
therian rings — 368 Rings F). 

A ring is Noetherian if and only if every 
prime ideal of the ring has a ‘finite basis 
(Cohen’s theorem). A ring is Noetherian if it is 
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generated by a finite number of elements over 
a Noetherian ring (Hilbert’s basis theorem). 
The following three conditions for a ring R 
are equivalent: (i) R is an Artinian ring, that 
is, it satisfies the tminimum condition for its 
ideals (for right and left Artinian rings — 368 
Rings F). (ii) R is a Noetherian ring and every 
prime ideal of R is a maximal ideal. (iit) There 
exist a finite number of Noetherian rings 
R,i=1,...,n) whose fmaximal ideals are 
tnilpotent such that R is the direct sum of 

the R,{(i=1,...,n). We say that the restricted 
minimum condition holds in a ring R if R/a is 
an Artinian ring for every nonzero ideal a of 
R; the latter condition is satisfied if and only if 
R is either an Artinian ring or a Noetherian 
domain of tKrull dimension 1. Every ideal of 
a Noetherian ring R can be expressed as the 
intersection of a finite number of tprimary 
ideals. Given a ring R and an R-module M,a 
submodule P of M is said to be a primary 
submodule of M if every element a of R that is 
a zero divisor with respect to M/P (i.e., there 
exists an me M/P such that m40 and am=() 
is nilpotent with respect to M/P (i.e., there 
exists a natural number n such that a"(M/P) 
=). 

The property of ideals in Noetherian rings 
stated above can be generalized to the case of 
Noetherian modules: If an R-module M is a 
*Noetherian module, then every submodule of 
M can be expressed as the intersection of a 
finite number of primary submodules. 

Let R be a Noetherian ring, a an ideal of R, 
M a finite R-module, and N and N’ submod- 
ules of M. Then we have (1) the Artin-Rees 
lemma: There exists a natural number r such 
that for all n>r, NA N' =a" "(a NAN’). 
(ii) Krull’s intersection theorem: { \°., a"M = 
{me M | Jaca such that (1—a)m=0} (hence, 
in particular, if m is the Jacobson radical or 
R, then (),2., m"M = {0}. (iii) KrulPs altitude 
theorem: If a is generated by s elements and 
p is a ‘minimal prime divisor of a, then the 
height of p<s. 


B. Topology Defined by an Ideal 


Let R be a ring, a an ideal of R, and M an R- 
module. Then the a-adic topology of M is 
defined to be the topology on M such that 
{a"M |n=1,2,...} is a tbase for the neighbor- 
hood system of zero. In particular, let R be 
Noetherian, M a finite R-module, and Na 
submodule of M. Then by the Artin-Rees 
lemma, the a-adic topology of N coincides 
with the topology on N as a subspace of M 
with the a-adic topology. Returning to the 
general case, M is a tT)-space (under the a- 
adic topology) if and only if (\2, a"M = {0}, 
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or, in other words, if and only if M is a tmetric 
space, where the tdistance d(a, b) between 
points a, b in M is defined to be inf{2™"|a— 
bea"M}. Moreover, if N is a submodule of 

M, then M/N is a T,-space if and only if N 

is a closed subset of M, that is, if and only 

if (e (N +a"M)=N. A sequence {a,} = 
(a,,4,...,4,,..-) is called a Cauchy sequence 
(under the a-adic topology) if Yn IN Yr Ys 

(with each of them a natural number) ay,, 
—dy.,€a"M; for this it is sufficient that 

Vn dN Vr ayi,41 —An+,€0"M. If this sequence 
converges to zero (i.e., Vn IN Vr ay,,€a"M), 

it is called a null sequence. The set W of all 
Cauchy sequences in M becomes an R-module 
if we define their sum and multiplication by an 
element of R by {a,}+ {b,} ={a,+,}, cay} = 
{ca,}. Then the set N of all null sequences is 

a submodule of M. An element m of M is 
identified with the sequence (m,m, ...,m,...), 
and we regard M as a submodule of Wt. Then 
the R-module M =Wt/M is the tcompletion of 
MMC), a"M) (as a metric space under the a- 
adic topology). M is called the a-adic com- 
pletion of M. If a has a finite basis, then the 
topology of M (as the completion) coincides 
with its a-adic topology. If M=R, we define a 
multiplication in W by {a,} {b,} ={a,b,}. In 
this case, Vt is a ring in which M is an ideal, 
and hence the completion R= M is a ring. If 

a= ©; a,;R, then considering the tring of 
formal power series R = R[[x,,...,x,]] and its 
ideal fi=(\2,(X4- (a,x) R+a"R), we have 
Ræ Rà. 


C. Zariski Rings 


For a Noetherian ring R and an ideal a of R, 
every element b of R such that 1—bea has an 
inverse in R if and only if every ideal of R is a 
closed subset of R under the a-adic topology. 
When this condition is satisfied, the ring R 
with the a-adic topology is called a Zariski 
ring; we often express this by saying that (R, a) 
is a Zariski ring. A Zariski ring is called com- 
plete if it is a complete topological space. The 
completion R of the Zariski ring (R, a) has the 
aR-adic topology, and (R, aR) is a Zariski 
ring. Furthermore, (i) R is a tfaithfully flat R- 
module; and (ii) when N is a submodule of a 
finite R-module M, then N is a closed sub- 
space of M (under their a-adic topologies), and 
their completions are identified with N ® kR, 
M QRR. 


D. Local Rings 


Suppose that R is a Noetherian ring having 
only a finite number of maximal ideals and J is 
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the Jacobson radical of R. Then the Zariski 
ring (R, J) is called a semilocal ring. Further- 
more, if R has only one maximal ideal, then 
(R, J) is called a local ring. A ring that has only 
a finite number of maximal ideals is called a 
quasisemilocal ring; if it has only one maximal 
ideal, it is called a quasilocal ring. (In some 
literature, the terms local ring and semilocal 
ring are used under weaker conditions; in the 
weakest case, quasisemilocal rings and quasi- 
local rings are simply called semilocal rings 
and local rings, respectively, and local rings 
and semilocal rings in our sense are called 
Noetherian local rings and Noetherian semi- 
local rings, respectively.) 

Assume that R is a semilocal ring with max- 
imal ideals m,,...,m,, and the Jacobson rad- 
ical J=m; N... Nm,. For every finite R- 
module M, we introduce the J-adic topology 
as its natural topology. The completion R of R 
is a semilocal ring with maximal ideals m, R, 
...,m1,R and is naturally isomorphic to the 
direct sum of the completions of the local rings 
Rm; (i= 1, ...,). Since R is a Zariski ring, (1) R 
is faithfully flat; (2) submodules of M are 
closed subsets of M; and (3) the completion of 
M is identified with M @ gR. If (R, m).is a 
complete local ring (i.e., a local ring and a 
complete Zariski ring at the same time), then 
R contains a subring J with the following 
properties: (i) J is a complete local ring, and 
Im 1) = R/m; and (ii) for the tcharacteristic p 
of R/m (p is either zero or a prime number), 
mNZ=pI. Therefore, if m is generated by n 
elements, then R is a homomorphic image of 
the ring of formal power series in n variables 
over I. This theorem is called the structure 
theorem of complete local rings, and 7 is called 
a coefficient ring of R. If R contains a field, 
then 7 is a field, called a coefficient field of R. 
A complete local ring is a *Hensel ring. 

When (R, m) is a local ring and 37_, x;R 
is m-primary, then we have an inequality 
r>(*Krull dimension of R); if the equality 
holds, then we say that x,,...,x, form a sys- 
tem of parameters of R. Furthermore, if m= 
Di-1x;R, then we say that x,,...,x, forma 
regular system of parameters of R. A local ring 
that has a regular system of parameters is 
called a regular local ring (cf. ‘Jacobian crite- 
rion). A regular local ring is a tunique factori- 
zation domain. Let d be the Krull dimension 
of a local ring (R, m). Then R is a regular local 
ring if and only if one of the following holds: 
(1) every R-module has finite thomological 
dimension, (2) every R-module has homolog- 
ical dimension of at most d, or (3) the homo- 
logical dimension of R/m (as an R-module) is 
finite (and actually coincides with d). A Noe- 
therian ring R’ is called a regular ring if Riis 
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a regular local ring for every prime ideal p’. A 
regular local ring is a regular ring. 

Consider a local ring (R, m) and an m- 
primary ideal q. Then the length /(n) of R/q” 
(as an R-module) is a function of n. For a 
sufficiently large n, the length /(n) can be ex- 
pressed as a polynomial f(n) in n with rational 
coefficients. The degree of f(n) coincides with 
the Krull dimension d of R. The multiplicity of 
q is (coefficient of nf in f(n)) x (d!). If x4, ..-, Xa 
form a system of parameters, then (multi- 
plicity of D4, x;R)< (length of R/È x;R); if 
the equality holds, then we call x}, ..., x4 a 
distinct system of parameters. A local ring 
that has a distinct system of parameters is 
called a Macaulay local ring. A local ring is a 
Macaulay local ring if and only if one of the 
following holds: (1) every system of parameters 
is a distinct system of parameters, or (2) if an 
ideal a of height s is generated by s elements, 
then every fprime divisor of a is of height s. A 
regular local ring is a Macaulay local ring. 

The notion of multiplicity can be also de- 
fined in general Noetherian rings [4]. Let R be 
a Noetherian ring. If R,, is a Macaulay local 
ring for every maximal ideal m, then R is 
called a locally Macaulay ring. Furthermore, if 
height m= Krull dim R for every maximal 
ideal m, then R is called a Macaulay ring. If R 
is a locally Macaulay ring, then the poly- 
nomial ring tn a finite number of variables 
over R is also a locally Macaulay ring. In 
general, an ideal a is called an unmixed ideal 
(or pure ideal) if the height of every prime 
divisor of a coincides with height a; otherwise, 
a is called a mixed ideal. Thus if R is a locally 
Macaulay ring, an ideal a of the polynomial 
ring R[X,,...,x,] over R is generated by r 
elements, and height a=r, then a is unmixed 
(unmixedness theorem). 

If the completion of a local ring R is a tnor- 
mal ring, then we say that R is analytically 
normal. If the completion of a semilocal ring R 
has no nilpotent element except zero, then we 
say that R is analytically unramified. A semi- 
local integral domain R which is a ring of 
quotients of a finitely generated ring over a 
field is analytically unramified. If R is a normal 
local ring, then R is analytically normal (O. 
Zariski). 

Let (R, m) be a local ring, and let q be an m- 
primary ideal. Set F,=q'/qit! (i=0,1,2,..., 
q° = R). Let a=a’ (mod qi*!)eF, and b=b’ 
(mod q/*')eF,. We put ab=a'b' (modq'**')e 
F;,;. Then the direct sum of modules F = 
Lic F; becomes a graded ring generated by 
F, over Fy, in which F; is the module of homo- 
geneous elements of degree i. F, called the 
form ring (or associated graded ring) of R with 
respect to q, plays an important role in the 
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theory of local rings, particularly in the theory 
of multiplicity. 


E. Chains of Prime Ideals 


Let R be a Noetherian ring with prime ideals 
p, q such that pcq. Consider the length n of a 
chain of prime ideals p=po Spi G... FP, = 

q which cannot be refined any more. It is 

not true in general that n is uniquely deter- 
mined by p and q (M. Nagata). However, n is 
uniquely determined for a rather large class of 
Noetherian rings, for instance the rings that 
are homomorphic images of locally Macaulay 
rings and, in particular, finitely generated rings 
over a ‘Dedekind domain. 


F. Integral Closures 


Let R be a Noetherian integral domain with 
the field of quotients k, let K be a finite al- 
gebraic extension of k, and let R be the tin- 
tegral closure of R in K. Then (i) If R is of 
Krull dimension 1, then for an arbitrary ring 
R’ such that RC R'c K and for every nonzero 
ideal a’ of R’, the quotient R’/a’ is a finite 
R/(a' N R)-module. In particular, R’ is a Noe- 
therian domain satisfying the restricted mini- 
mum condition. (it) If R is of Krull dimension 
2, then R is Noetherian. (iii) In general, R is a 
‘Krull ring, and for an arbitrary prime ideal p 
of R there are only a finite number of prime 
ideals p of R such that p=HNR. For any 

of such , the field of quotients of R/p is a 
finite algebraic extension of that of R/p. Result 
(i) is called the Krull-Akizuki theorem. We say 
that R satisfies the finiteness condition for 
integral extensions if R is a finite R-module for 
any choice of K. A Noetherian ring R is called 
a pseudogeometric ring, or a universally Japa- 
nese ring, if R/p satisfies the finiteness con- 
dition for integral extensions for every prime 
ideal p. A ring is pseudogeometric if it is gen- 
erated by a finite number of elements over a 
pseudogeometric ring. 


G. History 


J. W. R. Dedekind first introduced the concept 
of ideals in the theory of integers. The main 
objects studied in ring theory were subrings of 
number fields or function fields until M. Sono 
(Mem. Coll. Sci. Univ. Kyoto, 2 (1917), 3 (1918— 
1919)) originated an abstract study of Dede- 
kind domains, which was followed by E. 
Noether (Math. Ann., 83 (1921), 96 (1926)), 
who originated the theory of Noetherian rings. 
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W. Krull made further important contri- 
butions to the development of the theory of 
Noetherian rings and general commutative 
rings [1]. Many other authors, including E. 
Artin, Y. Akizuki, and S. Mori, also contri- 
buted to the theory. The theory of local rings 
was originated by Krull (J. Reine Angew. 
Math., 179 (1938)) and developed by C. Che- 
valley (Ann. Math., 44 (1943)), I. S. Cohen 
(Trans. Amer. Math. Soc., 59 (1946)), and Zari- 
ski (Ann. Inst. Fourier, 2 (1950)), and later by 
many authors, including P. Samuel, Nagata, 
M. Auslander, D. A. Buchsbaum, and J.-P. 
Serre [4]. The theory of Noetherian rings is 
applied to algebraic geometry. 
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A. History 


The validity of the fifth postulate of Euclid’s 
Elements, the taxiom of parallels, has been a 
subject of argument ever since it was for- 
mulated (— 139 Euclidean Geometry). At the 
beginning of the 18th century, G. Saccheri 
tried to prove the postulate by assuming the 
validity of other axioms. Under the hypothesis 
that the axiom does not hold, he deduced 
various extraordinary results. Although he 
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was mistaken in thinking that he had ob- 
tained a contradiction, his work is regarded 
as a forerunner to the study of non-Euclidean 
geometry. 

At the beginning of the 19th century, N. I. 
Lobachevskii and J. Bolyai opened up the 
impasse by establishing a geometry based on 
postulates that contradict the fifth postulate. 
This geometry is called hyperbolic geometry or 
Lobachevskii’s non-Euclidean geometry. Actu- 
ally, a similar idea had been conceived by C. F. 
Gauss, but he refrained from publishing it 
because of likely misunderstanding by a public 
still strongly influenced by I. Kant’s philoso- 
phy. On the other hand, B. Riemann con- 
structed so-called elliptic geometry (or Rie- 
mann’s non-Euclidean geometry), which is 
different from both Euclidean and hyperbolic 
geometry. Euclidean geometry (including the 
theory of similarity) is sometimes called para- 
bolic geometry. In general, a space satisfying 
axioms that contradict Euclid’s postulates is 
called a non-Euclidean space. 

Around the turn of the 20th century, A. 
Cayley, F. Klein, and H. Poincaré constructed 
models of non-Euclidean spaces that are sub- 
sets of Euclidean spaces, and E. Beltrami con- 
structed a differential geometric model. By 
means of these models, it was established that 
non-Euclidean geometries are consistent as 
long as there is no inconsistency in the under- 
lying Euclidean geometries. On the other 
hand, D. Hilbert established a complete system 
of axioms for Euclidean geometry and showed, 
by constructing non-Euclidean models, that 
the axiom of parallels is independent of the 
other axioms (— 155 Foundations of Geom- 
etry). The logical foundation of non-Euclidean 
geometries was thus clarified. Moreover, A. 
Einstein showed in his ‘theory of relativity 
that actual space-time does not satisfy Eu- 
clidean axioms. Together with Euclidean 
spaces, non-Euclidean spaces are often used 
as fundamental models both in the problem of 
tspace forms and in the theory of tsymmetric 
spaces. 


B. Axiomatic Considerations 


Hilbert’s system of axioms of plane Euclidean 
geometry consists of axioms of incidence, 
order, congruence, parallels, and continuity (— 
155 Foundations of Geometry). Specifically, 
the axiom of parallels is stated as follows: 
Suppose we are given a straight line and a 
point in a plane. If the straight line does not 
contain the point, then there exists only one 
line through the point that does not intersect 
the line. 
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The system of axioms of hyperbolic geome- 
try is obtained by replacing the axiom of par- 
allels by the following: Suppose we are given a 
straight line and a point in a plane. If the line 
does not contain the point, then there exist at 
least two lines that pass through the point 
without intersecting the line. (The other four 
groups of axioms are unaltered.) In this case, if 
lis a given line that does not contain a given 
point C, then there exist exactly two lines 
parallel to / that pass through C. We denote 
them by X Yand X’Y’, and they are character- 
ized as follows (Fig. 1): Any line that passes 
through C and lies in 4 X’CY necessarily 
intersects /; by contrast, neither the two lines 
XY, X’Y' nor any line in L X CX’ intersects l. 
Euclidean geometry can be considered as a 
“limit” of this geometry where the lines X Y 
and X’Y’ coincide. 


Xx’ Y 
l 
Fig. 1 


In elliptic geometry, the axiom of parallels is 
replaced by the following: Suppose we are 
given a straight line and a point in a plane. If 
the line does not contain the point, then any 
line passing through the point intersects the 
line. In this case, the lines are closed curves, 
and the axioms of order must be modified. 
Specifically, in Euclidean geometry, the axioms 
of order are based on the notion of a point A 
lying between points B and C, where A, B, C 
are distinct points on a line. In elliptic geome- 
try, however, to define order we utilize the 
notion of a pair A, C of points separating 
another pair B, D (and vice versa), where A, B, 
C, D are distinct points on a line. The axioms 
of order are modified accordingly. 

The sum of inner angles of a triangle is 
smaller or greater than two right angles ac- 
cording as we use the axioms of hyperbolic or 
elliptic geometry. 


C. The Projective-Geometric Point of View 


We take tprojective coordinates in an n- 
dimensional, real tprojective space P” and 
consider a quadric hypersurface defined by 
Q:axp+xi+...4+x2=0, a40(— 90 Coordi- 
nates; 343 Projective Geometry). We denote by 
G the group consisting of the totality of pro- 
jective transformations of P" that leave Q 
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invariant. We call Q the absolute and call G 
the group of congruent transformations. When 
a<0, then this Q is a real quadric hypersur- 
face. In this case, there exists a domain H” (the 
totality of points inside Q) whose boundary 
coincides with Q, and the group G acts ttransi- 
tively on H". The pair {G, H"} provides a 
model of hyperbolic geometry, and the n- 
dimensional hyperbolic space H” is homeo- 
morphic to an n-dimensional open cell. Points 
of H”, points on Q, and points outside Q are 
called ordinary points, points at infinity, and 
ultrainfinite points (or ideal points), respec- 
tively. Two lines on H” are said to be parallel if 
they intersect on the absolute Q. Next, when 
a>O0, the absolute Q is an imaginary quadric 
hypersurface, and the group G acts transitively 
on P”. The pair {G, P"} provides a model of 
elliptic geometry. The n-dimensional elliptic 
space P” is homeomorphic to n-dimensional 
real projective space; hence it is compact. In 
elliptic geometry, any two distinct lines in a 
plane necessarily intersect at a point. The 
above models {G, P"} of non-Euclidean geo- 
metries are called Klein’s models. 

Let {G, H"} be a Klein’s model, A, B distinct 
points in P”, / the line containing A, B, and I, J 
be two points where the line / meets Q (Fig. 

2). If we denote by (A, B, I, J) the tanhar- 
monic ratio of these four points, then the non- 
Euclidean distance p between the points A and 
B is given by p=alog(A, B, I, J), a constant. 
Next, let J, g be lines in H” intersecting at a 
point D. In the plane determined by / and g, 
we draw two imaginary tangents u, v to the 
absolute through D. Denoting by (l, g, u, v) the 
anharmonic ratio of these four lines, the non- 
Euclidean angle @ between the lines / and g is 
given by 0=(1/2i)log(I, g, u, v), i= Ji. Gen- 
erally, let O be a point in the projective space 
P”, and denote by p the tpolar of O with re- 
spect to the absolute. By counting the polar 

p doubly, it can be regarded as a quadric 
hypersurface, which will be denoted by Sy. A 
quadric hypersurface S of H” is called a non- 
Euclidean hypersphere if it belongs to the tpen- 
cil of quadric hypersurfaces determined by Q 
and Sọ. S is called a proper hypersphere, a 
limiting hypersphere, or an equidistant hyper- 
surface according as the center O is an ordi- 
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nary point, a point at infinity, or an ultrain- 
finite point, respectively. In the case of ellip- 
tic geometry {G, P"}, the distance p between 
two points A and B is given by p=(a/i)- 
log(A, B, 1, J) as I, J are imaginary points. 
Moreover, we may get parabolic geometry as 
the “limit” (a> oo) of the geometry given by 
Klein’s models. 


D. The Conformal-Geometric Point of View 


Let S” be an n-dimensional tconformal space. 
If we take suitable (n + 2)-hyperspherical co- 
ordinates in S", the space S” can be realized as 
a quadric hypersurface x7+x}+...+x?— 
2X9X =0 in (n+ 1)-dimensional real projec- 
tive space P"*'. A point in P"*! represents a 
hypersphere in S” (— 76 Conformal Geometry; 
90 Coordinates). We denote by G the group 
consisting of the totality of *MObius trans- 
formations leaving invariant a (real, point, or 
imaginary) hypersphere Q. When Q is a real 
hypersphere, the space S” is divided into Q and 
the two open cells H”, Hy. If we denote by G 
the totality of transformations of the group G 
that do not interchange H” and H}, then G is a 
subgroup of index 2 of G. In this case, each of 
the pairs {G, H"} and {G, Hy} provides a model 
of hyperbolic geometry. When Q is a point 
hypersphere, the space E” obtained from $" by 
omitting the point Q is homeomorphic to an 
open cell, and the pair {G, E"} provides a 
model of parabolic geometry. On the other 


hand, when Q is an imaginary hypersphere, G ` 


is isomorphic to the forthogonal group O(n+ 
1). We call the pair {G, S"\ a spherical geom- 
etry and S” an n-dimensional spherical space. 
In this case two points x and x’ are called 
equivalent if x’ is the image of x by symmetry 
with respect to Q. The space P” obtained 
from S” by identifying equivalent points is 
homeomorphic to the n-dimensional real 
projective space. If we denote by G the group 
obtained from G by making its actions effec- 
tive on P", then G is the factor group of G by a 
cyclic group Z, = Z/2Z. The pair {G, P"} pro- 
vides a model of elliptic geometry. These 
models are called Poincaré’s models. They 
were introduced as a result of research on 
tautomorphic functions in the case n=2. 

In Poincaré’s model, every straight line is 
represented either by a circle orthogonal to Q 
or by a circle passing through Q according as 
Q is a (real or imaginary) hypersphere or a 
point hypersphere. In spherical geometry, 
however, straight lines are usually called great 
circles, and two distinct great circles lying on a 
2-dimensional sphere necessarily intersect at 
two points that are symmetric with respect to 
Q. Also, in Poincaré’s model the distance 
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between two points A and B is defined as 
before, by making use of the anharmonic ratio 
of four points A, B, I, J on a circle (Fig. 3) (— 
74 Complex Numbers G). 


E. The Differential-Geometric Point of View 


An n-dimensional space M of tconstant curva- 
ture is by definition a Riemannian manifold 
whose line element ds is given by 


dx? +dx3+...+dx? 


ds*= 
K 2 2 2 
are (xi +x +... +x) 


with respect to appropriate local coordinates, 
where K is a constant called the tsectional 
curvature (— 364 Riemannian Manifolds). 
According as K is positive, zero, or negative, 
M can be considered locally as an elliptic 
space, Euclidean space, or hyperbolic space, 
respectively. In this case, lines are tgeodesics 
of M, and the non-Euclidean distance and 
non-Euclidean angle are those defined in the 
Riemannian manifold. When n=2, a tsim- 

ply connected and *complete space of posi- 
tive constant curvature is tembedded in 3- 
dimensional Euclidean space as a sphere, and a 
space of negative constant curvature is tlocally 
isometric to a pseudosphere (Fig. 4), which is a 
surface of revolution obtained by rotating a 
ttractrix around its asymptote. tComplete n- 
dimensional spaces of constant curvature 
(n> 2) are called space forms. A simply con- 
nected space form is necessarily one of spher- 
ical space, Euclidean space, or hyperbolic 
space. Each of these is a tuniversal covering 
manifold of a general connected space form 
with a curvature of the same sign, and the 
group of fcovering transformations is isomor- 


Fig. 4 
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phic to a tdiscontinuous subgroup of the 
group of congruent transformations. 

Each of the spaces, Euclidean, non- 
Euclidean, and spherical, is a thomogeneous 
space on which the corresponding group of 
congruent transformations acts transitively. 
Actually, each of these spaces has the structure 
of a tsymmetric Riemannian homogeneous 
space of rank 1. 
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A. General Remarks 


At present, the theory of nonlinear problems is 
still not unified, and many individual results 
obtained for specific classes of problems are 
stated in the languages of the corresponding 
fields of study. However, there are some funda- 
mental facts and methods of a general nature 
concerning nonlinear problems, which may be 
referred to as the subject matter of nonlinear 
functional analysis. 


B. Iterative Methods 


Let X be a ‘Banach space. Consider a non- 
linear mapping G: X ~X and the equation 
Gx=0 (xex). (1) 
Set F =] — G. Then (1) can be written as 

x= Fx. (2) 


F satisfies the Lipschitz condition if there exists 
a constant « such that 


| Fx — Fy|] <allx— yll (3) 
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for all x, yin X. In particular, the mapping F 
is said to be nonexpansive if 0<a<1.If0<a< 
1, then F is called a contraction. (Sometimes 

a nonexpansive mapping is also called a con- 
traction.) A contraction F satisfies the contrac- 
tion principle: F has a unique fixed point Xo, 
and the iteration 


(n=1,2,...) 


Xn+1 = Fx, 


with an arbitrary initial element x, always 
converges to xo [1-3]. Similar results hold for 
a contraction that is defined only on a tball 
and leaves the ball invariant. This leads to 
*Newton’s iterative process and the timplicit 
function theorem. 


C. Methods of Monotonicity 


By definition, a nonlinear mapping G from a 
tHilbert space H to H is a monotone or accre- 
tive operator if 


Re(Gx—Gy,x—y)20 (x, yeR). 


G. Minty [4] proved that if G: H>H is mono- 
tone and continuous, then AI +G is a mapping 
onto H for any A>0, and its inverse (AI + G) ' 
is nonexpansive. He has also shown that in the 
hypothesis of the theorem, we can replace the 
continuity requirement for G by maximality of 
G within the class of accretive operators that 
are possibly multivalued. Various develop- 
ments of Minty’s ideas, including generaliza- 
tion of his results to Banach spaces and appli- 
cations to partial differential equations, have 
been obtained by F. Browder (Amer. Math. 
Soc. Proc. Symposia in Appl. Math., 17 (1965)), 
J. Leray and J. L. Lions (Bull. Soc. Math. 
France, 93 (1965)), J. L. Lions [5], W. Strauss, 
H. Brézis (Amer. Math. Soc. Proc. Symposia in 
Pure Math., 18 (1970)), and others. 

A mapping A is said to be dissipative if — A 
is accretive. Dissipative mappings play a cen- 
tral role in the theory of nonexpansive semi- 
groups (— Section X). 


D. Topological Methods 


In the geometric study of ordinary differen- 
tial equations [6] some familiar theorems of 
topology and tdifferential topology have been 
strong tools, e.g., tBrouwer’s fixed-point theo- 
rem is utilized to establish the existence of 
periodic solutions. However, in order to deal 
with nonlinear partial differential equations we 
have to generalize these theorems to infinite- 
dimensional cases. For example, a fixed-point 
theorem in an infinite-dimensional space was 
first obtained by G. D. Birkhoff and O. D. 
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Kellogg (Trans. Amer. Math. Soc., 23 (1922), 
95-115). The theory of the degree of mappings 
was generalized to the case of Banach spaces 
by J. Leray and J. Schauder [33] for the class 
of mappings of the form J — F, where F is a 
compact continuous mapping, i.e., the image 
by F of any bounded set is relatively compact. 
Let D be an open bounded set in a Banach 
space X and ôD be its boundary. Let F:D>X 
be a continuous compact mapping and ® 
denote I — F. If a point pin X does not be- 
long to ®(éD), then we can define the Leray- 
Schauder degree d(®, p, D} of ® relative to p 
[1-3]. The Leray-Schauder degree d(®, p, D) 
is an integer with the following properties: (i) 
d(I, p, D)=1 if peD. If pé (D), then d(Q, p, D) 
=Q. (ii) (Homotopy invariance) d(®, p, D) de- 
pends only on the compact homotopy class of 
®:6D—X ~{p}. More precisely, let K : [0,1] x 
éD— xX be a continuous compact mapping 
such that x + K(t,x)#p for any te[0, 1] and 
any xe OD. Let ®)(x)=x+ K (0, x) and ®, (x)= 
x+ K(1, x). Then d(®p, p, D)=d(®,, p, D). (iii) 
If p and p’ are in the same component of 

X (0D), then d(®, p, D) = d(®, p’, D). Gv) (Con- 
tinuity) d(®, p, D) is a continuous locally con- 
stant function of ® (with respect to uniform 
convergence) and of pe X ~.®(6D). (v) (Domain 
decomposition) If D is the union of finite num- 
ber of open disjoint sets D; (j= 1,2, ..., N) 

with ôD;c ôD and ®(x)#p on | }®, éD,, then 
d(®, p, D)= E}, d(®, p, D,). (vi) (Excision) If A 
is a closed subset of D on which ®(x)¥# p, then 
d(®, p, D) = d(®, p, DA). (vii) (Cartesian prod- 
uct formula) If X = X, ® X, with D;c X, B= 
(®,,®,) with ®,;:D,> X, (i= 1,2), D=D, x Dy, 
and p=(p,, p2), then d(®, p, D)=d(®,, p,, D,) x 
d(®,, p2, D,), provided that the right-hand 
side is well defined. 

The degree of mapping of ® is also defined 
for some proper Fredholm mapping ® (— 
Section E; K. D. Elworthy and A. J. Tromba 
[38]; [32]). 

Brouwer’s fixed-point theorem (— 153 
Fixed-Point Theorems) is generalized to 
Schauder’s fixed-point theorem: A compact 
mapping F of a closed bounded convex set K 
in a Banach space X into itself has a fixed 
point. Using the Leray-Schauder degree 
theory, one has the Leray-Schauder fixed-point 
theorem: Let D be a bounded open set of a 
Banach space X containing the origin O. Let 
F (x,t): D x [0,1]—>X be a compact mapping 
such that F(x,0)=0. Suppose that F(x, 1) 4x 
for any xeðD and te[0, 1]. Then the compact 


mapping F(x, 1) has a fixed point x in D [1-3]. 


Other homotopy invariants, such as thomo- 
topy groups and *‘cohomotopy groups, are 
also used in nonlinear functional analysis 
[1,3]. For instance we have the following 
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theorem of Shvarts (1964) [3]: Let X and Y be 


two Banach spaces. Let D= {xe X | ||x|| <1} be 
the unit ball and ôD = {x€ X | |x||=1} be the 
unit sphere of X. Suppose L is a fixed continu- 
ous linear Fredholm operator from X to Y 
of index p>0. Let PL be the set of compact 
perturbation of L mapping ôD into Y~.0, i.e., 
PL={®=L+K|K is a continuous compact 
mapping of 6D to Y such that ®(x)=Lx+ 
K(x) #0 for xceĉD}. Two mappings ®, and 

@, in PL are said to belong to the same com- 
pact homotopy class on 6D if there exists a 
continuous compact mapping h:[0, 1] x 6éD> 
Y such that Lx + A(t, x) 40 for x in 6D, Bo(x) = 
Lx+h(0,x), and ®,(x)= Lx +-A(1, x). 

Shvarts’s theorem: Let L be a fixed continu- 
ous linear Fredholm mapping from X to Y 
of index p>0. Then the compact homotopy 
classes on D of PL are in one-to-one corre- 
spondence with the elements of the pth stable 
homotopy group 7,,,,(S") (n= p+ 1)(— 202 
Homotopy Theory H). 

Warning: The topological structure of an 
infinite-dimensional Hilbert or Banach space is 
quite different from that of a finite-dimensional 
Euclidean space. For instance, let X be a Hil- 
bert space of infinite dimension, and let D= 
{xe X | ||x|| <1} be its unit ball and êD = 
{xe X | ||x|| =1} be the unit sphere of X. S. 
Kakutani (Proc. Imp. Acad. Tokyo, 19 (1943)) 
gave a fixed-point free continuous mapping 
of D into itself if X is separable [1-3]. Thus 
naive generalization of the Brouwer fixed- 
point theorem is no longer true in infinite- 
dimensional spaces. V. Klee and C. Bessaga 
[17] proved that the unit sphere ôD is C*- 
diffeomorphic to X for an arbitrary Hilbert 
space X of infinite dimension. N. H. Kuiper 
proved that the group of invertible continuous 
linear operators on X is contractible if X is 
separable [3]. All this is in striking contrast 
to the well-known facts for finite-dimensional 
spaces (— 202 Homotopy Theory, 427 Topol- 
ogy of Lie Groups and Homogeneous Spaces). 
This is the reason why compactness assump- 
tions are made in the theorems mentioned 
above. 


E. Calculus in Banach (or Locally Convex) 
Spaces 


When one considers a nonlinear operator, 

it often happens that the domain and the 
range are neither linear spaces nor their open 
subsets. The domain might be a space of all 
smooth mappings of a compact manifold into 
another, and so might be the range. Such 
spaces have no linear structure, and hence 
linearity or semilinearity do not make sense in 
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general. The concept of infinite-dimensional 
manifolds is therefore introduced of necessity 
in nonlinear functional analysis. 

Definition of differentiable mappings. Let E 
and F be real Banach spaces and let L(E, F) 
(= L(E) if E= F) be the Banach space of all 
bounded linear operators with uniform oper- 
ator norm. Let U be an open subset of E and 
x a point of U. A mapping (= nonlinear oper- 
ator) f of U into F ts called Gateaux differ- 
entiable at x if lim, „ot '(/(x+ty)—f(x)) = 
df(x, y) exists for any ye E. df(x, y) is called the 
GAteaux derivative of f at x. f is called Fréchet 
differentiable at x if there exists a linear oper- 
ator Ae L(E, F) such that lim,_,9 || f(x + y)— 
f(x)—Ay||/|| yi] =0. A is called the Fréchet 
derivative of f at x and is denoted by df(x) or 
f'(x). f is Fréchet differentiable in U if and 
only if it is Gateaux differentiable, df(x, y) 
is linear in y, and sup, 4o |ldf(x, y)II/Ily|| is 
bounded [10]. 

Let U be an open subset of E. A mapping 
f of U into F is said to be of class C° if it 
is continuous and to be of class C! if it is 
Fréchet-differentiable at each point xe U and 
the differential df (x)e L(E, F) is continuous as 
a mapping of U into L(E, F). The differential 
df(x) is also called the linearized operator. If 
the mapping df: U > L(E, F) is of class C’"', 
then f is said to be of class C”. d(d’"' f)(x) is 
written as d’ f(x), and called the rth differential 
at x. d" f(x) is an r-linear, bounded, symmetric 
operator of Ex... x E (r times) into F. f is said 
to be of class C” if f is of class C” for every r. 
For an open subset V of F, f: U-V is called a 
C diffeomorphism if f is a bijection and both f 
and f~! are of class C”. 

A C” mapping f of U into F is called a Fred- 
holm mapping (Fredholm map) if df (x)e L(E, F) 
is a linear Fredholm operator for every xe U. 
Since Ind df(x) is constant if U is connected, 
that integer Ind df(x) is called the index of f: 


F. Taylor’s Theorem and Its Converse 


Let f: UF be of class C’ (r2 1). A general- 
ized Taylor theorem claims that f can be ap- 
proximated by a polynomial mapping: Let 

xeU and yeE be sufficiently close to 0 so that 
x+tyeU for 0<t<1. Then 


=t 1 
GDE 
x (d flx + ty) d f), “i , y)dt. 


Let L*,m(E, F) be the Banach space of all k- 
linear, bounded, symmetric operators of E x 
..x E into F with the uniform topology. If for 
every k, 0< k <r, there exists a continuous 


flx+y)= Eaton] S = 
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mapping p: ca F) such that 


fat) TE 


at every xe U and y sufficiently close to 0, then 
f:U >F is of class C", and d* f(x) =@, (x). 


G. The Implicit Function Theorem 


Using the notation above, let f: U— V be of 
class C”, r>1, and assume that Oe U, Oe V, and 
{(0)=0, where 0 is the origin of E or F. Sup- 
pose there is an AE L(F, E), called the right 
inverse of df(0), such that df(0)A = 1, (the iden- 
tity). Then the following assertions hold: (i) 
The image of F under A, AF, is a closed sub- 
space of E, and E = Kerdf(0)@ AF. (it) There 
are neighborhoods U,, U;, V’ of the zeros 

of Kerdf(0), AF, F, respectively, such that 

U, Ð U cU and such that the mapping g: 

U, ®U,>U, @ V defined by g(u, v) =(u, f(u, v)) 
is a C’-diffeomorphism. Therefore, denoting the 
inverse of g by h=(h,,h,), we have h, (u, w)=u 
and f(u, hlu, w))=w. The latter means that the 
nonlinear equation f(u,v)=w can be solved 
with respect to v. 


H. Existence and Uniqueness of Integral 
Curves 


Using the notation above, let f be a C” map- 
ping (r > 1) of U into E. Since U x E is the ‘tan- 
gent bundle of U, (x, f(x)) can be regarded as a 
C tangent vector field on U. The equation of 
integral curves is (d/dt)x(t) = f(x(t)). A local 
existence and uniqueness of solutions is stated 
as follows: For an arbitrarily fixed xe U, there 
are €>0 and an open neighborhood W of x 
such that there exists uniquely a C’ mapping h 
of Wx{—é,¢) into U satisfying (d/dt) h(w, ) = 
f(Atw, dD) and h(w, 0) = 

Using this fact, one can prove the Frobenius 
theorem: Let EF’ be a closed linear subspace of 
E with a direct summand £E”, and let f: U > 
L(E', E”) be a C” mapping (r > 1) such that 
f(0)=0. To each xe U one associates a closed 
linear subspace D, = {(v, f(x)v)|ve E'}. The 
disjoint union D={ } yey Dy can be regarded as 
a subbundle of the tangent bundle U x E. A 
mapping ŭ of U into E is called a cross sec- 
tion of D if ù(x}e D, for every xe U. D is called 
involutive if for any two C' cross sections ñ, Ò 
of D, the Lie bracket product [à, č] defined by 
[ ii, ò] (x) = dii(x)(6(x)) — dd(x)(a(x)) is again a 
cross section of D. Now suppose D ts an in- 
volutive subbundle of U x E. Then for an arbi- 
trarily fixed xe U, there are a neighborhood W 
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of x and a C’ diffeomorphism h of W ontoa 
neighborhood of 0 of E such that dh(x)(D,)= 
E’. This fact shows that an involutive sub- 
bundle can be trivialized by a suitable change 
of local coordinate systems. 


I. Local Theories on Locally Convex Spaces 


All local theories mentioned in Sections E-H 
are constructed on Banach spaces. However, it 
is important in concrete applications to con- 
struct these theories on a wider class of locally 
convex topological linear spaces. 

Let E, F be tlocally convex topological 


linear spaces, and let U be an open subset of E. 


A mapping f: U-F is said to be of class C? if 
it is continuous. f is said to be of class C’ if f 
is of class C"! and the following is fulfilled: 
For every xe U, there is an r-linear continuous 
symmetric mapping d’ f(x) of Ex... x E into F 


such that d'f :U x Ex... x EF is continuous. 


If we put 
F(y)=f(x+ y)—f(x)—df(x)(y)— ... 


1 
SAPs d) 
r: 


for every y sufficiently close to Oe E, then the 
mapping G defined by 


cena ffion 170 


teR 
0, ey ER 


is continuous at (0, 0)eR x E. The definitions 
of C” mappings and C” diffeomorphisms are 
given as in Section E. 


J. Implicit Function Theorems in Locally 
Convex Spaces 


The implicit function theorem does not hold 
in general locally convex spaces. However, 
since it is useful for nonlinear problems, sev- 
eral sufficient conditions are presently being 
studied. The following are some of them. We 
assume that f: U—F is a C” mapping (r > 1) 
such that f(0)=0 and that df(0):E>F has a 
continuous right inverse. 

(i) If dim F < œ, the implicit function theo- 
rem holds just as in Section G. 

(ii) An implicit function theorem that can 
be applied to nonlinear elliptic differential 
operators can be restated as follows. Suppose 
E, F are tprojective limit of families of Banach 
spaces {E*,k>d}, {F*,k>d} and U=EN U4, 
where U’ is an open subset of E’. If f: U >F 
can be extended to a C” mapping (r 2 2) of 
E*N U? into F* for every k >d, if f satisfies the 
following inequalities, called a linear estimate 
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with respect to | |,: 

ldf(x+ Vo) <C(lylelolat lylalel) 
+PV-Dlth-r, k>d, 

(d? f(x + y) (u, D)e < CU yh lulalola+lylalulelela 


+|ylalulalvld + PCy lulk- lvl- k>d, 
(4) 


where | |, is the norm in E* or F*, C is a posi- 
tive constant independent of k, and P, is a 
polynomial with positive coefficients, and if a 
right inverse A of df(0) satisfies the inequality 
of Garding type, 


|Aul,<C]ul,+Dyluh-1, k>d, (5) 


where C’>0 independent of k and D, >0, then 
the implicit function theorem holds just as in 
Section G, and the obtained mapping h satis- 
fies the same inequalities as (4) [11]. 

(iii) Nash-Moser implicit function theorem. 
Though linear estimates such as (4) hold for 
many differential operators, the second in- 
equality (5) is sometimes out of order, espe- 
cially if f is a nonlinear hyperbolic operator. 
However, one can often obtain instead of (5) 
a weaker inequality: 


|Aul,<C'|ulpest Dilulk+s-13 s>0. 


J. Nash [37] and J. Moser [12] approximated 
such an operator A by some smoothing oper- 
ators and proved an implicit function theo- 
rem under a certain additional condition. 

The Nash-Moser implicit function theorem 
was successfully applied to many difficult 
problems, e.g., the embedding problem of 
Riemannian manifolds [37], the small divi- 
sor problem of celestial mechanics [36], free 
boundary problems (e.g., L. Hörmander Arch. 
Rational Mech. Anal., 62 (1976), 1-52), and 
other problems (e.g., S. Klainerman, Comm. 
Pure Appl. Math., 33 (1980), 43—101; M. Kura- 
nishi, Amer. Math. Soc. Proc. Symposia in Pure 
Math., 30 (1977), 97-105). 

(iv) Analytic implicit function theorem. In 
cases (ii) and (iii), the spaces E, F were given 
as projective limits of Banach spaces. On the 
contrary, H. Jacobowitz considered the case 
where E£, F are tinductive limits of Banach 
spaces. For instance, the space E of the 
smooth functions can be approximated by a 
family of Banach spaces {E,|e>0} of all real 
analytic functions with e as the radius of con- 
vergence. Under this circumstance, certain 
conditions for f and the right inverse of df(0) 
yield an implicit function theorem [13]. 

(v) Mather’s implicit function theorem [14]. 
The difficulty of implicit function theorems in 
Fréchet spaces is concentrated in the following 
fact: Even if df(0) has a right inverse and even 
if x is sufficiently close to 0, df(x) may not have 
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a right inverse. If the implicit function theorem 
holds, such a phenomenon should not happen. 
In cases (ii)—{(iv), several functional-analytic 
conditions exclude this pathological phenom- 
enon. In some special cases, these conditions 
can be replaced by algebraic ones. J. Mather, 
using his division theorem, proved an im- 
plicit function theorem that was applied to the 
theory of singularities. 


K. Infinite-Dimensional Manifolds 


A Hausdorff space M is called a C” Banach 
manifold modeled on a Banach space E if the 
following conditions are satisfied: (a) M is 
covered by a family of open subsets {U,},.4. 
For each U, there are an open subset V, of E 
and a homeomorphism y, of U, onto V,. Such 
a pair is called a local coordinate system or a 
local chart of M. (b) If U, IU, 4, WW," 

is a C’-diffeomorphism of Y(U, N U,) onto 
w,(U, N Up). (c) The index set A is maximal 
among those that satisfy (a) and (b). 

If M (resp. M’) is a C" Banach manifold 
modeled on E (resp. FE’), then so is M x M’ 
modeled on E @ E’. A mapping f: MM’ is 
said to be of class C” if f expressed through 
local coordinate systems is of class C”. 

Suppose M is covered by a family of local 
charts {(U,, Ya) taca- On the disjoint union 
eA U, x E, define an equivalence rela- 
tion ~ as follows: For (x, u)e U, x E, (y, ve 
U; x E, (x,u)~(y, v) if and only if x= y and 
A(t ')(x)v =u. The set of equivalence classes 
is called the tangent bundle of M, which will 
be denoted by Ty. There is another definition 
of the tangent bundle which uses the ring of 
C” functions on M. However, since E is not 
reflexive in general, the latter gives us a differ- 
ent vector bundle. Ty is a C”! Banach mani- 
fold modeled on E @ E. The correspondence 
which sends (x, u)e | Jea U, x E to x induces a 
C"! mapping of Ty onto M, called the pro- 
jection of Ty, denoted by z. 

A topological group G is called a Banach- 
Lie group if G is a C” Banach manifold and 
the group operation (g,h)>g 'hisaC” 
mapping of G x G into G. If E is a Banach 
space, then GL(E), the group of all invertible 
bounded linear operators, is a Banach-Lie 
group under the uniform topology. If E is a 
Hilbert space, then the group of all unitary 
operators is also a Banach-Lie group under 
the same topology. 

The concepts of manifolds and Lie groups 
are similarly defined when the model space 
is a Hilbert space or a Fréchet space. These 
are called a Hilbert manifold and a Fréchet 
manifold, respectively. For some differential 
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topologies on separable Hilbert manifolds — 
279 Morse Theory E. 


L. Structures on Infinite-Dimensional 
Manifolds 


Suppose M is a C’*! Hilbert manifold 
modeled on E. At each xe M, the tangent 
space T.M =n! (x) is a Hilbert space, linear- 
homeomorphic to E. M is called a C” Riemann- 
ian manifold if there is defined an inner prod- 
uct <u, vò, on each T.M such that <-, +>, is of 
class C’ with respect to x. Existence of such a 
structure is ensured by using a partition of 
unity if M is paracompact. 

Let M be a C" Banach manifold (r > 1) 
modeled on a Banach space E. Each tangent 
space T, M is linear homeomorphic to E. M is 
called a C” Finsler manifold if there is a norm 
|u|,. defined on each T.M such that | |, is 
continuous with respect to x. A paracompact 
C! Banach manifold can have a C! Finsler 
structure. 

Let M be a C+? Fréchet manifold (r >0) 
and C’*1(M), T"*! (Ty) the spaces of all C’*! 
functions and of all C’*! vector fields on M, 
respectively. A bilinear mapping V of T" (Ty) 
x I’*1(T,,) into I(T) is called an affine 
connection on M if V satisfies V að = f Vað, 

V, ft =(af i+ f Vað for every a, ğe "+! (Ty), 
f¢C'*'(M). For an affine connection V, T(i, ò) 
= V,0—V,;u—[t, 0] is called the torsion tensor, 
and R(ù, č) =V; Vs —VVa— Via, 5 1s called the 
curvature tensor of V. If M is a Riemannian 
manifold, then there exists a unique affine 
connection without torsion which leaves the 
Riemannian inner product parallel. 


M. Local Linearization Theorems 


Let M be a C’*! Banach manifold (r > 1) 
modeled on E. Let & be a C’ vector field on M 
such that f(x) 40 at xe M. Then there are a 
neighborhood U, of x and a C’ diffeomor- 
phism y of U, onto an open subset V of E 
such that dWa(w ~*(y))=(y, v), ve E, for every 
yéV, where v does not depend on y. 


N. Morse Lemma 


Let M be a C"*? Hilbert manifold and f be an 
R-valued C’*? function. Suppose x is a critical 
point of f, i.e., df(x)=0. x is called a nondegen- 
erate critical point if d? f(x) is a nondegenerate 
bilinear form. For such x there are a neighbor- 
hood U, of x and a C” diffeomorphism w of U, 
onto an open neighborhood of 0 of the model 
space E such that w(x)=0, and f(y '(y))= 
|Py|?—|(1—P)y|?, where P is an orthogonal 
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projection in E. i,=dim(1— P)E (0 <i, < œ) is 
called the index of the critical point x of f. 


O. Submanifolds 


A subset N of a C” Banach manifold M mod- 
eled on E is called a C" submanifold if at each 
point xe N there are a neighborhood U, of 

x and a C’-diffeomorphism y of U, onto an 
open neighborhood V of 0 of E such that w(x) 
=0 and Y(U,N N)= VNF, where F is a closed 
linear subspace of E. There are some other 
definitions of submanifolds. One of them re- 
quires in addition that F be a direct summand 
of E, and another uses instead of ULN N its 
connected component containing x. In the 
latter definition, a submanifold is not neces- 
sarily locally closed. 


P. Sard-Smale Theorem 


Although it is not easy to define nontrivial 
measures on infinite-dimenstonal manifolds, 
the concept “almost everywhere” can some- 
times be replaced by that of residual sets. 

A subset of a topological space is called a 
tresidual set if it contains an intersection of 
countably many open dense subsets. A re- 
sidual set in a complete metric space or in 

a tBaire space is dense. S. Smale [15] ex- 
tended tSard’s theorem to inftnite-dimensional 
manifolds as follows: Let M, N be C" Ba- 
nach manifolds and f:M >N bea C” Fred- 
holm mapping. If M is separable and r> 
max {0, Ind(df(x))} for each xe M, then R, = 
N—f(C) is a residual subset of N, where C 
is the set of critical points of f, i.e., a point x 
where df(x) is not surjective. 


Q. Calculus of Variations and Infinite-. 
Dimensional Manifolds 


Many problems in the calculus of variations 
can be understood as problems seeking crit- 
ical points of functions defined on infinite- 
dimensional manifolds. R. Palais and S. Smale 
set up the following Condition C and fixed a 
category of functions where the critical points 
can be chased through gradient-like vector 
fields [6]. 

Palais-Smale Condition C. Let f be a C! 
function on a C! Finsler manifold M. If S is 
any subset of M on which f ts bounded but 
on which |df(x)| is not bounded away from 0, 
then there is a critical point of f adherent to S. 

In general, it is not easy to examine Con- 
dition C for a concrete f. However, many 
concrete problems, where the Euler equations 
are nonlinear elliptic, satisfy Condition C. 
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Morse theory. Let M be a C” complete 
Riemannian manifold and f a C” function 
bounded below satisfying Condition C and 
having only nondegenerate critical points. 
Then, using the Morse lemma, one can make a 
thandlebody decomposition of M by the same 
method as in the case of finite-dimensional 
manifolds (= 279 Morse Theory). 

Lyusternik-Shnirel’man theory. This theory, 
constructed on finite-dimensional manifolds, 
can be extended naturally to Finsler mani- 
folds. Let M be a complete C? Finsler mani- 
fold and f a C? function satisfying Condition 
C and bounded below. Then f has at least 
cat(M) critical points, where cat(M) =m means 
that M can be covered by m closed contrac- 
tible subsets of M but not by m—1 ones. If 
there is no such integer, then we set cat(M)= 
oon 

Both Morse theory and Lyusternik- 
Shnirel’man theory have been successfully 
employed in the global theory of the calculus 
of variations. 


R. Bifurcation Theory 


Bifurcation theory concerns itself with the 
structure of the zeros of the functional equa- 
tion of w with a parameter /: 


G(i, w)=0. (6) 


In general, the state w satisfying (6) represents 
the equilibrium (time-independent or station- 
ary) solution of the tevolution equation 


w,=G(A,w) and w(0)=wọ. (7) 


Here the evolution equation itself stems from a 
mathematical model describing natural phe- 
nomena, w= w(t) stands for the state at time t, 
and / is the set of parameters representing the 
physical environment. For example, in the 
*Navier-Stokes equation appearing in fluid 
dynamics, w(t) represents the unknown veloc- 
ity field at time t and / is the *Reynolds num- 
ber. It is important to study bifurcation phe- 
nomena because they typically accompany 
the transition to instability of the state when 
some characteristic parameter passes through 
a certain value, called a critical value. 

Let X, Y, and A be real Banach spaces, and 
let G{A, w) be a mapping from A x X to Y. 
Suppose that there exists a mapping w(A): A> 
X satisfying G(A, w(A))=0. One calls (A, W(A)) 
a trivial solution of (6). 

(Ao, W(Ap)) is called a bifurcation point of 
G(A, w) (with respect to the trivial solution) if 
in any neighborhood of (Ao, W(A,)) there exists 
a nontrivial solution of (6). (In general, there 
may appear another type of solution that is 
not connected with the trivial solution [20]). 
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Assume that G(A, w) is of class C! in some 
neighborhood of (åo, W(A)) in A x X. Then it 
follows from the timplicit function theorem 
that (29, w(4,)) is not a bifurcation point if 
G,,(A9, W(A)), the tFréchet derivative of G with 
respect to w, is nonsingular. 


S. The Principle of Linearized Stability 


Closely tied to the phenomenon of bifurcation 
is the property of stability. Suppose that the 
dynamics of a physical system are governed 
by (7). Let w(t; wọ) denote the solution of (7). 
An equilibrium solution w= wW(A) is called 
stable if for any £>0 there exists a ô >0 such 
that || w(t; wọ)— w|| <e for all t>0 whenever 

|| Wo — WI] <ô. Furthermore, w is said to be 
asymptotically stable if, in addition, w(t; wọ)—> 
Was foo. 

By the principle of linearized stability we 
mean that the stability of an equilibrium solu- 
tion Ŵ is determined formally by the tspec- 
trum of the linearized operator G„(4, w(A)). As- 
sume that G,,(A, W(A)) has only a tpoint spec- 
trum. Then (i) if W(A) is stable, the spectrum of 
G,,(A, W(A)) is contained in {ze C|Rez <0}, and 
(ii) if the spectrum of G„(4, W(A)) is contained 
in {zeC|Rez<0}, w is asymptotically stable. 

Suppose that as / crosses a certain value Ap, 
one or more eigenvalues of G„(4, w(4)) cross 
the imaginary axis from the left to the right 
half-plane, where w(A) is the known equilib- 
rium solution. This is precisely the situation 
when wW(A) becomes unstable. For notational 
simplicity, we put u= w — wW(A) and F(/,u)= 
G(A, ut w(A)). 


T. Bifurcation from Simple Eigenvalues 


Take A=R. Let FeC?(R x X, Y) be such that 
F(A, 0)=0 for any real å. Set Ly = F,{(Ao, 0), 
Lı =F, (Ao, 9), and suppose that (i) Ker(Lo) 
is spanned by ug £0; (it) codim R(Ly) = 1; 
and (iii) L uo R(Lo), where R(L,) denotes the 
trange of Ly. Then there exists a C'-curve 
(A, W):(— 6,6) >R x X defined on some inter- 
val (— ô, ô) such that 4(0)= 4p, y(0)=0, and 
F(A(s), (up + W(s)))=0 for any se(— ô, ô). 
Moreover, in a neighborhood of (4), 0) any 
zero of F either lies on this curve or is a trivial 
solution (M. G. Crandall and P. H. Rabino- 
witz, J. Functional Anal., 8 (1971)). 

In the above case, there exist only three 
possible situations of the curves (A, o) and 
(A, 0), called subcritical, supercritical, and 
transcritical bifurcations. In the third case, 
there occurs the so-called exchange of stability 
[19-21]. 
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U. Bifurcation of Periodic Solutions (Hopf 
Bifurcation Theorem) 


If u(å) loses stability by virtue of a pair of 
complex conjugate eigenvalues crossing the 
imaginary axis, then under suitable conditions 
one can prove the existence of bifurcating time- 
periodic solutions of (7). Rewrite equation (7) 
as 


u, + Lou+g(A, u)=0. (8) 


Suppose (i) Ly: D(Lo) € X >X is a densely de- 
fined linear operator on X such that — Lo 
generates a strongly continuous semigroup on 
X, which is holomorphic on X© (=the com- 
plexification of X). Lọ has compact resolvent; 
iis a simple eigenvalue of Ly, and ni¢a(Lo), 
the spectrum of Lo, for n=0, 2, 3,.... As a con- 
sequence of (i), if r> —ReA for any A€ (Lo), 
then the fractional power (rI + Lo) for «>0 is 
well defined. Because their domains are in- 
dependent of r, one can set X, = D{(rI + LY), 
which are Banach spaces under the norm 

lulla = llr + Loull. Suppose (ii) there exist 

an «e[0, 1) and a neighborhood @ of (0,0) in 
R x X such that ge C?(@, X,), where C*(O, X,) 
denotes the space of all X,-valued C* functions 
defined on ©. Moreover g(/, 0)=0 if (4, 0)E@, 
and g,(0,0)=0. (iii) Let 8 = B(A) be a continu- 
ously differentiable function defined in a neigh- 
borhood of 0 such that B(A)eo(Ly +g,(A, 0)) 
and £(0)=i. Suppose that Re f’(0) 40. If as- 
sumptions (3), (ii) and (tii) are satisfied, then 
there exist a positive 6 and continuously dif- 
ferentiable functions (p, 2, u):(— ô, 6) >R? x 
C°(R, X,) such that (a) for 0<|s}<6, u(s) is a 
2mp(s)-periodic solution of period 27p(s) of (8) 
corresponding to 1= A(s); (b) (0) = 1, 2(0)=0, 
u(0)=0, and u(<)40 if s #0; and (c) any 2zp- 
periodic solution of (8) in C2,,(R, X,) (=the 
space of 2zp-periodic continuous functions 
with value in X,) with |o—1|, |A| and ||ul| suffi- 
ciently small is of the above form for some 
|s|<6 up to a translation of the real line. 
Moreover, if ge C**!(0, X,), then the functions 
p, 4, u are of class C*. 


V. The Lyapunov-Schmidt Procedure 


Suppose that L(A)= F, (4,0) has at A=0 an n- 
fold eigenvalue at the origin, i.e., dim Ker L(0) 
=n, and assume that X c Y. Let Ker L(0) be 
spanned by 9,,...,@,, and let P be the projec- 
tion onto this linear subspace that commutes 
with L(0). Then P must take the form Pu= 
Li=1 <u, O*>@;, where the p*e Y* c X* are 
null functions of the adjoint operator L(0)* 
and <o; pf > =ò. P is a linear operator from 
Y to Y; hence it can be regarded as a mapping 
from X to X as well, and Q =I —P is a projec- 
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tion onto the range of L(0) in Y. Using P and 
Q, the equation F(A, u)=0 can be decomposed 
into the system of equations 


OF(A,v+W)=0 and PF(A,v+y)=0, (9) 


where v= Pu and w= Qu. One solves the first 
equation for wy =wW(/, v) by the implicit function 
theorem, and then, substituting this into the 
second, one has the bifurcation equation 


F(A, v)=PF(A,v+ WA, v))=0. (10) 


Solutions of the bifurcation equation are in 
one-to-one correspondence with solutions of 
the original system sufficiently close to the 
bifurcation point. 

There exists another method of reducing 
an infinite-dimensional problem to a finite- 
dimensional one, called the center manifold 
theorem [21, 39]. 


W. A Global Result 


The following global result is due to P. H. 
Rabinowitz (J. Functional Anal., 7 (1971)). 
Assume that F(A, u)=u—ALu+ H(A, u), where 
L is a compact linear operator and H:R x X > 
X is a compact mapping with H(A, u)=o(||ull) 
at 0 uniformly on bounded 4/-intervals. Then, 
if u~! is an eigenvalue of L of odd multiplic- 
ity, (4,0) is a bifurcation point for F with 
respect to the trivial solution. Moreover, the 
closure of the set of nontrivial zeros of F con- 
tains a component that meets (u, 0) and either 
is unbounded in R x X or meets (Å, 0), where 
u#fiand fi“ is an eigenvalue of L. 

The beginning of bifurcation theory seems 
to be in the celebrated work of H. Poincaré 
(Acta Math., 7 (1885)). 


X. Abstract Cauchy Problems 


Suppose that we are given an abstract Cauchy 
problem 


a Au (t >0), (11) 
u(+0)=a (12) 


in a Banach space X, where ae X and the non- 
linear operator A is assumed, for simplicity, to 
be independent of t. If the domain D(A) of A 
coincides with X and A is Lipschitz continu- 
ous, we can reduce the abstract Cauchy prob- 
lem to the tintegral equation of Volterra type 


t 
u(t)=at+ | Au(s) ds, 

0 
and by applying the iteration procedure we 
can easily show that the abstract Cauchy 
problem has a uniquely determined solution. 
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A similar treatment works for a singular but 
mildly nonlinear A of the form A=L +N if the 
linear operator L is the generator of a fstrong- 
ly continuous semigroup e™ in the sense of 
Hille-Yosida theory (— 378 Semigroups of 
Operators and Evolution Equations) and the 
nonlinear mapping N is Lipschitz continuous. 
In this case, we reduce the problem to the 
integral equation 


t 
u(t)=ea+ | el!“ IL Nu(s) ds. 
0 
Here, if we merely have to assure ourselves of 
the local existence of the solution, then N can 
merely be locally Lipschitz continuous; and 
in this case, N can even be singular to some 
extent if e'™ is tholomorphic in t [22, 23]. A 
typical application of this procedure was made 
by P. Sobolevskti, T. Kato, and H. Fujita to 
the Navier-Stokes equation (— 204 Hydro- 
dynamical Equations; 205 Hydrodynamics) 
to construct regular solutions [22]. 
tGalerkin’s method (— 304 Numerical Solu- 
tion of Partial Differential Equations) is some- 
times quite convenient for obtaining (tweak) 
solutions of (11) and (12) [5]. Again, typical 
applications were made to the Navier-Stokes 
equations by E. Hopf and others [5, 24, 25]. 
To prove the convergence of approximate solu- 
tions constructed by Galerkin’s method and 
subject to so-called energy estimates, we often 
make use of Aubin’s compactness theorem 
concerning vector-valued functions [5]. 
Remarkable developments have taken place 
since 1964 for the case where A is dissipative, 
i.e., — A is accretive. F. Browder [26] proved 
that if X is a Hilbert space and A is dissipative, 
then the mere continuity of A is sufficient for 
the twell-posedness of (11) and (12). Then Y. 
Komura [27] brought about a crucial advance 
by showing that if A is a (possibly multivalued 
and) maximal dissipative operator with D(A) 
dense in the Hilbert space X, then (11) and 
(12) are uniquely solvable for any ae D(A). 
Furthermore, he founded the theory of non- 
linear semigroups of operators by establishing 
a ‘nonlinear version of the Hille-Yosida the- 
ory for semigroups of ‘nonexpansive opera- 
tors in Hilbert spaces [28]. Subsequent de- 
velopments and applications were made in 
various directions by T. Kato (J. Math. Soc. 
Japan, 19 (1967)), M. Crandall and A. Pazy 
(J. Functional Anal., 3 (1969)), H. Brézis and 
A. Pazy (J. Functional Anal., 6 (1970)), S. 
Oharu (J. Math. Soc. Japan, 22 (1970)), M. 
Crandall and T. Liggett [29], Y. Konishi (Proc. 
Japan Acad., 47 (1971)), and others. While 
Komura’s original proof was based on the 
Yosida approximation A (I — £4) '(£—>0) for 
A, it is proved in [29] that in any Banach 
space a dissipative operator A that is maxi- 
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mal in a certain sense generates a nonlinear 
semigroup T, by the exponential formula 


n> O 


t =n 
Tix = lim (1-4) xX. (13) 
n 


The scope of applications of the generating 
theorem (13) can be seen, e.g., in B. K. Quinn 
(Comm. Pure Appl. Math., 24 (1971)), M. G. 
Crandall (Israel J. Math., 12 (1972)), Y. Koni- 
shi (Proc. Japan Acad., 48 (1972) and J. Math. 
Soc. Japan, 25 (1973)), and S. Aizawa (Hiro- 
shima Math. J., 6 (1976)). 


Y. Nonlinear Semigroups in Banach Lattices 


Let X be a Banach lattice (— 310 Ordered 
Linear Spaces). An operator A: X > D(A)>X 
is said to be dispersive if for all x, ye D(A), 


Ixy) * | <||(&-y—A(Ax—Ay))* || 4>0). 


When a dispersive operator A satisfies the 
range condition R(I —4A)> D(A) for any 2>0, 
it generates an order-preserving semigroup 7; = 
e'4 on D(A): I(x- Ty)" || <(x—y)* || for t> 
0. We have therefore the preservation of order: 
x< y implies T,x < T,y. We can prove in par- 
ticular that the order of initial data is inherited 
by the solutions of a nonlinear heat equation 
(Y. Konishi). 

Remark. Various pathological phenomena 
arise when we do not restrict the form of A 
in the abstract Cauchy problem (11), (12). We 
cite merely the “blowing up” of solutions of 
Cauchy problems for tnonlinear heat equa- 
tions (— 291 Nonlinear Problems) and the 
nonlinear wave propagations described by 
nonlinear Schrodinger equations (J. B. Bail- 
lon, T. Cazenave, and M. Figueira). 


Z. Abstract Cauchy-Kovaleyskaya Theorem in 
a Scale of Banach Spaces 


T. Yamanaka (Comment. Math. Univ. St. Paul, 
9 (1960)), L. V. Obsyannikov (Soviet Math. 
Doklady, 6 (1965) and 12 (1971)), F. Treves 
(Trans. Amer. Math. Soc., 150 (1970)), L. Niren- 
berg (J. Differential Geometry, 6 (1972)), and T. 
Nishida [30] discussed abstract treatments 

of classical Cauchy-K ovalevskaya theorem for 
partial differential equations: Let S= {B,},>0 
be a collection of Banach spaces depending on 
the real parameter p>0. Let ||u/|, denote the 
norm of an element ue B,. The collection S is 
called a scale of Banach spaces if, for any p and 
pP <p, B,<B,, and |jull y < llull, for any u in 
B,. Consider in S the initial value problem of 
the form 


(u(t), £), 


dt 


and u(0)=0. (14) 
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Assume the following conditions on F: (i) For 
some numbers R>0, n >0, pọ >0 and every 
pair of numbers p, p’ such that 0<p’<p< 

Po, (u, t)> F(u, t) is a continuous mapping of 
{ue B,| lul <R} x {t||tl<n} into Bp. Gi) For 
any p' <p < po and all u, ve B, with |ul|,< 

R, ||vl|p < R, and for any t, |t| <y, F satisfies 

|F (u, t)— F (v, tllo < Clu — vll p/p — p’), where C 
is a constant independent of t, u, v, p, or p’. (iii) 
F(0,t) is a continuous function of t, |t| <n, with 


values in B, for every p < py and satisfies, with 


a fixed constant K, || F(0,0)||,<K/(po—p), 0< 
P<Po- 

Abstract Cauchy-Kovalevskaya theorem. 
Under the preceding hypotheses there is a 
positive constant M such that there exists a 
unique function u(t) which for every positive 
p < po and |t|< M(po— p) is a continuously 
differentiable function of t with values in B,, 
||u(t)||,<R, and satisfies (14) (— also [31]). 

These results cover a theorem of M. Na- 
gumo (Japan. J. Math., 18 (1948)), which gen- 
eralizes the classical Cauchy-Kovalevskaya 
theorem. 
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A. Lattice Dynamics 


In order to elucidate certain characteristic 
features of nonlinear waves, one-dimensional 
lattice models have been studied. Around 
1953, E. Fermi et al. performed computer 
experiments on nonlinear lattices to verify a 
generally accepted belief that nonlinear cou- 
pling between the tnormal modes of harmonic 
oscillators would lead to complete energy 
sharing between these modes. To their sur- 
prise, their nonlinear lattices yielded very little 
energy sharing at all; on the contrary, the 
interactions resulted in the recurrence of the 
initial state. These results were later inter- 
preted in terms of solitons (— 387 Solitons), 
i.e., nonlinear waves that preserve identity 
despite mutual interaction. 

The equations of motion for a uniform 1- 
dimensional chain of particles of mass m with 
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nearest-neighbor interaction can be written as 


do, 


m 
dt? 


—p'(Qna—Qn-1) t PQr ~ Qn) 


mS 2y): 


It was shown that a lattice with a nonlinear 
interaction of the form 


olr)=e "+r+const. 


admits solutions in closed form. Later, it was 
shown that this lattice (the exponential lattice 
or the Toda lattice) is a completely integrable 
system. 

It is convenient to introduce s,, which is the 
generalized momentum canonically conjugate 
to the mutual displacement r,=Q,,4, —Q,. For 
a lattice with exponential interaction, 


e ™—l=ds,/dt, 


and if we introduce 


S, -| s, at, 


then the equations of motion can be written as 
log(1+d?S,/dt?)=S,.,+5S,-; —2S,, 
and the displacements are given by 
0,=5,— Shta- 
We have a solution 
S =log {1 + e2nt Fray 


where « and ô are arbitrary constants and f = 
+sinha. The associated wave 


e-'»—1=f* sech (an+ Bt +ô) 


represents a solitary wave or soliton. 
The multisoliton (N-soliton) solution is 
given by 


S, =logdet ¥,, 


where Y, is an N x N matrix whose elements 
are 


(Z,Z,)""" 
(Pix = Orn + ee 
J J dJ 1 — ZZ, 
with 
Z= te”, 


Asymptotically the wave reduces as t-> Foo to 
an assembly of solitons 


N 
e kE 1 = DA p? sech? (a;n + Bt + 6;*). 
j=1 


B. Conserved Quantities 


The equations of motion for a periodic ex- 
ponential lattice of N particles can be written 
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in a Lax representation dL/dt = BL — LB, 
where L and B are N x N matrices with the 
elements 


Lian = Dy, 


| ogee = Lint n= An» 
Li n= Ly, =4y, 


Bans = — Bati n= Tâp 


B, y= — By. =ay 


(the other elements of L and B are all zero), 
with 


Ay =te One ~Qn)/2 


b,=3P,  (P,=dQ,/dt). 


The eigenvalues 4 of L can be shown to be 
independent of time, and so the motion of the 
lattice is a spectrum-preserving deformation. 
Now, if we define {J;} by 


det(ål — L)=4 +4! +... tAly_y tIy, 


the n{1,} are polynomials of a, and b,. These 
are constants of motion that were discovered 
independently by M. Hénon and H. Flaschka. 
Thus the lattice has N conserved quantities; 
I, is related to the total momentum and J, to 
the total energy, but higher-index conserved 
quantities have no physical interpretation. 


C. Method of Integration 


Let L and B be the infinite matrices obtained 
from the foregoing ones in the limit N >œ. 
The eigenvalues 4 of the equation 


Le=i9 


are independent of time, and the time evolu- 
tion of ọ is given by the equation 


do/dt = Bg. 


If the motion in the lattice is restricted to a 
finite region, we can clearly speak of the scat- 
tering of the wave ¢ due to the deformation in 
the lattice. For a given initial motion Q,(0) and 
P,(0), or L(0), we calculate the initial scattering 
data of asymptotic form ọ ~z” (n- œ). The 
scattering data consist of the reflection coeffi- 
cient R(z), the bound state eigenvalue j;= 
—(z; +z; ')/2 (4;< —1 or A,> 1), and the co- 
efficient c; of the normalized bound state eigen- 
function of asymptotic form c,z? for n> co. 

From the initial data and the equations of 
motion for n—> +00, we get the scattering data 
at a later time t. In effect, we construct the 
kernel 


1 a _ 
Fem = Ree ml dz 
2ni 


FEG Oe era tap 
j 
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of the discrete integral equation (Gel’fand- 
Levitan-Morchenko equation) 


K(n,m)+ F(n+m)+ 5 k(n, n') F(n' +m)=0, 


n’=nt+1 


mon+1. 


After solving this equation for x(n, m), we 
calculate K (n,n), given by 


1 00 
—— =] +F(2n)+ k(n, n) F(w +n). 
kugi tE + Skin) F +n) 
Then the initial value problem is solved in the 
form 


e ~nr- d) — o Knn i 
K(n—1,n—1) | ` 


The solution can be given as dQ„/dt =S, — Sn+1> 
with 


S,=kK(n—1,n). 


The simplest case R(z)=0 yields the multi- 
soliton solution. 

For the periodic case also, eigenvalues of 
the equations Lo = ne and de/dt = Bọ, under 
suitable boundary conditions and for certain 
initial data, give sufficient information to 
construct a solution to the initial value prob- 
lem. Such a method of obtaining a general 
solution for the periodic lattice was developed 
by E. Date and S. Tanaka, and independently 
by M. Kac and P. Van Moerbeke. Following 
Date and Tanaka, the solution can be written 
in terms of the multivariable theta function, or 
the Riemann theta function 3, as 

d ant pt +ô) 


Z= t. 
ae Sane E R 


where a, #, and ô are certain vector constants. 
There has been much activity recently to- 

ward interpreting the integrability of the Toda 

lattice in terms of Lie algebras (B. Kostant 


[9]). 
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Nonlinear Ordinary 
Differential Equations 
(Global Theory) 


A. General Remarks 


Many well-known functions (with the notable 
exception of the tl’-function), such as the ex- 
ponential, trigonometric, telliptic, and tauto- 
morphic functions, satisfy ordinary differential 
equations of simple forms. For the purpose 

of finding new transcendental functions, P. 
Painlevé initiated the systematic study of the 
equation 


,y™®)=0 (1) 


in the complex domain. To investigate the 
solution in its whole domain of definition, he 
assumed that F is a polynomial in y, y’,..., y® 
whose coefficients are analytic in x. Such an 
equation is called an algebraic differential 
equation. If F is linear in y”, then (1) is written 
as 


F(x, y, ys... 


yr) 
poy 


where P and Q are polynomials in y, y’,..., 
y") with coefficients that are analytic func- 
tions of x. Equation (2) is called a rational 
differential equation. 

If F is linear in y, y’,..., y™®, i.e., (1) is a linear 
differential equation, and if the coefficient of 
y™ is 1, then singular points of solutions are 
situated at the singular points of the coeffi- 
cients (— 254 Linear Ordinary Differential 
Equations (Local Theory)). If F is not linear, 
then singular points of solutions of (1) are 
divided into two categories, one consisting of 
those points whose positions are determined 
by the equation itself and are independent of 
individual solutions, and the other consisting 
of those points whose positions depend on the 


TE P(x, y, y, ... 
Q(X, y, ys. 


(2) 
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choice of particular solutions. In other words, 
the singularities of the first category appear 
independently of the choice of arbitrary con- 
stants involved in the general solution, while 
those of the second category depend on the 
choice of the arbitrary constants. The former 
are called fixed singularities and the latter 
movable singularities. The linear differential 
equation (1) has fixed singularities only, which 
are situated at the singularities of the coeffi- 
cients. In the same way, fbranch points of 
solutions can be classified into two kinds, fixed 
branch points and movable branch points. 


B. Algebraic Differential Equations of the First 
Order 


Consider the equation 


y POs) 
Q(x. 9)" 


where P and Ọ are relatively prime polyno- 
mials in x, y. The fixed singular points of (3) 
are defined to be points č, č with the following 
properties: (i) Q(¢, y) =0. (i1) O(¢', y) #0, P(C', y) 
=Q(&’, y)=0 have a root y=’. (iii) If, in (ii), 
we substitute 1/z for y and if the same relation 
(ii) holds for x= € and z=0, we count such 
a value €, as a fixed singular point. (iv) We 
transform equation (3) by setting x = 1/t. If 
the value t=0 satisfies (i) or (1i) for this trans- 
formed equation, we count œ as a singular 
point € or é’ accordingly. The points €, ¢' are, 
in general, ‘transcendental singularities of solu- 
tions, and the points ¢' cannot be tessential 
singularities of solutions but may be tordinary 
transcendental singularities. A singular point 
of a solution different from č and č’ is an falge- 
braic singular point, and for any point dis- 
tinct from č and č’, equation (3) admits a 
solution with an algebraic singularity at this 
point. A necessary and sufficient condition 
that (3) has no movable branch point is that 
(3) be a *Riccati equation. 

Consider the algebraic differential equation 
of the first order 


F(x, y,y)=0. (4) 


After defining the fixed singular points € and 

E' of (4), where the algebraic function of x, y 
defined by (4) has bad singularities, Painleve 
proved that movable singularities of solutions 
are algebraic. Before Painlevé’s work, L. Fuchs 
gave a necessary and sufficient condition that 
(4) has no movable branch points, and then H. 
Poincaré showed that if this condition is satis- 
fied, then (4) is either reducible to the Riccati 
equation if g=0, integrable by the use of ellip- 
tic functions if g = 1, or algebraically integrable 
if g>1, where, with x the independent vari- 


(3) 
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able, g denotes the tgenus of the algebraic 
curve defined by (4). Painlevé found that there 
were gaps in the proofs of Fuchs and Poincaré 
and completed these by proving his theorem 
and the following one: Let @(x, yo, Xo) be the 
solution of (4) satisfying the initial condition 
y(X9) = Yo. Let X, Xo be points different from ¢ 
and č’, and let L be a curve connecting Xọ to 
x and not passing through any č or č’. If we 
denote by ,(X, Yo, Xo) the value at x =X of the 
branch obtained by continuing (x, Vo, Xo) 
analytically in a neighborhood of L and re- 
gard P(X, Yo» Xo) as a function of yo, then 

(1 (X, Vo, Xo) coincides, in a neighborhood of 
every point yọ =b, with several branches of an 
talgebroid function of yọ. Painleve studied the 
case when the general solution is finitely many- 
valued and gave a condition for (x, Yo, Xo) to 
be an algebraic function of yp. 

When equation (4) does not contain x ex- 
plicitly, no movable branch points appear if 
and only if all solutions are single-valued, and 
then the solutions are expressible in terms of 
rational, exponential, and elliptic functions. 
Such an equation is called a Briot-Bouquet 
differential equation. 

J. Malmquist proved, by using P. Bou- 
troux’s method of studying the behavior of 
solutions in the neighborhood of a fixed sin- 
gularity, that if equation (4) admits at least one 
solution that has an essential singularity and is 
finitely many-valued and free from movable 
branch points around this singularity, then (4) 
is an equation without movable branch points. 
If (4) admits a solution that is a finitely many- 
valued transcendental function, then an alge- 
braic transformation may be applied to (4) 
so that it will become an equation without 
movable branch points. It is an immediate 
consequence of the first assertion that if (3) 
admits a solution that has an essential sin- 
gularity and is finitely many-valued and free 
from movable branch points around the sin- 
gularity, then (3) is a Riccati equation. 

Later, equations (3) and (4) were studied by 
M. Hukuhara, K. Yosida, T. Sato, T. Kimura, 
and T. Matuda. The following results are due 
to Kimura. If a solution @(x) of (3) has an 
essential singularity at x =, then, in an arbi- 
trary neighborhood of č, g(x) assumes every 
value with the exception of the roots of P(é, y) 
=Q. If (3) is not a Riccati equation, it is deter- 
mined by a finite number of algebraic pro- 
cesses whether or not (3) admits a solution 
that has an essential singularity at x= č and 
has no movable branch point around é. If (3) 
admits such a solution, then the singularity is 
a ‘logarithmic branch point. For an essential 
singularity of a solution there exists, in gen- 
eral, a direction similar to a tJulia’s direction, 
which was investigated by Hukuhara and 
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Kimura. Matuda studied in detail the behavior 
of solutions as x tends to č along a half-line 
and concluded that, except for some special 
cases, any solution tends to a certain value 

as x tends to č along a half-line. To obtain 
algebraic solutions C. Briot and J. C. Bou- 
quet devised a method similar to *Puiseux 
expansion in the theory of algebraic functions. 
Hukuhara improved their method and suc- 
ceeded in reducing (3) to several differential 
equations of standard forms in a neighbor- 
hood of x = č. This enables us to apply the 
local theory to the global study. Hukuhara’s 
method was used by Kimura and Matuda to 
obtain the results discussed in this paragraph. 


C. Algebraic Differential Equations of the 
Second Order 


For second-order algebraic differential equa- 
tions we can pose the same problems as for 
first-order equations: When do these equations 
have single-valued or finitely many-valued 
general solutions? What new transcendental 
functions are needed to integrate such equa- 
tions? These problems, studied by E. Picard 
and Painlevé, are difficult because of the exis- 
tence of movable transcendental singularities. 
However, Painlevé succeeded in determining 
rational differential equations of the second 
order without movable branch points. Such 
equations, with the exception of those that 
are integrated by the use of solutions of the 
first-order and linear differential equations, 
can be transformed by rational transforma- 
tions into one of the following six differential 
equations: 


(I) y"=6y?+x, 
(I) y’=2y?+xy+a, 


MD y’=————+——— + yy? +— 
y x x 
42 3 3 

(IV) y=} pany? + 2(x? eye 
2y 2 y 


l 1 y 
V "y2 + 
(V) y"=y (= =) 5 


-17 yy ôy(y+l 
Po (14E) 2+ y(y+ 1) 
y x y-l 


bd 


(VI) y= 


i 1 1 ) 
—y'| -+ + + 
x x-l y-x 
Bx y(x—1) dx(x—1) 
i i y p- p-a | 


where «, ß, y, and 6 are constants. These equa- 


y(y—1)(y—x) 
x2(x—1) 
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tions and their solutions are called Painlevé 
equations and Painlevé transcendental func- 
tions, respectively. Equation (VI) was dis- 
covered by B. Gambier, who found an omis- 
sion in Painlevé’s calculations. All solutions of 
(I) are single-valued, and their properties were 
investigated by Boutroux. The solutions of 
(VI) have, in general, logarithmic branch 
points at x =0, 1, œ and were studied by R. 
Garnier. 

The case when the equation is of degree 2 
with respect to y” was studied by Malmquist 
and F. Tricomi. 

The following facts are known concerning 
movable transcendental singularities of the 
rational equation y” = P(x, y, y Q(x, y, Y), 
where P and Q are relatively prime poly- 
nomials (Kimura). Let p, q be the degrees of P 
and Q with respect to y’. If p>q +2, then any 
solution g(x) admits no movable essential 
singularities, but its derivative ~'(x) may admit 
such singularities. If p>q+2 and Q is not 
decomposable as Q, (x, y)Q3(x, y, y), then 
neither o nor g’ admits movable essential 
singularities. If p<q+2, then both p and ¢’ 
may have movable essential singularities. If 
p<q+2 and Q is not decomposable as above, 
then g’ has no movable essential singularities. 
If, for a solution (x), x =a is a movable essen- 
tial singularity of g(x) or g’(x), the (x) or 
(x) assumes all values other than a finite 
number of exceptional values in an arbitrary 
neighborhood of x =a. If p>q+2, then every 
solution possesses tIversen’s property, and 
hence the set of movable singularities is not a 
continuum. 


D. Higher-Order Equations and Other 
Equations 


Painlevé’s method of obtaining the second- 
order equations without movable branch 
points is applicable to higher-order equations. 
The determination of third-order equations 
without movable branch points was attempted 
by Painlevé, J. Chazy, and Garnier by the use 
of this method, but is not yet complete. Chazy 
studied in detatl an equation of the form 


t (1—1/n)y”? t t" + 
y Aa a Le +e(y)y? 


and showed that when n= —2 and b(y)= 
0, tFuchsian and *Kletnian functions are 
obtained as solutions (— 32 Automorphic 
Functions). 

R. Fuchs, a son of L. Fuchs, derived equa- 
tion (VI), at almost the same time as Gam- 
bier, from the study of monodromy groups. 
He showed that the monodromy group of the 
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—— = | — + —_~ HM 
z dt? (i (t—x) 
ô 3 a 


PUD S EEN 


ay 
=DE) 


remains invariant as the singularity x varies if 
and only if x, 8, y, and 6 remain constant; the 
singularity y, considered as a function of x, 
satisfies equation (VI); and a and b are rational 
functions of x and y and y’. Investigating a 
second-order linear differential equation of 
Fuchsian type with tregular singularities 0, 1, 
©, Xisor- Xans Y1 -++ Yn Where y4, ..., Yp are ap- 
parent fixed singularities, Garnier was led, 
under the hypothesis that the monodromy 
group of this equation remains invariant as x,, 
<- Xp Vary, to a tcompletely integrable system 
of partial differential equations and showed 
that a symmetric function of y,,..., Ya, CON- 
sidered as a function of any one of x,,...,x,; 
satisfies an equation without movable branch 
points (— 253 Linear Ordinary Differential 
Equations (Global Theory)). 

For nonalgebraic equations, movable 
branch points appear even in the first-order 
case. Kimura obtained a sufficient condition 
that for an equation F(x, y, y’)=0, where F is a 
polynomial of y’ with meromorphic coeffi- 
cients in x, y, every solution has Iversen’s 
property in a domain of the complex plane. 

It was shown by O. Holder that the T- 
function satisfies no algebraic differential 
equation. Also, if a function meromorphic in 
the unit circle is of torder œ, then it satisfies 
no algebraic differential equation. 
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289 (XIII.9) 

Nonlinear Ordinary 
Differential Equations 
(Local Theory) 


A. General Remarks 


Consider a system of n differential equations 


j=1,...,n, (1) 


where the f, are analytic functions of x, y4, 
<- Yn: TO simplify the notation, we use vector 
notation y instead of (y,,..., y,). If all the 

f; are holomorphic at a point (x, y)=(a, b), 
there exists one and only one solution y(x) for 
(1) such that y>b as xa and y(x) is holo- 
morphic at x=a. We say that a point (a, b) is 
a singular point of the system (1) if it is a sin- 
gular point of f; for some j. The well-known 
*Cauchy’s existence theorem can no longer be 
applied to the case when (a,b) is a singular 
point of system (1). In this case, the following 
three problems arise naturally: (i) to determine 
whether solutions y(x) such that y(x)—>b as 
x—a exist, and if they exist, to determine the 
number of independent solutions; (ii) to con- 
struct analytic expressions for solutions y(x) 
such that y(x)—>b as xa or, ina slightly more 
general way, analytic expressions for bounded 
solutions y(x) such that the values of (x, y(x)) 
stay in a neighborhood of the singular point 
(a, b); (iii) to investigate the properties of these 
solutions. These three problems are called 
local problems, since only those solutions in 

a neighborhood of the singular point (a, b) 

are considered. However, even when n= 1, - 
the study of local problems is very difficult 
except for the case of singular points of par- 
ticular types at which the functions f; are 
meromorphic. 

When n> 1, the problem becomes even 
harder; research on this case lags that for n=1. 
In the subsequent discussion we assume with- 
out loss of generality that a=0, b=0. 


dy;/dx = f(X, Yis ---5Vn)s 


B. The Case of a Single Equation 


Consider the equation 
dy/dx = Y(x, y)/X (x, y), (2) 


where X and Y are holomorphic functions of 
(x, y) at (0,0) x, yeC. When X(0,0)=0 and 
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Y(0,0) 40, we can rewrite equation (2) in the 
form dx/dy = X (x, y)/ Y (x, y), and we see that (i) 
if X (0, y) #0, equation (2) has one and only 
one solution y(x) which is algebraic at x =0 
and tends to 0 as x0; (ii) if X (0, y) =0, there 
is no solution of equation (2) such that y0 as 
x0. 

The case where X(0,0)=0 and Y(0,0)=0 
was first studied by C. A. A. Briot and J. C. 
Bouquet. In order to obtain algebraic solu- 
tions they introduced a method similar to 
*Puiseux expansion in the theory of algebraic 
functions. A. R. Forsyth and J. Malmquist 
studied a problem of reduction for equation 
(2) by using the Briot-Bouquet method. The 
theory of reduction was completed by M. 
Hukuhara, who divided a neighborhood of 
(0,0) into a finite number of subdomains in 
such a way that (i) the union of these sub- 
domains covers the given neighborhood of 
(0,0) completely; (ii) in each of these sub- 
domains equation (2) takes one of eight canon- 
ical forms. Hukuhara investigated the prop- 
erties of the solutions for these canonical 
forms. Among them, the following two are 
well known as the Briot-Bouquet differential 
equations: 


xdy/dx= f(x,y),  f(0,0)=0, (3) 
x°*! dy/dx = f(x, y), 
f(0,0)=0, o21 is an integer. (4) 


C. Systems of Differential Equations 

When y is a vector with components (y,), equa- 

tions (3) and (4) can be written as 

Jehat (5) 
j=l,...,n. (6) 


x dy,/dx = f(X, Yis -<> Yn) 
x7"! dy, /dx = f(x, Yis -3 Vas 


After Briot and Bouquet, the singular points of 
these types were studied by many authors, 
including Poincaré, E. Picard, H. Dulac, 
Malmquist, W. J. Trjitzinski, and Hukuhara. A 
method of constructing solutions of these 
equations consists of two parts: (i) formally 
transforming the given equations into simpli- 
fied or reduced equations of the simplest pos- 
sible form by applying a formal transforma- 
tion of the type 


y=} paux*z' (7) 
or 


= ko ok k 
V=} Prok, kX OZI eee ZH" 


j=1,2,...,n; (8) 


(ii) verifying convergence or the validity of 
tasymptotic expansions for the formal solu- 
tions of the given equations, which are ob- 
tained by substituting bounded solutions of 
the equations satisfied by z or z; into (7) or (8). 
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By studying these analytic expressions, the 
properties of the solutions can be clarified. 


D. Properties of Solutions of Briot-Bouquet 
Differential Equations 


For equation (3), the character of the simpli- 
fied equation depends on the value of å = 
f,(0, 0). We have the following four cases: (i) 
4 is neither 0 nor negative. A suitable formal 
transformation (7) changes (3) to 


xdz/dx = Az+ bx’. 


In particular, if 2 is not equal to a positive 
integer, then b=0. The double power series (7) 
is uniformly convergent. There exists a func- 
tion @(x, z) of (x, z) holomorphic at (0,0) such 
that y=¢(x, x*(blogx + C)) is a general solu- 
tion of (3) with an integration constant C. (ii) 
4=0. The equation satisfied by z takes the 
form 


xdz/dx=2z""'(b+b'z™, mæl. 


If b=0, b’ necessarily vanishes and x=O is a 
holomorphic point of (3). A general solution is 
given by 


Z(x)=(C—mblogx)"’", b#0, b'=0, 
and 
b’ (mb? 1 zam 
Z(x)= į —a{ ——log—+C , bb’ #0. 
b b' x 


Here, €=a(t) is the branch (of the inverse 
function of č —log¢ =t) such that a(t)—tlogt 
—~t—Qas t>o. Then there exists a holo- 
morphic function (x, z) of (x, z) for |x| <ô, 
|margz+argb|<3r/2—e, |z|<A such that 
(x, Z(x)) is a general solution of (3). (iii) A is a 
negative rational number — z/v. The equation 
satisfied by z is written as 


vx dz/dx = 2z(— u+ b(x"z°)" + b’(x"z")?”). 
A general solution has the form 


Z(x)=x*(C—mblogx) /",  b’=0, 


and 
b b2 1 —1/my 
Z(x)=x*{—a it log—+C , 
b b x 


In this case, there exists a holomorphic func- 
tion ~(x, z) of (x, z) for |mpargx+mvargz+ 
argb+2/2|<a—g, |x|<6, |z|<A such that 
(x, Z(x)) is a general solution of (3). M. 
Iwano expressed this solution in the form 

Ww ((x*Z(x)’)", x, Z(x)), where (w, x, z) is 
holomorphic for jargw +3|<2—«¢, 0<|w}<ô, 
|x|<A, |z|<A. In particular, if b=b’ =0, p(x, z) 
is holomorphic at (0, 0). (iv) 4 is a negative 


bb’ #0. 
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irrational number. The equation in z has the 
form 


xdz/dx = 4z. 


The formal transformation (7) may either 
diverge or converge. Dulac proved that if (7) 
diverges and if there exists a solution y(x) such 
that y(x)>0 as x0 along a suitable path L, 
then |x*(x)’ argx|> œ and |x*y(x}f arg y(x)| 
=œ as x0, xe L for any « and f. However, 
the existence of such a solution is not yet 
verified. C. L. Siegel proved that if A satisfies 
certain inequalities, the formal transformation 
(7) is divergent. An example such that (7) is 
divergent was first given by Dulac. A very 
simple example for such a case was given by 
Y. Sibuya. 

In the case f(0,0)=0 and A= f,(0,0) 40 for 
(4), the most complete result was obtained by 
Hukuhara, namely: (i) A suitable formal trans- 
formation (7) reduces equation (4) to 


x°*) dz/dx =2(Gyot%X+4+...+4 9X7), %=A. 


A general solution is given by Z(x)=C: 
x@e4), where A(x) is a polynomial in 1/x of 
degree ø. (ii) A general solution of (4) is ex- 
pressed by a uniformly convergent power 
series of the form È ~,(x)Z(x)*, where the ~,(x) 
are holomorphic functions of x for a certain 
sectorial neighborhood of x =0 and have 
‘asymptotic expansions È p,x* as x0. 
Assume f;(0, 0, ...,0)=0 for equations (5). 
Put 2 = 6f;/0y,(0,0, ...,0). Denote by 4,,..., 
Àn the eigenvalues of an n x n matrix with 
elements {/,,}. Then (i) if an angle w can be 
chosen so that for some m <n, all of |cl, 
ljargA, — wl, ...,Jarg4,,—«@| are less than 7/2, 
equations (5) possess solutions that are 
expressed as uniformly convergent (m + 1)- 
tuple power series of x, Z,(X), ..., Zm(X): 


ko Fk k 
Y Piz ky wk X ZP sAn, 
o%1 m 


Here the Z,(x) are general solutions of the 
simplified equations and have the expression 


Z,(x)=x** (C, +a polynomial of 
Cis., G-;, log x), 


(ii) Moreover, Iwano extended the result of (i) 
as follows: When there exists one and only one 
zero among the other n— m eigenvalues, equa- 
tions (5) have solutions, depending on m+ 1 
arbitrary constants, that are expressed as (m+ 
1)-tuple uniformly convergent power series 


2 Pika, kn (Zm OXZ (x)... Zig). 


Here the coefficients py. k,...k„(Zm+1) are holo- 
morphic functions of z,,,, in a sectorial neigh- 
borhood of z,,,,=0 and admit asymptotic 
expansions in powers of Zm+1 aS Zm+; 70. In 
this case, the functions Z, (x), -.., Zm+1(x) can- 


k=1,2,...,m. 
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not generally be integrated by quadratures, 
but as Z,,.;(x)0, the power series expansions 
coincide with the expressions obtained in (i). 
(iii) In the case when the Jacobian matrix (A;,) 
is the zero matrix, Iwano constructed, under 
additional assumptions, a convergent analytic 
expression of a general solution for equations 
(5). 

Let the right-hand side of (6) be holomor- 
phic at (0,0, ...,0). In 1939 Trjitzinski proved 
the existence of solutions that admit asymp- 
totic expansions in powers of n arbitrary con- 
stants. In 1940 and 1941, Malmquist proved, 
under strong conditions, the existence of 
solutions that are expressed as uniformly 
convergent power series of Z,(x), ..., Z,(x): 

È Pik, kg Z, (x)... Zg(x)**. Here the Z,(x) are 
polynomials of x and log x of the form 


Z, (x) =e x4 (C, +a polynomial of 


Cy-1,---> Ca, log x), k= 0p 03 8, 


where the coefficients admit asymptotic expan- 
sions in powers of x. Trjitzinski’s result is 
contained in Malmquist’s result as a special 
case. On the other hand, under much weaker 
assumptions than Malmquist’s, Hukuhara 
solved a problem on the formal simplification 
of equations (6) and formal solutions. Iwano 
improved Hukuhara’s result on formal solu- 
tions and discussed the convergence of formal 
solutions under weaker conditions than Malm- 
quist’s (Ann. Mat. Pura Appl., 1957, 1959). 
The pj,..x,(X) are holomorphic functions of 
x in a sectorial neighborhood of x =0 and 
admit asymptotic expansions in powers of x as 
x—0. The angle of the sector in which the 
asymptotic expansions are valid is largest for 
Iwano’s method. 

If Èc <n, equations of the form 


x%idy;/dx=f(X,y1,---,Vn) July... 


possess at least n- È a; solutions that are 
holomorphic for |x|<6 (R. W. Bass, Amer. J. 
Math., 77 (1955)). This result is analogous to 
those obtained by O. Perron, F. Lettenmyer, 
and Hukuhara and Iwano in the linear case 
(— 254 Linear Ordinary Differential Equa- 
tions (Local Theory)). 


E. Singular Perturbations 


The terms on the right-hand side of the non- 
linear differential equations 


e%Gdy;/dx = f(x,y gees , Yn» E) 


(9) 


are holomorphic functions of (x, y, €) for |x| <a, 
lyi <b, 0<ļeļ<c, |arge|<d, and admit uni- 
formly convergent expansions in powers of y 
with coefficients asymptotically developable in 
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powers of e. The g; are nonnegative integers. 
W. R. Wasow, W. A. Harris, Stbuya, and 
Iwano and T. Saito discussed problems on 
constructing asymptotic or convergent expan- 
sions for bounded solutions that are depen- 
dent on several arbitrary constants. 

In equations (9), the f; and Of;/Cy, are con- 
tinuous functions of (x, y, £) for —00<x< +o, 
lvl <b, |e| <c, and periodic functions of period 
T with respect to x. Moreover, assume that a 
system of degenerate algebraic equations 


0= f(x,y, s+ Ym 0), 


has a periodic solution y;= p;(x) of period T 
for —œ <x < +o. Then if equations (9) have 
periodic solutions y;= p;(x, £) of period T such 
that y;>p;(x) as e—0, the p,(x, £) are called 
singular perturbations of p,(x) for equations (9). 
Concerning this problem, see I. M. Volk (Prikl. 
Mat. Mekh. SSSR, 10 (1946)). The work of 
Wasow (1950) on a single equation 


j=: (10) 


o>0, n>m20, 


(11) 


yO = f(x, y, y’, Y, 8), 
is remarkable. 
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290 (Xill.11) 
Nonlinear Oscillation 


A. General Remarks 


By nonlinear oscillation we usually mean oscil- 
lation described by periodic or talmost peri- 
odic solutions of nonlinear ordinary differential 
equations. The theory of nonlinear oscillation 
is sometimes called nonlinear mechanics. In 
connection with oscillations in tdynamical 
systems and electrical circuits, the theory of 
nonlinear oscillation has been studied inten- 
sively in the Soviet Union under the direction 
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of N. M. Krylov and N. N. Bogolyubov since 
around 1930, and after World War II research 
in this field became active in Western countries 
also. 

Written as first-order systems, the differen- 
tial equations in this theory take one of the 
forms 


dx/dt = X(x) l (1) 
or 
dx/dt = X(x, t), (2) 


where x is a vector and t is a scalar. A dif- 
ferential equation of the form (1) ts said to 

be autonomous. In a differential equation of 
the form (2), X(x, t) is usually assumed to be 
periodic or almost periodic in t. In the former 
case, the differential equation (2) is said to be 
periodic, and in the latter case, almost periodic. 
Oscillations in physical systems are described 
mostly by periodic or almost periodic dif- 
ferential equations; therefore it is a principal 
problem in the theory of nonlinear oscillation 
to find a periodic or almost periodic solution 
of these differential equations. However, an 
oscillation described by a solution of a dif- 
ferential equation can be actually realized only 
when the solution is tstable (— 394 Stability) 
under a small variation of the initial value. 
Therefore it is important to investigate the 
stability of periodic or almost periodic solu- 
tions. In view of the fact that an actual phe- 
nomenon may be only approximately de- 
scribed by mathematical equations, sometimes 
it is necessary to require a certain stability 

of the system so that solutions of a peri- 

odic or almost periodic equation stay stable 
under a small variation of the equation itself. 
Such stability is called structural stability, 

the investigation of which is also important. 

It may happen that a differential equation 
possesses neither a periodic solution nor an 
almost periodic solution, but that it has an 
almost periodic tintegral manifold (i.e., a mani- 
fold x =f(t, 8) in tx-space, where 0 is a para- 
meter, such that f(t, 0) is periodic or almost 
periodic in t and periodic in 0) containing the 
‘trajectory of the differential equation passing 
through an arbitrary point of the manifold. In 
this case, we can consider that a solution 
corresponding to a trajectory lying on the 
manifold describes an oscillation. Therefore 
it is important to find a periodic or almost 
periodic integral manifold of a differential 
equation and to investigate the stability of 
such a manifold. 

The methods used most frequently in re- 
search on nonlinear oscillation are: (i) geo- 
metric methods, (ii) analytic methods, and (iii) 
numerical methods. 
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B. Linear Oscillations 


Let S, and S¥ be the spaces of all -periodic 
solutions of the w-periodic linear system 


dx/dt=A(t)x, A(t+@)=A(2), (3) 


and its adjoint system, respectively. Then the 
space P, of continuous w-periodic functions 
RR’ has direct sum decompositions P „= 

Si +S, and P,,.=S*+S*¥ so that (x, y)=0 for 
every xE S], yES, or xeS¥, ye S¥, where (x,y) 
=(1/w){$'x(s)y(s)ds. Let {€',...,€"} and 
{y',...,9™\ (m may be 0) be bases of S| and S* 
orthonormal with respect to (-,-). Then for 
peP,, there is a unique w-periodic solution x 
of 


dx/dt = A(t)x + p(t) (4) 


belonging to S, under the condition (n*, p)=0 
(k=1,...,m), which is represented in the form 
x =G[p] by a bounded linear operator G: P,,> 
P.,. Also, if (3) and hence its adjoint system 
have no nontrivial w-periodic solution, then 
(4) always has a unique w-periodic solution 
given by x = G[p], and this is true even if the 
co-periodicity is replaced by almost periodicity. 
The almost periodic system (3) is said to be 
regular if (4) has almost periodic solution for 
an arbitrary almost periodic function p(t). A 
necessary and sufficient condition for (3) to be 
regular is that (3) induce an exponential di- 
chotomy, that is, the solution space S of (3) 
have a direct sum decomposition S=S_ +S, 
such that |x(t)|< Me™" |x(s)| holds for —co 
<s<t<o ifxeS, and for -o<t<s<o if 
xeS_, where M and y are positive constants. 
An autonomous (resp. periodic) system (3) is 
regular if and only if no *characteristic roots 
(resp. characteristic exponents) have zero real 
parts. 


C. Geometric Methods 


Geometric methods are used frequently for 
finding a periodic solution in an autonomous 
or periodic case. In the autonomous case (1), a 
periodic solution describes a closed orbit (t is 
a parameter of a curve) in x-space, which is 
usually called a phase space. The geometric 
method is used to show the existence and the 
stability of a closed orbit by investigating 
geometrically the behavior of orbits in the 
phase space. In such an approach, the prop- 
erties of tcritical points and tlimit sets (— 126 
Dynamical Systems) are utilized frequently. 
This method is effective especially for 2- 
dimensional cases on the basis of the tPoincaré- 
Bendixson theorem, and various results are 
given for a generalized Liénard’s (or Duffing’s) 
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differential equation ¥ + f(x)ž + g(x)=0, which 
includes the van der Pol differential equation 
X—A(1—x?)X+x=0[]1,2]. 

In the periodic case (2), let w(>0) be a 
period of X(x, t) with respect to t and x= 
e(t, a) be a solution of (2) such that (0, x)= 
a. Then a periodic solution of (2) is given by 
X= P(t, %), where &o satisfies p(w, %o) = ap. 
Thus the existence of a periodic solution can 
be shown by investigating geometrically the 
existence of a ‘fixed point of the mapping x> 
x’ = ~(w, x) in the phase space. In such an 
approach, *Brouwer’s fixed-point theorem is 
utilized frequently. In the mapping xx’, it 
may happen that there is no fixed point but 
that there exists an tinvariant manifold. In this 
case, we get a periodic integral manifold. 

Geometric methods give information on 
qualitative properties, but usually not on 
quantitative properties like the shape of an 
oscillation. Thus these methods are in general 
not sufficient for the analysis of the phenom- 
ena met in practice. 


D. Analytic Methods 


At present analytic methods are used most 
frequently in the study of nonlinear oscilla- 
tions because, in comparison with geometric 
methods, they enable us to get many quantita- 
tive results in addition to the qualitative ones. 
However, these methods are usually efficient 
only for weakly nonlinear differential equations, 
that ts, differential equations differing only 
slightly from linear differential equations (gen- 
eral nonlinear differential equations are called 
sometimes strongly nonlinear differential equa- 
tions). In this sense, analytic methods are all 
tperturbation methods in the wider sense [3], 
and the variety of the methods lies in the form 
of perturbation and the method of calculation. 
To make use of analytic methods, we always 
reduce the given differential equation to a 
differential equation of the form 


X= Ax +éX(x, t, 8). (5) 


Here £ is a parameter with small absolute 
value, A is a matrix of the form A= 
diag(O,, B), where O, is a p x p zero matrix 
and B is a matrix whose eigenvalues have all 
nonzero real parts, and X(x, t, £) is periodic or 
almost periodic in t. 

(i) When X(x, t, £) is periodic in t, Poincaré’s 
perturbation method is used frequently. 

(ii) In practical problems, we frequently meet 
the case A =0, that is, the case where (5) is of 
the form 


x = &X(x, t, 8). (6) 
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In this case, the average X, (x, ¢)= 

lim ,o(1/T) {3 X(x, t, e)dt exists. If dx/dt = 
Xo(x, 0) has a periodic solution &(t) and the 
related variational linear system is regular, 
then (6) has an (almost) periodic integral mani- 
fold x =f,(t, 0) such that f,(¢, 0) €(0) uniformly 
as 0. The method of averaging based on 

this fact was devised by Bogolyubov and 
Mitropol’skii [4] and is used frequently. The 
equation 


X+ wx =eéf(x, X, t,£) (7) 


is one of the examples that can be reduced to 
an equation of the form (6). Equation (7) ap- 
pears frequently in practical problems, and 
hence various convenient techniques are de- 
vised, such as the method of linearization, the 
asymptotic method, the method of harmonic 
balance [4], etc., through which we can apply 
the method of averaging directly to the given 
equation (7). 

(iii) Consider an w-periodic system 


dx/dt = A(t)x + f(x, t). (8) 


For any x(t)e P,,, dx/dt = A(t)x + f(x(t), t) is of 
the form (4) and it has an w-periodic solution 
Er- a" + GLNx], or 


m 


Tx=) ača] nx- X ih. Noon | (9) 
k=1 k=1 
under the condition (n*, Nx)=0 (k=1,...,m) as 
in Section B, where N : P „>P, is defined by 
Nx=f(x(-),-), and the w-periodic solutions of 
(8) correspond to the fixed points of the trans- 
formation T: P,, P, induced by (9), where the 
a, in (9) are given by a,=(é*,x), as is required 
when x is a fixed point. This is the alternative 
or bifurcation method [6—9], which is extend- 
able to the case of almost periodic systems 
under the regularity of (3) [10]. In order to 
obtain a fixed point of T, various kinds of 
fixed point theorems can be utilized. 

(iv) In Section C and (iii) above, the search 
for a fixed point of an appropriate mapping is 
a principal device. However, the choice of a 
suitable domain for the mapping is a crucial 
problem, and usually an a priori bound for 
solutions is looked for. The concept of stability 
and hence Lyapunov’s second method are 
effective in such situations [11]. 


E. Numerical Methods 


Numerical methods are used for obtaining 
explicit forms of the oscillations. They are 
convenient in practical applications, since they 
can be used efficiently whether or not the 
nonlinearity of the system is weak. For an 
autonomous or periodic system, one can uti- 
lize the following methods as efficient means 
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of computing periodic solutions: Newton’s 
iterative method, the finite-element method, the 
Lindstedt-Poincaré method, the method of 
multiple scales, the method of harmonic bal- 
ance, the Galerkin method, etc. [12-15]. 

For weakly nonlinear cases, analytic meth- 
ods (that is, perturbation methods) are very 
efficient. However, when a parameter is fixed 
beforehand, it is not easy to know whether 
the conclusion obtained by the perturba- 
tion method is valid for the given value. For 
strongly nonlinear cases, it is very difficult 
to analyze the problems by analytic methods, 
and at present hardly any efficient methods 
exist. Numerical methods will therefore be- 
come exceedingly important in research on 
nonlinear oscillations. These are now of toler- 
able efficiency and reliability, due to progress 
in high-speed machine computation. 


F. Nonstationary Oscillations 


Physically speaking, an oscillation is a station- 
ary state. However, when a system contains a 
parameter varying slowly with time, the oscil- 
lation also varies slowly with time (for exam- 
ple, when the length of a pendulum varies 
slowly, the amplitude of the pendulum also 
varies slowly). Such variation of oscillations in 
the course of time is represented by dx/dt = 
ef(x, t, et, €), where f(x, t, s, £) is (almost) peri- 
odic in t, s. Such cases provide important 
problems in the theory of nonlinear oscilla- 
tions; one such problem has been posed by 
Mitropol’skii [16] under the name nonstation- 
ary oscillations and has been investigated by 
means of the method of multiple scales [15]. 


References 


[1] G. Sansone and R. Conti, Nonlinear dif- 
ferential equations, Pergamon, 1964. (Original 
in Italian, 1956.) 

[2] S. Lefschetz, Differential equations: Geo- 
metric theory, Interscience, 1959. 

[3] G. E. O. Giacaglia, Perturbation methods 
in non-linear systems, Springer, 1972. 

[4] N. N. Bogolyubov and Yu. A. Mitro- 
pol’skti, Asymptotic methods in the theory 

of nonlinear oscillations, Gordon & Breach, 
1961. (Original in Russian, 1958.) 

[5] N. Minorsky, Nonlinear oscillations, Van 
Nostrand, 1962. 

[6] J. K. Hale, Ordinary differential equations, 
Wiley, 1969. 

[7] M. A. Krasnosel’skii, Translation along 
trajectories of differential equations, Amer. 
Math. Soc., 1968. (Original in Russian, 1966.) 


1093 


[8] L. Cesari, R. Kannan, and J. D. Schuur, 
Nonlinear functional analysis and differential 
equations, Dekker, 1976. 

[9] N. Rouche and J. Mawhin, Equations 
differentielles ordinaires II, Masson, 1973. 
[10] M. A. Krasnosel’skii, V. Sh. Burd, and 
Yu. S. Kolesov, Nonlinear almost periodic 
oscillations, Wiley, 1973. (Original in Russian, 
1970.) 

[11] T. Yoshizawa, Stability theory and the 
existence of periodic solutions and almost 
periodic solutions, Springer, 1975. 

[12] C. Hayashi, Nonlinear oscillations in 
physical systems, McGraw-Hill, 1964. 

[13] A. Andronov, A. Vitt, and S. Khaikin, 
Theory of oscillations, Addison-Wesley, 1966. 
(Original in Russian, 1959.) 

[14] M. Urabe, Nonlinear autonomous oscil- 
lations, Academic Press, 1967. 

[15] A. H. Nayfeh and D. T. Mook, Nonlinear 
oscillations, Wiley, 1979. 

[16] Yu. A. Mitropol’skii, Problems of the 
asymptotic theory of nonstationary vibrations, 
Daniel, 1965. (Original in Russian, 1964.) 


291 (XIII.12) 
Nonlinear Problems 


A. General Remarks 


Nonlinear problems deal with nonlinear map- 
pings or operators and the related equations. 
Until recently it was customary to consider 
nonlinear problems as belonging to applied 
mathematics and the physical sciences. How- 
ever, nonlinear problems now belong to mod- 
ern mathematics. Many phenomena in math- 
ematical physics are essentially described 

by nonlinear equations, e.g., the motions of 
several particles or of viscous or compressible 
fluids (— 420 Three-Body Problem, 204 Hy- 
drodynamical Equations). Some of these equa- 
tions are approximated by linear equations 
only when the variables appearing in the equa- 
tions are restricted to very small domains; 
they are treated by perturbation methods 
when the variables stay in comparatively small 
domains. If these requirements cannot be met 
and we have to deal with equations in which 
the variation of the variables are not negli- 
gible, nonlinear problems certainly arise. 

The methods of solution of nonlinear prob- 
lems are not as powerful or general as those 
for linear differential equations. For instance, 
the tprinciple of superposition of solutions 
does not hold for nonlinear problems, and 
therefore Fourier methods are no longer ap- 
plicable. Indeed, some nonlinear problems can 
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be dealt with only by means of very particular 
or ad hoc devices. 


B. Methods Used in Nonlinear Problems 


Consider a nonlinear equation 
G(x)=0, (1) 


where G is a nonlinear mapping or operator of 
a subset S of a linear space X into itself. If we 
put G=I-—F (I is the identity), equation (1) 
becomes 


x = F(x). (2) 


Then a solution of (2) is a fixed point of F. 
Therefore fixed-point theorems of various kinds 
are useful for solving (2) (— 286 Nonlinear 
Functional Analysis). 

Let x” eS, and suppose that we can define 
x, k=1,2,..., by 


x= F(XAD), k=1,2,.... 


If F is continuous and the sequence x con- 
verges to a point xéS, then x is a fixed point 
of (2). Such a method of constructing an ap- 
proximate sequence by iteration is called the 
iterative method. ‘Newton’s iterative process, 
given by 


x) e x1) eat [G(x*" »] -1 G(x"), 


is one such method, where G’ denotes the 
*Fréchet derivative of G. 

If X is an infinite-dimensional space, many 
concepts and methods of the theory of func- 
tional analysis can be used for a number of 
nonlinear problems (— 286 Nonlinear Func- 
tional Analysis). 

We note that there exist nonlinear trans- 
formations that change nonlinear equations 
into linear ones. For example, the thodograph 
method, which is often applied in hydrody- 
namics, consists of reducing a system of fquasi- 
linear partial differential equations of the form 


A,(u, v)u, + Blu, v)u, + C,(u, v)v, + Du, v)v, =0 


to linear differential equations 
Aiy, — Byx,—Cy,+ DiX = 0 (i= 1,2) 


by means of the thodograph transformation, 
which changes the independent variables from 
x, y to u, v (— 205 Hydrodynamics). 


C. Nonlinear Algebraic and Transcendental 
Equations 
Consider a system of equations 


f%q....5%,)=0 (i=1,...,n). (3) 
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Newton’s iterative process can be applied as 
follows. Starting with a point x =(x, ..., 
x) lying near the desired solution, we define 
xP =(x®,..., x), k=1, 2,..., by solving the 
system of equations 


oi 
1 Ox; 


t 


(xD) (xP =x% D) + f(x H) =O 


TA: 


t 


Under some conditions the iteration {x} 
converges to the solution (— 301 Numerical 
Solution of Algebraic Equations). 

If the system (3) is a real one, (3) is equiva- 
lent to È f; =0. It is clear that a method of 
obtaining a minimum of a function f(x,,..., 
x,) is applicable to solving È f; =0. Taking a 
point x, we define 


x* =x® D — Apa Vf), 


where Vf =(6f/0x,,.. 
satisfy 


FP — Ay VAP) SE — AVP) 
(A>0). 


hee TF view 
., Of /0x,), and let A,_, 


Under suitable assumptions, a subsequence 
x) converges to x such that Vf(x)=0 and 
f(x) decreases monotonically to f(x) [1]. 

Let f be a continuous mapping from a 
domain D of R” or C” into itself. Then for any 
xeD, the iteration x™ can always be defined 
by x® = f(x® P). The problem of the behavior 
of x is an interesting and important one in 
pure mathematics; recent work in the physical 
sciences is yielding many new concepts related 
to this problem (— 126 Dynamical Systems, 
433 Turbulence and Chaos). 


D. Nonlinear Differential Equations 


Consider a nonlinear system of ordinary dif- 
ferential equations 


dx/dt=f{(t,x) (xER"). 


The initial value problem with initial condi- 
tion x(t) = € is equivalent to the problem of 
solving the nonlinear integral equation 


t 
x(t)=č+ | f(s, x(s))ds. (4) 
The solution of (4) is a fixed point of the 
operator T: p(t) ë+ fi f(s, p(s))ds defined 
for a suitable function space. Therefore fixed- 
point theorems are applicable to T, and the 
iterative method for T is called the tmethod of 
successive approximation. Also the method of 
the tCauchy polygon is useful (= 316 Ordi- 
nary Differential Equations (Initial Value 
Problems)). These ideas are extended to an 
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abstract Cauchy problem, 


dujdt=A(tu (t>0),  u(+0)=a, 


in a Banach space X, where ae X and A(t) is a 
nonlinear operator (— 286 Nonlinear Func- 
tional Analysis). 

For extensive studies of nonlinear ordinary 
and partial differential equations — 314 Ordi- 
nary Differential Equations (Asymptotic 
Behavior of Solutions), 290 Nonlinear Oscil- 
lation, 394 Stability, 321 Partial Differential 
Equations (Initial Value Problems), 323 Par- 
tial Differential Equations of Elliptic Type, 
325 Partial Differential Equations of Hyper- 
bolic Type. 

Nonlinear differential equations of special 
types appear in many fields of pure and ap- 
plied mathematics, e.g., the Monge-Ampére 
equation and the equation for minimal sur- 
faces in differential geometry (— 183 Global 
Analysis, 275 Minimal Submanifolds), and 
the Toda lattice equation and the Korteweg- 
de Vries equation in mathematical physics 
(— 287 Nonlinear Lattice Dynamics, 387 
Solitons). 


E. Nonlinear Problems of Control Systems 


The basic equation for a tcontrol system in 
which the state of the controlled object can be 
represented by an n-vector x is given by the 
following system of differential equations [3]: 


E=e(0), a=c'x—yt, (5) 


where X and Ë stand for dx/dt and dé/dt, re- 
spectively, č is a scalar function representing 
the control, b, c are constant n-vectors, y is a 
constant number, c’ is the transpose of c, and 
A is a constant n x n matrix whose character- 
istic roots have negative real parts. Further- 
more, we assume that when the control has 
no effect upon the system, x is determined by 
x= Ax. The quantities under consideration 
are all real. Finally, the function ọ = (ø) is a 
scalar function characteristic of the control 
mechanism. Generally, ọ is nonlinear in ø, and 
hence equation (5) is nonlinear. Normally we 
assume that ọ has the following properties: (i) 
(co) is a real-valued continuous function on 
(—oo, œ) with g(0)=0 and co(o)>0 for c # 
0; (ii) [F ” e(a)da = +00. We say that (5) is 
absolutely stable if 


č(t)>0 as 


x=Ax—(a)b, 


x(t) 0, t-> +00 


for any choice of ọ subject to (i) and (ii) and 
for every solution of (5). In the study of control 
systems an important problem is to obtain a 
necessary and sufficient condition for the 
system to be absolutely stable. In this connec- 
tion, we have the following result, due to M. V. 
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Popov: (5) is absolutely stable if there exists a 
nonnegative q such that 


Re{(1+imgq)(c(iwl — A) 1b) +qy>0 (6) 
for any real w, where I is the n x n identity 
matrix. Conversely, if the absolute stability of 
(5) is given by means of the tLyapunov func- 
tion (— 394 Stability) 


V(x, o)=x'Bx +a0* +af’x4+ s| o(o)do, 
0 


where B is a constant matrix and f is a con- 
stant vector, then there exists a nonnegative 
q for which (6) is satisfied for all real œ. 


F. Nonlinear Equations in Applied 
Mathematics 


Some examples of nonlinear problems are 
given here (— also 205 Hydrodynamics; 318 
Oscillations). 

(1) The nonlinear tdifferential-difference 
equation 


du(t)/dt =(a—u(t — 1))u(t) 


is called the Cherwell-Wright differential equa- 
tion [4]. Given the initial condition u(t) = g(t) 
(0<t<1, g(t) is a given continuous function), 
its solution is uniquely determined for 0<t< 
oo. If a<O and g(1)>0, then u(t)—>0 as t> 

oo; if a=0 and g(1)<0, then u(t) —œ as t> 
æ. For a>0, u(t) —œ as t>% if g(1) <0, 
while u(t) either approaches a monotonically 
(0<a<1/e) or oscillates (boundedly) around 
a {a> 1/e) if g(1)>0. In particular, we have 
damping oscillations for a<3/2, while oscilla- 
tory solutions without damping appear for 
a>n/2. 

(2) If we regard a star as a gas sphere and 
assume the polytropic relation p= Kp’ (K and 
y are constants) between the pressure p and the 
density p at each point inside the star, then 
we have a differential equation of the second 
order that determines the density distribu- 
tion. This equation constitutes the basis of the 
classical theory of the internal structure of the 
stars and is called Emden’s differential equa- 
tion or the polytropic differential equation. It 
reads: 


(1/¢°)a(? d0/dč)/dč = —0", 


where y= 1+ 1/n, p=/0", r=aé, a=((n+ 
1)KA”~?/(4nG))'?, with r the distance from 
the center, G the universal gravitational con- 
stant, and å an arbitrary constant. The solu- 
tion that satisfies the conditions 0=1 and 
d0/d€ =1 at €=0 is called the Lane-Emden 
function of index n. Emden’s equation is in- 
variant under the transformation ¢— Aé, 
0+A~7/""D0 (with A an arbitrary constant). 
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Although Emden’s equation can be reduced to 
an equation of the first order, it has not been 
possible to solve it analytically except for the 
cases n=O, 1, and 5. For certain values of n 
between 0.5 and 6, Emden gave numerical 
solutions, which were refined later by Green, 
D. H. Sadler, and D. C. Miller [5]. 

(3) Caianiello’s differential equations, which 
describe the state of a network of neurons [6], 
are 


x;(t+t)= ap >J af) x(t —rt) — J i 

jor 

where the function x;(t) represents the state of 
the ith neuron at the time ¢ and takes only the 
values 0 and 1. The state of the system is to be 
considered at the discrete times t =0, T, 21,.... 
The (real) coefficient a? represents the weight 
of the effect of hysteresis in the relay process 
from the cell j to the cell i. The nonnegative 
integer 0; is the threshold value of the cell i. 
Y[x] ts the unit step function that is equal to 1 
for x >0 and vanishes for x <0. 

(4) The following Hodgkin-Huxley differen- 
tial equation arises in the study of conduction 
and excitation in nerve systems [7]: 
0V 2ro ( ôV 


D Cozy t9m h(V— V) 


+g2n*(V— V2)+g3(V— w). 


™ = —(0,(V) + B,(V))m+0z,(V), 


ôt 


A 


h 
== —(x(V)+ B,(V))h+a,(V), 


oe —(a3(V) + B3(V))n+ «3 (V), 

where a;, f; (1 <i<3) are given functions of V, 
and ro, Ro, Co, gi, V; (1 <i< 3) are constants. 
The unknown function V is sought in the 
domain 0<x < 00, 0<t< œ, while the initial 
values of V, m, h, n, and the boundary value of 
V are given at t=0 and x=0, respectively. 
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A. Problems 


A nonlinear programming problem is a type of 
mathematical programming problem where it 
is required to minimize or maximize a non- 
linear function @(x) of n-vector variable x 
defined in a closed connected set X° with a set 
of linear or nonlinear constraints. Minimiza- 
tion or maximization of a continuously dif- 
ferentiable function (x) under the equality 
condition expressed by a set of continuously 
differentiable functions has been traditionally 
dealt with by the method of Lagrange multi- 
pliers. Hence a typical nonlinear programming 
problem is usually formulated as follows. 

(NLP) Minimize 6(x) under the condition 
xe X° CR" and g(x) <0 for i=1,2,...,m. 

Or, equivalently, determine the set of all 
x such that 0(X)=min,.-0(x), where C= 
{x|xeX° and g(x) <0}. 

Here we need only consider minimization, 
since maximization problems can be converted 
to minimization problems by virtue of the 
obvious relation max # = —min({—0). 

The set C is known as the feasible region or 
the constraint set, and X is called an optimal 
solution or simply a solution. In many non- 
linear programming problems X° is R”. If X° 
=R” and 0 and g are linear functions on R”, 
then the problem becomes a linear program- 
ming problem (— 255 Linear Programming). 
The problem of minimizing a quadratic func- 
tion subject to linear constraints is called the 
quadratic programming problem (— 349 Qua- 
dratic Programming). If X° is a convex set, 0 
is convex (or concave), and the g; are convex 
on X°, then the minimization (or maximiza- 
tion) problem is known as a convex (or con- 
cave) programming problem. Convex and con- 
cave functions are important in nonlinear 
programming, because they admit reason- 
ably straightforward sufficient conditions for 
optimality and also because they constitute the 
only important class of functions for which 
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necessary optimality conditions can be given 
without the differentiability condition. Subject 
to suitable modifications, the method of La- 
grange multipliers can also be applied to the 
solution of nonlinear programming problems. 
The Lagrangian function y associated with 
the minimization problem (NLP) is defined by 


W(x, u) = O(x) + u'g(x), 


where u=(u,,...,U,,) and w denotes the vector 
of Lagrange multipliers. 

A pair (x,u) is called a saddle point of y(x, u), 
provided xe X°, uc R”, u>0, and W(x, u)< 
W(x, u) <(x, u) for all xe X° and all ue R” 
such that u>0. It follows easily that: 

(1) (H. Uzawa [18]) If (x, u) is a saddle point 
of w(x, u), then X is an optimal solution. 

(2) (Kuhn and Tucker [11]) Assume that X° 
is open and convex, and that 0 and g are dif- 
ferentiable and convex. If there exists a pair 
(x,u) such that 


VO(x)+i’Vg(x)=0, xeC, i’g(x)=0, Z>0 


(where VO(x) and Vg(x) denote, respectively, 
the gradient vector of 8 at x and the Jacobian 
matrix of g at x), then X is an optimal solution. 


B. Necessary Conditions for Optimality 


In Section A sufficient conditions for opti- 
mality were given. Some necessary conditions 
are also known. 

(3) When 0(x) is continuously differentiable 
and all the g; are linear, that is, when the con- 
dition is expressed as Ax <b and x >0, where 
A is an m x n matrix and b an m-vector, X is 
an optimal solution only if X is an optimal 
solution of the following linear programming 
problem. 

(LP?) Minimize v’x under the condition that 
Ax <b, x >0, where v= VA(x). 

By applying the duality theorem of linear 
programming, it can be proved that there 
exists a vector 7 >0 such that 


Voe(x, 4) = VO(x)+0’A =0. 


(4) (F. John [9]) Assume that X° is open, 
and that 0 and g are differentiable. Then, if X is 
a solution of the problem (NLP), there exist 
uo > 0 and we R™ such that 


i, VO(X) + W’Vg(x) =0, 


g(x)<0, u’g(X)=0, m20. 


Now we consider the following: The vector- 
valued function g is said to satisfy Guignard’s 
constraint qualification at an inner point x of 
X° if any vector y satisfying the linear inequal- 
ities Vgi(x)* y <0 for ie I= {i|g,(x) =0} is in 

the convex hull spanned by the vectors tangent 
to the set C at x. 
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(5) (Guignard [7]) Let X° be an open set, let 
X be an optimal solution of (NLP), let 0 and g 
be differentiable at x, and assume that g satis- 
fies the Guignard constraint qualification. 
Then there exists me R” such that V@(x)+ 
u'Vg(x)=0, g(x) <0, and a’g(x)=0, 7>0. 

Guignard’s constraint qualification is satis- 
fied if neither of the following conditions 
hold: (L) Vectors g,(x) for ie/ are linearly 
independent. 

(S) (Slater’s constraint qualification) The g; 
are convex and X° is a convex set; and there 
exists a vector x such that g(x) <0 for all i. 
With convexity of 6 and g,, differentiability is 
not required: 

(6) (Kuhn and Tucker [11]) Let X°? be a 
convex set, let 0 and g be convex on X°, and 
assume that g satisftes Slater’s constraint 
qualification. If x is an optimal solution, then 
there exists ue R”, u20, such that u’g(x)=0 
and (x, i) is a saddle point of w(x, v) = @(x)+ 
u’g{x). 


C. Sensitivity Analysis 


Now consider the following class of problems. 

(Pc) Minimize the function @(x) under the 
condition that g(x)<c, where c is a real m- 
vector. We denote by X, the set of the solutions 
of the problem (Pc) and denote 0* = 0(x,), X.€ 
X.. Suppose that Guignard’s condition is 
satisfied for each c and that the set of La- 
grange multipliers A, is nonempty. Then for 
any real vector a, we have 


EA 


infa'u, < lim L oz, 1, ~ 9)<supa’u,, u,eA,. 
Therefore, if the Lagrange multiplier is unique- 
ly determined for some c, then u, represents 
the vector of the rates of increments of the 
objective function to the small increments of 
the components of the constraints vector. 
Hence the components of the Lagrange- 
multiplier vector are called the imputed 

prices or shadow costs of the constraints; these 
have important economic !mpiications, espe- 
cially when the objective function is expressed 
in terms of money or profits. 

More generally, we can consider the follow- 
ing class of problems. 

(GPc): Minimize the function @(x, c) under 
the condition that g(x,c)<0, where c is a real 
parameter. 

Denote by x(c) and u({c) the solution and the 
Lagrange multiplier of (GPc) corresponding to 
the parameter c, respectively, and let 0*(c)= 
6(x(c)). Then under a set of regularity condi- 
tions we have 


VO*(c)=V_O(x(c), c) + ule) V.g(x(C), c), 
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where V, denotes the gradient of 0 or g with 
respect to the parameter. 


D. Duality 


A duality theorem in mathematical program- 
ming is the statement of a certain relationship 
between two problems. This relationship has 
the following two aspects: (i) one problem is a 
constrained minimization problem and the 
other is a constrained maximization problem; 
(ti) the existence of a solution to one of these 
problems ensures the existence of a solution 
to the other, and in this case their respective 
values are equal. 

Let w(x, u) be the Lagrangian form of 
(NLP), and define œ(u) =inf eyo W(x, u). Then 
we can state the following two problems. 

(P) (primary problem) Miminize 6(x) under 
the condition that xe X° and g(x) <0 

(D) (dual problem) Maximize w(u) under the 
condition u >Q. 

If (x,u) gives a saddle point of w(x, u), then 


inf 0(x) = 0(X) = sup w(u) = w(u) = W(X, u). 


The dual problem can be formulated alterna- 
tively as follows. 

(D) Maximize (x, u)= (x) +u’g(x) sub- 
ject to (x,uje Y= {(x,u)|xeX°, ue R”, u>0, 
V,.W(x, u) =0} (where Vy (x, u) denotes the 
vector whose components are the parttal de- 
rivatives ôy (x, u)/0x; for i=1,...,n). 

There are a number of duality theorems 
related to problems (P) and (D); two such 
theorems are as follows: 

(1) (P. Wolfe [20]) Suppose that X° is open 
and convex, that 0 and g are differentiable and 
convex, and that g satisfies the Kuhn-Tucker 
constraint qualification. Then, if X is a solution 
of (P), there exists a me R” such that (x, u) is a 
solution of (D) and 6(x) = (Xx, i). 

(2) (O. L. Mangasarian and J. Ponstein [13]) 
Suppose that X° is open and convex, that 0 
and g are differentiable and convex, and that 
(%, û) is a solution of (D). If W(x, ô) is strictly 
convex in some neighborhood of Å, then £ is a 
solution of (P) and 6(%)= w(x, a). 

The above two problems (P) and (D) are not 
symmetric. The notion of symmetric duality 
was introduced by G. B. Dantzig, E. Eisen- 
berg, and R. W. Cottle: 


Primary: Minimize 

F(x, u)= K(x, u)—u'V,, K(x, u), 
subject to the constraints 

V, K(x, u) <0, x >0, and u>0; 
Maximize 

G(x, u}= K(x, u)— xV, K(x, u), 
subject to the constraints 
V.K(x,u)20, x20, and u>0, 


Dual: 
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where K is continuously differentiable in 
(x, u)e R” x R”. 

(3) Dantzig et al. proved [6] the existence of 
a common optimal solution (X, ït) to both the 
primary and dual problems, provided (i) an 
optimal solution (X, u) to the primary problem 
exists, (ii) K is convex in x for each u and 
concave in u for each x, and (itt) K is twice 
differentiable and the matrix of second partials 
(67K /éu' du’) is negative definite at (x, 1). 

Rockafellar [15] gave another expression of 
the duality relation: Define F(x, y)= (x) if 
g(x)<y and = œ otherwise, and denote (y) 
= inf... yo F(x, y). Then 0(x)= ¢(0). For any 
nonlinear function ¢(y), the conjugate ~*(x) is 
defined by 


p*(n)=sup (n'y — p(y). 


(4) (Rockafellar [15]) sup w(u) = p**(0)= 
clco (0), where clco (y) denotes the closed 
convex hull or maximum convex minorant 
of p(y) defined by clco y(y)=sup, l’y{1'z< 
~(z) for all z}. It follows from the above that 
inf (x)= sup w(u) if and only if g(0)=clco —(0), 
which holds true if (y) is convex. 

Further forms of the duality theorem hold 
for linear or quadratic programming problems 
(— 255 Linear Programming, 349 Quadratic 
Programming). 


E. Algorithms 


In a limited class of problems, i.e., when the 
objective function is quadratic and the con- 
straints are linear (— 349 Quadratic Program- 
ming) the optimal solution can be obtained by 
solving a system of linear equations by the 
simplex method or other algorithms; but in 
most nonlinear programming the solution is 
calculated by some kind of iterative procedure. 
Note that even when the constraints are given 
as equalities and all the functions involved are 
continuously differentiable, so that the optimal 
solution is explicitly given as a solution of a set 
of simultaneous equations, we usually require 
some iteration procedure, such as the Newton- 
Raphson algorithm, to obtain numerically the 
solution with preassigned accuracy; and the 
iteration is not always easy if the functions are 
sufficiently complex. 

Several iterative procedures for solving non- 
linear programming problems have been pro- 
posed. Since the simplex method is a powerful 
tool in linear programming, one type of ap- 
proach is to obtain an approximately optimal 
solution by approximating the objective and 
the constraint functions by piecewise linear 
functions and then applying linear program- 
ming techniques to get the approximately 
optimal solution within each region. Another 
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is to change the constrained problem to a 
nonconstrained one by introducing a suffi- 
ciently large number M, called the penalty, 
and then maximizing 


Dag (X) = O(x) + MY max(g;(x), 0) 


without the constraint. This is called the pen- 
alty method. 

The third and most generally applicable 
technique is the gradient method, of which 
several variations are known: 

(i) Arrow-Hurwicz-Uzawa gradient method 
[4]. Concave or convex programming prob- 
lems can be solved by finding a saddle point 
of the Lagrangian function w(x, u). Let (x, u) 
be strictly concave and of class C? in n-vector 
x >0 and convex and of class C? in m-vector 
u>0 and possess a saddle point (x, i). To 
approach a saddle point of g(x, u) it is natural 
to devise a gradient process of the form 


J 
= rn = 5 1 
dt 0x; dt Ou; 0) 
To keep the variables in the positive orthant, 


we need to modify (1), and we consider the 
following system of differential equations: 


ô 
0 if x,=0 and <0 


dx; ; 
dt lô 

E otherwise 

Ox; 

(i=1,...,n), 
; 6p 

du; 0 ifu;=0 and a” 
dt a l 

—-— otherwise 

Ôu; 


Under certain regularity hypotheses, there 
exists a unique solution (x(t), u(t)) of the sys- 
tem with any initial point (x°, u?), and the x 
component x(t) of the solution converges to x 
as t> 0. 

Applying the above results to the Lagran- 
gian function w(x, u) we can solve the concave 
or convex programming problem. 

(ii) Rosen’s gradient projection method [16]. 
If a point x° of the feasible region does not 
give a solution for the minimization problem 
(P), then we look for a feasible point with a 
lower function value by proceeding from x° in 
the direction of the gradient of the function 
— (x). The method fails if x° is a boundary 
point and if the gradient vector points toward 
the exterior of the feasible region. Rosen’s 
method [16] is to project the gradient onto the 
boundary of the feasible region and then pro- 
ceed in the direction of this projection. In this 
manner, we remain on the boundary of the 
feasible region. 
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(iii) Methods of feasible directions. These 
were first described by G. Zoutendijk [21]. 

Consider the problem of minimizing 0(x) 
subject to the constraint xe Sc R”, where S is 
a closed, connected set satisfying certain regu- 
larity conditions and @ is a continuously dif- 
ferentiable function of the n-vector x, such 
that, for some a, the set {xeS|0(x)<a} is 
bounded and nonempty. 

A method of feasible directions is any recipe 
for solving this problem by proceeding along 
the following lines: (1) Start with some x° eS 
such that 0(x°)<sup a. (2) Pass from the kth 
iteration point x* to x**! by first determining 
a direction s* in x* such that the ray x* + As* 
lies in S for all sufficiently small 14> 0. (3) Then 
determine the step length 2,, thus obtaining 
the (k+ 1)st iterate x**!=x*+1,5*. (4) Repeat 
this procedure until some prescribed stopping 
condition is satisfied. 

There are many methods of determining the 
s*, and in most cases the 4, are then deter- 
mined by solving a one-dimensional mini- 
mum problem along the direction so obtained. 
Zoutendijk has unified the various possible 
methods of feasible directions and the relevant 
normalization rules that yield the optimal 
directions s*, and has investigated this subject 
in detail from the viewpoint of computational 
technique [21, 22]. 


F. Generalizations 


The extension of the Kuhn-Tucker theory to 
linear topological spaces is due to L. Hurwicz 
[8]. Let X be a linear space, Y, Z be linear 
topological spaces, P,, Pz the nonnegativity 
cones of Y, X, respectively, which are closed 
convex cones containing inner points, Da 
convex set in X, and F, G concave mappings 
(— 88 Convex Analysis A) from D into Y, Z, 
respectively, such that G(D) contains an inner 
point of P,. If F(X) attains its maximal point 
when X = Xo and Xo satisfies G(X,)>0, Xo€D, 
then there exist ¥j* >0, Z* 20 such that 

D(X, Z*) = YŽ(F(X))+ Z%(G(X)) has a saddle 
point at (Xo, Z%*). The condition that P, and 

P, have inner points can be weakened to cover 
the cases of (l), (Lp), (s), (S) (Hurwicz and 
Uzawa). P. P. Varaiya [19] considered the 
following nonlinear programming problem in 
Banach space. 

(B) Maximize f(x) subject to xe A, g(x)E Ay, 
where X, Y are real Banach spaces, xe X, g: 
X—Y is a Fréchet differentiable mapping, f is 
a real-valued differentiable function, A is a 
subset of X, and Ay is a convex set in Y. 

The main results are similar to the Kuhn- 
Tucker necessary conditions. Varaiya also 
exhibited a saddle value problem related to 
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(B), when Ay is a closed convex cone. L. W. 
Neustadt [14] investigated nonlinear pro- 
gramming problems in linear vector spaces 
and gave an application to the theory of 
optimal control. The main results are: (i) 
Kuhn-Tucker type conditions which are both 
necessary and sufficient for optimality, (ii) a 
duality theory for obtaining multipliers in the 
generalized Kuhn-Tucker conditions, and (iii) 
an application to optimal control theory. 
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A. General Remarks 


Nonstandard analysis is a new field of research 
that has branched off from model theory and 
that provides a powerful method applicable in 
almost all fields of mathematical science. 

About 1960, A. Robinson successfully used a 
nonstandard model of the real number field R 
to justify Leibniz-type infinitesimal calculus. 
After this, using higher-order logic, he devel- 
oped a stronger theory of nonstandard analy- 
sis, and applied it to other mathematical fields 
[1]. 

In this article, we adopt a first-order logic 
over a universe. In the final section we present 
a theory called nonstandard set theory; this is 
a conservative extension of Zermelo-Fraenkel 
set theory with the axiom of choice. 


B. Axioms for Nonstandard Analysis 

A nonempty set U is called a universe if the 
following four conditions are satisfied: 

(a) xeU, yex=>yeu; 

(b) x, ye U = {x, yheU; 

(c) xeU =| )xeU; 

(d) xe U = P(x)eU. 


In what follows, U will be a universe contain- 
ing the field R of real numbers. 

We construct a language Z describing 
mathematics in U. The alphabet consists of the 
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following symbols: (1) countably many vari- 
ables. (2) constants corresponding to all ele- 
ments in U. (3) predicate symbols = and e. (4) 
logical symbols 4, A, v, >, 3, V. (5) auxiliary 
symbols [ , ]. The last two symbols will be 
omitted where there is no danger of confusion. 

Definition of formulas. (1) If t and s are 
terms (variables or constants), then t =s and 
tes are formulas (atomic formulas). (2) If ¢ 
and y are formulas, then 1¢, daw, ġy y, 
g@—yw are formulas. (3) If ¢ is a formula and x 
is a variable, then 3x[¢ġ] and Vx[@] are for- 
mulas. (4) The formulas are those that can be 
constructed by the above procedure. 

A formulas ¢ containing at most n free 
variables is called an n-ary formula. In this 
case, we often write #(x,,...,x,,) for ¢. A sen- 
tence is a 0-ary formula. We write F ¢ if a 
sentence ¢ is true under the usual inter- 
pretation. 

Consider a quadruple (U, *U, *e, +) where 
*U is a set, *e is a binary relation on *U, and 
x iS a mapping: at *a from U to *U. An 
element x of *U is called standard if x = *a for 
some ac U, and nonstandard if not. Adjoin to 
¥ constants representing all nonstandard 
elements of *U. Then we have a language * Z 
for *U. A sentence in ¥ is a sentence in * Z. 
Taking *U as the scope of quantifiers, we can 
interpret every *Z -sentence in *U. We write 
*E ġ ifan *#-sentence ¢ is true under this 
interpretation. 

Axiom 1 (transfer): For every sentence ¢ in 
L, we have 


Fos*k¢. 


Definition: Let (x, y) be a binary formula in 
£ (resp. in *#). We say that (x, y) is concur- 
rent in U (resp. in *U), if, for every finite num- 
ber of elements a,,...,a@, in U (resp. in *U), 
there exists an element b in U (resp. in *U) 
such that - ¢(a;, b) (resp. *F ¢(a;, b)) for 
I<icn. 

Axiom 2 (enlargement): If a binary formula 
#(x, y) in Z is concurrent in U, there exists an 
element f in *U such that * (a, f) for all a in 
U. 

A quadruple (U, *U, *e, *) or simply *U 
is called an enlargement of U if it satisfies 
Axioms 1 and 2. 

These axioms are strong enough to develop 
the basic theory of nonstandard analysis, but 
sometimes we require a stronger axiom. Let x 
be an infinite cardinal. 

Axiom 3 (x-saturation): Let @(x, y) be a 
binary *Z -formula concurrent in *U. Then, 
for every subset A of *U with cardinality at 
most x, there exists an element f of *U such 
that *H (a, P) for all xe A. 

*U is called a x-saturated model if it satisfies 
Axioms | and 3, and a x-saturated enlarge- 


1101 


ment if it satisfies Axioms 1, 2, and 3. If x is 
not less than the cardinality of U, then a xK- 
saturated model is necessarily an enlargement. 

In the remainder of this article, *U will be 
an enlargement of U, if the contrary is not 
explicitly mentioned. 

For «in *U, & is the set of €€*U such that 
E*ea:d@={Ee*U|*ea}. For simplicity, we 
write € for *e and identify å with «. Under this 
identification, a subset A of *U is called inter- 
nal if it belongs to *U and external if not. 

Axiom | and 2 imply the following results. 

(1) There exist infinite hypernatural num- 
bers, namely, elements of *N bigger than all 
*n,neN. 

(2) There are positive infinitesimal hyperreal 
numbers. 

(3) For every set A in U, there exists a 
hyperfinite internal subset of *A containing all 
*a, ae A. Here, I is called hyperfinite if we 
have *— (I), where ¢(x) is a formula that says 
“x is a finite set.” 

(4) Let A be a set in U. Then *A has non- 
standard elements if and only if A is an infinite 
set. 

(5) If a family F of sets in U has the finite 
intersection property, then the family {*A| Ae 
F} in *U has a nonempty intersection. 

(6) Let «a: *N—*R be an internal hyper- 
sequence. If «(*n) is infinitesimal for every 
neN, then there exists an infinite hypernatural 
number A such that «(v) is infinitesimal for 
every hypernatural number v less than å. 

If we moreover assume Axiom 3, we have 
the following results. 

(7) The cardinality of an internal set is either 
finite or more than x. 

(8) Ifa family F of internal sets in *U has 
the finite intersection property and cardinality 
k at most, F has a nonempty intersection. 

(9) Introduce the order topology in *R. 
Then every subset of *R with cardinality at 
most x is bounded and discrete. 

(10) Let «, $ be internal sets and C a subset 
of a with cardinality at most x. Then every 
mapping from C to f can be prolonged to an 
internal mapping from « to $. 

(10°) In particular, every external sequence 
N-f can be prolonged to an internal se- 
quence *N > ß. This follows from the assump- 
tion of countable saturation. This fact plays an 
essential role in nonstandard probability 
theory, which is now in the process of rapid 
development (— Section D (3)). 


C. Construction of Ultrapower Models 


Let I be an infinite set. A mapping « from J to 


U is a family of elements in U with indices in J. 


So we write <a(i) Xie; or simply <a(i)> for «. 


293 D 
Nonstandard Analysis 


Let F be a nonprincipal ultrafilter on J and 
define an equivalence relation ~, on the set 
U' of all mappings from 7 to U: 


<x)? ~s LPO <> {ie T | (i) = BO} EF. 


Denote by *U the quotient set of U under the 
relation ~,. The class of <a(i)>;., will be 
denoted by [a(i)];., or simply [a(i)]. 

Define a binary relation *e on *U: 


[a(i] *e [BW] {ielai Epi EF. 


Let « be the diagonal mapping from U to *U 
and consider the quadruple (U, *U, *e, +). 

Theorem (Los): The foregoing quadruple 
satisfies Axiom 1. Moreover, *U is a countably 
saturated model. 

In particular, let J be the set of all finite 
subsets of U. For iel, put w(i)={ jel |icj}. 
Then the family of sets Z = { u(i)ļie I} has the 
finite intersection property, and therefore there 
exists an ultrafilter F on I including 2 (use 
*Zorn’s lemma). If we construct *U from the 
ultrafilter F, then *U is a countably saturated 
enlargement of U [1]. 

It is not easy to construct a K-saturated 
enlargement for an arbitrary cardinal x. We 
have two ways: to use a x-good ultrafilter (the 
proof of its existence is difficult) [2,3] or to use 
an ultralimit (iteration of ultrapowers) [4, 16]. 


D. Applications 


(1) Infinitesimal Calculus. A hyperreal number 
a (element of *R) is called infinitesimal if |«| is 
smaller than every positive real number. If 

a — B is infinitesimal, « and f are said to be 
infinitely close to each other; this is written 

a p. If 1/x is not infinitesimal, æ is called 
finite. Every finite hyperreal number « is in- 
finitely close to a real number a (completeness 
of R). We call a the standard part of « and 
write st(x). Every finite hyperinteger is an 
integer. 

Let f be a real-valued function on a real 
interval J. Then *f is a hyperreal-valued func- 
tion on the hyperreal interval *I. The function 
f is continuous if and only if f(x) = *f(y) for 
every xel, ne*! with x xy. The function f is 
uniformly continuous if *f(č) = *f(y) for every 
é,ne*l with xn. 

Let 6x be a variable ranging over nonzero 
infinitesimals. The function 6f=*f(a+6x)— 
f(a) of ôx will be called the infinitesimal 
increment of f at a. f is differentiable at a if 
and only if the quotient 6//6x is of infinitesimal 
variation. The common standard part of f/x 
is the derivative f'(a). 

The higher-order differential 6"f is also 
justified as a higher-order infinitesimal dif- 
ference, with the standard part of 6"f/6x" being 
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f(a). We can define the Riemann integral as 
the standard part of a Riemann hypersum with 
respect to a hyperfinite partition of infini- 
tesimal width. 

This type of reformulation permits us to 
rewrite the whole of calculus, and many 
teachers are trying to adapt this theory to 
elementary calculus [5,6]. 


(2) Topological Spaces. Let X be a topological 
space in U and let a be a point of X. The 
intersection of *A, A varying over the neigh- 
borhoods of a is called the monad of a and is 
denoted by Mon(a). There exist hyperneigh- 
borhoods of a contained in Mon(a) (infini- 
tesimal neighborhoods). The topology is deter- 
mined by the system of monads <Mon(a)>, <x. 
We can thus rewrite the theory of topological 
spaces. For example, X is Hausdorff if and 
only if Mon(a) () Mon(b)= Ø for every a#b 
in X. X is compact if and only if every point of 
*X belongs to the monad of some element in 
X. This characterization is very useful; using it, 
Robinson and Bernstein were able to solve the 
invariant subspace problem for a special class 
of operators on a Hilbert space [7]. We can 
also construct a Haar measure very naturally 
and simply [2]. 


(3) Measures and Probability Theory. Let 
(X,B,m) be a measure space. Then there exist 
a hyperfinite subset I of *X and a positive 
hyperreal-valued internal function @ such that 
we have 


| fdm=¥ Eel) 
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for every integrable function f [8]. The right- 

hand side is a hyperfinite sum. It follows that 

the measure m can be extended to a finitely 

additive measure defined over all subsets of X. 
If in particular every measurable finite set is 

of measure 0, then there exist a hyperfinite 

subset I of *X including X and a hyper- 

natural number p such that we have 
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for every integrable function f [8]. If m(X)=1, 
then p can be taken as the hypercardinality of 
T. Here, the right-hand side is nothing but the 
mean value of *f on a hyperfinite set. This idea 
leads us to a simple description of probability 
theory. 

The above method is rather formal; but P. 
Loeb [9] has pioneered a new approach in 
probability theory by constructing an external 
measure space from a hyperfinitely additive 
internal measure space (X, £, v) in *U. In the 
following, *U is supposed to be countably 
saturated. 
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Let a(x) be the external countably additive 
algebra of subsets of X generated by £. Then 
the mapping .o/ +> st(v(.2)) from æ to R can 
be extended to a measure on o(./). The com- 
pletion of this measure space is denoted 
(X, L(V), L(v)). In cases where X is hyper- 
finite, æ is the totality of internal subsets of X, 
and v(X)= 1, then the space (X, L(.o/), L(v)) ts 
called a Loeb space and L(v) a Loeb measure. 

Every Radon probability space can be re- 
presented as the image of a Loeb space by a 
measure-preserving mapping. Therefore the 
probability theory on a Radon probability 
space can be reduced to that on a Loeb space. 

For example, for ‘Lebesgue measure on the 
interval [0,1], take an infinite hypernatural 
number 2 and put X ={u/åļ|ue*N, 0< u< 
A—1}. If we assign 1/4 to every point of X, we 
have an internal probability measure v on X. 
Then the mapping st: x—st(x) from X to [0,1] 
serves as a measure-preserving mapping from 
the Loeb space (X, L(.</), L(v)) onto the Lebes- 
gue measure space on [0,1]. 

The notion of lifting plays a key role in 
probability theory on Loeb spaces. Let f be a 
real-valued function on X, and F an internal 
hyperreal-valued function on X. F is called a 
lifting of f if we have f(x) =st(F(x)) almost 
everywhere with respect to L(v). Hence f is 
measurable if and only if it has a lifting. 

Shuttling between an internal probability 
space and its Loeb space by lifting and 
standard-part mapping, we can develop, sim- 
ply and partially hyperfinitely, probability 
theory on the Loeb space. Among others, 
Anderson [10, 11] and Keisler [12] applied 
this method quite successfully to Brownian 
motions, Itô integrals, stochastic differential 
equations, etc. [17]. 


E. Nonstandard Set Theory 


In our formulation in B, we must construct *U 
from a fixed universe U. Nelson [13], Hrbacek 
[14] and Cuda [15] independently invented 
theories that nonstandardize the whole of set 
theory. In these theories, there is only one real 
number field R, and R already contains in- 
finitesimal numbers. Compare this to 
Robinson-type infinitesimal analysis, where 
infinitesimal numbers are introduced as an ad 
hoc tool. The new theories may demand a 
reflection on mathematical description in the 
natural sciences. 

In the following, we outline (intuitively) 
Hrbacek’s theory, as strengthened and im- 
proved by Kawai [16]. 

We start from ZFC, the Zermelo-Fraenkel] 
set theory plus the axiom of choice. The lan- 
guage of Kawai’s theory NST is that of ZFC 
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plus two constants S and 7. We understand S 
as the totality of standard sets and / as the 
totality of internal sets. 

Let ¢ be a formula of ZFC, that is, a for- 
mula of NST without S and I. We write *¢ 
(resp. ‘¢), the formula of NST obtained by 
restricting the scope of variables in ¢ to S 
(resp. I). 

The mathematical axioms and axiom 
schemes of NST are substantially as follows. 
(1) If ¢ is an axiom of ZFC, then °¢ is an 
axiom of NST. In other words, all the axioms 
of ZFC are valid in the universe of standard 

sets. 

(2) In the universe of all sets, all the axioms 
of ZFC are valid except the regularity axiom. 

(3) Every standard set is internal. Every 
element of an internal set is internal. 

(4) (transfer) For every n-ary formula 
O(X1,--.,X,) of ZFC, we have 


Vx ES... Vx e SEx,- Xp) 
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(5) (saturation) For every set of size at most 
that of S, the scheme corresponding to Axiom 
3 (x-saturation) in Section B holds. Therefore 
the scheme corresponding to Axiom 2 (enlarge- 
ment) holds also. 

(6) (standardization) For every set a in- 
cluded in a standard set, there exists a stan- 
dard set b such that we have VxeS[xeav 
xeb]. 

This completes the description of NST. 

Theorem (Nelson-Hrbacek-Kawai): NST is 
a conservative extension of ZFC. That is, let ø 
be a sentence of ZFC. If °¢ can be proved in 
NST, then ¢ can be proved in ZFC. 

Corollary: If ZFC is consistent, so is NST. 
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A. General Remarks 


From counting, a primitive mental activity, 
came the natural numbers (N) (— Section B), 
which serve to denote the number of items or 
the order in which these items are arranged. 
We can extend this concept to define, step by 
step, the tintegers (Z), trational numbers (Q), 
treal numbers (R), and tcomplex numbers (C) 
(— 355 Real Numbers; 74 Complex Numbers). 
The extensions up to the rationals are 
carried out to attain a domain within which 
the operations of addition, subtraction, multi- 
plication, and division, namely, the four arith- 
metic operations (or rational operations) can 
be performed indefinitely, with division by 
0 the only exception. To develop the theory of 
natural numbers there are two well-known 
methods, one being G. Peano’s system of 
axioms [4], which will be stated below, and 
the other being R. Dedekind’s set-theoretic 
treatment [5]. The domain of rational num- 
bers can be extended, taking continuity into 
consideration, to that of real numbers by 
several methods of which the best known are 
those of the Dedekind cut (1872) [8], which 
will be described below, and of G. Cantor’s 
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fundamental sequences (1872) [9]. There ts 
also a way based on infinite series, given by 
K. Weierstrass in his lectures (1859-1860). 
Though in the domain of real numbers (1) 

the four arithmetic operations can be per- 
formed indefinitely and (2) an order relation for 
magnitude is defined, the equation x? + 1=0 
has no root. Introducing numbers expressible 
as a +ib (i= <f=1); it is possible to solve 
every equation of the second order. Such 
numbers, once called imaginary, have been 
used since the days of G. Cardano [6] in the 
16th century. L. Euler also made good use of 
complex numbers as convenient tools in many 
calculations and obtained among other things 
his formula expi0 =cos 8 + isin 0. Indeed, the 
notation i=./—1 was used for the first time 
by Euler (1777). Furthermore, C. F. Gauss, 
giving imaginary numbers the name complex 
numbers, showed that any algebraic equa- 
tion with numerical coefficients always has 
roots in the domain of complex numbers. The 
discovery of the geometric representation of 
complex numbers by several mathematicians 
in the late 18th and early 19th centuries and 
their use in many applications have made 


complex numbers indispensable in mathematics. 


Though there are extensions of complex 
numbers, such as *Hamilton’s quaternions or 
tCayley numbers, it is generally accepted that 
when we speak of a number we usually mean a 
complex number. 


B. Natural Numbers 


Peano, basing his system on a specific natural 
number, 1, and a function such that to each 
natural number x corresponds a natural num- 
ber x + 1 (hereafter denoted by x’ and called 
the successor of x), formulated the funda- 
mental properties of the set N of natural num- 
bers in the following five axioms, called the 
Peano postulates: (1) 1 € N; (2) if xe N, then 
x’ EN; (3) if xEN, then x’ #1; (4) if x =y 
(x, ve N), then x= y; and (5) if a set M satisfies 
the two conditions: 1 € M, and xe M implies 
x'e M, then Nc M. Since these postulates 
determine N uniquely up to isomorphism, they 
can be regarded as a definition of N. Elements 
of N are called natural numbers. 

Owing to Peano’s fifth postulate, regarding 
a certain property P(n) for natural numbers n 
we can deduce that P(n) is true for every n if 
we prove both of the following conditions: (i) 
P(1) is true; and (ii) for any natural number k, 
if P(k) is true, then P(k + 1) is true. Such rea- 
soning is called mathematical induction (or 
complete induction). Accordingly, Peano’s fifth 
postulate is called the axiom of mathematical 
induction. Double mathematical induction, a 
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generalization of mathematical induction, is as 
follows: to show that the property P(m, n) is 
true for every pair of natural numbers m and n, 
we have only to show that (iii) P(m, 1) and 
P(1,n) are true for every m and every n; and 
(iv) for every pair of natural numbers k and 1, if 
P(k+1,!) and P(k,/+ 1) are true, then P(k+ 1, 
{+ 1) is true. This axiom can be formulated in 
several other ways and can be generalized 
further to n-tuple mathematical inductions 
(n= 2, 3,4,...), generically called multiple 
mathematical inductions. 

Assume for a set M that a mapping f from 
the *Cartesian product N x M into M is given. 
Then a mapping ọ from N into M such that 
(v) p(1)=a; and (vi) p(x’) = f(x, p(x) (xEN) 
exists and is unique. Defining ¢ by (v) and (vi) 
is called the definition of o by mathematical 
induction. 

In particular, given a natural number a, the 
mapping @: N-N defined by (vii) g(l)=a; 
and (viii) @(x’)= g(x) is called addition by a. 
We shall write ọ(b)=a +b, whence x =x+ 1. 
Addition thus defined obeys the following 
laws: a+ b =b + a (commutative law); (a + b) 
+c=a+(b+c) (associative law). Peano’s 
postulates are thus equivalent to the following: 
(1) LEN; (2) for each pair a, be N, a+ DEN is 
defined so that addition obeys the commuta- 
tive and associative laws; (3’) for each pair of 
natural numbers a and b, one and only one of 
the following three relations holds: a=b+c 
(ceN); a=b; a+c=b (cEN); (4°) the same as 
(5) (mathematical induction). From (1’)-(4’) 
follows the cancellation law: a+ c=b+c= 
a=b. Define a> b if and only if a=b or a= 
b+c (a,b,ce N). Then, from (3’), N becomes a 
ttotally ordered set and a=b<ea+c>bt+e. 

For each aeN, the mapping 9: N->N 
defined by y(1)=a and ¢(x’)= g(x) +a is 
called multiplication by a, and we write ~(b) 
=ab (or a'b). Multiplication obeys the follow- 
ing laws: ab = ba (commutative law); (ab)c = 
a(bc) (associative law); a(b + c)=ab + ac, 

(a+ b)c=ac+ bc (distributive laws); and 
ac = bc <> a= b (cancellation law). The state- 
ment a‘! =1-a=a also holds. 

Natural numbers, which have been intro- 
duced thus far as tordinal numbers, also have 
the properties of tcardinal numbers. Denoting 
{1,2,...,n}=M,, we have M,=M,,<>m=n, 
M, + M, = Mons Mn x M, = Moy (— 49 


mtn 


Cardinal Numbers). 


C. Integers 


Introducing new numbers which are not in 
N=/1,2,...}, represented by the notations 
0, —1, —2,..., —n,..., we write Z={..., 
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—n,..., —2, — 1,0, 1, 2,...,n,...}. An element 
of Z is called an integer (or rational integer). 
Algebraically we can construct Z from N as 
follows: Let the set of all tordered pairs (k, l) 
of natural numbers k, I be M =N x N, and 
define in M an tequivalence relation (k, I) ~ 
(m,n) by k+n=m+!l. Let the equivalence class 
of (k, 1) be K(k, 1), and construct the tquotient 
space, M/~ = M*. Then the mapping ¢: 
Z— M* defined by y(n)= K(k +n, k), p(O)= 
K(k, k), p(—n)= K(k, k +n) is bijective. Setting 
K(k, 1)+ K(m,n)= K(k+m,/+n), addition 
can be defined in M* (and accordingly in Z) 
which is an extension of that in N. Since 
K(k, 1)— K(m,n)= K(k +n,1+ m), subtrac- 
tion can be defined in Z. This makes Z an 
*Abelian group with respect to addition. An 
order relation in Z is defined by K(k, 1) > 
K(m,n)<>k+n>m-+4I, which makes Za 
totally ordered set. This order relation is an 
extension of that in N. In particular, N= 
{aeZ|a>0}. Furthermore, setting K(k, 1) x 
K(m,n)= K(km+ In, kn + Im), multiplication 
in Z can be defined. It is an extension of 
that in N and obeys commutative, associative, 
and distributive laws. Also, for each a, be Z, 
we have ab=0<(a=0 or b=0). Thus, Z 
becomes an tintegral domain. 


D. Rational Numbers 


Let P be the set of all ordered pairs (a, b) of 
integers a, b with b #0, and define in P an 
equivalence relation (a, b)~ (c,d) by ad =bc. 
Each equivalence class determined by this 
relation is called a rational number. Denoting 
by L(a, b) the equivalence class to which (a, b) 
belongs, we can define the sum x + y, the 
difference x — y, the product xy, and the quo- 
tient x/y of rational numbers x = L(a, b}, y= 
L(c, d), as in the cases of addition and multi- 
plication of integers, in the following way: 
x+y=L(ad + be, bd), x — y= L(ad — bc, bd), 
xy = L(ac, bd), x/y = L(ad, bc), where the quo- 
tient is defined only when c #0. Thus the set 
Q of all rational numbers becomes a tfield. 

For the same reason that we have identified 
integers of a special type with natural num- 
bers, we now identify a rational number ex- 
pressible as L(a, |) with an integer a. Hence- 
forth, any rational number L(a, b) (b #0) can 
be expressed in the form of a quotient a/b 
(b £0) of integers a and b. 

From L(a, b)= L(ca, cb) (c £9), it is always 
possible to assume b> 0 in the representation 
L(a, b) of a rational number x. For any two 
rational numbers x = L(a, b), y= L(c, d) with 
b>0, d>0, define an ordering in Q by x>y 
<> ad > bc, which is an extension of the order- 
ing of integers. Thus (i) Q becomes totally 
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ordered, and we have (ii) x >y=>x+z>y+z 
and (iii) x > y and z > 0 = xz È yz. The ration- 
al x is called positive if x >0 and negative if 
x<0. 


E. Real Numbers 


Two typical methods of constructing real 
numbers from rational numbers are those of 
Dedekind and of Cantor. 


Dedekind’s Theory of Real Numbers. We call a 
pair (A,, A,) of subsets A,, A, of the set Q of 
all rational numbers a cut of Q if they satisfy 
the following conditions: (1) A; #@, A, # Ø; 
(ti) Q = A, UA); (ili) a, EA), G2 € Ay >, <ay. 
Then the following three cases are distin- 
guished: there is (i) a maximum in A, with no 
minimum in A,; (ii) no maximum in A, witha 
minimum in A,; or (iii) no maximum in A, 
with no minimum in A,. A cut with either 
condition (i) or (iti) is called a real number (in 
the sense of Dedekind); condition (ii) can be 
converted to (i). The set of all real numbers is 
denoted hereafter by R and cach real number 
by a or $ or .... A real number with property 
(i) is called a rational real number, and a real 
number with property (iii) an irrational real 
number. Any rational real number ts uniquely 
determined by the maximum a* of A,, and the 
mapping: a>a* =(A,, A,) from the set Q of 
rational numbers onto the set Q* of rational 
real numbers ts bijective. 

I. For real numbers 2=(A,, A,) and B= 
(B,, B,) we define a <£ if and only if A, € B,. 
By this ordering <, R becomes a totally 
ordered set. 

H. For real numbers «=(A,, A2) and B= 
(B,, B,), put C,={a+blaeA,,beB,} and 
C,=R-—C,; then (C,,C,)=7 is a real num- 
ber. Define the sum a+ $ by setting «+ B=. 
Addition thus defined obeys commutative 
and associative laws, and R becomes an 
tAbelian group with 0* as its zero element. 
Furthermore, for real numbers «=(A,, A2) 
and S=(B,, B,) with 0* <a, 0* < $, put D, = 
{ab|ae A, be B,}, D, =R—D,; then (D,, D,)= 
ô becomes a real number. Define the product «f 
by setting af =ô. According as 0* >a, 0* <p; 
0* <a, 0* > p; or 0* >a, 0* > Bf, define af = 
—((—2)B); xB = —(a(—f)); and xp =(—a)(— p), 
respectively. Multiplication thus defined obeys 
the commutative, associative, and distributive 
laws, and R becomes a field with 1* as its 
unity element. 

III. For ordering and arithmetic operations, 
we have (1) a> B =>a+y>f +y; and (2) 
a> p, y >0* => ay > By. 

By letting each rational number a corre- 
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spond to a rationally real number a*, we can 
set up a bijection between the set Q of all 
rational numbers and Q*. Furthermore, in this 
correspondence, the sum, product, 0, and 1 of 
Q are mapped to the sum, product, zero ele- 
ment, and unity element of Q*, respectively; in 
addition, the ordering is preserved. Thus Q 
and Q* are isomorphic with respect to both 
arithmetic operations and ordering. We shall 
hereafter identify the element a* with a. Ac- 
cordingly we call rational real numbers simply 
rational numbers and similarly irrational real 
numbers simply irrational numbers. 

IV. Similarly as for Q, we can define a cut 
(A,, A2) of the set of all real numbers R (pre- 
cisely, by the conditions A; # Ø, A, # Ø; 
R=A,UA,;d,€A,, d€ A, =d; <d,). For 
each cut (A,, A,) of R, either there is a max- 
imum in A, and no minimum in A, or no 
maximum in A, and a minimum in A, (Dede- 
kind). This property is called the connectedness 
(or continuity) of real numbers. 

By using definitions (1)—(IV), any real num- 
ber « can be represented (i) as the tleast upper 
bound of some set A of rational numbers: «= 
sup A; and also (ii) as the limit of a sequence 
{a,} of rational numbers: «=lima,. 


Cantor’s Theory of Real Numbers. A sequence 
{a,} of rational numbers is called a funda- 
mental sequence (or Cauchy sequence) if it 
satisfies the following condition: for each posi- 
tive rational number e, there exists a natural 
number ny such that —e<a,—a,,<é for 
every a,, and a, with m> nọ and n>np. For 
two Cauchy sequences {a,} and {b,}, we can 
write {a,}~{b,} if {a1,b,, 42,5, ...,4_,D_,---} 
is again a Cauchy sequence. The relation ~ is 
an equivalence relation. Let R’ be the set of all 
equivalence classes obtained by classifying, 
with respect to ~, the set of all Cauchy se- 
quences, and call an element of R’ (an equiva- 
lence class) a real number (in the sense of Can- 
tor). We shall write [{a,}] hereafter for the 
equivalence class of {a,}. In particular, {a,} 
with a, =a (n= 1,2,...) being a Cauchy se- 
quence, we denote a real number [{a,}] by 
a**, An element of R’ of this type is called a 
rational real number (in the sense of Cantor), 
while one not of this type is called an irra- 
tional real number (in the sense of Cantor). 
For each pair of real numbers a=[{a,}] 
and f =[{b„}], their sum and product are 
uniquely defined by «+ B=[{a,+5,'] and 
afb=[{a,b,}], where both {a,+b,} and {a, b,,} 
are shown to be Cauchy sequences. Further, 
for « and f, define «< $ if a, <b, hold for all 
n larger than some number ny. As regards 
arithmetic operations and ordering, R’ has the 
properties (I)-(IV) of Dedekind’s theory. Fur- 
ther, any Cauchy sequence of which the terms 
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all are real numbers has a limit in R’ (com- 
pleteness of real numbers). 

The sets of real numbers obtained by the 
above two methods are isomorphic with re- 
spect to arithmetic operations and ordering 
(— 355 Real Numbers). 


F. Complex Numbers 


To construct complex numbers from real ones, 
several methods have been devised, among 
which the one stated below is due to W. R. 
Hamilton. 

An ordered pair (a, b) of real numbers a and 
b will be called a complex number. Arithmetic 
operations among complex numbers are de- 
fined as follows: (a, b)+ (c,d)=(a+c,b+d), 
(a, b)—(c, d)=(a—c, b—4), (a, b)-(c, d)=(ac — 
bd, bc + ad), and 


(a, b) -( bd bc—ad 


= ; : , 4) #(0, 0). 
(c,d) \c?+d? ae (c,d) # (0,0) 


According to these definitions, the set C of all 
complex numbers becomes a tfield with (0, 0) 
and (1,0) as its tzero element and tunity ele- 
ment, respectively. R* = {(a,0)|aeER} is a 
tsubfield of C, and the mapping gy: R—>R* 
defined by ~(a)=(a, 0) proves to be a field 
‘isomorphism. Thus, identifying the element 
(a, 0) of R* with the element a of R, C can be 
regarded as an foverfield of R (R c C). Hence, 
the zero element of C is the real number 0 and 
the unity element is the real number 1. The 
complex number (0, 1) is called the imaginary 
unit and denoted by i. Thus we can write 

a+ bias usual for a complex number (a, b) 
(— 74 Complex Numbers). 
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A. Recurrent Sequences 


A (complex-valued) function that has the 

set of nonnegative integers (or the set N of 
natural numbers) as its tdomain is called a 
number-theoretic (or arithmetic) function. 
Thus it can be regarded as a tsequence of 
numbers. We first consider recurrent se- 
quences. Let f(xo,X,,..-,X,-,) be a complex- 
valued function of r variables. Put up =a, 
,Uu,_,=a,_,, and successively de- 
fine u;,,=f (Uj, Uj4 1, --->Uj+,—-1) (E=0, 1, 2,...). 
The sequence {u,} thus defined is called a re- 
current sequence of order r determined by the 
initial values ay),a,,...,a,-, and the function 
f. In particular, when the defining function f 
is given by D/-$5;x;, {u,} is called a linear re- 
current sequence. The Fibonacci sequence is a 
special linear recurrent sequence with initial 
values ao, a; and the defining function xo + x. 
Let «=(1+./5)/2, B=(1—,/5)/2 be two 
roots of 1+x=x?, and let c, and c, be deter- 
mined by c, +¢,=dy and c,a+c,fh=a,. Then 
the Fibonacci sequence with the initial values 
do, 4, is given by putting u, =c, 8" + cp" (n= 
0,1,2,...). If we put ag =1, a, =1, then we 
obtain Binet’s formula: 


wala) CY} 


B. Multiplicative Functions 


uy = 4]; 


A number-theoretic function f with domain N 
is said to be multiplicative if f(1)=1, f(mn) 

= f(m) f(n) for (m,n) = 1, and to be completely 
multiplicative if f(1)=1, f(mn)= f(m)f(n) for 
any m and n (EN). Similarly, f is said to be 
additive if f(1)=0, f(mn) = f(m) + f(n) for (m, n) 
= 1, and to be completely additive if f(1)=0 
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and f(mn)= f(m)+ f(n) for any m and n (EN). If 
f is multiplicative and £X, f(n) is absolutely 
convergent, then 1.9 f(p”) is absolutely con- 
vergent for any prime p. Moreover, we have 


Š f(n)= Il ( ¥ so"), 


where the infinite product on the right-hand 
side is also absolutely convergent. Further- 
more, if f(n) is completely multiplicative, then 
we have 


È sT — ftp). 


In particular, for f(n)=n~*, which is com- 
pletely multiplicative (with s=o+it a complex 
variable), we get the Euler product formula 
for the tRiemann zeta function: 


g= È n= [py 


p 


for g> 1. 


C. Convolutions 


If f and g are number-theoretic functions, the 
convolution f +g is defined by 


(Fe a(r)=p_ SA) glad), 


with the summation carried over all divisors d 
ofn. For any number-theoretic functions f, g, 
h, we have fxg=g*f and (f*g)*h= f*(g*h). 
If f and g are multiplicative number-theoretic 
functions, f*g is again a multiplicative 
number-theoretic function. 

The Möbius function u(n) is defined as fol- 
lows: u(1)=1, n(n) =0 if n is divisible by the 
square of a prime, and yu(n)=(—1)’ ifn is the 
product of r distinct primes. It can easily be 
proved that yz is multiplicative and that 


1 (n=1), 
zudi (n>1). 
Let e and p be functions defined by e(1)=1, 
e(n)=0 (n> 1), and p(n)=1 for every n. Then e 
is the identity element for the convolution 
x, and p*xp=p*p=e. It follows that f*p=g 
is equivalent to f =g * y, that is, 


g(n)=) f(d) 


dln 


implies that 
fin) => u(d)g(n/d), 


and vice versa. We call the latter the Möbius 
inversion formula. Similarly, for complex- 
valued functions F, G defined on [1, +00), 


G(x)= $, F(x/n) 


nsx 
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is equivalent to 


F(x)= F, pln)G(x/n). 
ngx 
Let ~(n) be the number of integers m not 
greater than n and such that (m, n)= 1. The 
function ọ(n), called the Euler function, is 
multiplicative, and we have 


otn)=nPI(1 -') =n me 


pir d\n 


and ~(p")=p"—p" ' for every prime p. Let v 
be the function defined by v(n)=n for every n. 
Then we have u*v=ọ and v=p* q, and hence 
Lain P(d) =n. 

The generalized divisor function d,(n) is the 
number of ways of expressing n as a product of 
k factors. Thus we have 
d,(n) = X l =È d-i (m) 

niha. nEn min 
and d, =p*...* p (k factors). Therefore, d, is 
multiplicative. For simplicity, we write d(n) 
instead of d,(n). Then d(n) is the number of 
divisors of n, and we call it the divisor function. 
For example, d(12)=6. We denote by o,(n) 
the sum of ath powers of divisors of n. Ifn= 
II, p’, then we have 


pur 4 


ti 


a (n)=} d*=[ | 


dn pin Pi —| 


and a, is also multiplicative. We write o,(n)= 
a(n). A number n satisfying o(n)=2n is said 

to be a tperfect number. Let n=T1,), p”. The 
functions œ(n)= È, 1 (the number of distinct 
prime factors of n) and Q(n)= È pnl, (the num- 
ber of prime factors of n) are often used, the 
former being additive and the latter com- 
pletely additive. 


D. Residue Characters and Gaussian Sums 


Let k be a positive integer and y be a com- 
pletely multiplicative function such that y(n) 
=0 for (n,k)>1 and 7(n,)=z(n,) for n; =n, 
(mod k). The function y is called a Dirichlet 
character (or residue character) with modulus k 
(or modulo k). There exist (k) distinct charac- 
ters modulo k for a given k. The character 
satisfying y(n)= 1 for every n coprime to k is 
called the principal character modulo k, and 
we denote it by yo. It ts easily proved that 


Sk) (X= Xo), 
PEOR 


0 (XZ Xo) 
_ folk) (n=1(modk)), 
2 m=] 0  (n#1(modk)). 


Suppose that k=k,k,...k, ((k;,k))=1, iF). 
Then there exist r characters y; modulo k; 
permitting the unique decomposition y = 
Liha Xr 
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Let x be a character modulo k and p= 
exp(2zi/k). The Gaussian sum modulo k is 
defined by 
Gag= © x(n)p™, 

n(mod k} 
where n runs over a complete system modulo 
m(— 297 Number Theory, Elementary, G). 
Hence if a=b (mod k), then G(a, x) = G(b, x). 
Suppose that k=k,k,...k, ((k;,k))=1, i#j) 
and a;=a (mod k;). Then we have 


G(a, x) =€G(a,, 7) G(az, X2) --- G(a,, %,), 
e= 7 (k/ky)ya(k/k 2)... %,(k/k,), 


where ¥=7%17%2---7, ts the decomposition of y 
mentioned above. 

Let ky be a divisor of k. If y(n)= 1 whenever 
(n,k)=1 and n=1 (mod kọ), then we say that 7 
is defined mod ky. The least positive integer 


f= f(x) modulo to which y is defined ts called 


the conductor of x. If the conductor of x is k 
itself, then y is said to be a primitive character 
modulo k. When x has the decomposition 7 = 
%1 -- Xr the conductor of x is also decom- 
posed as f(y)=/(%1)/(%2)--- f(%). If D is a 
square-free integer, then the tdiscriminant d of 
the quadratic number field Q(./D) (where Q is 
the rational number field) is either D (D=1 
(mod 4)) or 4D (D =2, 3 (mod 4)). The integers 
d so represented are called the fundamental 
discriminants. The ‘Kronecker symbol (d/n) 
(— 347 Quadratic Fields) is defined only for 
such fundamental discriminants. Z. Suetuna 
and A. Z. Walfisz (1936) proved that if y(n) is 
a real primitive character modulo k, then we 
necessarily have one of the following cases: 

(i) k =p P2 --- Py; Gi) K= 4p, pa... pr; oF (iii) k = 
8p; p2 .-- p, (with the p; distinct odd primes). In 
case (i), 


k/n) (k=1 (mod 4)), 
ANOS ine (ieee niga ayy 


in case (ii), 


(—k/n) (k/4=1 (mod 4)), 
m=} a ; 
(k/n) (k/4= —1 (mod 4)); 


and in case (iti), y(n) ts either (k/n) or (—k/n). 
If y is a primitive character modulo k, then 

G(a, 7) =7a)G(1, x), GO, WGC, %)=k. In par- 

ticular, if y is a real primitive character, then 


sü poli (x(-1)=1), 
lk (-D=—D. 


Sometimes S(a, k)= Xn (moan P” is called the 
Gaussian sum, where p=exp(2zi/k). If (a, k;) 
=] (i=1,2) and (k,,k,)=1, then S(a, k k2) 
= S(ak,,k,)S(ak,,k). If (a,2)=1, then 


0 (r=1), 
(1 +92 


Qriala s Dirt 1/2 


S(a, 2")= (r even), 


(r odd > 1). 
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Let p be an odd prime and (a, p)= 1. Then 
we have 


(a/p)S(1,p)  (r=1), 
S(a, p')=< p? (r even), 
p“? S(a,p) (r odd> 1), 


where (a/p) is the *Legendre symbol (— 297 
Number Theory, Elementary, H). The well- 
known Gauss formula is stated as follows: 


Vp (p=1 (mod 4)), 
S(1, p)=4_ a 
i,/p (p=3 (mod 4)). 
Various proofs of this formula have been given 


by many authors [5]. 
Let 


G(4,%)= } exp(2niah/k), 


himod k} 


called the Ramanujan sum, be denoted by c,(a). 


The sum Ehmoa y Means that h runs through a 
reduced residue system modulo m (— 297 
Number Theory, Elementary, G). It follows 
that c,(1) = p(k) and 


k 
glaj= > u(s)a 
dýka) \d 


E. Analytic Methods 


Let f be a number-theoretic function. One of 
the problems in analytic number theory is to 
estimate È< f(n) or to expand it into series. 
We assume that f(x) and g(x) are real-valued 
functions defined for x >1 and g(x) is of class 
Ct. Then we have 


2, finan =F)- | F(t)g'(t)dt, 
ngx 1 


where F(x)= 2%, <,f(n). If we take f(x) = 1, g(x) 
=log x or 1/x in this formula, then we obtain 


$ logn=xlogx—x + O(log x) 


nEx 


or 


X nt =logx+C+0(x ') 

nsx 

(where C is the tEuler constant), respectively. 
We now construct from the function f the 

Dirichlet series 


or the power series 


G 


P(x)= ), f(n)x’. 

There are called the generating functions of f. 
The consideration of these functions makes 
possible the application of function-theoretic 
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methods to the problem in this section. Some- 
times more complicated functions are used. 

We mainly consider the generating function 
represented by the Dirichlet series. The gen- 
erating functions of p, u, d, y (— Sections C, D) 
are ((s)= Xpan“, Broi lnn, Vee, dinn S, 
L(s, Y= Łza y(n)n *, respectively. The tab- 
scissa of absolute convergence of each of these 
Dirichlet series is 1. The function L(s, y) is 
called the *Dirichlet L-function (— 123 Distri- 
bution of Prime Numbers; 450 Zeta Func- 
tions). When k=1 and y=7p, L(s, x) is pre- 
cisely ¢{s). 

Let F(s), G(s), and H(s) be the generating 
functions of f, g, and h= f *g. Let f and g 
be multiplicative, so that h is also multiplica- 
tive. Moreover, if F(s) and G(s) are absolutely 
convergent for o > cp, then H(s) is also ab- 
solutely convergent for o 2 6g, and F(s)G(s)= 
H(s) for c > co. It follows from this that 
Dra (myn =g (s), Ere w(n)y(n)n 
=L(s, X) *, Lye d,(n)n* = C(s)*, and 
EL, d?(n)n $= C4(s)/C(2s). The last result 
was given by S. Ramanujan. More generally, 
Lz, "(n\n =C7"(s) p(s), where o(s) is an 
analytic function for o> 1/2. By utilizing ana- 
lytic methods, we can deduce from this that 
Enc d'(n ~ xte (log x)?! +... +c) (B. M. 
Wilson, 1923) (— 123 Distribution of Prime 
Numbers; 242 Lattice-Point Problems). 

There are many formulas known as the 
Euler summation formula. Among them the 
following one is convenient to use (E. Landau, 
L. J. Mordell, H. Davenport): Let fo(x)=x 
—[x]— 1/2 (when x is not an integer), =0 
(when x is an integer). We successively con- 
struct continuous functions fi (x), f>(x), ... of 
period 1 such that f; (x)= f,- (x) (rè 1) if x is 
not an integer, and {9 f(x)dx =0. If F(x) is of 
class C* on [a,b], then using these auxiliary 
functions, we have 


b 
Y F(m)= | F(x)dx 
agm<b a 


h-1 


+) (-I(L@F%C) 


r=0 


— f(b) F(b)) 
sf Sas) FO (x) dx, 


where >’ means that the term corresponding 
to m=a or m=b in the sum is to be replaced 
by F(a)/2 or F(b)/2 whenever a or b is an in- 
teger. Since the tFourier series of fọ(x) is 


* sin(27mnx) 
n=1 AN 


(which is convergent), f,(x) can be expressed by 


F 


2 exp(2zinx) 


geo riny” 


295 Ref. 
Number-Theoretic Functions 


where >’ means that the term with n=0 is 
omitted and this sum actually means 


> (eee | 


(2niny*! (—2niny*? 


Hence if h=1, we have 


b 
y ra= | F(x)dx 


agngb 
1 I? ; 
—— } -| F'(x)sin(2nnx)dx. 
For instance, if we put h=1,a=1, b=N, 


F(x)=x “and let N tend to oo, then we have 
the formula 


1 1 
-hel Solx OD ay for o>1. 
pore 


Utilizing the following integration by parts, 
the integral on the right-hand side can be 
extended so as to become holomorphic im s in 
the whole complex plane: 


[a x= funtion | AO file Tix 
1x 


Probabilistic considerations are also used 
for the study of various number-theoretic 
functions. If f(n) is œ(n) or Q(n), then we have 
Ln<x/(n=xloglogx +cx + o(x). Therefore 
the average order of «(n) or Q(n) is estimated 
as loglogn. Let A(x; a, p) be the number of n 


satisfying n <x and loglogn+,/loglogn < 


f(n)<loglogn+ B,/loglogn. Then 
. A(x; a, p) -u2 

l “$12 du. 
r ee 


For f(n)=a(n), the result was proved by P. 
Erdés and M. Kac (1940) by using the tcentral 
limit theorem and V. Brun’s tsieve method. 
Further general formulas were obtained by M. 
Tanaka (1955). Further development and the 
present state of probabilistic number theory 
can be seen in [7, 8]. 

Finally, we mention the well-known estima- 
tion formulas. Let € be an arbitrary positive 
number and n be a positive integer. Then d(n) 
= O(n"), where O (*Landau’s symbol) depends 
on £. We know that limsup,,,,, (log d(n))- 
(loglogn)/logn=log 2 (S. Wigert, 1907), and 
lim inf, „~ (n): (loglogn)/n=e~©. The result 
w(n)= O(log n/loglogn) is often used. Let y be 
a primitive character modulo k, and let S,,= 


=", x(n). We can prove that |S,,|<./k logk for 


all m (I. Schur, 1918) and S,,=Q(./k log log k) 
(R. Paley, 1932). This formula, with the symbol 
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Q, means that there exist infinitely many k 
satisfying |S,,| >c /k loglogk by taking m 
and y suitably, with c any positive constant. 
Let y be a character modulo k, and 7° be a 
primitive character associated with y. Then 
we have L(s, 4)= L(s,x°) p1 —x°(p)p ™>). 
It follows that 


S n= Y add) Y xi) 
d;d\k,d f 


3 
il 
mn 


m<n/d 


if the conductor of x° is f. A. G. Postnikov 
investigated (1956) the sum of characters. The 
least integer that is not a quadratic residue 
modulo p does not exceed p'?V¢ (log p)?, where 
p is a sufficiently large prime (I. M. Vinogra- 
dov, 1926). The least integer that is a fprimi- 
tive root modulo p does not exceed 2”*',/p, 
where m is the number of distinct prime divi- 
sors of p—1, with p a prime (L. K. Hua, 1942). 
D. A. Burgess (1962) deals with the latter re- 
sults. We conclude with Artin’s conjecture: If w 
is a given square-free integer, then there exist 
infinitely many primes p such that w is a primi- 
tive root modulo p. In 1967, C. Hooley proved 
this conjecture subject to the assumption that 
the general tRiemann hypothesis holds for 
*Dedekind zeta functions over certain *Kum- 
mer extensions of Q. He also obtained the 
asymptotic formula for the number of such 
primes not exceeding x. Recently the tsieve 
method has been widely applied to the various 
investigations in the theory of number theore- 
tic functions (— 123 Distribution of Prime 
Numbers E). 
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A. History 


Simple and curious relations among integers 
were discovered and admired from antiquity. 
For example, we have the relation 37+ 47 =5?, 
which has geometric meaning concerning right 
triangles. The Pythagoreans (— 187 Greek 
Mathematics) sought similar relations. They 
were also interested in tperfect numbers (num- 
bers equal to the sum of their divisors, such 

as 28=1+2+4+7+ 14). Modern arithmetic 
inherits from the Greeks the proof of the 
existence of an infinite number of primes, the 
tEuclidean algorithm to obtain the greatest 
common divisor of two integers (both given in 
Euclid’s Elements), and Eratosthenes’ tsieve for 
finding primes. In the 3rd century A.p., Dio- 
phantus of Alexandria discovered a method of 
solving indeterminate equations of degrees 1 
and 2; this marked the origin of Diophantine 
analysis. The ancient Chinese also knew how 
to solve equations of the first degree in some 
special cases. Arithmetic also developed in 
India from an early period; its relation to 
Greek mathematics is not yet entirely clear. In 
the 12th century, the Indian mathematician 
Bhaskara knew how to solve Pell equations 
using a method much like Lagrange’s. 

Interest in integers was reborn in Europe in 
the 17th century. During this period, Bachet 
de Meéziriac (1581-1638) rediscovered the 
solution of the tDiophantine equation of the 
first degree and published it in his famous 
book on mathematical recreations [1]. Primes 
of the form 2” — 1, closely related to perfect 
numbers and called tMersenne numbers, at- 
tracted considerable interest. Fermat, some- 
times called the father of number theory, 
announced numerous results without giving 
proofs; the most famous among them is the so- 
called Fermat’s last theorem (— 145 Fermat’s 
Problem). Another famous conjecture of his is 
that every integer is expressible as the sum of 
at most n n-gonal numbers, i.e., numbers of the 
form k(k — 1)n/2—k(k—2), keN. This was 
proved by Gauss (for the case n= 3), Jacobi 
(n=4), and Cauchy (for the general case). 

In the 18th century, remarkable progress 
was made by Euler and Lagrange. The second 
part of Euler’s Algebra [2] contains rich re- 
sults of miscellaneous sorts in the field. La- 
grange developed the theory of tcontinued 
fractions and applied it to arithmetic. Toward 
the end of the 18th century, Legendre com- 
piled his comprehensive book [3], from whose 
title originates the term number theory. 
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Gauss’s Disquisitiones [4] appeared at about 
the same time as Legendre’s book. The theo- 
retical arithmetic of today originates from this 
work of Gauss. The book includes the theories 
of tquadratic residues, tquadratic forms, and 
cyclotomy (i.e., arithmetic theory of the roots 
of unity in the field of complex numbers), all 
of which appeared as well-developed theories 
of a remarkably high standard. The work was 
received with more respect than comprehen- 
sion. Dirichlet made a lifelong effort to pop- 
ularize the Disquisitiones; he also applied 
analytical methods to compute the tclass 
number of quadratic forms, thus giving num- 
ber theory a new direction, the analytic theory 
of numbers. Gauss treated only binary qua- 
dratic forms; Eisenstein, Minkowski, and 
Siegel generalized the theory to the case of n 
variables. The algebraic theory of numbers has 
its origin in Gauss’s paper on biquadratic 
residues. (— 14 Algebraic Number Fields; 59 
Class Field Theory; 118 Diophantine Equa- 
tions; 182 Geometry of Numbers; 297 Num- 
ber Theory, Elementary; 347 Quadratic Fields; 
430 Transcendental Numbers.) 


B. Analytic Methods 


Analytic methods are sometimes used to solve 
arithmetic problems. For example, Legendre 
conjectured that any arithmetic progression of 
integers a, a+d, a+ 2d, ... contains an infinite 
number of primes if a and d are relatively 
prime. The conjecture was first proved by 
Dirichlet in 1837; in the proof he used an tL- 
function. Recently, proofs that do not use L- 
functions have been obtained, although these 
proofs are still not purely arithmetical. Analy- 
sis is indispensable in the formulation of some 
arithmetic problems. For example, let xeR 
and 2(x) be the number of primes not exceed- 
ing x. Euclid’s Elements already give the re- 
sult z(x)— 00 as x00, but to describe the 
behavior of z(x) as x— œ, notions of analysis 
are needed. Gauss conjectured that 

_ a(x) 

lim =; 
x0 X/log x 


This tprime number theorem was first proved 
in 1896, using the results of the theory of func- 
tions of a complex variable; more elementary 
proofs have been obtained in recent years. 

Analysis is sometimes needed to solve or 
simplify certain problems in number theory. 
The branch of mathematics treating such 
problems is called analytic number theory. The 
question of the extent to which analysis is 
really needed in dealing with such problems is 
itself an interesting one. 
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In the 20th century, analytic number theory 
has made rapid progress. The problem of 
distribution of primes has been generalized to 
the case of algebraic number fields. tAdditive 
number theory dealing with fWaring’s prob- 
lem, tGoldbach’s problem, and other prob- 
lems has developed and formed a new field. 
We also have geometric number theory, which 
deals with tlattice-point problems. (— 4 Addi- 
tive Number Theory; 123 Distribution of 
Prime Numbers; 242 Lattice-Point Problems; 
295 Number-Theoretic Functions; 328 Par- 
titions of Numbers; 450 Zeta Functions.) 
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A. The Euclidean Algorithm 


We denote the set of natural numbers 1, 2, 
3,... by N and the set of rational integers 

0, +1, +2, ... by Z. Evidently Z is an ordered 
tcommutative ring and also an tintegral do- 
main with respect to ordinary addition and 
multiplication. For any ae Z and beN, there 
exists a unique pair of integers q and r such 
that a=qb +r (0<r <b) (division algorithm); g 
is called the quotient and r the remainder of the 
division of a by b. When the remainder r is 
zero, we say that a is a multiple of b, b is a 
divisor of a, and a is divisible by b. We denote 
this relation by b{a. After a finite number of 
divisions a=q,b+1r,, b=qor, +r2, Fi =43l2 + 
r3,...(b>1r, >r, ...>0), we reach an equa- 
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tion of the form r, =qp+2fņk+1. The remainder 
r=r, +1, Uniquely determined in this manner by 
a and b, is the greatest common divisor (G. C. 
D.) of a and b. It can be expressed as r=ax+ 
by with integers x and y. This method of 
obtaining the G. C. D. is called the Euclidean 
algorithm. The greatest common divisor of 

a and b is denoted by (a, b). If an integer c 
divides both a and b, then c| (a, b). When 
(a,b)=1, we say that a and b are relatively 
prime, and there are integral solutions x, y of 
the equation ax + by=1. Given a pair of posi- 
tive integers a and b, there exists a unique 
positive integer c that divides any common 
multiple of a and b, called the least common 
multiple (L. C. M.) of a and b. If d and | denote 
the G. C. D. and L. C. M. of a and b, respec- 
tively, we have ab=dl. 


B. Prime Numbers 


An integer p is called a prime number if p is 
larger than 1 and has no positive divisors 
other than 1 and itself. A positive integer is 
called a composite number if it has positive 
divisors other than 1 and itself. The follow- 
ing method of selecting the prime numbers 
from the sequence 2, 3, 4, 5,..., known to the 
Greeks, is called Eratosthenes’ sieve: First we 
discard multiples of two, thus reaching the 
second prime number, 3. Then we discard 
multiples of three and reach the next prime, 5. 
Continuing this process, we find the sequence 
of primes 2, 3,5, 7, 11,... by sifting out all 
multiples. 


C. Decomposition into Primes Factors 


Every integer n> 1 can be uniquely expressed 
as the product of primes; the resulting decom- 
position can be written as n=p%q?r’... with 
distinct prime factors p,q,r,... and corre- 
sponding exponents «, $, y, ... uniquely de- 
termined by n. This theorem is called the 
fundamental theorem of elementary number 
theory. 


D. Perfect Numbers 


We denote by a{n) the sum of positive divisors 
of n (including 1 and n itself). According as 
o(n) is greater than, equal to, or less than 2n, 
we call n an abundant number, perfect number, 
or deficient number. An even number is perfect 
if and only if it can be represented as n=2°"'- 
(2°— 1) with prime 2°—1 (L. Euler). The exis- 
tence of an odd perfect number still constitutes 
an open question. It is well known that, if n 

is odd and perfect, then n must be of the 
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form n= ppi! ..-pft, where py =a, = 1 

(mod 4), and a; is even for i>0. Recently it has 
been proved that t must be >7 (P. H. Hagis, 
Jr., Math. Comp., 35 (1980)). Many necessary 
conditions for an odd number to be perfect 
have been given and repeatedly improved. It 
has been recently proved that there exists no 
odd perfect number less than 10°° (Hagis, 
Math. Comp., 27 (1973)). 

Two numbers m and n are said to form a 
pair of amicable numbers if a(n) —n=m and 
o(m)—m=n (e.g., m=220 and n= 284). Euler 
listed 61 such pairs. The following two num- 
bers m, n, both having 152 digits, constitute the 
largest amicable pair currently known: 


m=3*-5-11-5281'°-29-89(2- 1291 - 528119 
— 1), 
n=3*-5-11-528119(23- 33-52-1291 - 528119 
—1) 


(H. J. J. te Riele, Math. Comp., 28 (1974)). 

E. Lionnet considered the numbers n such 
that the product IT,,,d is equal to n? and 
called such numbers perfect numbers of the 
second kind, e.g., n= p°, n= pp’, where p and p’ 
are unequal prime numbers. It is also known 
that there are numbers n such that o(n)/n is an 
integer. For example, 2°- 3-7, 215-35- 52.72- 
11-13-17-19-31-43-257 have this property. 


E. Mersenne Numbers 


A number of the form 2°—1, where e is a 
prime, is called a Mersenne number. For the 
number 2°—1 to be prime, it is necessary but 
not sufficient that e be prime. If a Mersenne 
number is a prime, it is called a Mersenne 
prime. It is not known whether there are in- 
finitely many Mersenne primes; it has been 
verified, however, that for e < 44500, there are 
27 cases of e such that 2°— 1 is a Mersenne 
prime; e= 2, 3, 5, 7, 13, 17, 19, 31, 61, 89, 107, 
127, 521, 607, 1279, 2203, 2281, 3217, 4253, 
4423, 9689, 9941, 11213, 21701, 23209, and 
44497. Until the 18th century, the verifications 
for e<31 were done by direct calculation. For 
61 <e<127, the “Lucas test” was utilized to 
execute the computation. The remaining cases 
were calculated by means of electronic com- 
puters. The number 244497 — 1, which has 
13395 digits, is at present the largest known 
prime. 


F. Fermat Numbers 


Numbers of the form 2?” +1 are called Fer- 
mat numbers. For a number p= 2°+ 1 to bea 
prime, it is necessary that e be a power of 2. 
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Fermat conjectured that numbers of the form 
2?" +1 are all primes. In fact, for v=0, 1, 2, 3, 
4, the corresponding p= 3, 5, 17, 257, 65537 
are primes; however, 2?°+ 1 is divisible by 641. 
It is not yet known whether there exist Fermat 
primes other than these five primes. For fur- 
ther details — [10]. Fermat numbers are 
closely connected with the problem of tgeo- 
metric construction of regular polygons. 


G. Congruence 


Let m be a positive integer. Two integers a and 
b are said to be congruent modulo m if their 
difference is divisible by m; we denote this 
relation by the congruence a =b (mod m) (or 
simply by a=b (m)) and call m the modulus of 
this congruence. Congruence modulo m is an 
tequivalence relation, which is compatible 
with the ring operations and classifies Z into 
m Classes. We thus obtain the tresidue class 
ring Z/mZ with m elements. If p is a prime, 
then Z/pZ is a field which is isomorphic to the 
*prime field with tcharacteristic p. A complete 
system of representatives of the quotient set 
Z/mZ is called a complete residue system 
modulo m. On the other hand, a set of —(m) 
elements n; (g is tEuler’s function) such that 
n, =n; (m) (i4j) and (n,,m)= 1 is called a re- 
duced residue system modulo m. The set of all 
residue classes represented by a reduced re- 
sidue system modulo m forms a multiplicative 
Abelian group of order (m), we denote this 
group by (Z/mZ)*. If (a, m)=1, then a?” =1 
(mod). If p is a prime and (a, p)= 1, then 
a’~' =1 (mod p) (Fermat’s theorem), since 
~(p)=p—1. When m is 2, 4, p* or 2p* (p¥2; 
k=1, 2,...), (Z/mZ)* is a tcyclic group, whose 
tgenerator g is called a primitive root modulo 
m (— Appendix B, Table 1). For any a prime 
to m and generator g of (Z/mZ)*, there exists 
a unique number u (1 <u < (m)) such that 
a=g" (modm). We call u the index of a with 
respect to the basis g, and write w= Ind, a (— 
Appendix B, Table 2). The group (Z/2*Z)* 

(k > 3) is Abelian of type (2, 2" 7), a basis of 
which is formed by the residue classes repre- 
sented by —1 and 5. From this follows (p — 1)! 
= —1 (mod p) (Wilson’s theorem). Generally, 
if m= [[;-; m,, (m;,m,)=1 for i#j, then we 
have Z/mZ = Z/m,Z+...+ Z/m,Z (direct sum) 
and (Z/mZ)* =(Z/m, Z)* x ... x (Z/m,Z)* (di- 
rect product). 

The congruence ax =b (mod m) with (a, m) 
=d is solvable if and only if d|b, and when it is 
solvable, the solution is unique modulo m/d. 
The simultaneous congruences x =a; (mod m,) 
(i=1,2,...,k) are solvable if and only if a,=a, 
(mod(m,,m,)) (i,j/=1,...,k), and when they are 
solvable, the solution is unique modulo the 
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greatest common multiple of m,,...,m,. In 
particular, when (m;,m,)= 1 (i#/), the solution 
is unique modulo m; m, ... m, (Chinese remain- 
der theorem). If m= pf: ... pk is the factoriza- 
tion of m, solving the congruence f(x)=0 
(mod m), where f(x) is a polynomial with in- 
tegral coefficients, can be reduced to solving 
f(x)=0 (mod při) (i= 1, ...,k). Also, solving a 
quadratic congruence can be reduced to solv- 
ing a linear congruence and a congruence of 
the form x? =a (mod m). 


H. Quadratic Residues 


When the congruence x? =a (mod m), where 
({a,m)= 1, is solvable, a is said to be a quadratic 
residue modulo m; otherwise, a is said to be a 
quadratic nonresidue modulo m. The following 
two conditions are necessary and sufficient for 
a to be a quadratic residue modulo m: (i) ais a 
quadratic residue with respect to each of the 
prime factors p (#2); and (ii) a= 1 (mod 4) or 
a=1 (mod 8) according as 4|m or 8|m. 

Given a prime number p and integer a 
prime to p, the Legendre symbol (a/p) is by 
definition 1 or —1 according as a is a qua- 
dratic residue modulo p or not. The value of 
this symbol is determined by a (mod p), and 
we have (ab/p)=(a/p)(b/p). Hence the symbol 
determines a tcharacter of (Z/pZ)* of order 2. 
Furthermore, the congruence (a/p)=a? 1”? 
(mod p) holds (Euler’s criterion). 


I. Reciprocity Law 


For odd primes p and q (p #q), the formulas 
(p/q)(q/p)=(— 1 PTD a- 
(—1p)=(-"-2, (2p)=(-1)P 8 


are called the law of quadratic reciprocity of 
the Legendre symbol, the first complementary 
law, and the second complementary law, re- 
spectively. These laws were conjectured by 
Euler and first proved by Gauss, who gave 
seven different proofs. We now have more 
than fifty different proofs of these laws. P. 
Bachmann (Niedere Zahlentheorie I (1902)) 
lists the 47 different proofs of the laws known 
at the time. Gauss’s first proof was elementary; 
his second proof used quadratic forms. The 
latter has been reformulated utilizing the 
theory of quadratic fields [4]. The fourth proof 
used tGaussian sums, and the sixth proof used 
algebraic congruences with integral coeffi- 
cients. His seventh proof, contained in his 
posthumous works, used congruences of higher 
degree [4]. His fourth, sixth, and seventh 
proofs are related to the arithmetic of tcyclo- 
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tomic fields [4]. The third and fifth proofs 
are the most elementary and simple. They are 
based on Gauss’s lemma: Let r,,7r, -..,1%p—1)2 
be the residues of divisions of la, 2a, ...,(p— 
1)a/2 by an odd prime p, and let n be the 
number of these residues that are greater than 
p/2. Then we have (a/p)=(—1)". T. Takagi gave 
a simplified exposition of the third proof using 
geometric figures (1904). The same method 
was rediscovered by G. Frobenius (1914). 

When m is an odd integer such that m= 
+11; p%, (m,n) =1, we call (n/m) =T1(n/p, 
Jacobi’s symbol. If m has no square factor, it is 
a character of (Z/mZ)*. If we put sgnm= + I 
or — 1 according as m>0 or m<O, we have 
the following law of quadratic reciprocity of 
Jacobi’s symbol and its complementary laws: 

If m and n are relatively prime odd numbers, 
then 


(m/n)(n/m) 


= ( aes {= Diener W2)+((sgnm—1)/2)((sgnn—1)/2) 
($1/n) =(—1) De tee de, 
(2/n)=(—1)" "D4, 

where 

n*=(—1)"™ Pn, 


Furthermore, tKronecker’s symbol, another 
generalization of the Legendre symbol, is used 
in the theory of quadratic number fields (— 
347 Quadratic Fields). 
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Numerical Computation of 
Eigenvalues 


A. General Remarks 


Numerical computation of teigenvalues and 
feigenvectors of a matrix provides a basic 
technique for the numerical solution of various 
eigenvalue problems. Roughly speaking, there 
are two kinds of method. In methods of the 
first kind, one determines the tcharacteristic 
polynomial p(A)=det(Al — A) (where I is the 
unit matrix) of A (or to give an algorithm to 
calculate the value of p(A) for an arbitrary å), 
then solves the algebraic equation p(4)=0 
numerically to obtain the eigenvalues 4, (u= 
1,2,...)(— 301 Numerical Solution of Alge- 
braic Equations), and finally to one deter- 
mines the eigenvectors x, by means of the 
equations (A, — A)x, =0. In methods of the 
second kind, one obtains eigenvalues and 
eigenvectors directly without resorting to the 
solution of an algebraic equation, relying in- 
stead upon repeated application of similarity 
transformations. (The power method does not 
fit into this classification, being a different 
approach altogether; — Section C.) In partic- 
ular, a real symmetric or complex Hermitian 
matrix is reduced approximately to a diagonal 
matrix. In this article we denote a given square 
matrix of order n by A=(ay) (j= 1, ..., n). 


B. The Jacobi Method 


The Jacobi method is an iterative technique 
for determining all the eigenvalues and eigen- 
vectors of a real symmetric matrix [7]. Before 
the advent of high-speed computers it was 

not considered practical, but at present it is 
one of the most compact and elegant methods. 
The following algorithm can be extended 

to Hermitian matrices by replacing tortho- 
gona} transformations by suitable tunitary 
transformations. 

Roughly, the method transforms a given 
matrix A =(a,) (a, =aj;i,j=1,...,n) into a 
diagonal one by repeated application of 2- 
dimensional rotations of the reference axes. 
We first put A = A, U =], and compute 
A%, A®,...,U%, UP, ... successively as 
follows: 

(1) Select an off-diagonal element of A® = 
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(a?) with the maximum absolute value and 
denote it by a$}. 
(2) Compute 


2a) sgn(a®), — a’) 
a’), — a®|+ Ja®,— 


any +4(a ar 
(where sgn x = 1, 0, or —1 according as x >0, 


tand= 


=0, or <0), 
cosĝ=(1+tan? 0)", and 
sin ĝ = tan 0- cos 0; 
and form T® =(t9), where t9, = 10 =cos0, t9 = 
1 for i#p, q, — t$} =10 =sin9, aad t) =0 for 
all other (i, j). 

(3) Determine ATD and UCHD by ACY = 
TAT (' denotes the transpose) and UY 
= UT”. In this process, T® represents an 
orthogonal transformation (rotation) in the 
plane spanned by the pth and qth coordinate 
axes such that i a‘) is nullified. If we 
put N(B)=),, ,bj and M(B)= Eb, then 
N(B) is PEE SA an orthogonal trans- 
formation, so that N(A®)= N(A). Further- 
more, since af*!) =a‘) (i+ p,q) and (a¥3"))? + 
(al)? = (a)? + (a)? + 2(a)?, we have 
M(A“')) = M(A®)—2(a)?. Since a‘) has the 
maximum absolute value among all the off- 
diagonal elements, we have (a9)? > M(A)/ 
(n? —n). Therefore 


M(A“™”) <(1—2/(n* —n))M(A®) 


<(1—2/(n’ —n))'*'M(A) 


<M(A)exp(—2(1+ 1)/(n? —n)). 


It has been proved a fortiori [13] that, after 
M(A®) comes down to below a certain thresh- 
old value, the convergence of the iteration 
process becomes quadratic, i.e., there is a num- 
ber c determined by the order n of A and the 
arrangement of the eigenvalues of A such that 
M(AG*"®-D?2)) < e(M(A®)). Since the set of 
eigenvalues of A® coincides with that of A 
and the eigenvalues of an arbitrary symmetric 
matrix B can be made to correspond in a one- 
to-one way to its diagonal elements in such a 
way that the difference between an eigenvalue 
and the corresponding diagonal element is not 
greater than M(B)", a tends to 4; (i=1, 

,n) as 1 tends to infinity. Moreover, as z 
tends to infinity, each column vector of U® = 
(u\?) tends to the CoE pondini eigenvector, 
in the sense that Eg- a;,u{) — Aju? 0. 

The number of arithmetic computations 
required to obtain A“*”) and UCHD from A® 
and U” is at most proportional to n, so that 
for a given e>0, the arithmetic required to 
reduce max; la‘) — A;| below e(M(A))’” is at 
most proportional to n? (because z is at most 
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proportional to n°). On the other hand, the 
search for an off-diagonal element of A® with 
the maximum absolute value, if it is done by 
simply comparing all the elements, will require 
effort proportional to nê, so that the amount 
of work required by the searching process is 
proportional to n*. To bypass this searching 
process, the cyclic Jacobi method and the 
threshold Jacobi method are often used. The 
former method adopts as a$}, the off-diagonal 
element for which q> p and :=(p—1)(n—p/2) 
+(q =p), that is, ah, a¥3, a am aS’, aya, 
... are adopted in this sequence. The latter 
method adopts as a") off-diagonal elements 

in a sequence similar to the one above as long 
as they exceed a given threshold value; but if 
an element is less than that threshold value, 
then the next element in the sequence is a can- 
didate for adoption as a‘), where the thresh- 
old value is made to decrease gradually as 

the iteration process proceeds. However, the 
search for an element with the maximum 
absolute value can be done more effectively by 
taking account of the fact that only the matrix 
elements lying in rows p and q and in columns 
p and q change their values when we transform 
A® into A“*”), In fact, we can record for each 
row the value as well as the position of the 
(off-diagonal) element with the maximum 
absolute value in that row. By so doing, the 
effort of searching for an off-diagonal element 
with the maximum absolute value can be 
reduced to something proportional to n on the 
average. 


C. The Power Method 


The power method is suitable for obtaining 
only the eigenvalue of maximum absolute 
value [6]. Let us assume that the eigenvalues 
A,,+.-,4, Of A are arranged so that |A,|> 
|A,|2]A,|/>...>|4,|, with 2, real, and de- 
note by y, the left eigenvector corresponding 
to 4, (which means y,(A,/—A)=0). Starting 
from an arbitrary (real) vector x such that 
(y,,x)#0 and 7{°=1 for a prescribed ig, 
we compute 0, 9%, ... and x™, x)... by 
Ax = 69x") (1=0, 1,2, ...; x8) = 1). Then 
we have lim, ,,, 0° =, and lim, „p xP? =x; 
(the eigenvector corresponding to the eigen- 
value 4,). The rate of convergence depends on 
|A,/A,| if the telementary divisor of A cor- 
responding to 4, is linear, but in the case of a 
nonlinear elementary divisor, the convergence 
is too slow for practical purposes. If A, #42 
and |4,|=[4,|>|43|2...2|4,|, then the se- 
quences of 0 and x computed by the for- 
mulas above do not converge but in general 
oscillate. However, from 0, 9&4), x®, x@*), 
x"*?) for a sufficiently large 1, we can obtain 


i> 0 
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approximate eigenvalues 4, and 4, as the two 
roots of the quadratic equation in A: 


xP zP g getD 

ey) fr gO) =0 
+2 +2 2 

gage 


(i and j arbitrary, i Æj). 


The corresponding eigenvectors are given by 
KSA Ma and x, =A xO — xEtD, 
Complex conjugate pairs of eigenvalues can be 
dealt with in this manner. This is useful also 
when |/,|=|A,|. The extension to the case of 
more than two eigenvalues with the same 
maximum absolute value is obvious. 

In order to determine the remaining eigen- 
values by the power method we have to com- 
bine it with the deflation or transformation of 
matrices, as discussed later in this section and 
in the following section. The amount of com- 
putation depends on the arrangement of the 
eigenvalues of A and the required accuracy. 
We note that the multiplication of a matrix by 
a vector requires an amount of computation 
proportional to n°. 

Improvement in the approximations can be 
incorporated into the power method. If v=x, 
+ O(e) is an approximation to the eigenvector 
X, corresponding to an eigenvalue 4, of a real 
symmetric matrix A, then the Rayleigh quo- 
tient Ap =(v, Av)/(v, v} affords a good approxi- 
mation to /,. In fact, we have |4, —Apg|=O(¢’). 

If A; (i=1,...,n) is an eigenvalue of A and 
x, the corresponding eigenvector, then P(A;) 
(i=1,...,n) is an eigenvalue of P(A) and x; 
the corresponding eigenvector, where P(¢) is 
a polynomial in č and &~?. This fact can be 
utilized to transform the magnitudes of eigen- 
values to accelerate the convergence of the 
power method, to separate eigenvalues with 
the same absolute value, or to obtain inter- 
mediate eigenvalues. The choice P(A)=A—c 
or (A—c)~' is particularly useful, where c is 
an appropriate constant. In fact, an efficient 
algorithm known as inverse iteration exists 
for computing an approximate eigenvector 
when a good approximation to an eigenvalue 
is known. Given a trial eigenvector x cor- 
responding to a computed eigenvalue c, one 
computes an improved approximate eigen- 
vector y by solving (A —cl)y=x or y=(A— 
chx. 

Aitken’s 6?-method is also efficient in accel- 
erating the convergence of the power method. 

When an eigenvalue-eigenvector pair is 
known, another eigenvalue-eigenvector pair 
can be computed using the process known as 
deflation. If an eigenvalue 4, and the corre- 
sponding eigenvector x, (and also the corre- 
sponding left eigenvector 'y, if necessary) of A 
are known, it is possible to “subtract” them 
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from A to get a problem containing only the 
remaining eigenvalues. Such deflations are 
often used in combination with the power 
method. The following are two examples of 
deflation methods. 

(1) Assuming that x, and y, are normalized 
in such a way that (y,,x,)=1, form B= A— 
A,X,Y,- Then B has the same set of eigen- 
values and eigenvectors as A except for A,. The 
eigenvalue and the eigenvector of B corre- 
sponding to 4, of A are 0 and x,, respectively. 
This kind of deflation process can be gen- 
eralized to the case of nonlinear elementary 
divisors, but that becomes somewhat more 
complicated. 

(2) After normalizing x, so that its nth com- 
ponent x,,, is equal to 1, form B=(b;):b; =a; 
—X,:4,j (i,j=1,...,n—1). Then B has the same 
set of eigenvalues as A except for 4,. If w, is 
the eigenvector corresponding to the eigen- 
value 4, of B, then the corresponding eigen- 
vector x, of A is given by Xp; = Wki + dp Xp (i= 
1,...,n2—1), x,,=d,, where d, is determined 
from EFS] NW =(A,—A,)d, if 4, #A,. If A, =A, 
and r= DF... a,;W,;=90, then we can put d, =0. 
If 4,=A, and r40, A has a nonlinear elemen- 
tary divisor for 4, =4A,, and the x, defined by 
X= W,;/r and x,,=0 is a generalized eigen- 
vector of A in the sense that Ax, =4,Xk + Xp- 


D. Transformation of Matrices 


There are a number of methods of transform- 
ing a given matrix A by means of a suitable 
similarity transformation A> B=S ‘AS into 
another matrix B for which it is easier to solve 
the eigenvalue problem. The Givens method 
[9] transforms a symmetric matrix A into a 
tridiagonal matrix B (i.c., a matrix such that b, 
=0 for |i—j]>2) by means of a matrix S which 
is the product of 2-dimensional rotation ma- 
trices. The Householder method [10] also trans- 
forms a symmetric A into a tridiagonal B by 
means of an orthogonal matrix S of special 
type, i.e., a reflection matrix [-2uu* (u*u=1, 

u* =conjugate transpose of u), and the Lanc- 
zos method [8] transforms a general A into a 
tridiagonal B. To general matrices the follow- 
ing methods are also applicable: (1) The Dani- 
levskii method [1] transforms A into its com- 
panion matrix B by repeated application of 
elimination operations. Here, row interchanges 
can be combined to increase numerical sta- 
bility [14]. (2) The Hessenberg method [3] 
transforms A into a B such that b,, =0 for 
i—j>2 with a triangular S. (3) The Givens 
method [9] transforms A into a B of the same 
form as in (2) by repeated application of 2- 
dimensional rotations. This method now tends 
to give way to the Householder method or to 
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the elimination method with row interchange. 
All these methods require an amount of com- 
putation proportional to n°. In general, special 
treatment is necessary for the case of multiple 
eigenvalues. As an example, we explain the 
Givens method for general matrices. Let N = 
(n—1)(n—2)/2. For :=0, 1,..., N—1, choose 
(p, 4) = (3, 2), (4, 2), ...,(n, 2); (4, 3), (5, 3), ---, 

(n, 3); ...; (a—1,n—2), (n, n—2); (n,n— 1) in this 
order. Using T” of the same form as in the 
Jacobi method, and setting tan 0 =a"), _;/a?,-, 
in this case, calculate AY = A, UO =], AY = 
TY’ APT, UD = UOT® (1 =0,1,...,N—1), 
and put B= A“. Then we have b,,=0 for i— 
j22. We can solve the eigenvalue problem for 
this simplified B and then retransform the 
eigenvectors of B thus obtained into those of 
A by means of U™. It should be noted that 
the method of bisection based on *Sturm’s 
theorem is effectively used to solve the char- 
acteristic equation of a real tridiagonal ma- 
trix [14]. This method is remarkably stable 
numerically. It is used when the number of 
eigenvalues to be computed is small relative to 
the order of the given matrix. If all eigenvalues 
are desired, an alternative method such as the 
QR method (— Section F) is recommended. 


E. The Lanczos Method 


A more detailed exposition of the Lanczos 
method is now given. Let A be a given real 
matrix of order n. Pick two vectors c, and ¢,. 
Determine recursively the vectors c;,, and 
€:4,,1=1,...,n, that satisfy the following con- 
ditions: (i) Y;+1 Ci+1 = Ac; — HC; — Bici = bist, 
Jii Čia = AC, — GC; — Biči- =), 41, i= 2, ...,0; 
(ii) c;,, is orthogonal to é;_, and ĉ; (iii) ¢;,, is 
orthogonal to c;_, and c;, where «;, Bi, i, and 
B: are scalars, and where y; and 7; are nor- 
malizing scalars. Actually, a; =¢;Ac;/€,c;=4,, 
Pi =V: /Či-1 Ci-1, Pi = Y:čiCi/Ci-1Či-1, where 
c;¢;,i=1,...,n, are assumed nonzero. It can 
be shown that c;,, is orthogonal to every č, 
1 <j <i, and that ¢;,, is orthogonal to every c;, 
1<j<i(c,4,=€,4, =9). The conclusion is that 
the given matrix A is similar to the tridiagonal 
matrix H =(h,,), where h;,=a; ((=1,...,1), hiis 
= Bisio hirii = Yiri (= 2, -.., n). In fact, if C 
denotes the matrix whose jth column equals c; 
(j=1,...,n), one obtains C7! AC =H. Thus the 
eigenvalues of A are identical to those of H. 

In principle, the Lanczos method transforms 
a given matrix to a tridiagonal matrix similar 
to it if every cé; is nonzero (i= 1, ...,n). If c;ĉ; 
does vanish for a certain i, one selects c, and 
č again and restarts. On the other hand, if 
b,,, =0 one chooses an arbitrary vector c;,, 
orthogonal to ¢,,...,é, and if b.,, =0 one 
selects an arbitrary vector ¢,,, orthogonal to 
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C,,-.-,¢;. In the actual numerical computation, 
the distinction between zero vectors and non- 
zero vectors is usually blurred by rounding 
errors. 

In the application of the Lanczos method 
one often observes the loss of orthogonality 
c;¢;=0 (i#j). This usually results from cancel- 
lation errors in the computation of b,,, and 
b.. If the orthogonality is lost, C“'AC may 
significantly deviate from a tridiagonal matrix. 
As a practical means for preserving the in- 
dicated orthogonality one can reorthogonalize 
the vectors c; and é;. Indeed, one can add 
to the computed c;,, a linear combination 
of c,,...,¢; sO that the sum is orthogonal to 
¢,,...,¢;, then take the sum as c;,, after pro- 
perly normalizing. A similar process applies to 
Či+1- The Lanczos method is often applied 
in double precision. It has been suggested 
that one first reduces the given matrix to an 
upper Hessenberg form by using the Hessen- 
berg method with row interchange before 
applying the Lanczos method [15]. 

A further remark on the Lanczos method is 
in order. Recall that y; is determined from 
Aci, C1, %-1, and B;_,; B; from Ci, ¢;, ¢)-1, E;-1, 
and ¥,; and «; from Ac,, c;, and ¢;. This shows 
that the Lanczos method applied to a sparse 
matrix (a matrix whose elements are almost all 
zero) requires only a memory proportional to 
n. The Householder method, on the other 
hand, requires memory proportional to n? 
even for a sparse matrix. 

For a real symmetric matrix one can modify 
the Lanczos method for a general matrix so 
that C is orthogonal and H is real, symmetric, 
and tridiagonal (the details are omitted). 


F. The QR Method 


The QR method was discovered independently 
by J. G. F. Francis and by V. N. Kublanov- 
skaya in 1961 [15]. The method has been 
improved and extended considerably since 
then. In the usual application to matrix eigen- 
value problems, the QR method provides the 
most efficient iterative process for finding all 
eigenvalues of a given matrix. The matrix is 
reduced by means of a similarity transforma- 
tion to Hessenberg form or to tridiagonal form 
before application of the QR method; the re- 
duction process may be effected by the House- 
holder method or by the elimination method 
with row interchange. (If the given matrix 

is a complex matrix, the latter is preferable.) 
The reason is that one step of the QR process 
applied to a full matrix is prohibitively expen- 
sive, requiring a number of operations propor- 
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tional to n>, while one step of the QR process 
applied to a Hessenberg matrix requires a 
number of operations proportional to n?. 

We now describe one of the most useful 
versions of the QR method. Let A= A, bea 
given matrix, and define a sequence of matrices 
Ay, Az,... obtained from Ay as follows. At the 
ith (i=0, 1,...) step, choose an appropriate 
constant s;, called an origin shift, according to 
the process described below, and decompose 
A;— 8,1 into the product of a unitary matrix 
Q; and an upper Hessenberg matrix R;:A;— 
si:l =Q;R;. The matrix A;,, is then defined by 
Aj; =R,Q;4+ 8,1 = QF A,Q,. Hence A;,, is simi- 
lar to A;. The QR method is closely related to 
the power method and to the inverse power 
method. We describe this relationship in order 
to obtain an insight into the nature of the OR 
method. To this end, we first state a well- 
known theorem: Let A be diagonalizable, and 
let its eigenvalues 4; (i=1,...,n) have distinct 
moduli, say |A,|>|4,|>...>|A,|. Let X 1AX 
=diag{A,,4,,...,4,}, and let X have an LU- 
decomposition X = LU, where L and U are, 
respectively, lower triangular matrix and an 
upper triangular matrix. Then the QR al- 
gorithm without an origin shift (s;=0, i= 
0, 1,2,...) behaves as follows: a“) +0 (p >q); 

a >1,; a") oscillates (p <q), p, q=1,2,....n, 
i— cœ, where a‘) denotes the (p, q)th element 
of A;. In other words, A; approaches an upper 
triangular matrix as i œ so that the diagonal 
elements of A; converge to the eigenvalues of 
A. 

The relationship between the QR algorithm 
and the power method is given by (QQ, ... Q;) 
(R,R;-, ... Ro) =(A—Sol)(A—S,1)...(A—5;,]). 
If so=s, =... =s;=0, then the right-hand side 
reduces to A‘*?, and since the product R;... Ro 
is upper triangular, the first column of 0,;= 
QoQ, ...Q; equals a scalar multiple of A‘*'e,, 
where e, =(1,0,0,...,0)’. Therefore, by the 
power method, the first column of 0; converges 
to an eigenvector corresponding to the eigen- 
value 4, of A that has the largest modulus, 
under a certain fairly mild condition. Since 
Ai+ı = QF AQ;, Aie, = OF Aie, © A, e, for 
large i. In other words, the first column of A; 
converges to the vector (4,,0,0,...,0)' as i> œ. 

The relationship of the QR method to the 
inverse power method and to the Rayleigh 
quotient is now explained. From the definition 
of the QR algorithm with an origin shift, we 
have Q;=[(A—s,I)']*R*. Since R* is a lower 
triangular matrix, the nth column of Q; equals 
Q:¢,=L(A;— 5,1) *]*R*e,=(A—5,1) (a 
scalar multiple of e,), where e, =(0,0,...,0, LY. 
This last process of obtaining the last column 
of Q; represents a process known as the inverse 
power method. If s; is close to an eigenvalue of 
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A, (and hence to an eigenvalue of A), the last 
column of A; gives a good approximate eigen- 
vector of A corresponding to the eigenvalue 
under consideration. Now, if x is a given col- 
umn vector such that x*x =1, the value of 4 
which minimizes (Ax — Ax)*(Ax — Ax) is given 
by the Rayleigh quotient x*Ax. If x(x*x=1) 
happens to be an eigenvector of A, the Ray- 
leigh quotient equals an eigenvalue of A. If 
we take x =e,, the corresponding Rayleigh 
quotient equals a,,,. Therefore if we take the 
(n,n)th element of A; as the origin shift s;, then 
s; can be regarded as the best approximation 
to an eigenvalue of A; (hence of A) when e, is 
taken as an approximate eigenvector of Aj, in 
the sense that A=s,; minimizes the functional 
(Aje, —A€,)*(A jen — Aen). 

Under the same condition as in the preceed- 
ing theorem, the rate of convergence of the 
QR method with an origin shift is given as fol- 
lows: a‘) (n> p> q> 1) is asymptotically pro- 
portional to (4,/A,)' for s;=0 (i=0, 1,...) (no 
origin shift); and with the origin shift s,, the 
behavior of a‘) at the ith step is determined by 
(A,—s)/(Aq—S;). If 5; =A, (=the eigenvalue of A 
with the least modulus), each element in the 
nth row of A;,, except a@*” exhibits rapid 
decrease in modulus. A natural and practical 
choice of the origin shift s; is given as follows: 
(i) for a real tridiagonal matrix, s; is taken to 
be that eigenvalue of the 2 x 2 matrix situated 
at the lower right corner of A; that is closer 
to a‘ [14]; (ii) for a real upper Hessenberg 
matrix, the two eigenvalues of the 2 x 2 matrix 
situated at the lower right corner of A; as 
s; and s;,, [14]; (iii) for a complex upper 
Hessenberg matrix, s; is chosen as in (i) [17, 
COMQR]. 

The QR method with origin shift would 
eventually make each element in the nth row, 
except a, smaller in modulus than a pre- 
scribed positive number. At this stage of itera- 
tion, a? is taken as an approximate eigenvalue 
of A. The QR method is then applied anew to 
the (n— 1) x (n— 1) matrix obtained from A; by 
deleting the nth row and the nth column of A;, 
and another approximate eigenvalue of A is 
obtained. The method proceeds similarly until 
all the eigenvalues of A are computed. For 
maximum accuracy the eigenvalues of the 
given matrix should be computed in the order 
of increasing modulus. A word of caution is in 
order. When the given matrix A has elements 
of greatly varing modulus, rearrangement of 
elements of A may be necessary before apply- 
ing the QR method with an explicit origin 
shift, where 4;— s;I (i=0,1,...) is explicitly 
computed. 

The reader is referred to [14-16] for details 
of the OR method. 
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G. Generalized Eigenvalue Problem Ax = ABx 


An eigenvalue problem of the type Ax = ABx ts 
called a generalized eigenvalue problem and is 
often encountered in applied mathematics. A 
necessary and sufficient condition for A to be 
an eigenvalue is det(A —AB)=0. If B~' exists, 
then Ax =ABx is equivalent to B™'Ax =Ax and 
hence has n eigenvalues, where n is the order of 
A. If B~! does not exist, the eigenvalue prob- 
lem Ax =ABx has at most n—1 eigenvalues. 
We restrict ourselves to one of the most im- 
portant cases: that where A is real and sym- 
metric and B is real, symmetric, and positive 
definite. In this case one could solve the prob- 
lem by reformulating it as B-'Ax =Ax, where 
B'A is explicitly computed. However, B™'A 
is not in general symmetric. Moreover, when 
B has eigenvalues of widely different moduli, 
the elements of B™! may have widely different 
moduli as well, which would in turn make the 
computation of those eigenvalues of B~'A of 
smaller moduli difficult. An efficient and nu- 
merically stable method is known which obvi- 
ates the aforementioned difficulty by exploit- 
ing the symmetry of A and B. An outline of 
this procedure is now given. Since B is positive 
definite, a lower triangular matrix L exists 
such that LL’ =B. This is called the Cholesky 
decomposition of B. The elements of L can be 
computed by equating the corresponding ele- 
ments in LL’ = B. The eigenvalue problem Ax 
= ÀBx is now equivalent to L'A(L') t y = 
Ay with y=L’x, where the matrix L~1A(L’)™! 
=P is computed in two stages by solving 
LX =A and PL'= X. In this last equation it is 
enough to compute the upper right half of P, 
because P is symmetric while X is not gener- 
ally symmetric. The upper right half of P can 
be computed by equating the corresponding 
elements in PL’ = X. The eigenvalues / of 
Ax =ABx are given by the eigenvalues of the 
matrix P. Other types of eigenvalue problems, 
such as y AB=Ay’, BAy=Ay, and x'BA=Ax’, 
often appear in practice, where A is real and 
symmetric and B is real, symmetric, and posi- 
tive definite. By using the Cholesky decomposi- 
tion of B, one can reduce any one of these 
eigenvalue problems to the ordinary eigen- 
value problem for a real symmetric matrix [4]. 
If A and B are general matrices in Ax = ABx, 
the following method is known to be effective 
[18]. First, reduce B to an upper triangular 
matrix by applying n— 1 Householder trans- 
formations from the left. This reduces the 
eigenvalue problem Ax = ABx to the case 
where B is upper triangular. Next, apply to A 
a sequence of plane rotations of Givens type 
from the left, thereby reducing the eigenvalue 
problem Ax =ABx to the case where A is an 
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upper Hessenberg matrix and B is an upper 
triangular matrix. Then apply the QR method 
to B™'Ax =Ax without explicitly computing 
B~'A to reduce the eigenvalue problem Ax 

= /Bx to the case where A and B are both 
approximately upper triangular. The eigen- 
values of Ax = ABx are then easily computed 
as ratios of corresponding diagonal elements. 
This method is called the QZ method [18]. 

A collection of about 50 excellent FOR- 
TRAN subroutines for various types of ma- 
trix eigenvalue problems is contained in 
[18]. These subroutines are in most part trans- 
lations from ALGOL procedures given in 
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A. Interpolatory Integration Formulas 


Numerical integration is a method of finding 
an approximate numerical value of a definite 
integral of a given function f(x). Usually the 
integral f? f(x)dx or f? f(x)w(x)dx (w(x) is the 
weight function) is approximated by a linear 
combination £t, W,f(x,) of the values of the 
integrand at the points x1, X3, .--, Xp. Integra- 
tion formulas are divided into two groups, 
the interpolatory formulas and the formulas 
based on variable transformation. 

In order to obtain an interpolatory formula, 
we interpolate over the integrand f(x) at n 
points x,,X,,...,x, by means of the fLagrange 
interpolation polynomial of degree n—1, and 
then integrate the polynomial over [a, b]. 
Depending on the selection of the points x; 
and the weights W,, we have several kinds of 
formulas. 


(1) Newton-Cotes Formulas. We assume w(x) 
=constant and x;= xo +ih (i=0, 1,...,). The 
weights W, are so determined that the value of 
the integral can be calculated accurately if the 
integrand f(x) is a polynomial whose degree 
does not exceed n. There formulas are called 
the Newton-Cotes formulas: {*: f(x) dx =(fo + 
f,)h/2 for n= 1 (trapezoidal rule), fž2 f(x)dx = 
(fo +4fı + fa)h/3 for n=2 (Simpson’s 1/3 rule), 
and fž f(x)dx =(fo+ 3f, +3f + f3)3h/8 for 
n= 3 (Simpson’s 3/8 rule). The truncation 
errors of these three formulas are given by 
h? f(E)/12, hf (G/90, 3h5 f(2)/80, respec- 
tively, where ¢ is a number in the interval of 
integration and f denotes the ith derivative 
of f (differentiability is assumed). For an even 
n, the polynomial of degree n+ 1 can also be 
integrated accurately by these formulas. 

In Table 1 the coefficients A and B; of the 
Newton-Cotes formulas hA Xo Bif (x), h= 
(x, —Xo)/n, and the coefficients C of the error 
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Table 1 

n A B B B, B; B, B, Bg B, B C 

1 1/2 1 1 —1/12 

2 17 1 4 1 — 1/90 

3 3/8 1 3 3 —3/80 

4 445 7 32 {2° 32 7 — 8/945 

5 5/288 19 75 50 50 75 19 —275/12096 
6 1/140 41 216 377 37 27 216 41 —9/1400 

7 7/17280 751 3577 1323 2989 2989 1323 3577 751 — 8183/518400 
8 4/14175 989 5888 —928 10496 —4540 10496 —928 5888 989 —2368/467775 


term Ch?+! f PXE), where p=n+2 if n is even 
and p=n+ 1 ifn is odd, are given. 

When the interval [a, b] of integration is 
large, we usually divide it into small subinter- 
vals and apply formulas for small n for each 
part rather than formulas for large n for the 
whole interval. For example, if the interval is 
divided into m equal subintervals, we get the 
following formula by applying the trapezoidal 
rule for each subinterval: 


| f(x)dx=h((fo +i wl 2th the tet fea), 


where xX) =4, X,, =), h=(b—a)/m, with trun- 
cation error (b—a)? f'?)/(12m7). (Here, as in the 
rest of the article, f stands for f(E), with 
differentiability assumed as before.) By apply- 
ing Simpson’s 1/3 rule we obtain the formula 


[ioa 


2 
=F MUo Sam) 2 +AA Hf +.. + fam) 


+h +fa+ Sr + f2m-2)), 


where xo =4, Xam = b, h=(b—a)/2m, with 


the correction term h( fi —f_,)/24+h(f,,— 


Jm+1)/24=h(Afo + Af—1)/24 — hb Afn—1 + Mfn)/24, 


where Af; means f,,, — fj. 


(2) Chebyshev Formulas. The Chebyshev for- 
mulas are a family of integration formulas in 
which all the weights W, of Xt- W,f(x,) are 
equal, while the abscissas x; are chosen so that 
the integral can be evaluated exactly when f(x) 
is an arbitrary polynomial whose degree does 
not exceed n. When J! , f(x)dx = WE f(x), it 
is easy to see that W=2/n since the right-hand 
side must be equal to the left-hand side when 
J (x)= 1. It is known that the abscissas x; for 
n<7 and n=9 are real, while for n=8 and 

n= 10 at least one of the abscissas becomes 
complex. It is easy to see that Chebyshev for- 
mulas are interpolatory. 


(3) Gauss Formulas. In the Gauss formulas 
both the weights W, and the abscissas x; are 
chosen so that we obtain the accurate value of 
the integral when the integrand is any poly- 
nomial whose degree does not exceed 2n— 1. If 
we put /7(x)=[T}-,(x —x,), an arbitrary poly- 
nomial of degree 2n — 1 can be expressed in the 
form 


truncation error (b — a)? f/180(2m)*. n II(x) mt 
; PEE : S x)= ) ———————— f, + I(x a,x", 
For the integral of a periodic analytic func I(x) 2 (x—x,) x) h ( P k 


tion over a single period the trapezoidal rule 
with equally spaced points gives the best result 
asymptotically, as the number of points tends 
to infinity. 

Newton-Cotes formulas can also be ob- 
tained by integrating over the interval [a, b] 
the interpolation polynomial for equally 
spaced points. The formulas mentioned so 
far are called closed formulas since they use 
values at the two endpoints of the interval. 
We can also use open formulas, which do 
not use values at the ends. For example, we 
have fž4 f(x)dx =4h(2f, — f, + 2f3)/3 with trun- 
cation error 14h5f/45. Open formulas are 
useful for numerical solution of differential 
equations. There are also formulas that in- 
clude values outside the interval, for example, 
(— f+ 13fot 13f, — f2)h/24 for f3: f(x) dx. 
Applying these formulas to the m subintervals 
of [a,b], we obtain a trapezoidal formula with 


where /(x,)=f, and the first term is the *La- 
grange interpolation polynomial. By the as- 
sumption that the integral of f(x) with weight 
w(x) equals D?_, W, fy, we obtain W, = 

[2 (w(x) T(x) — x, T(x) dx and the relations 
k=0,1,... 


b 
| w(x)IT(x)x* dx =0, wn. 


Accordingly, the abscissas x; are determined as 
the roots of the polynomial /7(x) of degree n 
that is orthogonal to x* (k=0,1,...,n—1) with 
respect to the weight function w(x). 

The following are typical examples of inte- 
gration formulas of Gaussian type. 

(1) Gauss integration formulas (in the 
narrow sense). For w(x)=1 and the interval 
{ —1, 1], M(x) is the tLegendre polynomial 
P,(x)=(1/2"n!)d"(x? — 1)"/dx". The error is 
(n!)427" +1 £2” (2n + 1)((2n)!)?. 
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(ii) Gauss-Laguerre formulas. For the 
weight w(x)=exp(— x) and the interval [0, 0), 
TI(x) is the tLaguerre polynomial L,(x)= 
(exp x)d"(x"exp(—x))/dx". 

(iii) Gauss-Hermite formulas. For the weight 
function w(x)=exp(— x°) and the interval 
(—, ©), /7(x) is the fHermite polynomial 
H,,(x)=(—1)"exp x? d"exp(—x*)/dx". 

(iv) Gauss-Chebyshev formulas. For the 
weight function w(x)=(1 —x?)~!? and the 
interval [—1, 1], we use the Chebyshev poly- 
nomial T,(x)=27~)cos(narccos x). In this 
case, the W, are all equal to z/n. 

From the definition of W, we see that the 
Gauss formulas are interpolatory. 


(4) Clenshaw-Curtis Formulas. Although the 
Gauss formula is in general more accurate 
than the Newton-Cotes formula with the same 
number of points of interpolation, the points 
of interpolation for a Gauss formula of any 
order are distinct from those of any other 
order except the point zero, which appears in 
all formulas of odd order. The Clenshaw-Curtis 
formulas are interpolatory, with the points 
chosen so that the distribution of the points is 
similar to that of the Gauss formula and such 
that, in proceeding from a computation of 
order n to that of order 2n, all the function 
values evaluated in the former computation 
be used in the latter. For w(x)= 1, the inter- 
val [~1, 1], and even n, the points x, and the 
weights W, are given by 


T 
X,=cos—, k=0,1,...,n; 
n 
mM = W, ; 
o n Wn’ 
4n2 | 2njs 
W= Wes z COS— —, 
nj=0 1-4 n 
here 
s=1,2,...,x. 
2 


>” means that the first and the last terms in 
the sum are to be halved. There are some 
different types of Clenshaw-Curtis formulas 
depending on the selection of the points x, [1]. 


B. Integration Formulas Based on Variable 
Transformation 


If the integrand has some singularity at the 
endpoint, any interpolatory formula based on 
interpolation with a polynomial does not give 
a good result. In such cases integration for- 
mulas based on variable transformation are 
quite effective. 
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(1) IMT Formula. The tEuler-Maclaurin 
formula is given by 


b 1 n~-1 
| aa A flak +5fby 


a 


m a . E 


pe + Rm 


pA2mrh 


1 n-1 
= (2m) 
Rn Gm! I. B,,,(t) f5 fC™a+kh+ m) dt, 


where the B„(t) are tBernoulli polynomials 

of degree n and the B, are tBernoulli num- 
bers (By = 1, B, = —1/2, B, = 1/6, B, =0, B = 
—1/30,...). This formula suggests that if the 
higher derivatives of the integrand vanish at 
the both endpoints, the error of the trape- 
zoidal rule with equally spaced points be- 
comes very small. The IMT formula is based 
on the idea of transforming the variable x of 
f3 f(x)dx in such a way that all the deriva- 
tives of the new integrand vanish at both end- 
points by taking x= p(t), p(t)=K ~ fo y(t)dr, 
K =[§y(a)de, W(t) =exp(—t-? —(1—1)"1). 
Then the trapezoidal rule with h=1/n is 
applied to the transformed integral to obtain 
(1/Kn) Ej w/a) f(@(j/n)) [1]. The asymptotic 
expression of the error for the IMT formula is 
proportional to exp(— Cn) with a positive 
constant C. 


(2) Double Exponential Formula. The trape- 
zoidal rule with equally spaced points ap- 
plied to the integral of an analytic function 
over (—00, 00) gives in general a result of 

high accuracy. The double exponential for- 
mula is based on the idea of transforming 

JL, f(x)dx to [2 fle())o'(Hdt with x= 
y(t)=tanh(;zsinht) and applying the trape- 
zoidal rule with a mesh size h, which results in 
hE —.f(e(nh))e'(nh) [4]. The name of the 
double exponential formula is attributed to the 
decay of y(t) at t> +o, which is approxi- 
mately proportional to exp(—Cexp|t|) with a 
positive constant C. The transformations x = 
exp(zsinht) and x =sinh(}zsinht) give the 
double exponential formulas for the infinite 
integrals f f(x)dx and [%,, f(x)dx, respec- 
tively. In the actual computation the infinite 
summation is truncated at appropriate upper 
and lower bounds. The asymptotic expres- 
sion of the error for the double exponential 
formula in terms of the number N of the sam- 
pling points actually used is proportional to 
exp(—CN/log N) with a positive constant C. 
The IMT formula and the double exponential 
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formula are robust against the singularities at 
the endpoints. 


C. Automatic Integration 


By an automatic integration scheme we mean a 
computer program for numerical integration 
of f? f(x)dx in which the user gives the limits 
of integration a and b, a subroutine for com- 
puting f(x), and an error tolerance ¢. Then the 
program gives a value of the integral which is 
expected to be correct within the tolerance e. 
Usually in an automatic integration scheme 
the mesh size is halved until the desired ac- 
curacy is attained. Automatic integration 
schemes are classified into two groups, non- 
adaptive schemes and adaptive schemes. In a 
nonadaptive scheme, the sequence of integra- 
tion points is chosen according to some fixed 
rule independent of the shape of the inte- 
grand. Newton-Cotes formulas, Clenshaw- 
Curtis formulas, IMT formulas, and double 
exponential formulas can be used as base 
formulas for nonadaptive schemes. 

From the historical point of view, Romberg 
integration should be mentioned; it is a kind of 
nonadaptive automatic integrator. Consider 
an integral I= f} f(x)dx. Divide the interval 
[a,b] of integration into 2* subintervals and 
apply the trapezoidal rule with the mesh size 
h=(b~a)/2*, which we denote by Tf". Then, 
starting from the values obtained for T®, 
k=0,1,..., we compute the sequence 
pan Tt ,2,.... 

4”—1 
From the tEuler-Maclaurin formula the as- 
ymptotic error for the trapezoidal rule N® can 
be expressed as I — Tf =c,h*+c,h*+...+ 
Cmh?™ + ..., where c,,=const x (f 2” (b) — 
f'?""'(a)) does not depend on h. If we com- 
pute T® =(4Tf"*) — T®)/3 using the values 
Tf) and T®, then we see that the asymp- 
totic error expression of T® becomes | — T{ = 
cyh*+c,h®+...+c,h?"+ .... Romberg inte- 
gration is based on the idea of eliminating 
the term with h?™ in the expression of the error 
by successive application of 7,”. This is an 
application of Richardson’s extrapolation 
procedure. 

In the adaptive scheme, the points are 
chosen in a way that depends on the shape of 
the integrand. The Newton-Cotes formula of 
order 8, for example, is used as the base for- 
mula for the adaptive scheme. 


D. Approximate Multiple Integration 


If a region is a product region, such as a rect- 
angular parallelepiped, a product rule ob- 
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tained by forming the product of 1-dimen- 
sional formulas is useful. For integrals over a 
cube or a sphere there are monomial rules 
which are exact for a certain family of mono- 
mials. These rules can be used for integrals of 
dimension lower than 5 or 6. For integrals of 
higher dimension only methods based on 
sampling make sense. 


E. Numerical Differentiation 


In order to find the numerical value of the 
derivative f, = f'(x,) at a point x, from the 
tabulated values f, = f(x,), we usually use the 
derivative of the tLagrange interpolation 
formula. This gives 

IT'(X,) 1 
f F & (x, = xT)? ii oe Xp— a” 
where M(x) = Pg- (x — x). 

When we compute the derivative of a func- 

tion which can be evaluated at any point in a 
given interval, the approximation 


F(x +h)— f(x—h) 


I= F; 


is useful; similarly, we can use 


IO+A)=ASO+IO—h) 


f"(x)= ie 


It must be noted that, as h tends to zero, the 
difference f(x +h)— f(x —h) comes to contain 
fewer significant digits, so that it is meaning- 
less to carry out { f(x +h)—f(x—h)}/2h be- 
yond a reasonable value of h. 
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300 (XV.1) 
Numerical Methods 


In the earlier history of mathematics, the de- 
velopment of methods of numerical calcula- 
tion was one of the main purposes of research. 
Until the beginning of this century, logarith- 
mic calculation played a central role in numer- 
ical calculation, and the main topic of this field 
was to make tables of values of functions. The 
digital electronic computer (— 75 Computers), 
which made its first appearance in the 1940s 
and has been developing at an exponential 
rate, has caused drastic changes in numerical 
technique. Problem solving by numerical 
methods has now become one of the fundamen- 
tal means of research in the physical sciences 
and engineering, and also in the social sciences 
and humanities. In this article we give exam- 
ples of the changes in numerical methods 
brought about by the availability of digital 
computers and portable calculators. 

Computers may be effectively utilized for 
calculating individual values of functions. This 
has led to the reexamination and, in some 
cases, modification of approximate formulas 
for evaluating functions (— 142 Evaluation of 
Functions). For familiar functions, such as 
tlogarithmic, texponential, and ttrigonometric 
functions, tables have been almost completely 
replaced by function keys on electronic cal- 
culators. Microprogramming algorithms for 
obtaining values of these functions have also 
been devised [1]. A complex function-theoretic 
error-estimation method for use with numer- 
ical integration formulas is given in [2]. In 
this method, graphical outputs of the com- 
puter are utilized. For problems in which the 
existence and uniqueness of solutions have 
been established, as for algebraic equations 
and ordinary differential equations, fairly good 
numerical calculation methods and their error 
estimates have been established (— 301 Nu- 
merical Solution of Algebraic Equations; 

303 Numerical Solution of Ordinary Differen- 
tial Equations; [3,4]). A method for estimating 
arithmetic taccumulation errors for operations 
involving finite numbers of digits has been 
systematized (— 138 Error Analysis; [5]). 

It is not unusual nowadays for linear equa- 
tions with 10,000 or more unknowns to be 
solved. As new computing systems, such as the 
virtual memory system and the vector oper- 
ation system, come into existence, new al- 
gorithms are examined; a numerical method 
that is optimal for today’s technology may 
well be suboptimal for the next generation 
of computers. General-purpose program 
packages of linear problems, including eigen- 
value problems, have been developed (— 298 
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Numerical Computation of Eigenvalues; 
[6]). 

Partial differential equations seem to have 
become more familiar because of the visualiza- 
tion of their numerical solutions in graphical 
computer outputs. In [7], which appeared 
much earlier than computers, partial difference 
equations for the fundamental linear prob- 
lems in mathematical physics were discussed 
with a suggested variational treatment (— 304 
Numerical Solution of Partial Differential 
Equations; [8]). The ffinite element method, 
which started as a calculation technique in 
structural mechanics and is based on the tcal- 
culus of variations, is widely accepted as an 
efficient approximation method for partial 
differential equations [9, 10]. The term tsimu- 
lation is used often to describe procedures in 
which partial differential equations describing 
time-dependent phenomena are discretized; 
the resulting difference system can be solved 
for long periods of time [11]. 

Numerical analysis has heretofore long been 
considered to be the only numerical method 
(for error analysis in particular), and has been 
carried out mainly by means of the discretiza- 
tion of equations. Nowadays, however, mathe- 
matical modeling, taking into account both the 
phenomena to be described and the capabi- 
lities of the computers to be used, has become 
an important numerical method. 
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A. General Remarks 


Methods for finding roots of an equation 

FS (x)=0 (where f(x) is not necessarily a poly- 
nomial, but is assumed to be a function with 
some regularity) by numerical calculation can 
in general be divided into the following two 
types: (1) The first has as its goal the finding 
of good approximate values of the roots; 
examples are the Bernoulli method (Section J) 
and the Graeffe method (Section N). (ii) The 
second improves the accuracy of estimates of 
the roots; an example is the Newton-Raphson 
method (Section D). The methods belonging to 
(i) and (ii) can be used separately. However, in 
(i) convergence of the approximations may be 
excessively slow when a pair of nearly equal 
roots is present, and the size of numbers in- 
volved in the calculation may grow exponen- 
tially as we proceed through the iterations. In 
(ii) convergence of the approximate values is 
not assured unless the initial approximate 
value is suitably chosen. Accordingly, when an 
electronic computer is utilized, it is advisable 
to combine both types of method. Because of 
the development of computers, solutions with 
global convergence have become important 
Pi 2.15, 19,21 |; 


B. Successive Substitutions 


When an equation f(x)=0 is transformed into 
x = F(x) and the roots are obtained by iterative 
calculation of x;,, = F(x,) (i=0,1,2,...), suffi- 
cient conditions for its convergence are as 
follows: Let one of the roots of the equation be 
a. Then x;,, —%*=F(x;)—«, and therefore (xX;+; 
—a)/(x;—a)= F'(E) (x;<E<a or x> č >a). 
Accordingly, x converges monotonically if 0 < 
F’(€)< 1, while it converges with oscillation 

if 0> F'(&)> —1. If |F(6)|>1 and F~!(x) de- 
notes the inverse function of F(x), the iteration 
X;+1ı =F '(x;) converges. To define the speed of 
convergence by iterative processes the follow- 
ing notion of order is utilized: When lim; x; 
=a, the speed of convergence of the sequence 
{x,} is said to be of the kth order if lim, ,,,(x;4, 
— «)/(x; a) =c #0. Necessary and sufficient 
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conditions for the speed of convergence of 
{x,;} to be of the kth order are a= F(a), F’(a)= 
F"(a)=...= F(a) =0, F(a) 40. 

The main iterative processes used in numer- 
ical calculation are described in the following 
sections. 


C. Regula Falsi 


Regula falsi (or the method of false position) is 
a process of obtaining the real root « of an 
equation {(x)=0 by approaching the root 
from both sides. The calculation procedure is 
as follows: Assume that f(x,)>0 and f(x,) <0, 
where x, and x, are approximate values of « 
such that « lies between them. A new approxi- 
mate value x is then obtained from x=(x,f(x,) 
=x pf XQ) fp) —f(%q)). If) >0, then x> 
Xp and if f(x)<0, then x—x,, and the pro- 
cedure is repeated (the symbol > means re- 
placement). The conditions and speed of con- 
vergence of regula falsi are as follows: Let F(x) 
=(x f) Xf (XQ) F(x) —f(xq)); then F’(a) = 
(f(x_)+(a—x,)f'(«))/f(x,). If f and f” exist 
and are continuous near q, then f(x,)=f(«)+ 
(x,—a)f (x) +(1/2)(x;— 2) f" (E), where «> 

Č > X4 Or & < < x, Accordingly, assume 

that F’(a)=(1/2)(x,—«)f"(@/f(x,) #0. Then 
|F'(x)|<1 can be satisfied if x and x, are near 
a. Therefore, if the initial value is appropriate, 
the convergence is of the first order. In regula 
falsi, only calculation of f(x) is necessary, while 
that of f'(x) is unnecessary. Furthermore, if 
J(x)=0 has nearby real roots, i.e., «, B, ... close 
to each other and f'(x) small near the roots, 
there can be no mistake such as the neighbor- 
ing root ß being obtained in the process of 
finding the root « by this method. This method 
belongs to the inverse linear interpolation 
method. This kind of inverse interpolation 
includes Muller’s method [3], which uses 
tLagrange’s interpolation formula, the Torii- 
Miyakoda method [4], which uses tHer- 
mite’s interpolation formula, and Whittaker’s 
method [5], which uses *Stirling’s interpola- 
tion formula. These methods converge more 
rapidly but have more complicated formulas 
than inverse linear interpolation. The Sturm 
method (in which the interval where the roots 
exist is narrowed by tSturm’s theorem) and the 
Horner method (which obtains the decimal 
part digit by digit), both used to obtain the 
real roots of a high-order algebraic equation 
J(x)=0, are also of this type. 


D. The Newton-Raphson Method 


In obtaining the real roots of an equation f(x) 
=0, the Newton-Raphson method (or the New- 
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ton iterative process), which converges rapidly, 
is used when f(x) #0 is computable. Let x; be 
a sufficiently close ith approximation of the 
root a; then x,;,,=x;—f(x;)/f'(x;) is closer than 
x; to the true solution. The process is repeated 
until |x;,,—x,| is small enough. Conditions 
and the speed of convergence are as follows 
[6,7]: Let F(x) =x —f(x)/f'(x); then F’(x) = 
SO'O F and F(a) =0. Accordingly, 
F’(a) #0 if f(a) 40 and f’(a) 40, and the 
convergence is of the second order if it is pos- 
sible to determine an appropriate approxi- 
mate value x; that is close to « and satisfies 

| F’(x;)|< 1. In particular, if f(x) f (xo) 40, ho = 
= f(xo)/f (Xo), IF"COLSM, and | f(x) > 
2|ho|M, then every Newton approximation 

x; Starting with x, is contained in the interval 
IT=[x,—|ho|, x, +|Aol], the equation has only 
one root « in I, and x,->«. Besides, 


la— xin S Mla xia 2S GD), i= 1,2... 


With regard to convergence and evaluation of 
the error, including roundoff error in practical 
computations, M. Urabe’s studies should be 
consulted [8,9]. In general, the convergence 
is of the third order in the Newton-Raphson 
method, which uses not only f'(x) but also 
J”) L10]. 

When f'{x)=0, we must assume f”(x)#0 
and use the value f"(x). With regard to this 
case, the study by S. Hitotumatu should be 
consulted [11]. In addition, W. Kizner (SIAM 
J. Appl. Math., 12 (1964)) reported an itera- 
tive process in which the convergence is of the 
fifth order without using any derived function 
higher in order than f”(x). The essential part 
of his method lies in a numerical integration of 
the integral part by the Runge-Kutta formula, 
based on the fact that if x, is the first approxi- 
mate value of a root X of f(x)=0, then 


° dx 
x= —df +x. 
B l 


There may be several iterative processes for 
solving f(x)=0, even if the order k of the con- 
vergence is fixed. For example, in obtaining 
the positive root of f(x)=x? —a=0 (a>0), i.e., 
the square root x =a", both iterative pro- 
cesses X;41 = (x, +a/x))/2 and x;y = 2x7 /(3x? — 
a) give convergence of the second order. In 
obtaining the real root of f(x) =x? —a=0, i.e., 
the cube root x=a", x;,, =x;+(a/x? — x;)/3 
converges of the second order, while x;,, = 
x;/2+(a+ a/2)/(2x? + a/x;) converges of the 
third order. 

The Newton-Raphson method is applicable 
also to holomorphic functions of complex 
variables. 

Take simultaneous equations of two vari- 
ables, f(x, y)=0 and g(x, y)=0. If it is possible 
to transform the equations into x = ~(x, y) and 
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y=w(x, y), and if |6p/dx| + |oy/ex| <1, |6@/dy| 
+ |dw/dy|<1, then the iterative processes x;,, 
=¢(X; yi) and y;,, =W(x;, yi) are applicable. In 
the Newton-Raphson method, let the correc- 
tions to the ith approximate values x; and y; 
be Ax; and Ay,, and solve 


Ax yy Ax; + HX, y) Ay; = = f(X; Vas 


GAs yd Axi + G(X, y)Ay;= —g(X;, yi). 


Then we can take x;,, = xX;+ Ax; and yi; = 

y; + Ay; as the next approximate values. This 
process is repeated until both |Ax,| and |Ay,| 
are sufficiently small. More generally, the 
Newton-Raphson method for solving a system 


of n nonlinear equations f;(x,,...,X,)=0, i= 
1,2,...,n, is defined by 
xD = xO J(x) “1 F(x), k= 0,1,2,..., 


where x = (x, ...,x™), F(x)=(f10%1,...5%,) 
EmA s SalXi> res s Xn)’, I(x) =0f: x4, eet »Xp)/OX;) 
(the ‘Jacobian matrix of F), and where x is 
chosen appropriately [23]. 


E. The Bairstow Method 


Corresponding to a pair of complex roots «+ 
if of an algebraic equation f(x)=aox" +a, x" 
+...+a,=0 with real coefficients, there is a 
real quadratic factor x” + p*x +q*. The Bair- 
stow method (or Hitchcock method) obtains 
the coefficients p* and q* of this quadratic 
factor. 

To do this, first choose an appropriate qua- 
dratic factor x? + px +q as a candidate. Then 
through synthetic division by the quadratic 
factor, b; and c; are computed by means of 
the formulas 


1 


b;=a;— pb, —qb;-2, j=0,1, h, 
and 
cj=b;—pej-1—qej-2, j=9,1,...,n-1, 


where 
bp =b29=¢., =z. 
By solving the stmultaneous equations 
Cr-2Ap +¢,-3Aq=b,-1, 
Cr-1 Ap +c,-2Aqg=b,, 


where 


Cn-1 = Cn-1 = Diyas 


the quantities Ap and Aq are obtained. Then 
taking p=p+ Ap, G=q+ Aq, we have x? + 
px + q as a new candidate. This operation is 
repeated until Ap and Aq are sufficiently small. 
Since this method corresponds to the Newton- 
Raphson method in the case of two variables, 
and, accordingly, the convergence is of the 
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second order, a choice of suitable initial values 
of p and q leads to rapid convergence. The key 
to this method lies in choosing p and q so that 
R,(p,q)=0 and R,(p,q)=0, where R, and R, 
are such that R,x+R, is the remainder of f(x) 
divided by the trial quadratic factor x? + px + 
q. This method was generalized by A. A. Grau 
(SIAM J. Appl. Math., 11 (1963)). Namely, 
when f(x) =(x? + px + q)g(x)+r(x) with r(x) = 
r,x**! +7,x*, the functions r; and r, can be 
used instead of R, and R,. The process cor- 
responds to the Bairstow method when k =0, 
and to the McAuley method (SIAM J. Appl. 
Math., 10 (1962)) when k=n—2. 


F. The Durand-Kerner Method 


The Durand-Kerner (DK) method [12] for 
solving an algebraic equation f(z)=z"+a,z" ! 
+...4+a,=0 (a, 40) with complex coefficients 
is an iterative method defined by 


(k) 
zhk+1) — 7h) - fz ) a 
Wat, j41 (2 »— zh ) 
Plies k=0,1,2,.... (1) 


A feature of this method is that it can deter- 
mine simultaneously all the roots of f(z)=0. 
Let ¢,,(Z1,---5Z,), M=1,2,...,n, be the mth 

elementary symmetric functions with respect 


tO Zisa Zy 


A E E ne a ore Te 


š K M m 
llh... im 


Set f,,(Z)=(—1)" Pm(Z1> ---, Zn) — âm. Then «= 
(x,,...,%,)', a set of the roots of f(z)=0, is a 
solution of a system of the n equations f,,(z)= 
0, m= 1,2,...,n. I. O. Kerner [12] showed 
that the DK method is the Newton-Raphson 
method applied to the system of nonlinear 
equations f,,(z)=0, m=1,2,...,n. Therefore 
the speed of convergence of (1) is of the second 
order, provided that it converges. The initial 
values 2%, ..., 2) for the DK method are 
usually chosen as follows: Let g(z) = f(z—a,/n) 
=2z"+¢,2" *+...+¢, and h(z)=z"—|c,|z" 7 
—...—|c,|. If (cg, ...,¢,)#(0, ...,0) (n> 2), then 
it can be shown that h(z)=0 has exactly one 
positive root r, and all the roots of f(z)=0 lie 
in the disk |z+a,/n|<r. Now, with a positive 
ro 2r and 0=n7/(2n), put 


2(i— 1 
P= -Trel ( ë Zo] 
n n 


E m 0) 


Such a choice was proposed by O. Aberth 
[13]. Hence the process (1) together with (2) 
can be called the Durand-Kerner-Aberth 
(DKA) method. It is shown for the DKA 
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method that, if rọ is large enough, then 


a 1\* a 
+ %-(1-2) (2+5), i=1,2,...,n, 
n n n 


hold nearly for the first several steps. Thus 
the DKA method has a certain kind of global 
convergence property that renders it one of the 
most powerful methods for solving algebraic 
equations. 

A variant of the Durand-Kerner method is 
the Ehrlich-Aberth method [13, 14]; it is de- 
fined for i=1, 2, ..., n, k=0, 1, 2, ... by 


geet) 


aii. f(a) 
TF PYF) Ea jes Me — 2)" 


The speed of convergence of this method 

is of the third order. Further variants of 

the Durand-Kerner and the Ehrlich-Aberth 
methods have been proposed by M. Iri et al. 
[15] and A. W. M. Nourein [16]. 


G. The Dejon-Nickel Method 


Let f(z) be a polynomial of degree n with 
complex coefficients, and take zy such that 
J (Zo) #9. Then we can write 


f(Zo +h) =f(Zo) 14+ bhi +b hit! +... +b,h™ 
(b; #9) 


= f(zo){1+b;hi(1 +0}, @=o(h). 


Now, choose h sufficiently small so that |b,h'| 
<1, |0|<1, and arg(b;h’)= r. Then |1+),h‘|= 
1 —|b;h'| and |b;h'0|<|b;h'| so that | f(z> +h)| 
<If(Zo)|{[1 +b;h*l+lb:h'0]} <| f(zo)]. Hence, 
if zp is chosen so that min| f(z)|=|f(z5)|, 

then we must have f(z,)=0. This is the out- 
line of Cauchy’s existence proof for the roots 
of algebraic equations. The Dejon-Nickel 
method [17] is a method in which h is chosen 
as follows: 


h=( —1/b,)""*, 
where 


min {|1/b,{'9|b;40,i<j<n} =|1/b,|"* 
(=r, Say). 


If several such k exist, take the smallest one. 
The branch of the multivalued function 
(—1/b,)"* is chosen such that z™* is positive for 
z positive. If | f(z) +)| <(1 —¢)| f(z )| with a 
preassigned constant e such that 0<e<1, then 
put z;=2,)+h. If the inequality does not hold, 
then for each integer m > 1 choose the small- 
est integer !=/(m) such that max {|b,|(r/2”) | 
i<j<n\=|b|(r/2"), and put 2m = Zo +(r/2”) 
(—|b,|/b)'". Find the smallest integer m>1 such 
that | f(zo)| < —2 *(r/2"V'|b |) f(Zo)|, and put 
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z,=Z,,. By continuing this process, a sequence 
{z;} is constructed such that | f(z)|>|f(z,)| > 
| f(z)|> .... It converges toward some root 

of the equation f(z)=0. S. Hirano has pro- 
posed a similar method. 


H. Methods for Finding Good First Estimates 
of Roots 


Some of the principal methods for obtaining 
good first approximate values of the roots are 
given in the following sections I-N. 


I. Matrix Methods 


The problem of solving an algebraic equation 
f(z)=2"+a,2" '+...4+a,=0 with complex 
coefficients is equivalent to that of finding the 
eigenvalues of the companion matrix 


74) ~a —ay — apn- Tan 
l 0 0 0 0 
A=| 0 1 0 0 0 
0 0 0 1 0 


Therefore numerical methods for solving the 
eigenvalue problems for nonsymmetric ma- 
trices are applicable (— 298 Numerical Com- 
putation of Eigenvalues). However, it should 
be remarked that this might be inefficient 
because most of the elements of the matrix A 
are zero [18]. 


J. The Bernoulli Method 


In the Bernoulli method the iterative formulas 
S1 = — 41, S= —(4,S,-, +4.5,-2 +... H ar-a Si 
+ka,) (k=2,3,...,n), S= — a1 Sk-1 — 42 Sk-2 — 
1 Sy, (K=n+t 1,n4 2,...) are repeatedly 
applied to an algebraic equation f(x)=x"+ 
a,x” }+a,x" ?+...+a,=0. When the roots 
of f(x)=0 are «,,45,...,%, (x> l|... > 
lanl) Sk/Sk-1 >, If a, is a real and simple 
root and |a,|>|a,|. When «,, «, are complex 
roots, we put a, = Re”, a, = Re. If |a3|<R, 
then we have 


Se — Spat Ska 


2 —R?, 
Sk-1 — S,Sy-2 


SkSk-1 — Sk 41 Se~2 


z ~2Rcos 0, 
Sk-1 — SeSk—2 


and hence the two roots of x? — (2R cos 0)x + 
R?=Oare «, and a. S, is the sum of the kth 
powers of the n roots of the equation, «,, >, 
s+, æ. C. Lanczos used xf"(x)/f(x)=n+S,/x+ 
S,/x?+...4+8,/x*+ ... to compute S, [10, 

pp. 26-30]. Let the eigenvalues of the com- 
panion matrix A be A,,43,...,4, (4, ]2 1/4212 
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...=[A,|). Then the Bernoulli method is the 
same as the tpower method for obtaining the 
maximum characteristic value 4, (— 298 Nu- 
merical Computation of Eigenvalues C). 


K. The Lehmer Method 


The Lehmer method [9] is a general method 
for finding the n roots of an algebraic equation 
with complex coefficients 


f(z)=a9 +4, 244,27 +...4+4,2"=0 


in the complex plane by repeating the follow- 
ing procedure: First, draw a circle whose cen- 
ter is at the origin and whose radius is R= 
r2-°, where r is an arbitrary given number 
and @ an arbitrary given integer. Then, utiliz- 
ing the Lehmer theorem, observe whether a 
root x of f(z)=0 lies inside the circle. If there is 
no such root, replace R with 2R, whereas if 
such a root exists, replace R with R/2. Con- 
tinue this process until an R for which a root 
x exists within the annulus R <|z|<2R is 
obtained. 

Second, draw circles with the centers $, = 
(S5R/3)exp(i2zk/8) (k =0, 1,...,7) and com- 
mon radius p=(5R/3)/2 and cover the annulus 
obtained by the first step. Then find out which 
of the circles has the root « in its interior. If 
a is in the interior of the circle for k=j, the 
origin is shifted to $, and the operation is 
started again from the first step. If R, satisfy- 
ing R,<|z—f,|<2R, is obtained by the first 
step, we have R, <(5/12)R. Therefore, when 
the first step is repeated N times, the root « 
is confined in a small circle whose radius is 
smaller than 2(5/12)"R. Then the center ß of 
this small circle gives a good approximate 
value of «. 


L. The Downhill Method 


The downhill method is a method for obtain- 
ing the extreme values of a function of many 
variables. It is also applicable in obtaining 
approximate solutions of a system of equa- 
tions. Let us consider the downhill method in 
the case of two variables. The problem of 
obtaining the real roots of simultaneous equa- 
tions f(x, y)=0 and g(x, y)=0 can be reduced 
to the problem of obtaining the coordinates 
(a, 8) which give the extreme value 0 of D(x, y), 
where (x, y)= f?+g?. The values of ®(x, y) 
are calculated at 3? points obtained by the 
combination of x=x;, x; +h; y=y,,y¥, +h, 
where (x,, yı) are arbitrary approximate values 
of (x, 8) constituting the centers of those sets of 
points, and h is the given step size. Utilizing 
the values of the function at the 3? points, 
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(x, y) is approximated by a quadratic surface 
by +b,x+b,y +b, (3x? —2) 
+ b,5(3y? —2)+b, xy. 


The values of bp, bi, b2, by, 622, D12 are cal- 
culated by the ‘method of least squares, and 
the center (x*, y*) of the approximate quadra- 
tic surface is obtained. Then the first approxi- 
mations x, and y, are replaced by the sec- 
ond approximations x; +x¥ and y; + y¥. This 
process is repeated while the step size h is 
made appropriately smaller. This method is 
an improvement on the method of successive 
experimental planning used by G. E. P. Box 
and K. B. Wilson (1954) to obtain optimum 
conditions in the exploration of response 
curved surfaces. 

By using another minimization technique, 
J. A. Grant and G. D. Hitchins [21] gave an 
ever-convergent algorithm for determining 
initial approximations to the roots of algebraic 
equations with real coefficients. The prac- 
tical implementation of this method is given in 
J. A. Grant and G. D. Hitchins (Comput. J., 
18 (1975)). 


M. Continuation Methods 


Let F(x)=(f,(%), ---, CO) =0 (x =(%1, -.-, X,))! 
be a system of n equations. Suppose that no 
reasonable approximation for a solution exists. 
Then, take arbitrary x and define a one- 
parameter family of equations 


H(x,0=F(x)-(1—-pF(x)=0, O<r<i. 


(3) 


Suppose that for each t the equation has a 
solution x(t) which depends continuously on 

t. Observe then that x(0)=x and x(1) is a 
solution for the equation F(x)=0. Partition 
the interval [0, 1] by the points O=t)<t, <... 
<ty=1. First, solve H(x,t,)=0 by some itera- 
tive method using x as a first approximation. 
Let x be a solution thus obtained. Next, 
solve H(x, t,)=0 by some iterative meiiuu 
using x" as a first approximation, and so on. 
Finally, solve H(x, ty)=0 by some iterative 
method which uses x“~") as a first approxima- 
tion. Then x“, a solution thus obtained, can 
be used as a first approximation in an itera- 
tive method applied to the equation F(x)=0. 
This method is called a continuation method 
[22-24]. 

As another approach, differentiate (3) with 
respect to t. Then J(x(t))x’(t) + F(x®)=0, 
where J(x(t)) is the Jacobian matrix of F, 
evaluated at x= x(t). Hence x(t) is the solution 
of the system of ordinary differential equations 


x (th= —J(x(t)) TF), O<t<], 
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subject to the initial condition x(0)= x, pro- 
vided that J(x(t)) is nonsingular. Therefore 
numerical solution at t= 1 gives a good ap- 
proximation for a solution of F(x)=0. This is 
called Davidenko’s method of differentiation 
with respect to a parameter. These methods 
can be used to find initial approximations of 
the Durand-Kerner and the Ehrlich-Aberth 
methods. They are also applicable to a single 
algebraic equation f(x)=0, which is the special 
case n= 1. 


N. Other Methods 


To obtain the first approximate values of the 
roots of an algebraic equation, the Graeffe 
method has been used, in which the roots of 
the equation are separated by successively 
forming an equation whose roots are the 
squares of the roots of the preceding equation. 
In computer calculations, however, the other 
methods described in previous sections are 
more convenient than the Graeffe method. 

The Lanczos method [10] is a method in 
which g(x), an approximate function of f(x), 
is obtained by the process of approximating 
functions, and the roots of g(x)=0 are taken 
as approximate values of the roots of f(x)=0. 
The Garside-Jarratt-Mack method [25] is a 
modification of the Lanczos method and ap- 
proximates f(x)/f’(x) by a rational function 
g(x) =(x —a)/(b + cx). 


O. Error Bounds for Computed Solutions 


Let z,,...,Z, be computed solutions of an 
algebraic equation {(z)=0 which were ob- 
tained by some method. If z,,...,z,, are dis- 
tinct, then the following result due to B. T. 
Smith [26] is quite useful for estimating the 
errors of z;: Let 


[z—z1<y,= n| f(z] 


j=1,j4: 121-21 


Then the union of the disks T; contains all 

the roots «,,...,«, of f(z) =0. Any connected 
component of |)”, I, which consists of just 
m disks T;, contains exactly m roots of f(z)= 
0. Hence if F;1,=@ for every j 4i, then |a;— 
z;|<y,;. Smith obtained a more general result 
for the case where z,, ...,2Z, are not necessarily 
distinct. 
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A. Condition of Linear Systems 


The solution of the system of linear algebraic 
equations 


b; real, (1) 


n 
> ayx,=b; i=1,...,n, dijs 
j= 


which may be written in the matrix form 


Ax=b; A = (a;;), x =(x)), b=(b)), (1) 


is expressed as quotients of determinants by 
*Cramer’s rule. In practice, however, this form 
of the solution ts of little value for numerical 
computation, because the direct evaluation of 
the determinants involves (n+ 1)!(n— 1) multi- 
plications which, even for a moderate-sized 
system, amount to a prohibitive number of 
arithmetic operations to be executed, even 
with modern high-speed computers; besides, 
it requires high-precision calculation. There 
are a variety of practical methods of solving 
efficiently the system (1) with finite-precision 
calculation. These numerical methods are 
generally divided into two classes: direct meth- 
ods and iterative methods. 
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Whatever method is used, inherent diffi- 
culties are encountered when the solution of 
the system is unstable. The instability is usu- 
ally measured by the condition number of the 
coefficient matrix, which is defined by 


condA=o,/a, (=|All||A7'I 21), (2) 


where the g; (0, >0,>... 20, 20) are the non- 
negative square roots of the eigenvalues of 
A'A (called the singular values of A), || A|| = 
max {|| Ax||/l|x||:x #0}, and lix =,/x'x. This 
definition is valid also for an m x n matrix A 
(m>n). The condition number cond A satisfies 


oxi cond A (a el) 
Ixl ôA | Al tb 
— ~ CO 
All 


ndA 


where Ax =b, (A+ 6A)(x + 6x)=b+ ôb, and 
|S Ajj | A! || <1. As the condition number 
increases, solution processes become more 
susceptible to errors. If cond A is large, the 
system is called ill-conditioned. 


B. Direct Methods 


A direct method is one that yields an exact 
solution tn a finite number of arithmetic oper- 
ations if they are performed without roundoff 
error. Among the existing direct methods, the 
one known as Gaussian elimination with pivot- 
ing, which ts based on systematic elimination 
of unknowns of the equation (1), is found to be 
the best with respect to time or accuracy. Its 
dependability was re-established by means of 
backward error analysis (— 138 Error Analy- 
sis; [1—3]). The method generates successive 
vectors b® =(b*) and matrices A® = (aj), typi- 
cally of the form 


ADM = 


ocoo * 
ooo * * 
* * * & k 
* * * * 
* * * & * 


* 


when n=5 and k=2. At the kth stage, a pivo- 
tal element aj, ' 40 (i, j> k) is chosen in one 
way or another. Then the ith and kth rows 
and the jth and kth columns of A*® are 
interchanged so that af"! becomes aj,. The 
ith and kth elements of b*~" are also inter- 
changed. The pivot aj, is then used to elimi- 
nate all the nonzero entries in its column 
below the diagonal as follows: 

k k-1 


jT lij > 


bt=bf, i=1,...,k; 


a i=1,...,k, j=1,...;7; 


= (ai, */aK ag, 
i=k+1,...,n, Jaki oats 


bk=be'—(ak/ak)bk, i=k+1,...n. 
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Starting with A® = A and b® =b, a system 
of linear equations with an upper-triangular 
coefficient matrix, 


=p -1 
APD x* = per D. 


is produced at the stage k=n—1, where x* is 
a permutation of x, caused by interchanging 
columns. This part of the process is called 
forward elimination. If ak~' =0 for all i, j>k at 
some stage, then A is a singular matrix of 
trank k—1 and the system (1) admits infinitely 
many solutions if b‘~! =0 for i=k,...,n, and 
no solution otherwise. If this is not the case, A 
is a nonsingular matrix with det A=a7,'a53' 
...a"™ +, and the solution of (1) is given by 


n 
k n-1 n-l lx* n-1 
x# =| bo — Y aik Xk |j Gi 
k=i+1 


for i=n,n—1,...,1, which is called back sub- 
stitution. Taking the pivot to be an element 

of the largest absolute value among column 
elements a$ t (i> k) at each stage is called the 
partial pivoting strategy. In complete pivoting, 
the pivot is taken to be an element of the 
largest absolute value among aj”? (i,j >). 
These pivotings are introduced to prevent loss 
of accuracy due to rapid growth of elements of 
successive A“. Although a smaller bound for 
the growth factor is obtained for complete 
pivoting, in practice partial pivoting appears 
to be entirely adequate. 

If rows or columns are not interchanged in 
the process, forward elimination effectively 
produces a factorization of A into the product 
of a lower triangular matrix L and an upper 
triangular matrix U, i.e., 


A=LU, (3) 


where L has unit diagonal elements and U = 
A“), This factorization is computed directly 
by the Doolittle method without calculating 
the intermediate A“. The Crout method also 
produces a similar factorization (3) in which 
U has unit diagonal elements. The Cholesky 
method determines a similar factorization (3) 
of a tpositive definite matrix A, in which U = 
L'. Once the triangular factorizations (3) are 
formed, the solution of the system (1) is deter- 
mined by solving the two triangular systems 
Ly=b and Ux=y successively. 

The number of multiplicative operations 
required for these factorizations are about n?/3 
for forward elimination, Dootittle’s method, 
and Crout’s method, and about n?/6 for Cho- 
lesky’s method. The solution of each triangu- 
lar system requires about n?/2 multiplicative 
operations. Special properties of A, such as a 
banded structure, can be exploited to reduce 
the number of operations and memory re- 
quirements considerably [4]. Gauss-Jordan 
elimination is similar to Gaussian elimina- 
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tion, but the elements above the diagonal 

are also eliminated to dispense with back sub- 
stitution. However, it requires about n*/2 
multiplications. Modern computers can easily 
handle problems of size n= 100 by these direct 
methods. 


C. Iterative Methods 


An iterative method is a dynamical process 
that generates a sequence of approximations 
{x*! converging to the exact solution. At each 
step of the iteration, an improved approxima- 
tion is obtained from the previous ones. The 
accuracy of the solution depends on the num- 
ber of iterations performed. Most iterative 
methods retain the coefficient matrix in its 
original form throughout the process and 
hence have the advantage of requiring minimal 
memory. They are suitable for solving large 
sparse systems arising in finite-dimensional 
approximations to ‘partial differential equa- 
tions, where A is sparse if most of its elements 
are zero. 

Linear stationary iterative processes are the 
most frequently used iterative methods. A 
method in this class is written in the form 


xkaxk14 R(b—Ax*) k= 1,2,..., (4) 


where R is chosen to approximate the inverse 
of A. Usually, R and b— Ax*~! are not ex- 
pressed explicitly in the actual algorithms. If 
the tspectral radius of the iteration matrix | — 
RA is less than one, the method converges 

to the solution for an arbitrary initial ap- 
proximation x° [5]. The key matrix R can be 
chosen quite freely as long as thts condition 
is satisfied. In the Richardson method, R = 
aA‘, 0<a<2/||A\|?. R=(A+E) * is the usual 
choice, in which E is a perturbing matrix that 
makes A + E easily invertible. In the Gauss- 
Seidel method and the Jacobi method, R is 
chosen to be the inverse of the lower triangular 
and the diagonal submatrices of A, respec- 
tively, i.e, R=(L+D) ' and R=D ', where 
L=(a,) (i>j) and D =(a;)). 

Direct methods can be combined with the 
iterative method (4) to obtain an approxi- 
mate inverse R. In the course of factorization 
by a direct method, an artificial perturbation 
E is introduced to produce an incomplete 
factorization, 


LO=A+E, (5) 


so that £ and U are of low tcomplexity of 
computation. A combined method is then 
constructed by putting R= U0 '£7'. Even if 
E is not added intentionally, each of the di- 
rect methods actually produces an incomplete 
factorization (5), in which E is a matrix of 
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small elements accounting for the effect of 
roundoff errors [1—4]. In this case, the com- 
bined method is called the iterative improve- 
ment of the direct method. If applied to a first 
solution x? = Õ ~'E~'b, it produces more accu- 
rate solution in a few steps with only a mod- 
est increase in computation time, when the 
system ts not too ill-conditioned [4]. It is 
essential, however, that the residual b — Ax? be 
computed with higher precision. 

Various convergence criteria have been 
established for a variety of methods [5,6]. For 
example, the Gauss-Seidel method ts conver- 
gent for a symmetric positive definite matrix A. 
The smaller the spectral radius, the faster the 
convergence of the method. In general, a larger 
amount of computation is required in each 
iteration to get faster convergence. If the spec- 
tral radius p(J — RA) is close to one, the con- 
vergence is slow, and an acceleration of the 
process is needed. SOR (successive over- 
relaxation) is an accelerated version of the 
Gauss-Seidel method, in which R,,=(L+ 
w 'D)~! and the optimum acceleration 
parameter œ (0 <œ < 2) is chosen to minimize 
p(l — R,,A)(— 304 Numerical Solution of 
Partial Differential Equations; [6]). There is 
an adaptive acceleration method [7], which, 
if applied to a scalar sequence, reduces to 
Aitken’s 6?-method. 


D. The Conjugate Gradient Method 


The conjugate gradient (CG) method is a non- 
linear stationary iterative method for solving 
a system with a symmetric positive definite 
coefficient matrix. The method generates {x"}, 
{r*}, and {p*} by means of the formulas 


Oy = (rs F) /(p*, Ap), 

xtti = xt Op’ 

rtt =b— Ax" =r*— g, Ap*, (6) 
B, = (r+, re (rk, r”, 

pt =r 4g Bk 


where x° is arbitrary and p? =r° =b — Ax°, 
(r,s)=s*fr. 

The CG method shares a feature with the 
direct method. In theory, {x*} converges to 
the solution in less than n steps, and the p' 
are mutually conjugate, i.e., (p/)'Ap'=0 (iA/). 
When Hestenes and Stiefel [9] proposed the 
method in 1952, they created a great sensation 
because of the method’s theoretical elegance. 
The CG method turned out, however, to be 
highly sensitive to roundoff errors. In practice, 
nice theoretical properties, such as finite ter- 
mination, do not hold in the presence of error. 
Recently, the CG method has regained its 
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popularity as an iterative method for solving 
large sparse systems. The iteration (6) is usu- 
ally restarted periodically to accelerate the 
convergence. It may converge in a small num- 
ber of steps, as it takes advantage of the distri- 
bution of eigenvalues of A adaptively. The CG 
method is most effective when it is used to 
accelerate linear stationary iterative methods 
or when it is applied to the preconditioned 
system 


EAE Y y=Lb, x=(L +}y, (7) 


where Č is computed by the incomplete Cho- 
lesky factorization LE'= A + E. The matrix 
£~'A(£~')' is not to be formed explicitly in the 
CG method. 


E. Linear Least Squares Problem 


Let Ax =b be an overdetermined system of 
linear equations, where A is an m x n matrix 
(m> n). The linear least squares problem is 

to find the x that minimizes the Euclidean 
norm ||b— Ax||. We assume here that A is of 
rank n. The most straightforward method for 
solving the problem is to apply the Cholesky 
method to the normal equation 


(A‘A)x = A'b. (8) 


This may result in ill-conditioning of the sys- 
tem, since cond A'A =(cond A)”. Ill-conditioning 
can sometimes be avoided by forming the 
factorization A= LU by Gaussian elimination, 
where L is an m x n lower trapezoidal matrix 
and U is an upper triangular matrix. The 

least squares solution is then obtained by 
solving successively the systems L'Ly=L'b 
and Ux=y. 

Another approach to avoiding ill- 
conditioning is based on orthogonalization. 
The modified Gram-Schmidt orthogonaliza- 
tion method produces in an effective manner 
the factorization 


A=QU, (9) 


where Q is an m x n matrix whose columns 
are a set of forthonormal vectors and U is an 
upper triangular matrix [10, 11]. The least 
squares solution is obtained by solving Ux = 
Q'b. A sequence of Householder transforma- 
tions, P, = I —2w,w{/||w,||?, can produce the 
factorization 


U 
PPri PiP A=), (10) 


where U is an n x n upper triangular matrix 
[12]. The least squares solution is obtained by 
solving Ux=b, where b is formed by the first 
n elements of the vector P,P,_,...P,P,b. A 
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similar factorization can also be produced by 
using a sequence of Givens transformations, 
which have the advantage of exploiting the 
sparseness of A. The matrices U in (9) and (10) 
are unique and coincide with the transpose L' 
of the lower triangular matrix L of the Cho- 
lesky factorization, A'A = LLt, of A'A. The 
multiplicative operations required for the 
factorizations (9) and (10) are about mn? and 
mn? —n3/3, respectively. 

When the rank of the matrix is unknown, 
we can determine an “effective rank” p of A, 
based on the singular value decomposition 
(SVD): 


A=UDV', 


where U and V are torthogonal matrices of 
dimensions m and n, respectively, and D is an 
m xn diagonal matrix whose diagonal ele- 
ments d;; are the singular values o; of A [13]. If 
singular values smaller than ø, can be ignored, 
the approximate least squares solution is given 
by 2? = VD; U'b, where the ith element d? of 
the diagonal matrix DB, is o; | ifi<p and 0 
otherwise. 

These methods certainly solve the system (1) 
but generally require more computational 
work than those based on the triangular fac- 
torization. The least squares solution is com- 
puted also by applying the CG method (6) to 
the normal equation (8), in which A'A is not to 
be formed explicitly [9]. Iterative methods for 
solving linear systems with singular and rect- 
angular coefficient matrices are characterized 
in terms of the trange of RA and the null 
space of AR [15]. 

Besides the methods discussed here, numer- 
ous other methods have been proposed for 
solving the system (1) [16—18]. Recently, vari- 
ous sparse techniques have been developed for 
direct methods to control the growth of non- 
zero entries (fill-in) in the process of matrix 
factorization [19]. In general, direct methods 
have an advantage over iterative methods; 
however, their relative computational effi- 
ciency varies according to the scale, sparsity, 
and type of the coefficient matrix and to the 
available computational devices. 
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Numerical Solution of 
Ordinary Differential 
Equations 


A. General Remarks 


Applications of the numerical solution of 
tordinary differential equations include ‘initial 
value problems, tboundary value problems, 
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and teigenvalue problems. Although solution 
by an tanalog computer is among the numer- 
ical methods in the wide sense, we usually 
mean by “numerical solution” a solution ob- 
tained by approximating a problem of infinite 
degrees of freedom and a continuous variable 
by one of finite degrees of freedom. The ap- 
proximation of an arbitrary function by a 
linear combination of a given finite system of 
functions is an example (— Section I). Succes- 
sive substitution and power series expansion 
are also used but have no particular advan- 
tages over other methods. Among various 
methods of approximation, the so-called dif- 
ference methods (or discrete variable methods) 
are the most flexible and have the largest field 
of application. A difference method reduces a 
given problem to an approximate problem 

in which we deal only with a systematically 
chosen set of discrete values of the variable 
and with values of the unknown functions 

at the chosen points. A difference method is 
usually carried out by a simple iterative com- 
putation, so that it is suitable for fdigital com- 
puters. We confine ourselves almost exclu- 
sively to difference methods [1-7]. 


B. Initial Value Problems 


To solve numerically general initial value 
problems, it suffices to consider the problem 
of determining numerically the values on an 
interval [a,b] of m functions y'(x) (i=1,...,m) 
that satisfy a system of ordinary differential 
equations of the form y’(x)= f{x, y(x), ..., 
y™(x)) and the initial conditions y‘(a)=n' (i= 
1,...,m), where the f? are given functions with 
appropriate smoothness, and a, b, and y? are 
given constants. We define the mesh points 

x, =a+nh (n=0,1,...), call h the step size, and 
determine numerically the values y} approxi- 
mating y'(x,). 

Assuming that the y} were computed with 
infinite accuracy, i.e., without troundoff error, 
we call ei = y} — y'(x,) the global truncation 
error (or global discretization error), while 
we call ri = ĵi — yi the global roundoff error, 
where the ĵi are the values we actually have 
by means of finite-accuracy computation. On 
the other hand, if the assumption holds locally 
that the solutions at the previous steps are 
exact, we call the ej and rj the local truncation 
error (or local discretization error) and the 
local roundoff error, respectively. To avoid 
confusion, we denote the local truncation error 
by ti. The numerical solution must have the 
property that, for every xe[a, b], lim, .o,.,-.€n 
=0, which is called the condition of conver- 
gence. In particular, if ti = O(h?) (x,=x,h> 
0, where O(h?*') is Landau’s symbol), then p 
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is called the order of the solution process. 
We shall show the typical methods of numer- 
ical solution, using the abbreviation ff for 


S'n yi). 


C. Overall Approximation 


If we rewrite the differential equations on 
the interval [X,,Xn+p] as Y'(Xn4)—Y'(X) = 
fama f'(x, yi(x)) dx, q=1,..., P, and substitute 
suitable ‘numerical integration formulas for 
the integrals on the right-hand side, then we 
have the overall approximation formulas: 


P 
Vira VSA’ Cae NYES q=1,...,P, 
r=0 


from which we can obtain yi,,,..., vi.p when 
the yi are given. Since f,, contains y!,,, these 
equations are nonlinear in y,,,, so that we 
have to use, for example, the iterative proce- 
dure of starting from suitable Oth approxima- 
tions and adopting as the /th approximations 
of the yi +4 values computed from the overall 
approximation formulas by substituting the 
(l—1)th approximations for the y/,, in the fi, , 
on the right-hand sides. These formulas are 
often used to determine a set of starting values 
for a multistep method. The truncation error 
of the formulas depends on that of the numer- 
ical integration formula substituted for the 
integral. A few examples: P= 1, (c,9,€1)= 
(1/2, 1/2) (error h? yiP(x,)/12 + O(h4)); P= 

2, (C105 C119 C12) =(5/12, 8/12, —1/12) (error 
—h* y(x) + O(AP)), (C20; C2122) = (1/3, 4/3, 
1/3) (error hë yxp) + O(h%)); P=3, (C10; C11 
C12, C13) =(9/24, 19/24, — 5/24, 1/24), (C20; €21, 
C22, €23)=(1/3, 4/3, 1/3,0), (C305 €31,€325€33) = 
(3/8, 9/8, 9/8, 3/8). 


D. Runge-Kutta Methods 


By a general Runge-Kutta method we mean a 
method of determining yj=n', yi, yj, ... SUC- 
cessively by means of a formula yi,,—yi= 
h®'(x,, yj; h), where the functions @ are de- 
fined in terms of parameters «, and f,, as Ọ' 
(x, yi; h)= Tho a, ki, kà = f'(x, y), ki= fi(x+ 
Boh, y? + A( Bro — Zs=1 Bys)kb +h Es-1 Prskò) (r= 
1,...,P). A Runge-Kutta method is called ex- 
plicit if £, =0 for r <s, and implicit otherwise. 
In the latter case, if f,,=0 for r <s, the method 
is called semi-explicit (or semi-implicit). Unless 
otherwise stated, the Runge-Kutta methods 
considered below are assumed to be explicit. 
In order to proceed one step from yi to yi, 
we have to compute each function f' (P+ 1) 
times. Hence this is called the (P + 1)-stage 
method. If we denote by z'(r) the solutions 
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of z”(t)= f(t, z/(t)) with the initial condi- 
tions z‘(x)=y' and put hA‘(x, y/;h)=2z'(x +h) — 
z'(x), then we have ®'(x, y/;h)—A'(x, yi; h)= 
h? p(x, y+ O(h?**), where the ~'(x, y} are 
expressible in terms of the f‘(x, y’) and their 
(partial) derivatives. Various Runge-Kutta 
methods have been devised by searching for 
values of the «, and £,, that make p as large 

as possible for a given p (p is called the order 
of the method). When searching for these 
values of «, and f,,, we usually impose at least 
one of the following conditions: (1) «, and B,, 
are simple; (2) the truncation error is small; (3) 
the region of absolute stability (— Section G) 
is large; (4) a, and f,, give smaller roundoff 
errors; (5) only a small computer memory is 
required. The ordered pairs (P, p), where p is 
the highest order that can be attained by the 
(P + 1)-stage method, are (P <3, P+ 1), (4, 4), 
(5, 5), (6, 6), (7, 6), (8, 7), (9-10, 8), (p+ 2<(P + 
1) <(1/2) (p? —2p +4), p> 9). The following 
formulas are frequently used since they are 
accurate and have simple parameters: for P = 
1 and p=2, the formulas with a) =0, «, =1, 
Bio =1/2 (modified Euler method), with a, = 
xı = 1/2, fio = 1 (improved Euler method) and 
with xo = 1/4, «, =3/4, Bio =2/3 (Ralston’s 
second-order method); for P=2 and p=3, 

the formulas with a) =2/9, «, = 1/3, «,=4/9, 
Bio =1/2, Bro =3/4, B21 =3/4 (Ralston’s third- 
order method), and with xo = 1/4, x; =0, a, = 
3/4, Bro = 1/3, Bro = 2/3, B21 =2/3 (Heun’s 
third-order method). For P =3, p=4, the for- 
mula with xo =a; = 1/6, 7, =a, = 1/3, Big =f 
= B,,=1/2, B30 = 832 =1, B3,=0 is well known 
and is frequently referred to as “the fourth- 
order Runge-Kutta method” or “the Runge- 
Kutta method.” It has various desirable fea- 
tures [8]. Gill’s modification of the classical 
Runge-Kutta method (sometimes called the 
Runge-Kutta-Gill method) has some advantage 
in regard to roundoff errors and to the neces- 
sary computer memory size, while Ralston’s 
second-, third-, and fourth-order methods have 
minimum error bounds in Lotkin’s sense for 
the local truncation error [7]. There exist 
“substantially fifth-order” methods for P=4 
[9]. Today we frequently prefer to use higher- 
order methods, especially fifth-order methods, 
instead of lower-order methods, because the 
former require less computation time for solu- 
tions of given accuracy [10]. 

For the estimation of the local truncation 
error, the following two practical techniques 
are well known: (i) one-step—two-half-steps 
error estimate. If we let yi}, and y$}, denote 
the approximations that are computed with 
an mth-order method by taking two half-steps 
and one full step (=h,), respectively, then 
an estimate of the local truncation error per 


half-step associated with yi? is given by the 
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formula 
thar © Valh D" lyi — yi.4)/2(1 —2”, 


where we assume that y$, does not vary much 
over the interval [x„, x,., ]. (ii) The formula 
of embedding form. The following formula of 
embedding form is more effective than the 
traditional method above. By use of the pth- 
order method yi,,=yi+L?_5a,k! and the 
(p+ 1)th-order method yi*, =y +EP oat ki, 
we obtain an estimate of the local truncation 
error in yt,, as ti, x yi,,—yi*, [11-13]. At 
present, formulas of this type are known for p 
= 2-8, where the method for p=5 is said to be 
the most efficient. Recently, a similar method 
for the estimation of the global truncation 
error has been investigated [14]. In addition, 
multistep generalizations of explicit Runge- 
Kutta methods, which are known as pseudo— 
Runge-Kutta methods, have also been inves- 
tigated [15]. 


E. Multistep Methods 


A linear multistep method approximates given 
differential equations by difference equations 
p(E)y} =ho(E)f, where E is the operator of 
increasing n by 1, p(Q)=a,C* + apC! + 

s HACH A, OE) = Pek + By P+ + 
Bič + Bo, % #0, and |a | +|Bo| #0. (If p and 

o are of degree k in ¢, we speak of a linear k- 
step method.) By means of these difference 
equations we can determine yj, from yi.,_,, 
Viek-29 +++» Y. Since the y!,, are determined 
explicitly if £,=0 and implicitly if 8, #0, the 
difference equations are called explicit and 
implicit, respectively. In the implicit case, if |A| 
<|(a,/B,)L|, where L is a tLipschitz constant 
for f'(x, yò, then the y!,, can be calculated by 
successive substitutions. In order to obtain the 
y, successively by means of a k-step method 
with k >0, it is necessary, to begin with, to give 
the k— 1 sets of m values y{,..., yi_, (called the 
starting values) in addition to the initial values 
yy=n'. To determine the starting values, 
Runge-Kutta methods, overall approximation 
formulas, etc., are ordinarily used. The number 
of times we have to compute the f! to pro- 
ceed one step from yi to yi}; is only 1 for an 
explicit case, while for an implicit case it is 
equal to the number of iterations required to 
secure the convergence of the successive substi- 
tutions (ordinarily, the step size h as well as the 
Oth approximation for y}, are chosen so that 
the convergence is attained after a few itera- 
tions). We have lim,_.o,,,-.e, =0 (xe[a, b]) 

for any f‘ and y’ and any starting values such 
that lim, „oyi =n" (u=0,1,...,k—1), if and 
only if the polynomials p and ø satisfy the 
following two conditions: (i) consistency: p(1)= 
0, p’(1) = o(1); (ii) zero stability: Every root 
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of p(f)=0 lies on or inside the unit circle, and 
every root lying on the unit circle is simple. 
We call t, =(p(E)y'(x,) — ho(B)y"(x,))/a(1) the 
local truncation error. For a consistent and 
stable method determined by (p, ø) there exist 
a real constant C,,,, (#0) and an integer p 
(> 1) such that th=h?*'C,,, yl? (x,)/o(1) + 
O(h?*?): p is called the order of the method 
and C=C,,,,/o(1) the error constant. For a 
given p satisfying (it), ø is chosen to make the 
order as small as possible and then to make 
the error constant as small as possible. How- 
ever, p cannot exceed k + 1 if k is odd or k+2 
if k is even; moreover, p can be equal to k +2 
only if all the roots of p(£)=0 lie on the unit 
circle. 

The following are examples of multistep 
methods. 


(1) Explicit Methods. 
(a) Adams-Bashforth methods. 


k=1, p()=C-1, o(=1t, p=, 
C=1/2 (Euler method); 

k=2,  pQ=CC-1), o()=(3E—-1)/2, 
p=2, C=5/12; 

k=3,  p(Q=C(C—1), 
o(6)=(2307-166+5)/12,  p=3, 
C=3/8; 

etc. 

(b) Midpoint rule. 

k=2,  pQ=C?—-1, o(f)=26, 
p=2, C=1/6. 


(c) Milne’s predictor. 


k=4, p(Q=C4-1, 
a(t) =(8¢° — 4t? + 80)/3, 
p=4, C=7/90. 


(2) Implicit Methods. 
(a) Adams-Moulton methods. 


k=1, p®=¢-1, o(f=(C+1)/2, 

p=2, C=-—1/12 (trapezoidal rule); 
k=2, p(Q=C(C—-1), 

(0) = (SC? + 86 — 1)/12, 

p=3, C=—1/24 
etc. 


(b) Milne’s corrector (or the Milne-Simpson 
formula). 


k=2,  p(Q=C-1, o(M=(C +4041)/3, 
p=4, C= -—1/90. 
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When an implicit formula (p, g) is used to 
obtain y!,,, the Oth approximation yi*, for 
yi,, is usually determined by an explicit for- 
mula (p*,o*) of the same order as (p, a) (where 
(p*,o*) itself may not necessarily be stable). 
This kind of combination of implicit and ex- 
plicit methods is called a predictor-corrector 
method (or PC method), where the formula 
(p*,a*) is called the predictor and (p, a) the 
corrector. Typical combinations are the mid- 
point rule and the trapezoidal rule, Milne’s 
predictor and Milne’s corrector (this combina- 
tion is called Milne’s method), and an Adams- 
Bashforth method and an Adams-Moulton 
method of the same order. A predictor- 
corrector method has the advantage that the 
local truncation error (of the corrector) can be 
estimated in the course of calculation without 
extra computation. In fact, if the orders of the 
predictor (p*, a*) and the corrector (p, a) are 
equal to p and their error constants to C* and 
C, respectively, then the local truncation error 
ti can be estimated by ti = KD! + O(h?*?) (n= 
0,1,...), where Di=yi*, —y!,, is the differ- 
ence between the value yi*, at x, ,, obtained 
by the predictor and the y}, obtained by the 
corrector, and K = o*C/[(C —C*)o*(1)] [16]. 

The term multistep methods originates from 
the fact that these use the values of the depen- 
dent variables at more than two different mesh 
points in order to proceed one step. These are 
also called multivalue methods since they use 
more than one value of the dependent vari- 
able. The multivalue method is, however, 

a more general concept than the multistep 
method. 

Linear multistep methods are not only ex- 
amples of the PC method; they are also exam- 
ples of the variable-step variable-order algo- 
rithms (VSVO algorithms), where the order of 
the formula as well as the step size are auto- 
matically chosen according to the behavior of 
the solution. In practical VSVO algorithms, 
the Adams-Bashforth-Moulton family of PC 
pairs of order | to 13 are usually used, and 
for solving stiff systems (— Section G) those 
correctors with large regions of absolute sta- 
bility are used. In these algorithms we use the 
multivalue method, which saves the informa- 
tion at different steps in a form convenient 
for the change of order and of step size. The 
multivalue method also facilitates error esti- 
mation and stability, and results in an efficient 
use of memory and reduction of the computa- 
tional cost. For example, in Gear’s algorithm 
the information required for computing y,,, 
is saved in the following form: y, =(y,,A4Y;; -> 
(h*/k!)y), where y'(x) is the polynomial P, ,(x) 
interpolating fa, fy-1s---5fn-ki1, Where y® = 
d'* P, ,(x)/dx'"|,-, hold. All the local trun- 
cation error estimators are based essentially 
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on Milne’s device. In the VSVO algorithms 
heuristics play an important role [2, 17, 18]. 


F. Extrapolation Methods 


Let us assume that the numerical solution with 
the step size h at some fixed point x has the 
form 


y(x, h)=y(x)+ 5 qh"! + O(n YD), 
i=1 


where r is a positive integer. Then we can 
approximate y(x, h) by a function R,,(x, h) with 
(m+ 1) unknowns determined by the require- 
ment that R,,(x, hj) = y(x, hj), j=0, 1,2, ...,m, 
h,>h,; (i<j) in order to approximate y(x) by 
R,, (x, 0). If R,,(x, h) is a polynomial of degree 
mr in h, we call the foregoing method a poly- 
nomial extrapolation method, where R,,(x, h) 

= y(x)+ o0(h""*) as h tends to 0. Defining 

Ri = y(x, h) and R,,(x,0)= Ri, for j=i,i+1, 
...,4+m, we can calculate the Ra by the re- 
cursion relation Ri, = Rit! +(Rift, — Ri -1)/ 
((Ai/hj+m) — 1), m= 1. It is well known that, for 
the following method (Gragg’s method), there 
exists an asymptotic expansion with r=2 [19]: 
X,= nh, n(0, h)= yO), n(%1.h)= Yo + hf(vo. 9), 
n(Xn+1 A =N (Xn -1 A) + 2Af (n(x, h), X,), n= 1, 2, 
...sN—1, where xy = x, y(x, h)=1/2[4(xy_1,4) 
+ (xy, h)+ hf (y(xy, h), xy) ]. If R,,(x, h) is a 
rational function 


R,,.(X, hj) =(Po + Pi h+ pyh? + tpn") 
Kaot+ayh+qzh? +...+q,h’), 


where u= [m/2], v=m—p=[(m-+ 1)/2], j=i, 
it+1,...,i+m, we call this method a rational 
extrapolation method. For r= 2, the following 
formulas were derived by Bulirsch and Stoer 
[20]: 


R= Rit, + (Ra — Rin- ((Ai/ his md)” 


x [1 —(RĘ2 — Ri, DAR —Ri)] =1), 


m>1,R_,=0, Ri=y(t,h). 


G. Stability 


Consider the application of the general k-step 
method at the nth point (which is consistent 
and 0-stable) 


k k 
È OYn =h F, Bilari 
j=0 j=0 
to y= f(x, y), y(xo)= yo. Let §,, be the numer- 
ical solution at x=x,. Let &,= y(x,)— J, be the 


global error, and let @, be the total error at the 
nth application. Then we find 


k k of z 
X Xn =h > ay mt Cnr) Bien+j + Pnoks 
j=0 j=0 0Y 
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where ¢,,4; lies in the open interval whose 
endpoints are y„+; and y(x,,,). If we make two 
assumptions, 0f/dy =A (const), g, = ¢ (const), 
the above equation reduces to Df-9(a;— 
hAf)é,+;=@, whose general solution is given 


by 


where the d, are arbitrary constants and the r, 
are the roots, assumed distinct, of the poly- 
nomial equation 


(a,—hAB)r’ =0. 
j=0 


J 


We call the linear k-step method absolutely 
stable for a given ha if |r| <1, s=1,2,...,k. 
On the other hand, we call the linear k-step 
method relatively stable for a given hå if |r| < 
iri (or |r| <e, s=2,3,...,k, where r, is the 
root corresponding to the theoretical solu- 
tion. We call the region S,={hA||r,|<1, s= 
1,2,...,k} (or S,= {hå | |r| <e* (or [ry |), s= 
2,3,...,k}) in the complex plane the region 
of absolute stability (or the region of relative 
stability) of the linear k-step method. Also we 
call $, R (or S.N R) the interval of absolute 
stability (or the interval of relative stability) of 
the linear multistep method, where R is the 
real line. The explicit methods and the PC 
methods have a finite interval of absolute 
stability. The implicit methods usually have 
larger intervals of absolute stability than the 
corresponding explicit methods. The higher- 
order PC methods have smaller intervals, 
while Runge-Kutta methods do not. The 
PECE methods usually have larger intervals 
of stability than the corresponding PEC or 
P(EC)* methods, where P indicates an appli- 
cation of predictor, C an application of cor- 
rector, and E an evaluation of f [21, 22]. 

Let us consider the linear system y'= Ay + 
g(x), where A is an m x m constant matrix 
and y(x), y(x), ø(x)e R". If A possesses m dis- 
tinct eigenvalues 2, = u, + iv, t=1,2,...,m, the 
theoretical solution of this system is given by 


3 


l 


10) = K,exp((y, + iv,)x)C, + w(x), 

where K, and C,, =1,2,...,m, are, respec- 
tively, arbitrary constants and the eigenvectors 
corresponding to å, and y(x) is a particu- 

lar solution. We call the linear system stiff 

if (1) 44 <0, t=1,2,...,m, (i) s >> 1, where s = 
MAX, <r<mltul/ MIN; <r<mlkHl- We call the non- 
linear system y'= f(x, y), y(x), y"(x), f(x, yje R” 
stiff in an interval Z of x if, for every xe I, the 
eigenvalues /,(x) of the ‘Jacobian matrix of f 
satisfy (i) and (ii). The ratio s is called the stiff- 
ness ratio. To solve stiff systems effectively 

a variety of methods with infinite stability 


1138 


regions have been proposed. Some of them 

are: 

(1) A-stability (Dahlquist): S,2 {hé| Re(hd) <0} 

(2) Stiff-stability (Gear): S, >S,US,, Sı = 

{hd|Re(ha)< —a <0}, S,={ha| —a<Re(hd)< 

b, —c<Im(hA)<c,b20,c>0} 

(3) A(a)-stability (Widlund): $, 2 {hå | —a<a— 

arg(ha) <x, «€(0, 2/2)} 

(4) A,-stability (Cryer): S, > {hA| Re(hd)< 

0, Im(h2) = 0}. 

The order p and the step number k of linear 

multistep methods are restricted by the follow- 

ing stability requirements: 

(1) A-stability: implicit, p <2; trapezoidal rule is 

the most accurate method. 

(i) Stiff-stability: implicit, p< k; backward 

differentiation formulas Lh 9 %G;Yn+; = fir, are 

stiffly stable for p=k=1,2,...,6, O-unstable 

for k=6. 

(111) A(a)-stability: implicit; there exist high- 

order A(0)-stable linear multistep methods. 

(A method is said to be A(0)-stable if it is A(«)- 

stable for some sufficiently small «€(0, 2/2).) 
For the Runge-Kutta (P, p) methods, we can 

write 


Baas = +h +(hd)?/2! +4... +(hd)?/p! 


P+1 


ag X nir] ent Ortis 
q=p+1 

where ĝf/ĉy = å (const) and y, are functions of 
the coefficients of the method in use. We call a 
region R={hA||1+hA+(hd)?/2!+...+(hd)?/p! 
+ ph Ya(hA)|<1} a region of absolute 
stability of the Runge-Kutta (P, p) method. 

The implicit Runge-Kutta methods have a 
larger region of absolute stability than the cor- 
responding explicit Runge-Kutta methods. 
Therefore the implicit Runge-Kutta methods 
are suitable for stiff equations, and the explicit 
Runge-Kutta methods are suitable for nonstiff 
or mildly stiff equations. 

Recently, Yamaguti and others have pointed 
out that the instabilities occurring in the nu- 
merical solution of ordinary differential equa- 
tions are closely connected to the phenomena 
of chaos studied by Li, Yorke, and others 
[23, 24]. 


H. Boundary Value Problems 


A boundary value problem is generally for- 
mulated as the problem of obtaining functions 
y'(x) that satisfy the differential equations 
y'(x)= fi(x, p(x), ..., y"(x)) and boundary 
conditions Bay Xn) es V (Xn) V Xa) ee 
Vn V Ond os V nE ik=, a, 
m), where the f' and B, are given functions. 

If the f' and B, are linear in the yf, we can 
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first calculate the solutions yi (x) of the differ- 
ential equations under an appropriate initial 
condition, as well as a set of m independent 
solutions yj(x) (I= 1, ...,m) of the homogene- 
ous differential equations (say, under the ini- 
tial conditions y;(a)=6; at some point x =a), 
and then substitute the expression for the 
desired solution y‘(x)=yi(x)+ D™, a, yi(x) in 
the boundary conditions to obtain simulta- 
neous linear equations for the unknowns «, 
(— 302 Numerical Solution of Linear Equa- 
tions). No universally powerful method is 
known for the case where the f' or the B, or 
both are nonlinear. Ordinarily, we resort to 
the trial-and-error method of solving the 
differential equations iteratively under differ- 
ent initial conditions until the solution fully 
satisfies the boundary conditions, or to ap- 
proximating the differential operators by 
suitable difference operators to obtain a set of 
(generally nonlinear) simultaneous equations 
approximating at the same time the differential 
equations and the boundary conditions. For 
related topics — 298 Numerical Computa- 
tion of Eigenvalues; 315 Ordinary Differen- 
tial Equations (Boundary Value Problems); 
[25-27]. 


I. Methods Other than Difference Methods 


Besides difference methods there are frequently 
used methods for finding in a given finite- 
dimensional function space a function that 
best satisfies the differential equations as well 
as the initial or boundary conditions. Denote 
the equations by L[y(x)]=0 and assume the 
conditions to be linear. We choose a function 
Yo(x) that satisfies the conditions and that is 
considered to approximate the exact solution, 
and also functions y,(x) (/=1,...,g) that are 
considered to represent the typical deviations 
of yo{x) from the exact solution and each of 
which satisfies the homogeneous conditions. 
We then determine the 2, in such a way that 
y(X) = Vo(x) + X7-., xı y(x) best satisfies the 
differential equations. Corresponding to differ- 
ent interpretations of the words “best satisfy” 
there are different methods. The collocation 
method determines the «, so as to nullify the 
values of L[y(x)] at some prescribed points 
x;; the method of least squares minimizes the 
integral of |L[ y(x)]|? over the considered 
region; the Galerkin method makes L[ y(x)] 
orthogonal to every y(x) (l= 1, ...,q); and the 
Ritz method considers the variational problem 
ôJ Ly(x)] =0 whose tEuler equation is L[ y(x)] 
=0 (— 46 Calculus of Variations F), if such a 
problem exists, and determines the a, so as to 
have J[ y(x)] take an extremum value with 
V(X) = Vo(X) + Xf- y(x). 
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A. General Remarks 


The numerical solution of partial differen- 

tial equations became practical in the 1950s 
with the advent of automatic digital compu- 
ters. Nowadays, by means of modern high- 
performance computers, the numerical solu- 
tion of partial differential equations is carried 
out extensively and often on a very large scale 
for problems in physics, engineering, and 
other fields of applied analysis, in order to 
obtain approximate solutions of rigorous 
equations or to simulate real phenomena by 
means of numerical experiments. Various 
numerical-solution schemes have been pro- 
posed, applied, and studied, and programming 
techniques required to implement numerical 
solutions are being invented regularly; some 
are of a general nature, and others are de- 
vised for particular problems. The variational 
method, which includes the Ritz-Galerkin 
method and the finite element method, and 


1140 


the difference method deserve special mention 
here because of their generality and scope of 
application. 


B. The Ritz-Galerkin Method 


Having been applied mainly to boundary 
value problems for elliptic partial differen- 

tial equations, the Ritz method is a classical 
method, which has been used since the old era 
of mechanical calculators, and is mathemati- 
cally nothing other than the fdirect method in 
the calculus of variations (— 46 Calculus of 
Variations F) applied to tvariational problems 
that are equivalent to the original boundary 
value problems. In order to exemplify the 
method, let Q be a bounded domain in R“ 
with ptecewise smooth boundary S, and con- 
sider the Dirichlet boundary value problem 
comprised of the Poisson equation 


Au=~—f (1) 
and the boundary condition 
ul ,=B. 2 


Here f and $ are given functions on Q and S, 
respectively. (In the following, given functions 
are assumed to be sufficiently smooth.) The 
boundary value problem (1), (2) is equivalent 
to the variational problem of minimizing the 
tfunctional 


1 
Jlu]=5a u) -u f) (3) 


within the set D(J) of admissible functions u 
subject to (2). In (3) the following notation is 
used: 


a(u,v)=(Vu, Vo); o= | Vu: Vvdx 
Q 
and 


(u, v)=(u, da= | uvdx, 


Q 


while 
D(J)= {ue L,(Q)|VueL,(Q), ul, =8}. 


This variational problem is further reduced to 
the condition: 


ue D(J), 
a(u, p)=(f, 9) 


where V stands for the tSobolev space Hg (Q) 
(— 168 Function Spaces B), that is, 


(VpeV), (4) 


V= Hi (Q)= {ue L,(Q)| Vue L(Q), u|, =0}. 


Note that V is equal to D(J) with 6=0. The 
condition (4) is often called a weak form of the 
boundary value preblem. In applying the Ritz 
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method to this problem, we introduce un- 
known parameters «,,%>,...,%, and set the 
approximate solution u, in the form 


u, = F(x; 0,,%,...,4,)€D(J) 


and minimize J [u„] as a function of the n vari- 
ables «,,%,,...,%,- A standard way to form 

u, 1s to choose first a be D(J) subject to the 
inhomogeneous boundary condition and ¢,, 
3, ---,Q,€ V subject to homogeneous bound- 
ary condition, and then to set 


Un =D+ 0, P1 +0, Q. +... +4, Py: (5) 


The functions ø; (j= 1,2,...,n) are called basis 
functions or coordinate functions in the Ritz 
method. Denoting by D, the set of all possible 
functions appearing on the right-hand side of 
(5) and by V, the linear space generated by 

Q1, P2, ---, Øn, we can state the condition to 
determine the approximate solution u,, as: 


u,€ D, 


alun, P)=(f 9) 


Thus the Ritz method is a kind of projective 
approximation method which approximates the 
weak equation (4) by its projection on a finite- 
dimensional space V,. The condition (6) is 
equivalent to the equations 


(j=1,2,...,7). (7) 


(Vee V,). (6) 


a(n, p;) = (f, p) 


Particularly when 8} =0 in (2), D, coincides 
with V,, and the equations (7) that deter- 
mine the coefficients «; are reduced to a linear 
equation 


Ka=y (8) 


for a= (a1, 0% ,...,%,), where the matrix K = 
(K,,) and the vector y='(y,, Y2, --.,y,,) are given 
respectively by K,,=a(@;, p;) and y,;=(f, p). 
Solving (8) numerically by the fGauss elimina- 
tion method or by titeration, one eventually 
obtains the approximate solution u,, by the 
Ritz method. The Ritz method is also appli- 
cable, for instance, to the eigenvalue problem 
Au = du, u|,=0, since this eigenvalue problem 
is equivalent to the variational problem of 
finding a stationary Rayleigh’s quotient R[u] = 
a(u, u)/(u, u) with V as the set of admissible 
functions. In the approximation to restrict the 
admissible functions to V,, the approximate 
eigenvalue 1 is determined through the ma- 
trix eigenvalue problem 


Ka=4Ma, (9) 


where the matrix K is as above and the matrix 
M =(M,,) is given by M; =(P;, p). Sometimes, 
optimization of J[u,] or R[u,] is carried out 
more directly, e.g., by the tgradient method, 
particularly when u, contains free parameters 
a; in some nonlinear way. 
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Many studies have been made of conver- 
gence and error estimation for the Ritz method 
in general (— e.g., [1-4]). For instance, it is 
known that, if B=0, fe L?(Q), and {¢,, @, 

.-,Q,,--.} Spans a linear subspace dense in 
V =H, (Q), then the approximate solution u, 
obtained through (6) (or through (12) below) 
converges to the rigorous solution in the H!- 
topology (and hence in the L?-topology). 
Nevertheless, success in using the method in 
practical applications can be gained only by 
a clever choice of trial functions. A typical 
but systematic example of such a choice is 
made by the finite element method (— Sec- 
tion C). 

We proceed to the Galerkin method [1]. 
Suppose that we are to solve the equation 


L[uj=Au+(b-V)u= —f, (10) 


which is the equation in (1) perturbed by a 
lower-order term (the convection term). Here b 
=(b,(x), ...,5,(«)) and (b: V)u= X; b;(x)(3u/3x;). 
For simplicity, the boundary condition (2) 

is assumed to be homogeneous (i.e., 6 =0). 
Although the boundary value problem (10), (2) 
cannot be reduced to a variational problem 
since (10) is not symmetric, it is equivalent to 
its weaker form 


((b-V)u, 0) =(F p) 


provided that ue V. Note that (11) is ob- 
tained from (L[u]+ f, p), =0 by transforming 
(L[u], ọ) through integration by parts. Ac- 
tually, this boundary value problem has a 
unique solution u for any fe L,(Q) if |div b| is 
sufficiently small. According to the Galerkin 
method as applied to the present problem, one 
replaces V by V, in (11) and determines the 
approximate solution u,=«,9,+...+4,9,€V, 
by the condition 


(b: Vun =F 9) 


which is equivalent to the equations 


(Sop) (j=1,2,...,n). 
(13) 


a(u, p)— (VgeV), (11) 


alun @)— (Veer), 02 


alun, P) —((b: V)un p;)= 


Sometimes it is convenient to project (L[u] + 
J, p)=0 onto the finite-dimensional space and 
to deal directly with (L[u,]+ fg) =0 (j= 
1,2,...,n). When the coefficients of the equa- 
tion and the given function f are periodic in 
space variables and when sine or cosine func- 
tions are chosen as basis functions, the Galer- 
kin method turns out to be the same as the 
so-called Fourier approximation method and 
can be efficiently implemented with the aid of 
a fast Fourier transform (FFT) to yield the 
required Fourier coefficients numerically [5]. 
The Galerkin method can be applied to 
the numerical solution of evolution equa- 
tions, namely, in order to approximate time- 
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dependent solutions. This is exemplified 
through the following initial-boundary value 
problem for the diffusion equation (heat 
equation): 

= = Au (t>0, xeQ) (14) 


with the homogeneous boundary condition 
u|,=0 (15) 
and the initial condition 

uly-p =a(x) = (x EQ). (16) 


We first set the required approximate solution 
u, =u,(t, x) in terms of the basis functions 


P13 Pas +++ Pn OF V, as 
Uy = 21 (tP, +2 2(t)P2 +... + nlt) Pn, (17) 


and then determine the coefficients «, (t), «,(¢), 
...,%,(t) from the conditions 


£4, 0)= — ality, 9) (20, VpeV,) (18) 


dt 

and 

(u,(0), p) = (a, p) 
Since (18) is equivalent to the equations 


d 


an p)= —a(uy, p;) (j=1, 2, ida A), : 


(Vo E Va). (19) 


the vector function a(t) ='(x; (£), z(t), ...,a,(t)) 
is obtained by solving the ordinary differential 
equation 


M oy = — Ka, (20) 

dt 
where the matrices M and K are those in (8) 
and (9). The initial value «(0) of « should be 
given in accordance with (19). Sometimes the 
procedure above is called a semidiscrete ap- 
proximation based on the Galerkin method. 
Obviously, this method can be applied to more 
general evolution equations, for instance, to 
the diffusion-convection equation 
Ou 
g L15 Au +ib: V)u, (21) 
which governs time-dependent states corre- 
sponding to stationary states subject to (10). 
As a concrete example, we can refer to the 
finite element approximation for evolution 
equations described in Section D. If the geom- 
etry of Q is simple enough, e.g., for 1-dimen- 
sional Q, one can adopt eigenfunctions of L[ ] 
as the basis functions; the resulting version of 
the semidiscrete Galerkin method is called a 
spectral method. 

In order to obtain u,(t) one has to carry out 
numerical integration for (20), discretizing 
the time variable. The Galerkin approxima- 


1142 


tion with discretization of the space and time 
variables is called a full discrete approximation. 
Finally, an advantage of the Ritz-Galerkin 
method is that if the boundary condition 
imposed on rigorous solutions is a natural 
boundary condition, then the admissible func- 
tions and particularly the basis functions need 
not satisfy it [4]. 


C. Finite Element Methods for Boundary 
Value Problems 


In recent years the method of numerical solu- 
tion of partial differential equations most 
extensively employed in structural mechanics 
and many other fields of engineering has been 
the finite element method. In its standard form, 
the finite element method can be regarded, 

at least mathematically, as a type of Ritz- 
Galerkin method that adopts as its basis func- 
tions piecewise polynomials of low degree 
and with narrow supports. Although the idea 
leading to the finite element method can be 
traced back to a paper by R. Courant in 1943 
(Bull. Amer. Math. Soc.), the method acquired 
its popularity in the late 1950s when it was 
rediscovered by engineers on the basis of me- 
chanical considerations [6]. Here we apply 
the method to the boundary value problem 
(1), (2), assuming that 8 =0 and Q a polygonal 
domain in R? [7-9]. First, Q=QUS is divided 
into small triangles of which the length of any 
side does not exceed h >0. Each triangle T 
appearing in this decomposition of Q is called 
a triangular element, or simply an element 
(following the terminology in structural me- 
chanics), and we denote the set of all triangular 
elements by 7,,, h being equal to max;,z, {the 
longest side of T}. The set of all nodal points, 
i.e., the vertices of the triangular elements, is 
denoted by N, and we put 


N,={PEN|PeQ}={P,,Py,...,P,}. (22) 


Then a standard choice of admissible functions 
is to adopt as V, in (6) the following V, cV= 
Hy (Q): 


V, = {v,€ C(Q) |v, is linear on each element 
and v,|,=0}, (23) 


namely, V, is the set of all elementwise linear 
continuous functions satisfying the boundary 
condition. Now the approximate solution 


u, € V, is determined by the condition 
alUn, P =(P) (VE Va). (24) 


A basis of V, is formed by pyramidal functions 
g; (j=1,2,...,n)e V, such that 


e(P)=1 and @(Q)=0 (QENN{P}) (25) 
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In order to obtain u,, one has to solve (8) 
numerically for «. Since the support of o; is 
confined to elements adjacent to P,, the matrix 
K turns out to be a sparse matrix of multi- 
diagonal type, and hence particular elimination 
procedures can be applied with high efficiency. 
In the finite element method, the matrix K is 
often called the stiffness matrix. The conver- 
gence of u, to u as h-0 is guaranteed if 7, 
(0<h< ho) satisfies a certain regularity con- 
dition. For instance, if any angle of a triangle 
T is not smaller than 6, >0, then u, converges 
in the H!(Q)-topology. Moreover, when Q is 
convex, it holds that 


llu— Uy lls < Chl Sl, 
lu— ulle, S Ch Sl, (26) 


[7,9]. Uniform convergence of u, to u and L,- 
estimation of the error have also been estab- 
lished (e.g., [7, chap. 3]). u, is subject to the 
maximum principle, provided that the trian- 
gulation Z, is of acute type. Here, Z, is said to 
be of acute type (or of strongly acute type) if 
any angle 0 of Te J, is in0<0<(n/2) (0< 
8<60, <(x/2). To acquire higher accuracy, 
more sophisticated admissible functions that 
are piecewise polynomials of higher degree are 
used. To take account of curved boundaries, 
several devices, including the isoparametric 
method, have been proposed. 

In dealing with biharmonic equations or 
other equations of higher order, one can con- 
veniently apply a modified version of the finite 
element method of mixed type or of noncon- 
forming type; in the latter the approximate 
solution is sought within a class of functions 
which are of less regularity at the interface 
of elements and hence do not belong to the 
domain of the original variational problem. i.e., 
which are not admissible in the classical sense 
[10, 11]. 


D. Finite Element Methods for Initial Value 
Problems 


The most basic type of finite element method 
applicable to evolution problems is described 
here, and concerns the initial boundary value 
problem (14)—(16), assuming that Q is the same 
as in Section B [8,9]. In the semidiscrete finite 
element approximation one uses the functions 
g; of the preceding section as the basis for the 
approximate solution, now denoted by u,, and 
then determines u, through (17)-(19) with V, 
replaced by V,. Then u, converges to u as h0, 
provided that the triangulation 7, is regular. 
Furthermore, under the same condition that 
makes (26) hold, the error is given by the fol- 
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lowing estimate [12]: 


lu— url, <Ch7llallp2/t  (t>0), (27) 


which reflects the smoothing property of the 
diffusion equation. In the terminology of the 
finite element method, the matrix M in (20) is 
called the mass matrix. In order to gain a fully 
discrete finite element approximation u, , for 
u, one discretizes the time variable with the 
tmesh length t and solves the difference analog 
of (20), i.e., either 


M(a(t+1)—a(t))/t = — Kalt) (28) 
or 
M(a(t+t)—a(t))/t = — Ka(t+7), (29) 


where t is restricted to t=kt (k=0, 1,2,...). 
The difference schemes (28) and (29) are of 
forward type and of backward type, respec- 
tively. From (29) follows ||a(kt)|| < ||a(0)||, and 
consequently Us aH ll, <Clla||,,. In the gener- 
ally accepted terminology, the approximation 
is stable if for each T>0 there exists a positive 
constant Cy depending on T such that 


lu wl <Czllall (<t=kt<T). (30) 


Thus the approximation through (29) is un- 
conditionally stable, while the one through 
(28) is stable only under certain conditions on 
the triangulation 7, and on the time mesh zt. 
For instance, the forward scheme (28) is stable 
if 7, is regular and satisfies the inverse 
assumption 

h 


su Maxs a +00, 
peers re 7, (longest side of T) 


and if the ratio t/h? is sufficiently small 
[13, 14]. 

Sometimes, the mass matrix M in (28) and 
(29) is modified by the following procedure, 
called mass lumping: For each P,, one joins 
alternately the center of gravity of the ele- 
ments with P, as one of their vertices and the 
middle point of the sides with P, as one of their 
endpoints, thus forming a closed broken line 
I}; surrounding P;. We denote by g; the char- 
acteristic function of the polygonal domain 
B, bounded by I;. When mass lumping is ap- 
plied, the matrix M is replaced by the matrix 
M =((@;, @)). Generally, mass lumping re- 
laxes the conditions for the stability and the 
maximum principle to hold, although it may 
lower the accuracy to some extent (H. Fujii, 
Proc. US-Japan Seminar, Univ. Tokyo Press, 
1973). Usually, schemes without mass lumping 
are called consistent mass schemes. 

In principle, there is no difficulty in applying 
the finite element method to the diffusion- 
convection equation (21). However, when ||b|| 
is large, the stability condition becomes strin- 
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gent. To meet these difficulties, M. Tabata, 

T. Ikeda [15], and others have devised some 
schemes that enjoy better stability, the maxi- 
mum principle, and even the law of conserva- 
tion of mass by improving mass lumping and 
introducing the idea of an upstream approxi- 
mation or an artificial viscosity for discretizing 
the convection term. Moreover, finite element 
methods are currently being applied to wave 
equations, the Navier-Stokes equations, and 
various other evolution equations. 


E. Difference Methods for Boundary Value 
Problems 


In difference methods for solving partial dif- 
ferential equations we reduce the original 
equation to its difference analog, replacing 
tdifferential quotients of the unknown function 
by the corresponding difference quotients. The 
difference analog, usually a system of algebraic 
equations, is then solved numerically, yielding 
the desired approximate solution. Since the 
last part of the method, i.e., the numerical 
solution of the difference analog, requires 
a large amount of computation, difference 
methods were not feasible until the develop- 
ment of automatic computers. 

In terms of the variable x, difference ap- 
proximations for the derivative df/dx are the 
forward difference 


(Def) (x) =(f(x + h)— f0))/h, 

the backward difference 

(Def (x) =S) — f(x — h)V/h, 

and the central difference (symmetric difference) 
(SES (x) = (f(x + h) — f(x —h)/2h. 


In order to exemplify the difference method 
applied to boundary value problems, we re- 
turn to equations (1) and (2), supposing that Q 
is a 2-dimensional square Q = {(x, y|O0<x<1, 
O<y<t}[1, 16-18]. Then we cover QUS by 
a square net (lattice, grid) with mesh length 
h=1/N, for N a large integer. The set of net 
points P,,,=(mh, nh) lying in Q is denoted by 
Q,, and S, is the set of net points on S. We 
want to have a net function u, on R,=Q,US, 
that approximates the solution u of (1) and 
(2), where u, is determined by the difference 
equation 


Apuntx, Y= = f(xy) (C y)€Q,), (31) 


and the boundary condition 


((x, ye S,). (32) 


Here A, is a difference operator, the 5-point 


u(x, y)= B(x, y) 
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difference analog of A, which is defined by 


A, p = DEDi gp + DÌ D; o 


=(p(x +h, y)+ e(x—Af, y) + (x,y +h) 
+ (x, y—h)—4 (x, y))/h?. 


In view of the maximum principle for net 
functions on R,, it is seen that (31) and (32) 
admit only u,=0 as a solution if f= 6 =0, 
which implies that the solution u, of (31) and 
(32) exists uniquely for all f and f. When we 
are concerned with the convergence of u, to 

u as h-0, we have to consider (31) and (32) 
for all small h>0. Such a family of difference 
equations is called a difference scheme. Actu- 
ally, u, obtained through (31) and (32) con- 
verges to u as h-0, and we can say that the 
difference scheme (31) and (32) is convergent. 
In general, the order of the error u— u, de- 
pends on the smoothness of u. For instance, if 
ue C*(Q), then |u—u,|<Ch? holds uniformly 
on R,. When the asymptotic behavior of the 
error is known to satisfy u — u, = wh? + o(h’) 
for some w= w(x, y), it is possible to attain a 
higher accuracy by eliminating the h?-order 
term if we compute u, for two different values 
of h and form an appropriate linear combi- 
nation of the u, thus obtained (Richardson’s 
extrapolation; — e.g., [17]). The convergence 
of difference schemes applied to boundary 
value problems is not affected if rectangular 
nets are used instead of square nets. Many 
techniques have been devised to treat curved 
boundaries. There have been attempts recently 
to map Q into a domain Q of simpler geom- 
etry, say, a rectangular one, and then discretize 
the arising partial differential equations with 
variable coefficients in Q (e.g., J. F. Thompson 
et al., J. Comput. Phys., 1982.) Moreover, 
when information regarding the location and 
nature of singularities of solutions is available, 
one can refine the net near these points. 

The difference analog of elliptic equations 
described above is a system of linear equa- 
tions with the values of the u, as its unknowns. 
Since the coefficient matrix of this system is of 
multidiagonal type, direct elimination methods 
can be efficiently applied to solve it. Partic- 
ularly, some elimination algorithms employ- 
ing vector computations have been invented 
[19]. Iteration methods of the Gauss-Seidel 
type can also be applied, where acceleration 
of convergence by SOR is effective (— 302 
Numerical Solution of Linear Equations C). 


F. Difference Methods for Initial Value 
Problems 


To demonstrate characteristics of the dif- 
ference method applied to evolution equa- 
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tions [17, 20, 21], we first consider the initial 
boundary value problem (14)~(16) for the dif- 
fusion equation supposing that Q=(0, 1), a 
1-dimensional interval. Then we cover Q = 
[0, co) x Q by a rectangular net t=nt (n= 
0,1,2,...) and x =jh (j=0,1,...) with mesh 
lengths At=t and Ax =h. The value of the 
approximate function u, , at the net point 
(nt, jh) is denoted by U?. Then the simplest 
difference scheme of forward type for (14) is 
written as 


Uf*'—Up _ Uf + Uf, —2U? 


T h? 


(33) 


Namely, we adopt the difference operator 
L. aC ]=D,~D"D* as the difference analog of 
(6/6t) —(67/6x?). Equation (33) is consistent in 
the sense that L, ,[u]=“the residual of u”—0 
(t,h-0) for any smooth solution u of (14). This 
is the case also with the difference scheme of 
backward type 
up = Ur Un + Un —2U;"! 
T = h? 


(34) 


To compute the approximate solution U” = 
{U?|0<j<N}, (33) or (34) must be combined 
with the approximate initial condition cor- 
responding to (16), say U? =a(jh), and with 
the boundary condition Uj} = Uj =0. Then one 
can proceed from U® to U!, from U! to U?, 
and so on. Actually, (33) is rewritten in the 
form 


UP! = AUP, +AUP +(1—21)Up, (32) 


with 2 =1/h?, which gives U”*! explicitly in 
terms of U”. Hence (33) is an explicit scheme. 
On the other hand, in order to acquire U"*! 
from U” according to (34), it is necessary to 
solve a system of linear equation with U"*! as 
its unknown. Hence (34) is an implicit scheme. 
Since the coefficient matrix of the system of 
linear equations to determine U"*’ according 
to (34) is tridiagonal, one can employ a partic- 
ularly efficient elimination algorithm. 

We introduce the maximum norm ||¢9,||,= 
max,z,|P(jh)| for net functions p, on R,=Q,,U 
S,={x=jh|O<j<N}. Then, as is obvious from 
(35), the approximate solution U"=u, ,(nt) 
obtained through (33) satisfies || U"||,<||U°|, 
(n=0, 1,...), provided that 


ok | 
O<4= TSS (36) 


Generally, a difference scheme approximating 
an initial value problem is said to be stable if 
the approximate solution u, , for the initial 


value a, = u, ,(0) satisfies, for small t and A, 
e(Olla<Mrlldyl, (O<t=nt<T) (37) 


for any T>0, M, being a constant depending 
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on T. Lax’s equivalence theorem asserts that 

if the original initial value problem is twell- 
posed and if the approximating difference 
scheme is consistent, then the stability of the 
difference scheme is necessary and sufficient 
for the convergence of the approximate solu- 
tion u,,, to the rigorous solution u as t, h0. 
For instance the explicit difference scheme (33) 
for the 1-dimensional diffusion equation is 
convergent under the mesh condition (36). On 
the other hand, the implicit scheme (34) is 
unconditionally stable and hence is conver- 
gent. Many difference schemes more sophisti- 
cated than (33) and (34), of higher accuracy or 
with other favorable properties, have been 
proposed and studied. All these difference 
schemes can be generalized to the case of 
many-dimensional Q. For instance, if Qc R?, 
then by means of the difference operator A, in 
(31), the forward scheme and the backward 
scheme are given by D,U"=A,U" and D,U"= 
A, U"*!, respectively. The former is stable if 
0<A=t/h? <1/4, while the latter is uncon- 
ditionally stable. Moreover, in a method called 
the ADI method (alternating direction implicit 
method) one introduces U"*!? on fractional 
steps t=nt+ 1/2 and discretizes (14) as 


y” +12 — U" 


= D} DŁ U" + DEDIU"*2, 
1/2 are ae 


unt peti 
= DEDEU"*! 4 DEDEUT*1?_ (38) 
t/2 ‘a me 
Then, starting from U°, the computation goes 
as U°>U!? 4U! 3... +U">U"*!? |. by 
solving at each step alternatively a difference 
equation implicit with respect to x and y, for 
which a particular elimination method can be 
used with good efficiency. Furthermore, the 
ADI method is unconditionally stable. Various 
types of fractional-step method that generalize 
the ADI method have been proposed [12, 17]. 

We now proceed to hyperbolic equations. 
First consider the Cauchy problem where the 
spatial domain is the whole R!. Noting that 
the wave equation v,,=v,.,., for instance, can be 
rewritten as 


_{9 1\ĉu 
TU O/éx 


by putting u="(v,, v,), we here deal with a 
hyperbolic system of the form 


Ll S 


ĝu Ou 
— = A— (39) 


where the unknown u is an m-vector u = 
'(u,,U>,...,U,,), and A is a constant m x m 
matrix that is supposed to be real and sym- 
metric. For hyperbolic equations, explicit 
schemes are generally preferred, and as a typ- 
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ical one we mention the Friedrichs scheme. 
In the Friedrichs scheme applied to (39), the 


approximate solution U="(U,, U,,..., Um) is 
determined by 
U(t+t,x)—3(U(t, x +h) + U(t, x —h)) 
T 
= At U(t, x). (40) 


Here t is restricted to t=nt (n=0, 1,...), while 
x is regarded as a variable ranging over R!. 
Introducing translation operators T, and T_, 
by 


T,:Q(x)>@(x+h) and T_,:e(x)>¢(x—h), 


we obtain from (40) 
U(t+t,-)=S,U(t,), (41) 


where, with the mesh ratio 1=7/h, S, is given 
by 


s-(ta44)ne(1_ta\r 42 
n=l ats ht > 7 -h (42) 


Generally, if a difference scheme is reduced to 
the form of (41), then the operator S,, which 
yields evolution of the solution of the dif- 
ference equation for one time step, is called 
the amplification operator of the scheme. The 
stability of the difference scheme is implied 
by the uniform boundedness of the operator 
norm ||S;|| of S" acting in the Hilbert space 
(L,(R'))” for O0<n< T/t. Consequently, the 
scheme is stable if ||S,|| < 1+ Crt, C being a 
constant. The tsymbol §,(¢) of S, with respect 
to the Fourier transform is called the amplifi- 
cation matrix of the scheme. The L,-stability 
of the scheme mentioned above is equivalent 
to the uniform boundedness of the matrix 
norm ||$?()|| for O0<nt<T and ëeR!. There- 
fore, denoting by r,(&) the spectral radius of 
§,(¢), one can give a necessary condition for 
the stability by |r,(€)|<1+ Ct, which is called 
the von Neumann condition. Concerning gen- ` 
eral critera for the uniform boundedness of 
powers of §,(£) when §,(£) is not necessarily 
normal, a fundamental theorem was given by 
H. O. Kreiss in 1962 [21]. 

If the largest-modulus eigenvalue of A is 
denoted by uo, then the Friedrichs scheme is 
stable and hence is convergent under the con- 
dition on the mesh ratio 2=t/h given by 


A<1/|tol- (43) 


In particular, (43) implies that for the stable 
Friedrichs scheme 


propagation speed of the difference scheme 


> propagation speed of the original equation. 
(44) 


Condition (44) is a necessary condition for 
general difference schemes approximating 
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hyperbolic equations to be stable, and is 
known as the CFL condition (Courant- 
Friedrichs-Lewy condition) [22]. For a scheme 
more accurate than the Friedrichs scheme we 
refer to the Lax-Wendroff scheme, which pro- 
ceeds by way of the following amplification 
operator $: 


2 
T 
Sp=1+tAdr +> ADD; 


2 


À A 
=1+5A( +T) + Ty) 


The Lax-Wendroff scheme is stable under (43). 
There are many other difference schemes 
to approximate (39), most of which are appli- 
cable to higher-dimensional spaces. Some of 
these schemes can be conveniently employed 
to solve nonlinear hyperbolic equations, for 
instance, those arising in the gas dynamics of 
compressible fluids, for which the Friedrichs 
scheme and the Lax-Wendroff scheme were 
originally intended [23]. Concerning difference 
schemes with variable coefficients which ap- 
proximate a tregularly hyperbolic şystem with 
an x-dependent principal part, criteria for the 
stability of the scheme in terms of the symbol 
S,(x, é) of S, have been obtained, making use of 
the theory of tpseudodifferential operators, by 
P. D. Lax and L. Nirenberg (Comm. Pure Appl. 
Math., 1966), M. Yamaguti and T. Nogi (Publ. 
Res. Inst. Math. Sci., 1967), R. Vaillancourt 
(Math. Comput., 1970), Z. Koshiba (J. Math. 
Anal. Appl., 1981), and others. 
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